Links: | BOTTOM |
|
reference predict_h11814 (Jan 22, 2001 06:49:59) reference pred_h11814 (Jan 22, 2001 06:50:05) PPhdr from: pagomez@cnb.uam.es PPhdr resp: MAIL PPhdr orig: HTML PPhdr want: HTML PPhdr password(###) prediction of: - threading (TOPITS)- return msf format ret topits hssp ret topits strip ret topits own ret html ret html perline=60 # default: single protein sequence description=curso1 RLSWYDPDFQARLTRSNSKCQGQLEV YLKDGWHMVC SQSWGRSSKQWEDPSQASKVCQRLNCGVPLSLGPFLV TYTPQSSIICYGQLGSFSNCSHSRNDMCHSLGLTCLE
------------------------------------------------------------- Pattern-ID: ASN_GLYCOSYLATION PS00001 PDOC00001 Pattern-DE: N-glycosylation site Pattern: N[^P][ST][^P] 92 NCSH Pattern-ID: PKC_PHOSPHO_SITE PS00005 PDOC00005 Pattern-DE: Protein kinase C phosphorylation site Pattern: [ST].[RK] 43 SSK Pattern-ID: CK2_PHOSPHO_SITE PS00006 PDOC00006 Pattern-DE: Casein kinase II phosphorylation site Pattern: [ST].{2}[DE] 3 SWYD 96 SRND 107 TCLE Pattern-ID: MYRISTYL PS00008 PDOC00008 Pattern-DE: N-myristoylation site Pattern: G[^EDRKHPFYW].{2}[STAGCN][^P] 63 GVPLSL 85 GQLGSF
Identities computed with respect to: (query) prot Colored by: consensus/70% and property
HSP processing: ranked
1 [ . . . . : . 60 prot RLSWYDPDFQARLTRSNSKCQGQLEVYLKDGWHMVCSQSWGRSSKQWEDPSQASKVCQRL 1 PD012755 QLSWEDPGLQMRLSGSGCRCQGQLEVQMSSRWHPVCSQSWGGVSPYWKNPRQAGKVCQQL 2 PD002499 ------PKGQIRLVGGSSPCEGRVEVHMRGQWGTVCDSNWTRSEAQWE--AQWEVVCREL consensus/100% ......PthQhRLstuss.CpGplEV.hpstWt.VCspsWstsp..Wc..tQhthVCppL consensus/90% ......PthQhRLstuss.CpGplEV.hpstWt.VCspsWstsp..Wc..tQhthVCppL consensus/80% ......PthQhRLstuss.CpGplEV.hpstWt.VCspsWstsp..Wc..tQhthVCppL consensus/70% ......PthQhRLstuss.CpGplEV.hpstWt.VCspsWstsp..Wc..tQhthVCppL 61 . . . 1 ] 110 prot NCGVPLSLGPFLVTYTPQSSIICYGQXXSFSNCSHSRNDMCHSLGLTCLE 1 PD012755 GCRPPLKLGSAPPDYEPQWAIICHGQ--SFSNCNHSRNDQCHSLGLICLE 2 PD002499 GCGDAVSV------------------------------------------ consensus/100% sCtsslpl.................. ...................... consensus/90% sCtsslpl.................. ...................... consensus/80% sCtsslpl.................. ...................... consensus/70% sCtsslpl.................. ...................... |
--- ------------------------------------------------------------ --- --- Again: these results were obtained based on the domain data- --- base collected by Daniel Kahn and his coworkers in Toulouse. --- --- PLEASE quote: --- F Corpet, J Gouzy, D Kahn (1998). The ProDom database --- of protein domain families. Nucleic Ac Res 26:323-326. --- --- The general WWW page is on: ---- --------------------------------------- --- http://www.toulouse.inra.fr/prodom.html ---- --------------------------------------- --- --- For WWW graphic interfaces to PRODOM, in particular for your --- protein family, follow the following links (each line is ONE --- single link for your protein!!): --- http://www.toulouse.inra.fr/prodom/cgi-bin/ReqProdomII.pl?id_dom1=PD012755 ==> multiple alignment, consensus, PDB and PROSITE links of domain PD012755 http://www.toulouse.inra.fr/prodom/cgi-bin/ReqProdomII.pl?id_dom2=PD012755 ==> graphical output of all proteins having domain PD012755 http://www.toulouse.inra.fr/prodom/cgi-bin/ReqProdomII.pl?id_dom1=PD002499 ==> multiple alignment, consensus, PDB and PROSITE links of domain PD002499 http://www.toulouse.inra.fr/prodom/cgi-bin/ReqProdomII.pl?id_dom2=PD002499 ==> graphical output of all proteins having domain PD002499 --- --- NOTE: if you want to use the link, make sure the entire line --- is pasted as URL into your browser! --- --- END of PRODOM --- ------------------------------------------------------------
--- ------------------------------------------------------------ --- MAXHOM multiple sequence alignment --- ------------------------------------------------------------ --- --- MAXHOM ALIGNMENT HEADER: ABBREVIATIONS FOR SUMMARY --- ID : identifier of aligned (homologous) protein --- STRID : PDB identifier (only for known structures) --- IDE : percentage of pairwise sequence identity --- WSIM : percentage of weighted similarity --- LALI : number of residues aligned --- NGAP : number of insertions and deletions (indels) --- LGAP : number of residues in all indels --- LSEQ2 : length of aligned sequence --- ACCNUM : SwissProt accession number --- OMIM : OMIM (Online Mendelian Inheritance in Man) ID --- NAME : one-line description of aligned protein --- --- MAXHOM ALIGNMENT HEADER: SUMMARY ID STRID IDE WSIM LALI NGAP LGAP LSEQ2 ACCNUM OMIM NAME cd5_human 100 100 110 0 0 495 P06127 153340 T1/LEU-1) (LYMPHOCYTE ANT cd5_rat 48 60 100 1 1 491 P51882 1) (LYT-1) (LYMPHOCYTE AN cd5_mouse 48 60 103 1 1 494 P13379 1) (LYT-1) (LYMPHOCYTE AN cd5_bovin 46 54 109 1 2 495 P19238 T1/LEU-1) (LYMPHOCYTE ANT cd6_human 34 39 90 3 12 468 P30203 186720 T-CELL DIFFERENTIATION AN wc11_bovin 34 42 76 2 13 1436 P30205 ANTIGEN WC1.1. netr_mouse 28 31 98 2 11 761 O08762 PROTEASE 3) (BSSP-3). netr_human 25 26 105 3 12 875 P56730 NEUROTRYPSIN PRECURSOR (E --- --- MAXHOM ALIGNMENT: IN MSF FORMAT
--- --- Version of database searched for alignment: --- SWISS-PROT release 39.0 (5/00) with 85 249 proteins ---
Identities computed with respect to: (1) predict_h1180 Colored by: consensus/70% and property
1 [ . . . . : . 60 1 predict_h1180 RLSWYDPDFQARLTRSNSKCQGQLEVYLKDGWHMVCSQSWGRSSKQWEDPSQASKVCQRL 2 cd5_human RLSWYDPDFQARLTRSNSKCQGQLEVYLKDGWHMVCSQSWGRSSKQWEDPSQASKVCQRL 3 cd5_rat ---------QVMLSGSNSKCQGLVEVQM-NGMKTVCSSSWRLSQDLWKNANEASTVCQQL 4 cd5_mouse -------DIQVMLSGSNSKCQGQVEIQMENKWKTVCSSSWRLSQDHSKNAQQASAVCKQL 5 cd5_bovin -LKVEVQGLTMRLSGSGSRCQGRLEVSNGTEWYAVHSQSWGQLSLYQVAPRQFLKLCQEL 6 cd6_human -----------RLTGGADRCEGQVEVHFRGVWNTVCDSEWY--------PSEAKVLCQSL 7 wc11_bovin ---------KLRLRGGDSECSGRVEVWHNGSWGTVCDDSWSLA--------EAEVVCQQL 8 netr_mouse -------DGVIRLAGGKSTHEGRLEVYYKGQWGTVCDDGWTEMNTYVakYGKQSSVNHFD 9 netr_human RLSL---GFPVRLMDGENKKEGRVEVFINGQWGTICDDGWTDKDAayKGPARARTMAYFG consensus/100% ...........hL.tutspppG.lEl...s.h.hlpsptW...........p...hs... consensus/90% ...........hL.tutspppG.lEl...s.h.hlpsptW...........p...hs... consensus/80% ..........hhLttutscppGplEV.htstWthVCspuWt.........tptphlsp.h consensus/70% ..........hRLssusS+CpGplEV.hpstWtsVCspuWt..p....tstpAptlCppL 61 . . . 1 ] 110 1 predict_h1180 NCGVPLSLGPFLVTYTPQSSIICYGQLGSFSNCSHSRNDMCHSLGLTCLE 2 cd5_human NCGVPLSLGPFLVTYTPQSSIICYGQLGSFSNCSHSRNDMCHSLGLTCLE 3 cd5_rat GCGNPLALGHLTLWNRPKNQILCQGPPWSFSNCSTSSLGQCLPLSLVCLE 4 cd5_mouse RCGDPLALGPFPSLNRPQNQVFCQGSPWSISNCNnsSQDQCLPLSLICLE 5 cd5_bovin QCRDPLLLSsyFKEVQFQKLIICHGQLGSFSNCSLNRGRQVDSLALICLE 6 cd6_human GCGTAVERPKGLpsLSGRMYYSCNGEELTLSNCSWRFNncSQSLAARVL- 7 wc11_bovin GCGQALEAVRSAAFGPGNGSiqCGGRESSLWDC----------------- 8 netr_mouse GSNRPIWLDDVSCSGKEVSFIQCSRRQWGRHDCSHRED-----VGLTCYP 9 netr_human EGKGPIHVDNVKCTGNERSLADCIKQDIGRHNCRHSEDaiCDYFG----- consensus/100% tst.sl.hs...........h.C.t...sh.sC................. consensus/90% tst.sl.hs...........h.C.t...sh.sC................. consensus/80% tst.sl.hs.h....p.pt.h.C.tp.huhpsCp.p.......hu..... consensus/70% tCssPltls.hhs.hp.ps.l.C.Gp.hohpNCshpptt.s..luhhsh. |
**************************************************************************** * * * PHD: Profile fed neural network systems from HeiDelberg * * ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ * * * * Prediction of: * * secondary structure, by PHDsec * * solvent accessibility, by PHDacc * * and helical transmembrane regions, by PHDhtm * * * * Author: * * Burkhard Rost * * EMBL, 69012 Heidelberg, Germany * * Internet: Rost@EMBL-Heidelberg.DE * * * * All rights reserved. * * * **************************************************************************** * * * The network systems are described in: * * * * PHDsec: B Rost & C Sander: JMB, 1993, 232, 584-599. * * B Rost & C Sander: Proteins, 1994, 19, 55-72. * * PHDacc: B Rost & C Sander: Proteins, 1994, 20, 216-226. * * PHDhtm: B Rost et al.: Prot. Science, 1995, 4, 521-533. * * * ****************************************************************************
%H: 14.6 | %E: 33.6 | %L: 51.8 |
%A: 1.8 | %C: 7.3 | %D: 4.5 | %E: 2.7 | %F: 2.7 |
%G: 7.3 | %H: 2.7 | %I: 1.8 | %K: 3.6 | %L: 10.9 |
%M: 1.8 | %N: 3.6 | %P: 4.5 | %Q: 8.2 | %R: 5.5 |
%S: 15.4 | %T: 3.6 | %V: 4.5 | %W: 3.6 | %Y: 3.6 |
AA : | amino acid sequence | |
PHD_sec: | PHD predicted secondary structure: H=helix, E=extended (sheet), blank=other (loop) PHD = PHD: Profile network prediction HeiDelberg | |
Rel_sec: | reliability index for PHDsec prediction (0=low to 9=high) Note: for the brief presentation strong predictions marked by '*' | |
SUB_sec: | subset of the PHDsec prediction, for all residues with an expected average accuracy > 82% (tables in header) NOTE: for this subset the following symbols are used: L: is loop (for which above ' ' is used) .: means that no prediction is made for this residue, as the reliability is: Rel < 5 | |
pH_sec: | 'probability' for assigning helix (1=high, 0=low) | |
pE_sec: | 'probability' for assigning strand (1=high, 0=low) | |
pL_sec: | 'probability' for assigning neither helix, nor strand (1=high, 0=low) | |
P_3_acc: | PHD predicted relative solvent accessibility (acc) in 3 states: b = 0-9%, i = 9-36%, e = 36-100%. | |
Rel_acc: | reliability index for PHDacc prediction (0=low to 9=high) Note: for the brief presentation strong predictions marked by '*' | |
SUB_acc: | subset of the PHDacc prediction, for all residues with an expected average correlation > 0.69 (tables in header) NOTE: for this subset the following symbols are used: I: is intermediate (for which above ' ' is used) .: means that no prediction is made for this residue, as the reliability is: Rel < 4 | |
PHD_acc: | PHD predicted relative solvent accessibility (acc) in 10 states: a value of n (=0-9) corresponds to a relative acc. of between n*n % and (n+1)*(n+1) % (e.g. for n=5: 16-25%). |
PHD results (brief)....,....1....,....2....,....3....,....4....,....5....,....6 AA RLSWYDPDFQARLTRSNSKCQGQLEVYLKDGWHMVCSQSWGRSSKQWEDPSQASKVCQRL PHD_sec EEEEE EEEE EEEEEEE EEEEEE HHHHHH HHHHHHHHHH Rel_sec * *** ** **** ******* ****** * ***** *** * ******* P_3_acc ebebeeeebebbbeeeebebebebbbbbeeebebbbbebbeebeeeeeeeeebeebbeeb Rel_acc * * * * * *** * ** ....,....11...,....12...,....13...,....14...,....15 AA NCGVPLSLGPFLVTYTPQSSIICYGQLGSFSNCSHSRNDMCHSLGLTCLE PHD_sec EEE EEEEEE EE EEEE Rel_sec **** *** *** ***** **** *********** ** * P_3_acc ebe bbebeebeeeeeeebbbbbebee bbbbbeeeeeeebebbbbbbbe Rel_acc * * * *** * *
PHD results (normal) ....,....1....,....2....,....3....,....4....,....5....,....6 AA RLSWYDPDFQARLTRSNSKCQGQLEVYLKDGWHMVCSQSWGRSSKQWEDPSQASKVCQRL PHD_sec EEEEE EEEE EEEEEEE EEEEEE HHHHHH HHHHHHHHHH Rel_sec 905673682798616998786289999517477897423121347860614699998543 SUB_sec L.EEE.LL.EEEE.LLLLLLL.EEEEEE.L.EEEEE........HHH.L..HHHHHHH.. P_3_acc ebebeeeebebbbeeeebebebebbbbbeeebebbbbebbeebeeeeeeeeebeebbeeb Rel_acc 031010101180610110211618090221021485000210013103203251066011 SUB_acc ..........b.b........b.b.b.......bbb................b..bb... ....,....11...,....12...,....13...,....14...,....15 AA NCGVPLSLGPFLVTYTPQSSIICYGQLGSFSNCSHSRNDMCHSLGLTCLE PHD_sec EEE EEEEEE EE EEEE Rel_sec 77883432586334598459998165553113788899875774145639 SUB_sec LLLL....LLL...LLL.EEEEE.LLLL....LLLLLLLLLLL...EE.L P_3_acc ebe bbebeebeeeeeeebbbbbebee bbbbbeeeeeeebebbbbbbbe Rel_acc 12102302020001022100607101202001600111103016560514 SUB_acc ....................b.b.........b..........bbb.b.e
--- --- GLOBE: prediction of protein globularity --- --- nexp = 60 (number of predicted exposed residues) --- nfit = 55 (number of expected exposed residues --- diff = 5.00 (difference nexp-nfit) --- =====> your protein appears as compact, as a globular domain --- --- --- GLOBE: further explanations preliminaryily in: --- http://www.columbia.edu/~rost/Papers/98globe.html --- --- END of GLOBE
--- --- ------------------------------------------------------------ --- TOPITS prediction-based threading --- ------------------------------------------------------------ --- --- TOPITS ALIGNMENTS HEADER: PARAMETERS --- str:seq= 50 : structure (sec str, acc)= 50%, sequence= 50% --- str:seq = 50 : weight structure/sequence,i.e. str= 50%, seq= 50% --- smin = -1.00 : minimal value of alignment metric --- smax = 2.00 : maximal value of alignment metric --- go = 2 : gap open penalty --- ge = 0.2 : gap elongation penalty --- len1 = 110 : length of search sequence, i.e., your protein --- --- TOPITS ALIGNMENTS HEADER: ABBREVIATIONS --- RANK : rank in alignment list, sorted according to z-score --- EALI : alignment score --- LALI : length of alignment --- IDEL : number of residues inserted --- NDEL : number of insertions --- ZALI : alignment zcore; note: hits with z>3 more reliable --- PIDE : percentage of pairwise sequence identity --- LEN2 : length of aligned protein structure --- ID2 : PDB identifier of aligned structure --- NAME2 : name of aligned protein structure --- IFIR : position of first residue of search sequence --- ILAS : position of last residue of search sequence --- JFIR : PDB position of first residue of remote homologue --- JLAS : PDB position of last residue of remote homologue --- --- TOPITS ALIGNMENTS HEADER: ACCURACY --- : Tested on 80 proteins, TOPITS found the --- : correct remote homologue in about 30% of --- : the cases, detection accuracy was higher --- : for higher z-scores (ZALI): --- ZALI>0 : 1st hit correct in 33% of cases --- ZALI>3 : 1st hit correct in 50% of cases --- ZALI>3.5 : 1st hit correct in 60% of cases --- --- TOPITS ALIGNMENTS HEADER: SUMMARY RANK EALI LALI IDEL NDEL ZALI PIDE LEN2 ID2 NAME2 1 42.73 105 27 9 2.74 31 339 1qr7_A OL_ID: 1; 2 42.53 91 32 8 2.71 37 113 1by2 _ID: 1; 3 39.07 105 28 10 2.25 27 452 1ivy_A OL_ID: 1; 4 38.80 102 45 9 2.22 28 202 1tcr_A OL_ID: 1; 5 37.67 93 23 8 2.07 32 618 1qsa_A OL_ID: 1; 6 37.53 109 68 14 2.05 37 364 1a6q _ID: 1; 7 37.47 105 43 12 2.04 34 444 1igt_B OL_ID: 1; 8 36.40 95 49 11 1.90 36 483 1ac5 _ID: 1; 9 36.33 99 23 8 1.89 30 160 1ext_A OL_ID: 1; 10 36.27 107 54 12 1.88 33 228 1nfp XF GENE PRODUCT (NONFLUOR 11 36.27 105 69 17 1.88 40 308 4pah L_ID: 1; 12 35.80 103 67 16 1.82 40 757 1kit L_ID: 1; 13 35.47 98 38 12 1.77 37 650 1lox L_ID: 1; 14 35.40 93 24 11 1.77 31 289 1d5y_A MOL_ID: 1; 15 35.33 108 55 15 1.76 34 639 1cx8_B MOL_ID: 1; 16 35.27 102 27 10 1.75 32 446 1be3_A MOL_ID: 1; 17 35.20 99 16 7 1.74 23 502 1gpm_A MOL_ID: 1; 18 35.20 102 23 4 1.74 20 414 1ajs_A MOL_ID: 1; 19 35.00 104 40 10 1.71 28 746 1lvk L_ID: 1; 20 34.93 94 26 9 1.70 31 267 1abr_B ABRIN-A COMPLEXED WITH TW --- --- TOPITS ALIGNMENTS HEADER: PDB_POSITIONS FOR ALIGNED PAIR RANK PIDE IFIR ILAS JFIR JLAS LALI LEN2 ID2 --- --- TOPITS ALIGNMENTS: SYMBOLS AND EXPLANATIONS --- BLOCK 1 : your protein and its predicted 1D structure, --- : i.e., secondary structure and solvent accessibility --- line 1 : amino acid sequence (one-letter-code) --- line 2 : predicted secondary structure: --- H : helix --- E : strand (extended) --- L : other (no regular secondary structure) --- line 3 : predicted residue relative solvent accessibility --- B : buried, i.e., relative accessibility < 15% --- O : exposed (outside), i.e., relative accessibility >= 15% --- : --- BLOCKS 1-20 : 20 best hits of the prediction-based threading --- ATTENTION : We chose to include all first 20 hit. However, --- ATTENTION : most of them will not constitute true remote --- ATTENTION : homologues. Instead, all hits with a zscore --- ATTENTION : (ZALI) < 3.5 are, at best, rather speculative! --- : for each aligned protein: --- line 1 : amino acids conserved between guide (yours) and the --- : aligned protein (putative homologue) --- line 1 : sequence of aligned protein --- line 3 : secondary structure, taken from DSSP (assignment --- : of secondary structure based on experimental coordinates) --- line 4 : relative solvent accessibility, taken from DSSP --- --- TOPITS ALIGNMENTS 1 - 51 ....:....1....:....2....:....3....:....4....:....5 pred RLSWYDPDFQARLTRSNSKCQGQLEVYLKDGWHMVCSQSWGRSSKQWEDPS EEEEE EEEE EEEEEEE EEEEEE HHHHHH HH OBOBOOOOBOBBBOOOOBOBOBOBBBBBOOOBOBBBBOBBOOBOOOOOOOO 1. 1qr7_A 42.73 RlsIHDpeYATRLLALREELKDELevYFEkgWKGLINDPHMDNSFQINDGL LELLLLHHHHHHHHHHHHHHLLLEELLLLLLLLLLLLLLLLLLLLLHHHHH D D RL QG E W VC W D 2. 1by2 42.53 DGDM..RLADGGATNQGRVEIFYRGQWGTVCDNLW........DLT LLLE..EEELLLLLLEEEEEEEELLEEEEEELLLL........LHH S D F A T Q E L D K E P 3. 1ivy_A 39.07 FSYSDDKFYA..TNDTEVAQSNFEA.LQDFFRLF..PEY.KNNKLfespT LLEELLLLLL..LEHHHHHHHHHHH.HHHHHHHL..HHH.LLLLEEELHH L Y DF A SNS V D C S G K DPS 4. 1tcr_A 38.80 LKYYSGdfEAEFSKSNSshLRKASVHWSDSAVYFCAVsfGSGTkaLKDps EEELLLLEEEEEELLLLEEEEELLLLHHHLEEEEEEEEELLLEEEEELLL L D ARL S SK Q QL Y Q W S 5. 1qsa_A 37.67 LMYWNLDNTArlVKSKSKtqAQLARY...AFN...NQWWDLS....VQAT HHHLLLHHHHHHHLLLLHHHHHHHHH...HHH...LLLHHHH....HHHH R D R RS S G L C S GR KQ PS 6. 1a6q 37.53 RTGFLEIDEHMRvdRSGSTAVGVL.ISPQHTYFINCGDSRgrNRkqDHKPS HHHHHHHHHHHHHLLLEELEEEEE.ELLLEEEEEEELLLEEELLELLLLLL RL WY PD R TR N K L LKD C WG SS PS 7. 1igt_B 37.47 RLEwyYPdvKGRFtrDNAKNTLYLQMslkdTAMYYCAryWGqsSAKTTAPs LEEEELLHHLLLEEEELLLLEEEEEELLLHLEEEEEEEEEELLLLLLELLE S Y P F A L SK G L L GW QS S 8. 1ac5 36.40 SyyIPFfnAILNHNksKIDgdLKALlgNGWIDPNTQsiDESNPNFKHLT ELHHHHHHHHHHHHHLLLLLEEEEEEEEELLLHHHHHLLLLLLLHHHHH S R S SKC GQ E D VC G Q 9. 1ext_A 36.33 SFTASENHLRHCLSCSKCrgQVEIstVD.RDTVC....GCRKNQYRH.Y EELLLLELLLLLEELLLLLLLEEEEELL.ELLEE....ELLLLEEEE.E S YD F L V L H K W D 10. 1nfp 36.27 SIYdvAFSEHHIDKSYNDETKLavSLGKQIHVLAteTVVKAAkkWDDSQ LLEEEEEELLLLLLELLLLLELLEEELLEEEEELLHHHHHHHHLLLLLH DP AR N GQ E Y K W VC KQ ED S 11. 4pah 36.27 HPGFKDPVYRARRKqyNYR.HGqvE.YmkKTWGTvcnHIFPLLEkqLEDVS LLLLLLHHHHHHHHHHHLL.LLLLL.LLHHHHHHHLHHHHHHHHHLHHHHH S Y P LTR S QG L Y D V S WG SS E PS 12. 1kit 35.80 MSLygPGHGITLTRQqsGSqgRL.IyvLDRFFlvMssNwgkSSSILeePS EEEELLLLLEELLLLLLLLLLLE.EEEELLLLEEEELLLEEELLEEELEE S Y D RL S QG Y KD QSW R 13. 1lox 35.47 SFYAQD.ALRLWEIISrvQGIMGLYYkdDLEL...QSWCREITEIGL.Q LHHHHH.HHHHHHHHHHHHHHHLLLLLLLHHH...HHHHHHHHLLLL.L RLS RLT L L Q R KQ P 14. 1d5y_A 35.40 RLSKSAVAL..RLTA.....RPILDIALQ..YRFDSQQTFTRakKQfqTPa HHHHHHHHH..HHLL.....LLHHHHHHH..LLLLLHHHHHHHHHHHLLHH LSW D FQ NSK G L VYL G V G K EDP 15. 1cx8_B 35.33 KLswRDQHfqVKDSAQNsk.NGRL.VYLVegykLVHA.NFG.TKKDFEdpV LLLLEEEEEEEELLLLLEL.LLLL.LLLLELLLEEEL.LLL.LLLLLLLLL LS YD D L S C G DG W P 16. 1be3_A 35.27 LstYDEDAVPTL....SPCrgSQICHREDGLPLA.HVAIAVEGPGWAHPD LLLLLLLLLLLL....LLLLLEEEEEEELLLLLE.EEEEELLLLLLLLLH Y F T LE D C W K D 17. 1gpm_A 35.20 RFYGVQFHPEVTHTRQGMR.MLERFVRD..ICQCEALW.TPAKIIDDae LEEEELELLLLLLLLLHHH.HHHHHHHL..LLLLLLLL.LHHHHHHHHH D Q K G C QS S 18. 1ajs_A 35.20 FPFFDSAYQGFASGNLEKDAWAIRYFVSEGFELFCAQSFSKNFGLYNErs EEEEEELLLLLLLLLHHHHLHHHHHHHHLLLLEEEEEELLLLLLLHHHLH R WY PD R C G D S R K D 19. 1lvk 35.00 RYIWYNPD...PKERDSYEC.GEIVSETSDSFTFKTSDGQDRQVKK.DDAN LEEEELLL...LLLLLLLEE.EEEEEELLLEEEEELLLLLEEEEEH.HHLE P L S G L VYL GW S S D 20. 1abr_B 34.93 TIINPKSALVLSAESSSMGGTLTvylrQGWRTGNNTsfVTSISGYSdqA LEEELLLLEEEELLLLLLLLELEELLHHLLEELLLLLEEELEELHHHEE --- --- TOPITS ALIGNMENTS CONTINUED --- 1. 1qr7_A 42.73 LraRKLLLDInsGLP.AAGEFLDMITPqaDLMSWGAIGARTTESQVHREL. HHHHHHHHHHHLLLL.EEEELLLLLLHHHHHLLEEEELHHHLLLHHHHHH. AS VC LN L GP T T S C LG SNC H R 2. 1by2 42.53 TDASVVCRAlnATQAlgSGPIMltGTEASLADC.KSLGWlsNCRHERDA.. HHHHHHHHHLLEEEEELLLLEEELLLLLLHHHL.ELLLLLLLLLHHHEL.. A V Q L G L Y YG LG S CS 3. 1ivy_A 39.07 TLAVLVMQDPSMNlgLAVGNGLSSYEQNDNSLVygLLGnwSscSQNKCNFY HHHHHHLLLLLLLEEEEEELLLLEHHHHHHHHHHLLLLHHHHEELLEELLL SQ S C N VP T S SN 4. 1tcr_A 38.80 sQDSTLcsQIN..VPKTMES..GTFITDATVLDMKAMDSKSNGAIAWSN.. LLLLEEELLLL..LLLLLLL..LEEELLLEEEELLLLLLEEEEEEEEEL.. A K L PL YTPQS S N 5. 1qsa_A 37.67 TIAGKLWDHLEERFPLAYNDLFKRYtpQSYAMAIARQESAWN.PKVKSPVG HHHLLLLLLHHHHLLLLLHHHHHHHHLHHHHHHHHHHHHLLL.LLLELLLL S S QR N SLG FLV P SIIC G C SR D C 6. 1a6q 37.53 SNpsVMIQRVNGSLAVslGDflVSPEpeRsiicwDVMGNEELCdrSrdDlc LLHLEELLEELLLELLLEELHLLELLLELLEELHLLLLHHHHHHHHHLLHH S VC G LG Y P GS S H D 7. 1igt_B 37.47 spLAPVC.GDTTGSSVTLGCLVKGYFPEPVTLTWNS.GSLSSGVHtqSdlS EEELLLL.LLLLLLLEEEEEEEEEEELLLLEEEEHH.HLELLLEEEELLEE A CQ L C L L L YT SS Q G C D 8. 1ac5 36.40 TNAHENCQNleCENILNL...LLSYTRESS.....QKGT.ADcnFNLKDSY HHHHHHHHHHHHHLHHHH...HHHHLLLLL.....LLLL.LLEEEEEEELL Q NC L G C G FSNC S C 9. 1ext_A 36.33 YWSENLFQCFNCSLCLN.GTVHLSCQEKQNTVCTCHAGFfsNCKKSLE..C EEELLEEEEEELLLLLL.EEEEELLELLELLEEEELLLEEHHLLLLLL..L QA KV RL LSL L YT SII G F C H M 10. 1nfp 36.27 QqaAkvRHRlqAKAELslEDYL.SYtaETsiINSNAAGNFDTCLHHVAEMA HHHHHLLEEEHHHHHHHHHHHH.HHHLLLLHHHLLEEELHHHHHHHHHHHH SQ C RL V LSLG F V YTP IC LG FS S 11. 4pah 36.27 SQFLQTCtrLR.PValslgaFRvmYTPEPD.ICHELLGhfSDRSFAQ..FS HHHHHHHHEEE.ELLLLHHLLLELLLLLLL.HHHHHHHLHLLHHHHH..HH S A V L G L L LV Y P I L FSN S D 12. 1kit 35.80 SEADMV..ELQNG.DLLLTARlvNYSPRQQfiTWSLLEafSNIsfEQSDGS EEEEEE..ELLLL.LEEEEEEEEELLLEEEELLLEEEEELLLLLEELLLLL A K Q G P SL VT T Q S I GQL F NC D 13. 1lox 35.47 QGAQK..Q....GFPTSLQshFVTmtGQHSSIHLGQLDWFtnctpTTKDAT LLHHH..H....LLLLLLLLHHHHHLHHHHHHHLLHHHHLLHLLLLLLLLL S PL LG F VT I G S S CS D 14. 1d5y_A 35.40 arRSPEWSAFGIRPPLRLGEffVT.LEDTPLI..GVTQSYS.CSLEqsDFR HHHLLLEELLLLLLLLLLLLLEEE.ELLEEEE..EEEEELL.LLHHHHHHH S V R V SL Y Q S LG F H 15. 1cx8_B 35.33 VNGSIVIVrfAEKvaESLNAivLIYMDQtsFFGHAHLGtfPSFNHTQFPPS LLLLEEEEEHHHHHHHHLLLLEEEELLLLLELELLLLLLLLLLLLLLLLLL A CG LSL T Q ICYG LG C HS DM 16. 1be3_A 35.27 DnaNAIIGHYDcgAHLslASIAATnsFQTFNICygLLGAHFVCDhsIDDMM HHHHHHHLEEELHHHLLHHHHHHHLEEEEEEEELEEEEEEEEELLLHHHHH Q KV L GV S L C G L 17. 1gpm_A 35.20 eqdDKVILGLSGGVDSSVTAMLLHRAIGKNLTCvgLL.RLNEAEQVLDMFG HHLLEEEEELLLLHHHHHHHHHHHHHHHHHEEEELLL.LLLHHHHHHHHHL SQ K R P G V TP G SR 18. 1ajs_A 35.20 sQMQKI.VRVTWSNPPAQGARIVARtpELFHEWTGNVKTMADRILsrSELR HHHHHH.HHLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q GV LS G FLV P I Y Q F RN 19. 1lvk 35.00 NQRNPI..KFD.GveLSygLFLVAVNPFKRIPIYTqvDIFK..GRRRNEVa EELLLH..HHL.LLHLLLLLEEEEELLLLLLLLLLHHHHLL..LLLHHHLL Q S VC L YT SI NC S 20. 1abr_B 34.93 AQGSNvcDSNKKEQQWAL......YT.DGSI...RSVQNTNNCLTSKD... EELLEELLLLLHHHLEEE......LL.LLLE...EELLEEEEEEEELL... --- --- TOPITS ALIGNMENTS CONTINUED --- 1. 1qr7_A 42.73 ..ASGLSC ..HLLLLL G C 2. 1by2 42.53 ....GVVCTN ....EEEELL L C 3. 1ivy_A 39.07 YDNKDLECVT LLLLLHHHHH TC 4. 1tcr_A 38.80 ..QTSFTCQD ..LLLLLHHH S GL 5. 1qsa_A 37.67 GAS.GL LLE.LL C TCL 6. 1a6q 37.53 cNEVVDTCL HHHHHHHHH S T 7. 1igt_B 37.47 SSSVTVT EEEEEEL S G 8. 1ac5 36.40 YPSCGM LLLLLL C L L E 9. 1ext_A 36.33 CTKLCLPQIE LHHHHLLLLL L L C E 10. 1nfp 36.27 AQGLnlFCFE HHHLLEEELL GL L 11. 4pah 36.27 SQEIGLASL HHHHHHHHL H L T 12. 1kit 35.80 SHFLLFT LEEEEEE L 13. 1lox 35.47 TLETVMATLP LHHHHHHHLL H 14. 1d5y_A 35.40 RHEM HHHH S GL 15. 1cx8_B 35.33 SRSSGLPNIP LLLLLLLLLL L 16. 1be3_A 35.27 MFVL HHHH GL 17. 1gpm_A 35.20 GDHFGL LLLLLL L 18. 1ajs_A 35.20 RARL HHHH H 19. 1lvk 35.00 ahIFAISDVA LLHHHHHHHH H G T L 20. 1abr_B 34.93 .HKQGSTIL .LLLLLEEE --- --- TOPITS ALIGNMENTS END ---
Identities computed with respect to: (1) predict_h1180 Colored by: consensus/70% and property
1 [ . . . . : . 60 1 predict_h1180 RLSWYDPDFQARLTRSNSKCQGQLEVYLKDGWHMVCSQSWGRSSKQWEDPSQASKVCQRL 2 1qr7_A RlsIHDpeYATRLLALREELKDELevYFEkgWKGLINDPHMDNSFQINDGLraRKLLLDI 3 1by2 -----DGDM--RLADGGATNQGRVEIFYRGQWGTVCDNLW--------DLTDASVVCRAl 4 1ivy_A -FSYSDDKFYA--TNDTEVAQSNFEA-LQDFFRLF--PEY-KNNKLfespTLAVLVMQDP 5 1tcr_A -LKYYSGdfEAEFSKSNSshLRKASVHWSDSAVYFCAVsfGSGTkaLKDpsQDSTLcsQI 6 1qsa_A -LMYWNLDNTArlVKSKSKtqAQLARY---AFN---NQWWDLS----VQATIAGKLWDHL 7 1a6q RTGFLEIDEHMRvdRSGSTAVGVL-ISPQHTYFINCGDSRgrNRkqDHKPSNpsVMIQRV 8 1igt_B RLEwyYPdvKGRFtrDNAKNTLYLQMslkdTAMYYCAryWGqsSAKTTAPspLAPVC-GD 9 1ac5 --SyyIPFfnAILNHNksKIDgdLKALlgNGWIDPNTQsiDESNPNFKHLTNAHENCQNl 10 1ext_A --SFTASENHLRHCLSCSKCrgQVEIstVD-RDTVC----GCRKNQYRH-YWSENLFQCF 11 1nfp --SIYdvAFSEHHIDKSYNDETKLavSLGKQIHVLAteTVVKAAkkWDDSQqaAkvRHRl 12 4pah HPGFKDPVYRARRKqyNYR-HGqvE-YmkKTWGTvcnHIFPLLEkqLEDVSQFLQTCtrL 13 1kit -MSLygPGHGITLTRQqsGSqgRL-IyvLDRFFlvMssNwgkSSSILeePSEADMV--EL 14 1lox --SFYAQD-ALRLWEIISrvQGIMGLYYkdDLEL---QSWCREITEIGL-QGAQK--Q-- 15 1d5y_A RLSKSAVAL--RLTA-----RPILDIALQ--YRFDSQQTFTRakKQfqTParRSPEWSAF 16 1cx8_B KLswRDQHfqVKDSAQNsk-NGRL-VYLVegykLVHA-NFG-TKKDFEdpVNGSIVIVrf 17 1be3_A -LstYDEDAVPTL----SPCrgSQICHREDGLPLA-HVAIAVEGPGWAHPDnaNAIIGHY 18 1gpm_A --RFYGVQFHPEVTHTRQGMR-MLERFVRD--ICQCEALW-TPAKIIDDaeqdDKVILGL 19 1ajs_A -FPFFDSAYQGFASGNLEKDAWAIRYFVSEGFELFCAQSFSKNFGLYNErsQMQKI-VRV 20 1lvk RYIWYNPD---PKERDSYEC-GEIVSETSDSFTFKTSDGQDRQVKK-DDANQRNPI--KF 21 1abr_B --TIINPKSALVLSAESSSMGGTLTvylrQGWRTGNNTsfVTSISGYSdqAQGSNvcDSN consensus/100% ............................................................ consensus/90% ...h.t.t....h.t........h.h.h.t.h.h.....h.......tp....t....t. consensus/80% ..thhs.t...hh.t.ttt.ts.l.h.htpth.h..t..hs.t...hpp.tttt.hh.th consensus/70% ..shhssphthphstpppp.psplthhhpcthhhhtsp.aspsttthpcsspss.lh.ph 61 . . . 1 ] 110 1 predict_h1180 NCGVPLSLGPFLVTYTPQSSIICYGQLGSFSNCSHSRNDMCHSLGLTCLE 2 1qr7_A nsGLP-AAGEFLDMITPqaDLMSWGAIGARTTESQVHREL--ASGLSC-- 3 1by2 nATQAlgSGPIMltGTEASLADC-KSLGWlsNCRHERDA-----GVVCTN 4 1ivy_A SMNlgLAVGNGLSSYEQNDNSLVygLLGnwSscSQNKCNFYDNKDLECVT 5 1tcr_A N--VPKTMES--GTFITDATVLDMKAMDSKSNGAIAWSN---QTSFTCQD 6 1qsa_A EERFPLAYNDLFKRYtpQSYAMAIARQESAWN-PKVKSPVGAS-GL---- 7 1a6q NGSLAVslGDflVSPEpeRsiicwDVMGNEELCdrSrdDlcNEVVDTCL- 8 1igt_B TTGSSVTLGCLVKGYFPEPVTLTWNS-GSLSSGVHtqSdlSSSVTVT--- 9 1ac5 eCENILNL---LLSYTRESS-----QKGT-ADcnFNLKDSYPSCGM---- 10 1ext_A NCSLCLN-GTVHLSCQEKQNTVCTCHAGFfsNCKKSLE--CTKLCLPQIE 11 1nfp qAKAELslEDYL-SYtaETsiINSNAAGNFDTCLHHVAEMAQGLnlFCFE 12 4pah R-PValslgaFRvmYTPEPD-ICHELLGhfSDRSFAQ--FSQEIGLASL- 13 1kit QNG-DLLLTARlvNYSPRQQfiTWSLLEafSNIsfEQSDGSHFLLFT--- 14 1lox --GFPTSLQshFVTmtGQHSSIHLGQLDWFtnctpTTKDATLETVMATLP 15 1d5y_A GIRPPLRLGEffVT-LEDTPLI--GVTQSYS-CSLEqsDFRHEM------ 16 1cx8_B AEKvaESLNAivLIYMDQtsFFGHAHLGtfPSFNHTQFPPSRSSGLPNIP 17 1be3_A DcgAHLslASIAATnsFQTFNICygLLGAHFVCDhsIDDMMFVL------ 18 1gpm_A SGGVDSSVTAMLLHRAIGKNLTCvgLL-RLNEAEQVLDMFGDHFGL---- 19 1ajs_A TWSNPPAQGARIVARtpELFHEWTGNVKTMADRILsrSELRARL------ 20 1lvk D-GveLSygLFLVAVNPFKRIPIYTqvDIFK--GRRRNEVahIFAISDVA 21 1abr_B KKEQQWAL------YT-DGSI---RSVQNTNNCLTSKD---HKQGSTIL- consensus/100% .................................................. consensus/90% t.t...s.t..h.....t......t.ht.h.......t............ consensus/80% p.t..hsht.hhhtht.pt.....t.htthsst.ht.p....t.s..... consensus/70% p.t.shuhsshhhshp.ppshhshs.hsshsstshspsth.tphsh.... |
HSSP HOMOLOGY DERIVED SECONDARY STRUCTURE OF PROTEINS , VERSION 1.0 1991 PDBID predict_h11814 DATE file generated on 22-Jan-01 SEQBASE RELEASE 38.0 OF EMBL/SWISS-PROT WITH 80000 SEQUENCES PARAMETER SMIN: -1.0 SMAX: 2.0 PARAMETER gap-open: 2.0 gap-elongation: 0.2 PARAMETER conservation weights: NO PARAMETER InDels in secondary structure allowed: YES PARAMETER alignments sorted according to :ZSCORE THRESHOLD according to: ALL REFERENCE Sander C., Schneider R. : Database of homology-derived protein structures. Proteins, 9:56-68 (1991). CONTACT e-mail (INTERNET) Schneider@EMBL-Heidelberg.DE or Sander@EMBL-Heidelberg.DE / fax +49-6221-387306 AVAILABLE Free academic use. Commercial users must apply for license. AVAILABLE No inclusion in other databanks without permission. HEADER predict_h11814.phdRdb COMPND SOURCE AUTHOR SEQLENGTH 110 NCHAIN 1 chain(s) in predict_h11814 data set KCHAIN 1 chain(s) used here ; chain(s) : A NALIGN 20 NOTATION : ID: EMBL/SWISSPROT identifier of the aligned (homologous) protein NOTATION : STRID: if the 3-D structure of the aligned protein is known, then STRID is the Protein Data Bank identifier as taken NOTATION : from the database reference or DR-line of the EMBL/SWISSPROT entry NOTATION : %IDE: percentage of residue identity of the alignment NOTATION : %SIM (%WSIM): (weighted) similarity of the alignment NOTATION : IFIR/ILAS: first and last residue of the alignment in the test sequence NOTATION : JFIR/JLAS: first and last residue of the alignment in the alignend protein NOTATION : LALI: length of the alignment excluding insertions and deletions NOTATION : NGAP: number of insertions and deletions in the alignment NOTATION : LGAP: total length of all insertions and deletions NOTATION : LSEQ2: length of the entire sequence of the aligned protein NOTATION : ACCESSION: SwissProt accession number NOTATION : PROTEIN: one-line description of aligned protein NOTATION : SeqNo,PDBNo,AA,STRUCTURE,BP1,BP2,ACC: sequential and PDB residue numbers, amino acid (lower case = Cys), secondary NOTATION : structure, bridge partners, solvent exposure as in DSSP (Kabsch and Sander, Biopolymers 22, 2577-2637(1983) NOTATION : VAR: sequence variability on a scale of 0-100 as derived from the NALIGN alignments NOTATION : pair of lower case characters (AvaK) in the alignend sequence bracket a point of INSERTION IN THIS sequence NOTATION : dots (....) in the alignend SEQUENCE INDICATE POINTS of deletion in this sequence NOTATION : SEQUENCE PROFILE: relative frequency of an amino acid type at each position. Asx and Glx are in their NOTATION : acid/amide form in proportion to their database frequencies NOTATION : NOCC: number of aligned sequences spanning this position (including the test sequence) NOTATION : NDEL: number of sequences with a deletion in the test protein at this position NOTATION : NINS: number of sequences with an insertion in the test protein at this position NOTATION : ENTROPY: entropy measure of sequence variability at this position NOTATION : RELENT: relative entropy, i.e. entropy normalized to the range 0-100 NOTATION : WEIGHT: conservation weight ## PROTEINS : EMBL/SWISSPROT identifier and alignment statistics NR. ID STRID %IDE %SIM IFIR ILAS JFIR JLAS LALI NGAP LGAP LSEQ2 ACCESSION PROTEIN 1 : 1qr7_A 0.31 0.00 1 108 46 174 105 9 27 339 MOL_ID: 1; 2 : 1by2 0.37 0.00 6 110 4 108 91 8 32 113 MOL_ID: 1; 3 : 1ivy_A 0.27 0.00 2 110 106 230 105 10 28 452 MOL_ID: 1; 4 : 1tcr_A 0.28 0.00 2 110 47 186 102 9 45 202 MOL_ID: 1; 5 : 1qsa_A 0.32 0.00 2 106 391 494 93 8 23 618 MOL_ID: 1; 6 : 1a6q 0.37 0.00 1 109 101 275 109 14 68 364 MOL_ID: 1; 7 : 1igt_B 0.34 0.00 1 107 44 189 105 12 43 444 MOL_ID: 1; 8 : 1ac5 0.36 0.00 3 106 176 310 95 11 49 483 MOL_ID: 1; 9 : 1ext_A 0.30 0.00 3 110 47 159 99 8 23 160 MOL_ID: 1; 10 : 1nfp 0.33 0.00 3 110 40 199 107 12 54 228 LUXF GENE PRODUCT (NONFLUORESCENT FLAVOPR 11 : 4pah 0.40 0.00 1 109 30 195 105 17 69 308 MOL_ID: 1; 12 : 1kit 0.40 0.00 2 107 512 676 103 16 67 757 MOL_ID: 1; 13 : 1lox 0.37 0.00 3 110 447 570 98 12 38 650 MOL_ID: 1; 14 : 1d5y_A 0.31 0.00 1 104 56 157 93 11 24 289 MOL_ID: 1; 15 : 1cx8_B 0.34 0.00 1 110 56 212 108 15 55 639 MOL_ID: 1; 16 : 1be3_A 0.32 0.00 2 104 219 338 102 10 27 446 MOL_ID: 1; 17 : 1gpm_A 0.23 0.00 3 106 172 281 99 7 16 502 MOL_ID: 1; 18 : 1ajs_A 0.20 0.00 2 104 218 341 102 4 23 414 MOL_ID: 1; 19 : 1lvk 0.29 0.00 1 110 32 161 104 10 40 746 MOL_ID: 1; 20 : 1abr_B 0.31 0.00 3 109 102 208 94 9 26 267 ABRIN-A COMPLEXED WITH TWO SUGAR CHAINS ## ALIGNMENTS 1 - 20 SeqNo PDBNo AA STRUCTURE BP1 BP2 ACC NOCC VAR ....:....1....:....2....:....3....:....4....:....5....:....6....:....7 1 1 R 0 0 200 8 15 R RR H RK R 2 2 L E 0 0 0 14 26 l FLLTL PM LLL FY 3 3 S E 0 0 63 20 35 s SKMGESSSGSSSssRPIT 4 4 W E 0 0 0 20 29 I YYYFwyFIFLFKwtFFWI 5 5 Y E 0 0 79 20 40 H SYWLyyTYKyYSRYYFYI 6 6 D E 0 0 58 21 35 DDDSNEYIAdDgAADDGDNN 7 7 P 0 0 66 21 42 pGDGLIPPSvPPQVQEVSPP 8 8 D 0 0 79 21 38 eDKdDDdFEAVGDAHDQADK 9 9 F 0 0 0 19 42 YMFfNEvfNFYH.LfAFY.S 10 10 Q E 0 0 71 18 48 A.YETHKnHSRGA.qVHQ.A 11 11 A E 0 0 0 18 42 T.AAAMGALEAIL.VPPG.L 12 12 R E 0 0 0 20 42 RR.ErRRIRHRTRRKTEFPV 13 13 L E 0 0 0 20 38 LL.FlvFLHHRLLLDLVAKL 14 14 T 0 0 69 20 46 LATSVdtNCIKTWTS.TSES 15 15 R 0 0 121 20 45 ADNKKRrHLDqREAA.HGRA 16 16 S 0 0 63 19 46 LGDSSSDNSKyQI.Q.TNDE 17 17 N 0 0 76 19 47 RGTNKGNkCSNqI.N.RLSS 18 18 S 0 0 0 20 38 EAESSSAsSYYsS.sSQEYS 19 19 K 0 0 73 20 41 ETVsKTKKKNRGr.kPGKES 20 20 C 0 0 0 18 52 LNAhtANICD.Sv..CMDCM 21 21 Q 0 0 71 20 43 KQQLqVTDrEHqQRNrRA.G 22 22 G 0 0 0 20 35 DGSRAGLggTGgGPGg.WGG 23 23 Q E 0 0 71 21 49 ERNKQVYdQKqRIIRSMAET 24 24 L E 0 0 0 21 22 LVFALLLLVLvLMLLQLIIL 25 25 E E 0 0 0 18 41 eEESA.QKEaE.GD.IERVT 26 26 V E 0 0 0 20 40 vIAVRIMAIv.ILIVCRYSv 27 27 Y E 0 0 0 20 42 YF.HYSsLsSYyYAYHFFEy 28 28 L E 0 0 0 20 38 FYLW.PlltLmvYLLRVVTl 29 29 K E 0 0 100 20 46 ERQS.QkgVGkLkQVERSSr 30 30 D 0 0 79 19 26 kGDD.HdNDKKDd.eDDEDQ 31 31 G 0 0 41 18 40 gQFSATTG.QTRD.gG.GSG 32 32 W E 0 0 0 20 30 WWFAFYAWRIWFLYyL.FFW 33 33 H E 0 0 66 21 52 KGRVNFMIDHGFERkPIETR 34 34 M E 0 0 0 20 44 GTLY.IYDTVTlLFLLCLFT 35 35 V E 0 0 0 19 48 LVFF.NYPVLvv.DVAQFKG 36 36 C E 0 0 0 17 40 IC.C.CCNCAcM.SH.CCTN 37 37 S E 0 0 0 18 40 ND.ANGAT.tns.QAHEASN 38 38 Q 0 0 97 19 42 DNPVQDrQ.eHsQQ.VAQDT 39 39 S 0 0 0 20 47 PLEsWSys.TINSTNALSGs 40 40 W 0 0 0 20 32 HWYfWRWi.VFwWFFIWFQf 41 41 G 0 0 41 18 39 M..GDgGDGVPgCTGA.SDV 42 42 R 0 0 89 19 47 D.KSLrqECKLkRR.VTKRT 43 43 S H 0 0 0 20 42 N.NGSNsSRALSEaTEPNQS 44 44 S H 0 0 63 19 49 S.NT.RSNKAESIkKGAFVI 45 45 K H 0 0 100 19 39 F.Kk.kAPNkkSTKKPKGKS 46 46 Q H 0 0 71 19 45 Q.La.qKNQkqIEQDGILKG 47 47 W H 0 0 111 18 34 I.fL.DTFYWLLIfFWIY.Y 48 48 E H 0 0 95 20 40 N.eKVHTKRDEeGqEADNDS 49 49 D 0 0 79 21 34 DDsDQKAHHDDeLTdHDEDd 50 50 P H 0 0 66 19 38 GLppAPPL.SVP.PpParAq 51 51 S H 0 0 63 21 45 LTTsTSsTYQSSQaVDesNA 52 52 Q H 0 0 97 21 41 rDLQINpNWqQEGrNnqQQQ 53 53 A H 0 0 0 21 41 aAADApLASaFAARGadMRG 54 54 S H 0 0 46 21 43 RSVSGsAHEALDQSSNDQNS 55 55 K H 0 0 100 21 45 KVLTKVPENkQMKPIAKKPN 56 56 V H 0 0 0 20 28 LVVLLMVNLvTV.EVIVIIv 57 57 C H 0 0 0 17 48 LCMcWICCFRC..WIII..c 58 58 Q H 0 0 71 18 46 LRQsDQ.QQHt.QSVGLV.D 59 59 R H 0 0 89 20 45 DADQHRGNCRrE.ArHGRKS 60 60 L 0 0 0 20 33 IlPILVDlFlLL.FfYLVFN 61 61 N 0 0 76 20 39 nnSNENTeNqRQ.GADSTDK 62 62 C 0 0 0 17 53 sAM.EGTCCA.N.IEcGW.K 63 63 G 0 0 41 20 38 GTN.RSGESKPGGRKgGSGE 64 64 V 0 0 35 20 45 LQlVFLSNLAV.FPvAVNvQ 65 65 P 0 0 0 21 40 PAgPPASICEaDPPaHDPeQ 66 66 L E 0 0 0 20 35 .lLKLVVLLLlLTLELSPLW 67 67 S E 0 0 46 21 36 AgATAsTNNssLSRSsSASA 68 68 L E 0 0 0 20 30 ASVMYlLL.llLLLLlVQyL 69 69 G 0 0 41 19 27 GGGENGG.GEgTQGNATGg. 70 70 P 0 0 66 19 42 EPNSDDC.TDaAsEASAAL. 71 71 F 0 0 0 18 40 FIG.LfL.VYFRhfiIMRF. 72 72 L 0 0 59 19 29 LML.FlVLHLRlFfvALIL. 73 73 V 0 0 69 19 38 DlSGKVKLL.vvVVLALVV. 74 74 T 0 0 69 20 44 MtSTRSGSSSmNTTITHAA. 75 75 Y 0 0 179 20 44 IGYFYPYYCYYYm.YnRRVY 76 76 T 0 0 69 21 41 TTEItEFTQtTStLMsAtNT 77 77 P 0 0 66 20 41 PEQTppPREaPPGEDFIpP. 78 78 Q 0 0 97 21 35 qANDQeEEKEERQDQQGEFD 79 79 S E 0 0 0 21 46 aSDASRPSQTPQHTtTKLKG 80 80 S E 0 0 0 21 49 DLNTYsVSNsDQSPsFNFRS 81 81 I E 0 0 0 19 44 LASVAiT.Ti.fSLFNLHII 82 82 I E 0 0 0 19 35 MDLLMiL.VIIiIIFITEP. 83 83 C E 0 0 0 18 48 SCVDAcT.CNCTH.GCCWI. 84 84 Y E 0 0 108 17 47 W.yMIwW.TSHWL.HyvTY. 85 85 G 0 0 0 20 38 GKgKADN.CNESGGAggGTR 86 86 Q 0 0 71 21 49 ASLARVSQHALLQVHLLNqS 87 87 L 0 0 80 20 33 ILLMQM.KAALLLTLLLVvV 88 88 G 0 0 13 20 24 GGGDEGGGGGGEDQGG.KDQ 89 89 S 0 0 0 21 47 AWnSSNSTFNhaWStARTIN 90 90 F E 0 0 0 20 40 RlwKAEL.fFffFYfHLMFT 91 91 S E 0 0 0 21 41 TsSSWESAsDSStSPFNAKN 92 92 N 0 0 0 19 38 TNsNNLSDNTDNn.SVED.N 93 93 C 0 0 0 19 39 ECcG.CGcCCRIcCFCAR.C 94 94 S 0 0 46 21 47 SRSAPdVnKLSstSNDEIGL 95 95 H 0 0 66 21 47 QHQIKrHFKHFfpLHhQLRT 96 96 S 0 0 63 21 44 VENAVStNSHAETETsVsRS 97 97 R 0 0 121 21 44 HRKWKrqLLVQQTqQILrRK 98 98 N 0 0 100 20 43 RDCSSdSKEA.SKsFDDSND 99 99 D 0 0 79 18 31 EANNPDdD.E.DDDPDMEE. 100 100 M 0 0 92 17 38 L.F.VllS.MFGAFPMFLV. 101 101 C 0 0 0 17 47 ..Y.GcSYCASSTRSMGRa. 102 102 H 0 0 66 18 47 ..D.ANSPTQQHLHRFDAhH 103 103 S 0 0 0 20 47 A.NQSESSKGEFEESVHRIK 104 104 L 0 0 0 19 44 S.KT.VVCLLILTMSLFLFQ 105 105 G 0 0 0 18 35 GGDSGVTGCnGLV G G AG 106 106 L E 0 0 0 18 26 LVLFLDVMLlLFM L L IS 107 107 T E 0 0 0 15 41 SVET TT PFATA P ST 108 108 C E 0 0 0 13 41 CCCC C QCS T N DI 109 109 L E 0 0 0 12 31 TVQ L IFL L I VL 110 110 E 0 0 157 9 39 NTD EE P P A ## SEQUENCE PROFILE AND ENTROPY SeqNo PDBNo V L I M F W Y G A P S T C H R K Q E N D NOCC NDEL NINS ENTROPY RELENT WEIGHT 1 1 0 0 0 0 0 0 0 0 0 0 0 0 0 13 75 13 0 0 0 0 8 0 0 0.736 35 1.47 2 2 0 57 0 7 14 0 7 0 0 7 0 7 0 0 0 0 0 0 0 0 14 0 1 1.352 51 1.27 3 3 0 0 5 5 0 0 0 10 0 5 55 5 0 0 5 5 0 5 0 0 20 0 2 1.608 54 1.10 4 4 0 5 15 0 30 20 20 0 0 0 0 5 0 0 0 5 0 0 0 0 20 0 2 1.739 58 1.21 5 5 0 5 5 0 5 5 50 0 0 0 10 5 0 5 5 5 0 0 0 0 20 0 1 1.775 59 1.01 6 6 0 0 5 0 0 0 5 10 14 0 5 0 0 0 0 0 0 5 14 43 21 0 1 1.723 58 1.10 7 7 14 5 5 0 0 0 0 10 0 38 10 0 0 0 0 0 10 5 0 5 21 0 1 1.897 63 0.96 8 8 5 0 0 0 5 0 0 5 14 0 0 0 0 5 0 10 5 10 0 43 21 0 2 1.814 61 1.05 9 9 5 5 0 5 37 0 16 0 5 0 5 0 0 5 0 0 0 5 11 0 19 2 2 1.981 67 0.96 10 10 6 0 0 0 0 0 6 6 17 0 6 6 0 17 6 6 17 6 6 0 18 3 0 2.341 81 0.85 11 11 6 17 6 6 0 0 0 11 33 11 0 6 0 0 0 0 0 6 0 0 18 3 0 1.956 68 0.96 12 12 5 0 5 0 5 0 0 0 0 5 0 10 0 5 50 5 0 10 0 0 20 1 1 1.706 57 0.97 13 13 10 50 0 0 10 0 0 0 5 0 0 0 0 10 5 5 0 0 0 5 20 1 1 1.636 55 1.03 14 14 5 5 5 0 0 5 0 0 5 0 20 30 5 0 0 5 0 5 5 5 20 1 1 2.181 73 0.88 15 15 0 5 0 0 0 0 0 5 20 0 0 0 0 10 25 10 5 5 5 10 20 1 1 2.108 70 0.91 16 16 0 5 5 0 0 0 5 5 0 0 26 5 0 0 0 5 11 5 11 16 19 2 0 2.202 75 0.88 17 17 0 5 5 0 0 0 0 11 0 0 16 5 5 0 11 11 5 0 26 0 19 2 2 2.129 72 0.87 18 18 0 0 0 0 0 0 15 0 10 0 55 0 0 0 0 0 5 15 0 0 20 1 1 1.278 43 1.03 19 19 5 0 0 0 0 0 0 10 0 5 10 10 0 0 10 35 0 10 5 0 20 1 2 1.968 66 0.97 20 20 6 6 6 11 0 0 0 0 11 0 6 6 22 6 0 0 0 0 11 11 18 3 1 2.274 79 0.78 21 21 5 5 0 0 0 0 0 5 5 0 0 5 0 5 20 5 30 5 5 5 20 1 3 2.181 73 0.94 22 22 0 5 0 0 0 5 0 60 5 5 5 5 0 0 5 0 0 0 0 5 20 1 1 1.505 50 1.10 23 23 5 0 10 5 0 0 5 0 5 0 5 5 0 0 14 10 19 10 5 5 21 0 1 2.425 81 0.83 24 24 14 57 10 5 5 0 0 0 5 0 0 0 0 0 0 0 5 0 0 0 21 0 0 1.402 47 1.34 25 25 6 0 6 0 0 0 0 6 11 0 6 6 0 0 6 6 6 39 0 6 18 3 2 2.057 71 0.99 26 26 30 5 25 5 0 0 5 0 10 0 5 0 5 0 10 0 0 0 0 0 20 1 1 1.917 64 1.01 27 27 0 5 0 0 15 0 40 0 5 0 20 0 0 10 0 0 0 5 0 0 20 1 3 1.653 55 0.97 28 28 15 40 0 5 5 5 10 0 0 5 0 10 0 0 5 0 0 0 0 0 20 1 3 1.861 62 1.04 29 29 10 5 0 0 0 0 0 10 0 0 15 0 0 0 15 20 15 10 0 0 20 1 2 2.016 67 0.89 30 30 0 0 0 0 0 0 0 5 0 0 0 0 0 5 0 16 5 11 5 53 19 2 2 1.486 50 1.28 31 31 0 0 0 0 6 0 0 39 6 0 11 17 0 0 6 0 11 0 0 6 18 3 0 1.796 62 0.99 32 32 0 10 5 0 25 30 15 0 10 0 0 0 0 0 5 0 0 0 0 0 20 1 1 1.752 58 1.19 33 33 5 0 10 5 10 0 0 10 0 5 0 5 0 10 14 10 0 10 5 5 21 0 0 2.491 83 0.78 34 34 5 30 5 5 10 0 10 5 0 0 0 20 5 0 0 0 0 0 0 5 20 1 1 2.042 68 0.93 35 35 32 11 0 0 16 0 5 5 5 5 0 0 0 0 0 5 5 0 5 5 19 2 1 2.132 72 0.84 36 36 0 0 6 6 0 0 0 0 6 0 6 6 53 6 0 0 0 0 12 0 17 4 0 1.588 56 1.00 37 37 0 0 0 0 0 0 0 6 22 0 17 11 0 6 0 0 6 6 22 6 18 3 3 2.014 70 1.01 38 38 11 0 0 0 0 0 0 0 5 5 5 5 0 5 5 0 32 5 5 16 19 2 1 2.132 72 0.97 39 39 0 10 5 0 0 5 5 5 5 5 35 10 0 0 0 0 0 5 10 0 20 1 3 2.107 70 0.87 40 40 5 0 10 0 30 35 5 0 0 0 0 0 0 5 5 0 5 0 0 0 20 1 1 1.708 57 1.15 41 41 11 0 0 6 0 0 0 39 6 6 6 6 6 0 0 0 0 0 0 17 18 3 1 1.874 65 1.03 42 42 5 11 0 0 0 0 0 0 0 0 5 11 5 0 26 21 5 5 0 5 19 2 2 2.083 71 0.86 43 43 0 5 0 0 0 0 0 5 10 5 30 5 0 0 5 0 5 10 20 0 20 1 1 2.042 68 0.96 44 44 5 0 11 0 5 0 0 5 11 0 21 5 0 0 5 16 0 5 11 0 19 2 0 2.260 77 0.84 45 45 0 0 0 0 5 0 0 5 5 11 11 5 0 0 0 53 0 0 5 0 19 2 4 1.587 54 1.03 46 46 0 11 11 0 0 0 0 11 5 0 0 0 0 0 0 16 32 5 5 5 19 2 0 1.986 67 0.91 47 47 0 17 17 0 22 17 17 0 0 0 0 6 0 0 0 0 0 0 0 6 18 3 2 1.850 64 1.12 48 48 5 0 0 0 0 0 0 5 5 0 5 5 0 5 5 10 5 25 10 15 20 1 1 2.290 76 1.00 49 49 0 5 0 0 0 0 0 0 5 0 5 5 0 14 0 5 5 10 0 48 21 0 3 1.725 58 1.12 50 50 5 11 0 0 0 0 0 5 16 47 5 0 0 0 5 0 5 0 0 0 19 2 3 1.657 56 1.04 51 51 5 5 0 0 0 0 5 0 10 0 33 19 0 0 0 0 10 5 5 5 21 0 2 2.000 67 0.91 52 52 0 5 5 0 0 5 0 5 0 5 0 0 0 0 10 0 38 5 19 5 21 0 4 1.922 64 0.98 53 53 0 5 0 5 5 0 0 10 48 5 5 0 0 0 10 0 0 0 0 10 21 0 1 1.750 58 0.99 54 54 5 5 0 0 0 0 0 5 10 0 33 0 0 5 5 0 10 5 10 10 21 0 0 2.132 71 0.94 55 55 10 5 5 5 0 0 0 0 5 14 0 5 0 0 0 33 5 5 10 0 21 0 1 2.107 70 0.90 56 56 45 20 15 5 0 0 0 0 0 0 0 5 0 0 0 0 0 5 5 0 20 1 1 1.565 52 1.23 57 57 0 6 24 6 6 12 0 0 0 0 0 0 41 0 6 0 0 0 0 0 17 4 1 1.624 57 0.85 58 58 11 11 0 0 0 0 0 6 0 0 11 6 0 6 6 0 33 0 0 11 18 3 1 1.985 69 0.88 59 59 0 0 0 0 0 0 0 10 10 0 5 0 5 10 30 5 5 5 5 10 20 1 1 2.181 73 0.91 60 60 10 40 10 0 20 0 5 0 0 5 0 0 0 0 0 0 0 0 5 5 20 1 3 1.748 58 1.13 61 61 0 0 0 0 0 0 0 5 5 0 10 10 0 0 5 5 10 10 30 10 20 1 1 2.112 70 1.03 62 62 0 0 6 6 0 6 0 12 12 0 6 6 24 0 0 6 0 12 6 0 17 4 1 2.262 80 0.78 63 63 0 0 0 0 0 0 0 40 0 5 15 5 0 0 10 10 0 10 5 0 20 1 0 1.791 60 1.04 64 64 30 20 0 0 10 0 0 0 10 5 5 0 0 0 0 0 10 0 10 0 20 1 3 1.904 64 0.91 65 65 0 0 5 0 0 0 0 5 19 33 5 0 5 5 0 0 5 10 0 10 21 0 1 2.000 67 1.00 66 66 10 60 0 0 0 5 0 0 0 5 5 5 0 0 0 5 0 5 0 0 20 1 1 1.435 48 1.09 67 67 0 5 0 0 0 0 0 5 24 0 43 10 0 0 5 0 0 0 10 0 21 0 4 1.588 53 1.08 68 68 10 60 0 5 0 0 10 0 5 0 5 0 0 0 0 0 5 0 0 0 20 1 1 1.366 46 1.20 69 69 0 0 0 0 0 0 0 58 5 0 0 11 0 0 0 0 5 11 11 0 19 2 1 1.337 45 1.25 70 70 0 5 0 0 0 0 0 0 26 11 16 5 5 0 0 0 0 11 5 16 19 2 1 2.028 69 0.96 71 71 6 11 17 6 33 0 6 6 0 0 0 0 0 6 11 0 0 0 0 0 18 3 3 1.956 68 1.00 72 72 11 47 5 5 16 0 0 0 5 0 0 0 0 5 5 0 0 0 0 0 19 2 1 1.657 56 1.21 73 73 42 26 0 0 0 0 0 5 5 0 5 0 0 0 0 11 0 0 0 5 19 2 2 1.572 53 1.03 74 74 0 0 5 10 0 0 0 5 10 0 25 30 0 5 5 0 0 0 5 0 20 1 0 1.917 64 0.94 75 75 5 0 5 5 5 0 50 5 0 5 0 0 5 0 10 0 0 0 5 0 20 1 2 1.775 59 0.93 76 76 0 5 5 5 5 0 0 0 5 0 10 48 0 0 0 0 5 10 5 0 21 0 3 1.816 61 0.99 77 77 0 0 5 0 5 0 0 5 5 45 0 5 0 0 5 0 5 15 0 5 20 1 1 1.842 61 0.97 78 78 0 0 0 0 5 0 0 5 5 0 0 0 0 0 5 5 29 29 5 14 21 0 1 1.864 62 1.10 79 79 0 5 0 0 0 0 0 5 10 10 19 19 0 5 5 10 10 0 0 5 21 0 1 2.252 75 0.90 80 80 5 5 0 0 10 0 5 0 0 5 33 5 0 0 5 0 5 0 14 10 21 0 2 2.107 70 0.83 81 81 5 16 26 0 11 0 0 0 11 0 11 11 0 5 0 0 0 0 5 0 19 2 1 2.056 70 0.92 82 82 5 16 42 11 5 0 0 0 0 5 0 5 0 0 0 0 0 5 0 5 19 2 1 1.822 62 1.10 83 83 6 0 6 0 0 6 0 6 6 0 6 11 39 6 0 0 0 0 6 6 18 3 0 2.057 71 0.85 84 84 6 6 6 6 0 24 24 0 0 0 6 12 0 12 0 0 0 0 0 0 17 4 4 2.018 71 0.86 85 85 0 0 0 0 0 0 0 40 10 0 5 5 5 0 5 10 0 5 10 5 20 1 0 1.956 65 1.04 86 86 10 24 0 0 0 0 0 0 14 0 14 0 0 10 5 0 19 0 5 0 21 0 1 1.951 65 0.84 87 87 15 45 5 10 0 0 0 0 10 0 0 5 0 0 0 5 5 0 0 0 20 1 0 1.704 57 1.14 88 88 0 0 0 0 0 0 0 60 0 0 0 0 0 0 0 5 10 10 0 15 20 1 0 1.201 40 1.30 89 89 0 0 5 0 5 10 0 0 14 0 24 14 0 5 5 0 0 0 19 0 21 0 4 2.017 67 0.87 90 90 0 15 0 5 40 5 5 0 5 0 0 5 0 5 5 5 0 5 0 0 20 1 2 1.999 67 1.00 91 91 0 0 0 0 5 5 0 0 10 5 43 10 0 0 0 5 0 5 10 5 21 0 1 1.905 64 0.99 92 92 5 5 0 0 0 0 0 0 0 0 16 11 0 0 0 0 0 5 42 16 19 2 1 1.649 56 1.05 93 93 0 0 5 0 5 0 0 11 5 0 0 0 58 0 11 0 0 5 0 0 19 2 2 1.410 48 1.02 94 94 5 10 5 0 0 0 0 5 5 5 29 5 0 0 5 5 0 5 10 10 21 0 2 2.335 78 0.87 95 95 0 10 5 0 14 0 0 0 0 5 0 5 0 29 10 10 14 0 0 0 21 0 2 2.021 67 0.86 96 96 14 0 0 0 0 0 0 0 10 0 29 14 0 5 5 0 0 14 10 0 21 0 2 1.930 64 0.92 97 97 5 14 5 0 0 5 0 0 0 0 0 5 0 5 24 14 24 0 0 0 21 0 2 1.964 66 0.92 98 98 0 0 0 0 5 0 0 0 5 0 30 0 5 0 5 10 0 5 10 25 20 1 0 1.917 64 0.95 99 99 0 0 0 6 0 0 0 0 6 11 0 0 0 0 0 0 0 22 11 44 18 3 1 1.504 52 1.18 100 100 12 24 0 18 24 0 0 6 6 6 6 0 0 0 0 0 0 0 0 0 17 4 1 1.905 67 1.04 101 101 0 0 0 6 0 0 12 12 12 0 24 6 18 0 12 0 0 0 0 0 17 4 1 1.987 70 0.87 102 102 0 6 0 0 6 0 0 0 11 6 6 6 0 28 6 0 11 0 6 11 18 3 0 2.212 77 0.87 103 103 5 0 5 0 5 0 0 5 5 0 25 0 0 5 5 10 5 20 5 0 20 1 0 2.247 75 0.86 104 104 11 32 5 5 11 0 0 0 0 0 11 11 5 0 0 5 5 0 0 0 19 2 0 2.087 71 0.93 105 105 11 6 0 0 0 0 0 50 6 0 6 6 6 0 0 0 0 0 6 6 18 0 1 1.715 59 1.09 106 106 11 50 6 11 11 0 0 0 0 0 6 0 0 0 0 0 0 0 0 6 18 0 0 1.561 54 1.27 107 107 7 0 0 0 7 0 0 0 13 13 13 40 0 0 0 0 0 7 0 0 15 0 0 1.714 63 0.99 108 108 0 0 8 0 0 0 0 0 0 0 8 8 54 0 0 0 8 0 8 8 13 0 0 1.517 59 0.98 109 109 17 42 17 0 8 0 0 0 0 0 0 8 0 0 0 0 8 0 0 0 12 0 0 1.583 64 1.17 110 110 0 0 0 0 0 0 0 0 11 22 0 11 0 0 0 0 0 33 11 11 9 0 0 1.677 76 1.02 ## INSERTION LIST AliNo IPOS JPOS Len Sequence 1 2 48 7 lLVVIGPCs 1 7 60 4 pVAAKe 1 25 82 4 eIVMRv 1 30 91 5 kPRTTVg 1 52 118 1 rIa 1 61 128 1 nDs 1 78 145 2 qYLa 2 60 49 3 lGFEn 2 66 58 7 lGRAAFGQg 2 73 72 5 lDEVQCt 2 90 93 1 lKs 3 47 146 3 fLTGe 3 49 151 6 sYAGIYIp 3 64 172 1 lQg 3 84 193 5 yFAYYHg 3 89 203 2 nRLw 3 92 208 5 sLQTHCc 4 8 54 8 dPVVQGVNGf 4 19 73 1 sFh 4 39 94 7 sGFASALTf 4 45 107 15 kVIVLPYIQNPEPAVYa 4 50 127 1 pRs 4 57 135 6 cLFTDFDs 5 12 402 5 rSEWANl 5 20 415 1 tEq 5 76 462 5 tSGKEIp 6 13 114 8 vMSEKKHGAd 6 41 149 3 gLLCr 6 45 156 5 kVHFFTq 6 53 169 11 pLEKERIQNAGGs 6 67 194 2 sRAl 6 71 200 13 fDYKCVHGKGPTEQl 6 77 219 5 pEVHDIe 6 80 227 6 sEEDDQFi 6 82 235 2 iLAc 6 83 238 3 cDGIw 6 94 252 2 dFVr 6 97 257 4 rLEVTd 6 100 264 3 lEKVc 7 4 48 11 wVAYISNGGGSTy 7 8 63 1 dTv 7 14 70 2 tISr 7 27 85 1 sRl 7 29 88 2 kSEd 7 38 99 8 rHGGYYAMDy 7 42 111 6 qGTTVTVs 7 51 126 2 sVYp 7 96 171 5 tFPAVLq 7 99 179 3 dLYTl 8 4 178 3 yAGQy 8 9 186 1 fAn 8 17 195 1 kFs 8 22 201 3 gDTYd 8 28 210 1 lIg 8 39 222 12 sLSYLPFAMEKKLi 8 60 255 15 lINSASTDEAAHFSYQe 8 93 294 4 cLNMYn 9 21 66 3 rKEMg 9 27 75 2 sSCt 9 90 133 9 fLRENECVSCs 10 6 44 1 dVv 10 25 64 2 aPFv 10 37 78 2 tSPe 10 45 88 7 kYGMPLLFk 10 52 102 12 qKRIELLNHYQAAa 10 55 117 7 kFNVDIANv 10 60 129 11 lMLFVNVNDNPTq 10 67 147 2 sIYl 10 76 157 1 tQa 10 80 162 3 sIDEi 10 105 190 5 nNKVDFl 11 15 45 5 qFADIAy 11 23 57 4 qPIPRv 11 28 65 3 mEEEk 11 35 75 12 vFKTLKSLYKTHAc 11 36 88 3 cYEYn 11 45 100 10 kYCGFHEDNIPq 11 58 123 2 tGFr 11 65 131 2 aGLl 11 67 135 4 sSRDFl 11 69 141 2 gGLa 11 73 147 13 vFHCTQYIRHGSKPm 11 89 175 3 hVPLf 12 5 516 7 yGNASVNPg 12 17 535 2 qNIs 12 21 541 1 qNg 12 27 547 3 yPAIv 12 34 557 1 lNv 12 37 561 7 sIYSDDGGs 12 40 571 2 wQTg 12 41 574 9 gSTLPIPFRWk 12 48 590 2 eTLe 12 72 613 8 lDFNQIVNGv 12 81 630 6 fLSKDGGi 12 89 644 5 aNNANVf 12 94 654 10 sTGTVDASITRf 13 19 463 1 rYv 13 29 474 6 kTDEAVRd 13 70 511 5 sVAQACh 13 75 521 5 mCIFTCt 13 91 542 3 tWVPn 13 92 546 2 nAPc 13 94 550 5 tMRLPPp 14 43 90 1 aFk 14 47 95 1 fAq 14 51 100 2 aLYr 14 71 122 6 fTMPEHKf 14 97 150 1 qIs 15 3 59 2 sKVw 15 9 67 3 fVKIq 15 18 79 5 sVIIVDk 15 30 94 3 eNPGg 15 32 99 11 yVAYSKAATVTGk 15 49 125 3 dLYTp 15 59 138 5 rAGKITf 15 64 148 2 vANa 15 71 157 1 iGv 15 79 166 9 tKFPIVNAELs 15 89 185 7 tGDPYTPGf 16 3 221 1 sGt 16 21 236 2 rFTg 16 52 268 5 nVALQVa 16 62 283 4 cTYGGg 16 67 292 2 sSPl 16 75 302 4 nKLCQs 16 84 315 3 yADTg 16 95 329 1 hMs 17 50 216 5 aVARIRe 17 52 223 2 qVGd 17 84 257 4 vFVDNg 18 50 267 18 rVGNLTVVAKEPDSILRVLs 18 76 310 3 tLSDp 18 96 333 1 sMr 19 64 88 4 vEDMSe 19 68 96 23 yLNEPAVFHNLRVRYNQDLIYTYSg 19 86 137 2 qEMv 19 101 152 1 aPh 20 26 126 4 vQTNEy 20 28 132 1 lMr 20 39 144 1 sPf 20 49 155 3 dLCMq 20 56 165 4 vWMADc //
================================================== MAXHOM-STRIP ===================================================== test sequence : /home/phd/server/work/predict_h11814.phdDssp list name : /home/phd/server/work/TOPITS_DB.list last name was : /data/dssp/1lst.dssp seq_length : 110 alignments : 2697 sort-mode : ZSCORE weights 1 : NO weights 2 : NO smin : -1.00 smax : 2.00 maplow : 0.00 maphigh : 0.00 epsilon : 0.00 gamma : 0.00 gap_open : 2 gap_elongation : 0.2 INDEL in sec-struc of SEQ 1: YES INDEL in sec-struc of SEQ 2: YES NBEST alignments : 1 secondary structure alignment: NO =================================================== SUMMARY =========================================================== IAL VAL LEN IDEL NDEL ZSCORE %IDEN STRHOM LEN2 RMS SIGMA NAME 1 42.73 105 27 9 2.74 0.31 0.41 339 -1.00 0.000 1qr7_A MOL_ID: 1; . 2 42.53 91 32 8 2.71 0.37 0.55 113 -1.00 0.000 1by2 MOL_ID: 1; . 3 39.07 105 28 10 2.25 0.27 0.36 452 -1.00 0.000 1ivy_A MOL_ID: 1; . 4 38.80 102 45 9 2.22 0.28 0.44 202 -1.00 0.000 1tcr_A MOL_ID: 1; . 5 37.67 93 23 8 2.07 0.32 0.25 618 -1.00 0.000 1qsa_A MOL_ID: 1; . 6 37.53 109 68 14 2.05 0.37 0.32 364 -1.00 0.000 1a6q MOL_ID: 1; . 7 37.47 105 43 12 2.04 0.34 0.43 444 -1.00 0.000 1igt_B MOL_ID: 1; . 8 36.40 95 49 11 1.90 0.36 0.41 483 -1.00 0.000 1ac5 MOL_ID: 1; . 9 36.33 99 23 8 1.89 0.30 0.42 160 -1.00 0.000 1ext_A MOL_ID: 1; . 10 36.27 107 54 12 1.88 0.33 0.36 228 -1.00 0.000 1nfp LUXF GENE PRODUCT (NONFLUORESCENT FLAVOPROTEIN) COMPLEXED . 11 36.27 105 69 17 1.88 0.40 0.28 308 -1.00 0.000 4pah MOL_ID: 1; . 12 35.80 103 67 16 1.82 0.40 0.54 757 -1.00 0.000 1kit MOL_ID: 1; . 13 35.47 98 38 12 1.77 0.37 0.25 650 -1.00 0.000 1lox MOL_ID: 1; . 14 35.40 93 24 11 1.77 0.31 0.30 289 -1.00 0.000 1d5y_A MOL_ID: 1; . 15 35.33 108 55 15 1.76 0.34 0.45 639 -1.00 0.000 1cx8_B MOL_ID: 1; . 16 35.27 102 27 10 1.75 0.32 0.39 446 -1.00 0.000 1be3_A MOL_ID: 1; . 17 35.20 99 16 7 1.74 0.23 0.38 502 -1.00 0.000 1gpm_A MOL_ID: 1; . 18 35.20 102 23 4 1.74 0.20 0.35 414 -1.00 0.000 1ajs_A MOL_ID: 1; . 19 35.00 104 40 10 1.71 0.29 0.40 746 -1.00 0.000 1lvk MOL_ID: 1; . 20 34.93 94 26 9 1.70 0.31 0.47 267 -1.00 0.000 1abr_B ABRIN-A COMPLEXED WITH TWO SUGAR CHAINS . ==================================== ALIGNMENTS =================================== 1 - 51 ....:....1....:....2....:....3....:....4....:....5 pred RLSWYDPDFQARLTRSNSKCQGQLEVYLKDGWHMVCSQSWGRSSKQWEDPS EEEEE EEEE EEEEEEE EEEEEE HHHHHH HH OBOBOOOOBOBBBOOOOBOBOBOBBBBBOOOBOBBBBOBBOOBOOOOOOOO LLLLLLLLHHHHHLLLLLLLLLHHHHHHLLLLLEELLLLLLLLLLLLLLLL RLS DP RL LEVY GW S Q D 1. 1qr7_A 42.73 RlsIHDpeYATRLLALREELKDELevYFEkgWKGLINDPHMDNSFQINDGL LELLLLHHHHHHHHHHHHHHLLLEELLLLLLLLLLLLLLLLLLLLLHHHHH D D RL QG E W VC W D 2. 1by2 42.53 DGDM..RLADGGATNQGRVEIFYRGQWGTVCDNLW........DLT LLLE..EEELLLLLLEEEEEEEELLEEEEEELLLL........LHH S D F A T Q E L D K E P 3. 1ivy_A 39.07 FSYSDDKFYA..TNDTEVAQSNFEA.LQDFFRLF..PEY.KNNKLfespT LLEELLLLLL..LEHHHHHHHHHHH.HHHHHHHL..HHH.LLLLEEELHH L Y DF A SNS V D C S G K DPS 4. 1tcr_A 38.80 LKYYSGdfEAEFSKSNSshLRKASVHWSDSAVYFCAVsfGSGTkaLKDps EEELLLLEEEEEELLLLEEEEELLLLHHHLEEEEEEEEELLLEEEEELLL L D ARL S SK Q QL Y Q W S 5. 1qsa_A 37.67 LMYWNLDNTArlVKSKSKtqAQLARY...AFN...NQWWDLS....VQAT HHHLLLHHHHHHHLLLLHHHHHHHHH...HHH...LLLHHHH....HHHH R D R RS S G L C S GR KQ PS 6. 1a6q 37.53 RTGFLEIDEHMRvdRSGSTAVGVL.ISPQHTYFINCGDSRgrNRkqDHKPS HHHHHHHHHHHHHLLLEELEEEEE.ELLLEEEEEEELLLEEELLELLLLLL RL WY PD R TR N K L LKD C WG SS PS 7. 1igt_B 37.47 RLEwyYPdvKGRFtrDNAKNTLYLQMslkdTAMYYCAryWGqsSAKTTAPs LEEEELLHHLLLEEEELLLLEEEEEELLLHLEEEEEEEEEELLLLLLELLE S Y P F A L SK G L L GW QS S 8. 1ac5 36.40 SyyIPFfnAILNHNksKIDgdLKALlgNGWIDPNTQsiDESNPNFKHLT ELHHHHHHHHHHHHHLLLLLEEEEEEEEELLLHHHHHLLLLLLLHHHHH S R S SKC GQ E D VC G Q 9. 1ext_A 36.33 SFTASENHLRHCLSCSKCrgQVEIstVD.RDTVC....GCRKNQYRH.Y EELLLLELLLLLEELLLLLLLEEEEELL.ELLEE....ELLLLEEEE.E S YD F L V L H K W D 10. 1nfp 36.27 SIYdvAFSEHHIDKSYNDETKLavSLGKQIHVLAteTVVKAAkkWDDSQ LLEEEEEELLLLLLELLLLLELLEEELLEEEEELLHHHHHHHHLLLLLH DP AR N GQ E Y K W VC KQ ED S 11. 4pah 36.27 HPGFKDPVYRARRKqyNYR.HGqvE.YmkKTWGTvcnHIFPLLEkqLEDVS LLLLLLHHHHHHHHHHHLL.LLLLL.LLHHHHHHHLHHHHHHHHHLHHHHH S Y P LTR S QG L Y D V S WG SS E PS 12. 1kit 35.80 MSLygPGHGITLTRQqsGSqgRL.IyvLDRFFlvMssNwgkSSSILeePS EEEELLLLLEELLLLLLLLLLLE.EEEELLLLEEEELLLEEELLEEELEE S Y D RL S QG Y KD QSW R 13. 1lox 35.47 SFYAQD.ALRLWEIISrvQGIMGLYYkdDLEL...QSWCREITEIGL.Q LHHHHH.HHHHHHHHHHHHHHHLLLLLLLHHH...HHHHHHHHLLLL.L RLS RLT L L Q R KQ P 14. 1d5y_A 35.40 RLSKSAVAL..RLTA.....RPILDIALQ..YRFDSQQTFTRakKQfqTPa HHHHHHHHH..HHLL.....LLHHHHHHH..LLLLLHHHHHHHHHHHLLHH LSW D FQ NSK G L VYL G V G K EDP 15. 1cx8_B 35.33 KLswRDQHfqVKDSAQNsk.NGRL.VYLVegykLVHA.NFG.TKKDFEdpV LLLLEEEEEEEELLLLLEL.LLLL.LLLLELLLEEEL.LLL.LLLLLLLLL LS YD D L S C G DG W P 16. 1be3_A 35.27 LstYDEDAVPTL....SPCrgSQICHREDGLPLA.HVAIAVEGPGWAHPD LLLLLLLLLLLL....LLLLLEEEEEEELLLLLE.EEEEELLLLLLLLLH Y F T LE D C W K D 17. 1gpm_A 35.20 RFYGVQFHPEVTHTRQGMR.MLERFVRD..ICQCEALW.TPAKIIDDae LEEEELELLLLLLLLLHHH.HHHHHHHL..LLLLLLLL.LHHHHHHHHH D Q K G C QS S 18. 1ajs_A 35.20 FPFFDSAYQGFASGNLEKDAWAIRYFVSEGFELFCAQSFSKNFGLYNErs EEEEEELLLLLLLLLHHHHLHHHHHHHHLLLLEEEEEELLLLLLLHHHLH R WY PD R C G D S R K D 19. 1lvk 35.00 RYIWYNPD...PKERDSYEC.GEIVSETSDSFTFKTSDGQDRQVKK.DDAN LEEEELLL...LLLLLLLEE.EEEEEELLLEEEEELLLLLEEEEEH.HHLE P L S G L VYL GW S S D 20. 1abr_B 34.93 TIINPKSALVLSAESSSMGGTLTvylrQGWRTGNNTsfVTSISGYSdqA LEEELLLLEEEELLLLLLLLELEELLHHLLEELLLLLEEELEELHHHEE ================================== ALIGNMENTS ================================== 51 - 101 ....:....1....:....2....:....3....:....4....:....5 pred SQASKVCQRLNCGVPLSLGPFLVTYTPQSSIICYGQLGSFSNCSHSRNDMC HHHHHHHHH EEE EEEEEE EE OOBOOBBOOBOBOOBBOBOOBOOOOOOOBBBBBOBOOOBBBBBOOOOOOOB LLHHHHHHHHLLLLLLLLLLHHHLLLLLLLHEHHLLHLLHLLLLLLLLHHH A K N G P G FL TPQ G G S 1. 1qr7_A 42.73 LraRKLLLDInsGLP.AAGEFLDMITPqaDLMSWGAIGARTTESQVHREL. HHHHHHHHHHHLLLL.EEEELLLLLLHHHHHLLEEEELHHHLLLHHHHHH. AS VC LN L GP T T S C LG SNC H R 2. 1by2 42.53 TDASVVCRAlnATQAlgSGPIMltGTEASLADC.KSLGWlsNCRHERDA.. HHHHHHHHHLLEEEEELLLLEEELLLLLLHHHL.ELLLLLLLLLHHHEL.. A V Q L G L Y YG LG S CS 3. 1ivy_A 39.07 TLAVLVMQDPSMNlgLAVGNGLSSYEQNDNSLVygLLGnwSscSQNKCNFY HHHHHHLLLLLLLEEEEEELLLLEHHHHHHHHHHLLLLHHHHEELLEELLL SQ S C N VP T S SN 4. 1tcr_A 38.80 sQDSTLcsQIN..VPKTMES..GTFITDATVLDMKAMDSKSNGAIAWSN.. LLLLEEELLLL..LLLLLLL..LEEELLLEEEELLLLLLEEEEEEEEEL.. A K L PL YTPQS S N 5. 1qsa_A 37.67 TIAGKLWDHLEERFPLAYNDLFKRYtpQSYAMAIARQESAWN.PKVKSPVG HHHLLLLLLHHHHLLLLLHHHHHHHHLHHHHHHHHHHHHLLL.LLLELLLL S S QR N SLG FLV P SIIC G C SR D C 6. 1a6q 37.53 SNpsVMIQRVNGSLAVslGDflVSPEpeRsiicwDVMGNEELCdrSrdDlc LLHLEELLEELLLELLLEELHLLELLLELLEELHLLLLHHHHHHHHHLLHH S VC G LG Y P GS S H D 7. 1igt_B 37.47 spLAPVC.GDTTGSSVTLGCLVKGYFPEPVTLTWNS.GSLSSGVHtqSdlS EEELLLL.LLLLLLLEEEEEEEEEEELLLLEEEEHH.HLELLLEEEELLEE A CQ L C L L L YT SS Q G C D 8. 1ac5 36.40 TNAHENCQNleCENILNL...LLSYTRESS.....QKGT.ADcnFNLKDSY HHHHHHHHHHHHHLHHHH...HHHHLLLLL.....LLLL.LLEEEEEEELL Q NC L G C G FSNC S C 9. 1ext_A 36.33 YWSENLFQCFNCSLCLN.GTVHLSCQEKQNTVCTCHAGFfsNCKKSLE..C EEELLEEEEEELLLLLL.EEEEELLELLELLEEEELLLEEHHLLLLLL..L QA KV RL LSL L YT SII G F C H M 10. 1nfp 36.27 QqaAkvRHRlqAKAELslEDYL.SYtaETsiINSNAAGNFDTCLHHVAEMA HHHHHLLEEEHHHHHHHHHHHH.HHHLLLLHHHLLEEELHHHHHHHHHHHH SQ C RL V LSLG F V YTP IC LG FS S 11. 4pah 36.27 SQFLQTCtrLR.PValslgaFRvmYTPEPD.ICHELLGhfSDRSFAQ..FS HHHHHHHHEEE.ELLLLHHLLLELLLLLLL.HHHHHHHLHLLHHHHH..HH S A V L G L L LV Y P I L FSN S D 12. 1kit 35.80 SEADMV..ELQNG.DLLLTARlvNYSPRQQfiTWSLLEafSNIsfEQSDGS EEEEEE..ELLLL.LEEEEEEEEELLLEEEELLLEEEEELLLLLEELLLLL A K Q G P SL VT T Q S I GQL F NC D 13. 1lox 35.47 QGAQK..Q....GFPTSLQshFVTmtGQHSSIHLGQLDWFtnctpTTKDAT LLHHH..H....LLLLLLLLHHHHHLHHHHHHHLLHHHHLLHLLLLLLLLL S PL LG F VT I G S S CS D 14. 1d5y_A 35.40 arRSPEWSAFGIRPPLRLGEffVT.LEDTPLI..GVTQSYS.CSLEqsDFR HHHLLLEELLLLLLLLLLLLLEEE.ELLEEEE..EEEEELL.LLHHHHHHH S V R V SL Y Q S LG F H 15. 1cx8_B 35.33 VNGSIVIVrfAEKvaESLNAivLIYMDQtsFFGHAHLGtfPSFNHTQFPPS LLLLEEEEEHHHHHHHHLLLLEEEELLLLLELELLLLLLLLLLLLLLLLLL A CG LSL T Q ICYG LG C HS DM 16. 1be3_A 35.27 DnaNAIIGHYDcgAHLslASIAATnsFQTFNICygLLGAHFVCDhsIDDMM HHHHHHHLEEELHHHLLHHHHHHHLEEEEEEEELEEEEEEEEELLLHHHHH Q KV L GV S L C G L 17. 1gpm_A 35.20 eqdDKVILGLSGGVDSSVTAMLLHRAIGKNLTCvgLL.RLNEAEQVLDMFG HHLLEEEEELLLLHHHHHHHHHHHHHHHHHEEEELLL.LLLHHHHHHHHHL SQ K R P G V TP G SR 18. 1ajs_A 35.20 sQMQKI.VRVTWSNPPAQGARIVARtpELFHEWTGNVKTMADRILsrSELR HHHHHH.HHLLLLLLLLHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q GV LS G FLV P I Y Q F RN 19. 1lvk 35.00 NQRNPI..KFD.GveLSygLFLVAVNPFKRIPIYTqvDIFK..GRRRNEVa EELLLH..HHL.LLHLLLLLEEEEELLLLLLLLLLHHHHLL..LLLHHHLL Q S VC L YT SI NC S 20. 1abr_B 34.93 AQGSNvcDSNKKEQQWAL......YT.DGSI...RSVQNTNNCLTSKD... EELLEELLLLLHHHLEEE......LL.LLLE...EELLEEEEEEEELL... ================================== ALIGNMENTS ================================== 101 - 110 ....:....1....:....2....:....3....:....4....:....5 pred CHSLGLTCLE EEEE BOBBBBBBBO HHHHLLLHHH GL C 1. 1qr7_A 42.73 ..ASGLSC ..HLLLLL G C 2. 1by2 42.53 ....GVVCTN ....EEEELL L C 3. 1ivy_A 39.07 YDNKDLECVT LLLLLHHHHH TC 4. 1tcr_A 38.80 ..QTSFTCQD ..LLLLLHHH S GL 5. 1qsa_A 37.67 GAS.GL LLE.LL C TCL 6. 1a6q 37.53 cNEVVDTCL HHHHHHHHH S T 7. 1igt_B 37.47 SSSVTVT EEEEEEL S G 8. 1ac5 36.40 YPSCGM LLLLLL C L L E 9. 1ext_A 36.33 CTKLCLPQIE LHHHHLLLLL L L C E 10. 1nfp 36.27 AQGLnlFCFE HHHLLEEELL GL L 11. 4pah 36.27 SQEIGLASL HHHHHHHHL H L T 12. 1kit 35.80 SHFLLFT LEEEEEE L 13. 1lox 35.47 TLETVMATLP LHHHHHHHLL H 14. 1d5y_A 35.40 RHEM HHHH S GL 15. 1cx8_B 35.33 SRSSGLPNIP LLLLLLLLLL L 16. 1be3_A 35.27 MFVL HHHH GL 17. 1gpm_A 35.20 GDHFGL LLLLLL L 18. 1ajs_A 35.20 RARL HHHH H 19. 1lvk 35.00 ahIFAISDVA LLHHHHHHHH H G T L 20. 1abr_B 34.93 .HKQGSTIL .LLLLLEEE 3 71 36
# TOPITS (Threading One-D Predictions Into Three-D Structures) # -------------------------------------------------------------------------------- # FORMAT begin # FORMAT general: - lines starting with hashes contain comments or PARAMETERS # FORMAT general: - columns are delimited by tabs # FORMAT general: - the data are given in BLOCKS, each introduced by a line # FORMAT general: beginning with a hash and a keyword # FORMAT parameters: '# PARA:tab keyword =tab value tab (further-information)' # FORMAT notation: '# NOTATION:tab keyword tab explanation' # FORMAT info: '# INFO:tab text' # FORMAT blocks 0: '# BLOCK keyword' # FORMAT blocks 1: column names (tab delimited) # FORMAT blocks n>1: column data (tab delimited) # FORMAT file end: '//' marks the end of a complete file # FORMAT end # -------------------------------------------------------------------------------- # PARA begin # PARA TOPITS HEADER: PARAMETERS # PARA: len1 = 110 # PARA: nali = 2697 # PARA: listName = /home/phd/server/work/TOPITS_DB.list # PARA: sortMode = ZSCORE # PARA: weight1 = NO # PARA: weight2 = NO # PARA: smin = -1.00 # PARA: smax = 2.00 # PARA: gapOpen = 2 # PARA: gapElon = 0.2 # PARA: indel1 = YES # PARA: indel2 = YES # PARA: threshold = ALL # PARA: str:seq = 50 (i.e. str= 50%, seq= 50%) # PARA end # -------------------------------------------------------------------------------- # NOTATION begin # NOTATION TOPITS HEADER: ABBREVIATIONS PARAMETERS # NOTATION: len1 : length of search sequence, i.e., your protein # NOTATION: nali : number of alignments in file # NOTATION: listName : fold library used for threading # NOTATION: sortMode : mode of ranking the hits # NOTATION: weight1 : YES if guide sequence weighted by residue conservation # NOTATION: weight2 : YES if aligned sequence weighted by residue conservation # NOTATION: smin : minimal value of alignment metric # NOTATION: smax : maximal value of alignment metric # NOTATION: gapOpen : gap open penalty # NOTATION: gapElon : gap elongation penalty # NOTATION: indel1 : YES if insertions in sec str regions allowed for guide seq # NOTATION: indel2 : YES if insertions in sec str regions allowed for aligned seq # NOTATION: threshold : hits above this threshold included (ALL means no threshold) # NOTATION: str:seq : weight structure:sequence # NOTATION TOPITS HEADER: ABBREVIATIONS SUMMARY # NOTATION: id2 : PDB identifier of aligned structure (1pdbC -> C = chain id) # NOTATION: pide : percentage of pairwise sequence identity # NOTATION: lali : length of alignment # NOTATION: ngap : number of insertions # NOTATION: lgap : number of residues inserted # NOTATION: len2 : length of aligned protein structure # NOTATION: Eali : alignment score # NOTATION: Zali : alignment zcore; note: hits with z>3 more reliable # NOTATION: strh : secondary str identity between guide and aligned protein # NOTATION: ifir : position of first residue of search sequence # NOTATION: ilas : position of last residue of search sequence # NOTATION: jfir : pos of first res of remote homologue (e.g. DSSP number) # NOTATION: jlas : pos of last res of remote homologue (e.g. DSSP number) # NOTATION: name : name of aligned protein structure # NOTATION end # -------------------------------------------------------------------------------- # INFO begin # INFO TOPITS HEADER: ACCURACY # INFO: Tested on 80 proteins, TOPITS found the correct remote homologue in about # INFO: 30%of the cases. Detection accuracy was higher for higher z-scores: # INFO: ZALI>0 => 1st hit correct in 33% of cases # INFO: ZALI>3 => 1st hit correct in 50% of cases # INFO: ZALI>3.5 => 1st hit correct in 60% of cases # INFO end # -------------------------------------------------------------------------------- # BLOCK TOPITS HEADER: SUMMARY rank id2 pide lali ngap lgap len2 Eali Zali strh ifir ilas jfir jlas name 1 1qr7A 31 105 9 27 339 42.73 2.74 41 1 108 46 174 1qr7_A MOL_ID: 1; . 2 1by2 37 91 8 32 113 42.53 2.71 55 6 110 4 108 1by2 MOL_ID: 1; . 3 1ivyA 27 105 10 28 452 39.07 2.25 36 2 110 106 230 1ivy_A MOL_ID: 1; . 4 1tcrA 28 102 9 45 202 38.80 2.22 44 2 110 47 186 1tcr_A MOL_ID: 1; . 5 1qsaA 32 93 8 23 618 37.67 2.07 25 2 106 391 494 1qsa_A MOL_ID: 1; . 6 1a6q 37 109 14 68 364 37.53 2.05 32 1 109 101 275 1a6q MOL_ID: 1; . 7 1igtB 34 105 12 43 444 37.47 2.04 43 1 107 44 189 1igt_B MOL_ID: 1; . 8 1ac5 36 95 11 49 483 36.40 1.90 41 3 106 176 310 1ac5 MOL_ID: 1; . 9 1extA 30 99 8 23 160 36.33 1.89 42 3 110 47 159 1ext_A MOL_ID: 1; . 10 1nfp 33 107 12 54 228 36.27 1.88 36 3 110 40 199 1nfp LUXF GENE PRODUCT (NONFLUORESCENT FLAVOPROTEIN) COMPLEXED . 11 4pah 40 105 17 69 308 36.27 1.88 28 1 109 30 195 4pah MOL_ID: 1; . 12 1kit 40 103 16 67 757 35.80 1.82 54 2 107 512 676 1kit MOL_ID: 1; . 13 1lox 37 98 12 38 650 35.47 1.77 25 3 110 447 570 1lox MOL_ID: 1; . 14 1d5yA 31 93 11 24 289 35.40 1.77 30 1 104 56 157 1d5y_A MOL_ID: 1; . 15 1cx8B 34 108 15 55 639 35.33 1.76 45 1 110 56 212 1cx8_B MOL_ID: 1; . 16 1be3A 32 102 10 27 446 35.27 1.75 39 2 104 219 338 1be3_A MOL_ID: 1; . 17 1gpmA 23 99 7 16 502 35.20 1.74 38 3 106 172 281 1gpm_A MOL_ID: 1; . 18 1ajsA 20 102 4 23 414 35.20 1.74 35 2 104 218 341 1ajs_A MOL_ID: 1; . 19 1lvk 29 104 10 40 746 35.00 1.71 40 1 110 32 161 1lvk MOL_ID: 1; . 20 1abrB 31 94 9 26 267 34.93 1.70 47 3 109 102 208 1abr_B ABRIN-A COMPLEXED WITH TWO SUGAR CHAINS . //
END of results for file predict_h11814
Links: | TOP |
|