Chromosome 7

CaJ7.0066(CaO19.7067)

InitTermntaa
CDS 841778745232761093

< < <> > >

SPTrEMBL

Acc#GeneLocusDBOrganismProductScoreE-valueIdentitiesPositivesGaps
P89105CTR9CTR9_YEASTSPSaccharomyces cerevisiae (Baker's yeast)CTR9 protein.524e-147332/1029 (32%)541/1029 (52%)40/1029 (3%)
O14409CDP1TrEMBLSaccharomyces cerevisiae (Baker's yeast)CDP1P.524e-147332/1029 (32%)536/1029 (52%)40/1029 (3%)
O42668TPR1TPR1_SCHPOSPSchizosaccharomyces pombe (Fission yeast)Tetratricopeptide repeat protein 1.3477e-94264/994 (26%)456/994 (45%)16/994 (1%)
Q15015KIAA0155TrEMBLHomo sapiens (Human)Hypothetical protein KIAA0155.1915e-47206/860 (23%)363/860 (42%)10/860 (1%)
Q62018SH2BP1TrEMBLMus musculus (Mouse)Phosphoprotein.1914e-47206/860 (23%)363/860 (42%)10/860 (1%)
Q8C9Y3SH2BP1TrEMBLMus musculus (Mouse)TPR-containing (Fragment).1854e-45199/821 (24%)344/821 (41%)10/821 (1%)
Q8T5I730E5.3TrEMBLAnopheles gambiae (African malaria mosquito)Putative TPR-containing phosphoprotein.1822e-44200/867 (23%)354/867 (40%)17/867 (1%)
Q9W0H4CG2469TrEMBLDrosophila melanogaster (Fruit fly)CG2469 protein (LD24034p).1761e-42207/888 (23%)370/888 (41%)23/888 (2%)
Q8BRD1
TrEMBLMus musculus (Mouse)TPR-containing.1722e-41187/757 (24%)317/757 (41%)10/757 (1%)
Q03560B0464.2YKD1_CAEELSPCaenorhabditis elegansHypothetical protein B0464.2 in chromosome III.1592e-37168/783 (21%)320/783 (40%)27/783 (3%)
Q8VYL2AT2G06210TrEMBLArabidopsis thaliana (Mouse-ear cress)Putative TPR repeat nuclear phosphoprotein.1516e-35184/779 (23%)313/779 (40%)38/779 (4%)
Q8BND9SH2BP1TrEMBLMus musculus (Mouse)TPR-containing (Fragment).1322e-29149/577 (25%)246/577 (42%)8/577 (1%)
Q8S8H1AT2G06210TrEMBLArabidopsis thaliana (Mouse-ear cress)Putative TPR repeat nuclear phosphoprotein.1262e-27192/917 (20%)347/917 (37%)70/917 (7%)
Q8H9E4
TrEMBLOryza sativa (japonica cultivar-group)Putative TPR-containing nuclear phosphoprotein.1254e-27196/921 (21%)352/921 (38%)60/921 (6%)
Q9W1T9CG9899TrEMBLDrosophila melanogaster (Fruit fly)CG9899 protein.98.65e-19142/662 (21%)256/662 (38%)6/662 (0%)
O96549DG1071TrEMBLDictyostelium discoideum (Slime mold)Developmental protein DG1071 (Fragment).79.72e-1385/323 (26%)148/323 (45%)9/323 (2%)
Q8TJS3MA3704TrEMBLMethanosarcina acetivoransTPR-domain containing protein.64.31e-08143/663 (21%)267/663 (40%)33/663 (4%)
Q61371TG737TG37_MOUSESPMus musculus (Mouse)Recessive polycystic kidney disease protein Tg737 (TgN(Imorpk)737Rpw).61.28e-0886/398 (21%)159/398 (39%)24/398 (6%)
Q921J5TGN737RPWTrEMBLMus musculus (Mouse)Similar to transgene insert site 737, insertional mutation, polycystic kidney disease.61.28e-0886/398 (21%)159/398 (39%)24/398 (6%)
Q8C9W7SH2BP1TrEMBLMus musculus (Mouse)TPR-containing (Fragment).61.28e-0859/198 (29%)84/198 (42%)5/198 (2%)
Q9AHL1LMP1TrEMBLBorrelia burgdorferi (Lyme disease spirochete)LMP1.60.12e-0798/451 (21%)173/451 (38%)20/451 (4%)
Q9AHK6LMP1TrEMBLBorrelia burgdorferi (Lyme disease spirochete)LMP1.60.12e-0798/451 (21%)173/451 (38%)20/451 (4%)
O51228BB0210TrEMBLBorrelia burgdorferi (Lyme disease spirochete)Surface-located membrane protein 1 (LMP1).60.12e-0798/451 (21%)173/451 (38%)20/451 (4%)
Q9AHK7LMP1TrEMBLBorrelia burgdorferi (Lyme disease spirochete)LMP1.60.12e-0798/451 (21%)173/451 (38%)20/451 (4%)
Q9AHK9LMP1TrEMBLBorrelia burgdorferi (Lyme disease spirochete)LMP1.60.12e-0798/451 (21%)173/451 (38%)20/451 (4%)
Q9AHL2LMP1TrEMBLBorrelia burgdorferi (Lyme disease spirochete)LMP1.60.12e-0798/451 (21%)173/451 (38%)20/451 (4%)
Q9AHK4LMP1TrEMBLBorrelia burgdorferi (Lyme disease spirochete)LMP1.60.12e-0798/451 (21%)173/451 (38%)20/451 (4%)
Q9AHL0LMP1TrEMBLBorrelia burgdorferi (Lyme disease spirochete)LMP1.59.72e-0798/451 (21%)173/451 (38%)20/451 (4%)
Q9AHK8LMP1TrEMBLBorrelia burgdorferi (Lyme disease spirochete)LMP1.58.94e-0798/451 (21%)172/451 (38%)20/451 (4%)
Q8N719
TrEMBLHomo sapiens (Human)Probe hTg737 (Polycystic kidney disease, autosomal recessive, in).57.89e-0751/212 (24%)89/212 (41%)15/212 (7%)

Pfam

Scores for sequence family classification (score includes all domains):

Model       Description                                 Score    E-value  N 

--------    -----------                                 -----    ------- ---

TPR         TPR Domain                                   99.4    7.2e-26  11

UPF0150     Uncharacterised protein family (UPF0150)     -9.2        9.7   1

FliS        Flagellar protein FliS                      -32.0          1   1

Vicilin_N   Vicilin N terminal region                   -78.0        7.1   1

Myosin_tail Myosin tail                                -613.7       0.36   1



Parsed for domains:

Model       Domain  seq-f seq-t    hmm-f hmm-t      score  E-value

--------    ------- ----- -----    ----- -----      -----  -------

TPR           1/11    112   145 ..     1    34 []     4.1       11

TPR           2/11    200   233 ..     1    34 []    10.7      1.9

TPR           3/11    234   268 ..     1    34 []     3.7       13

TPR           4/11    270   303 ..     1    34 []    23.5   0.0051

TPR           5/11    404   437 ..     1    34 []    17.7     0.28

TPR           6/11    438   471 ..     1    34 []     0.0       36

TPR           7/11    526   559 ..     1    34 []     9.7      2.5

TPR           8/11    565   599 ..     1    34 []    13.9     0.78

UPF0150       1/1     637   699 ..     1    68 []    -9.2      9.7

TPR           9/11    738   771 ..     1    34 []     2.8       17

TPR          10/11    805   838 ..     1    34 []    17.6     0.29

TPR          11/11    841   874 ..     1    34 []     5.9        7

FliS          1/1     863   989 ..     1   133 []   -32.0        1

Vicilin_N     1/1     934  1049 ..     1   183 []   -78.0      7.1

Myosin_tail   1/1     374  1110 ..     1   864 []  -613.7     0.36




CDS Sequence

ATGCAAGAAC CAACTGACAT CTCGTATTAC ATTGGGAAAG AATCTGCCGA ATCTTTAAAT 
TTGTTGGATG TTCCGGTTGG AGGTGGACAA ATTGTTTCCA TCGATTTGCG AAATGAATTA 
TCGGATGACC CATCTGAATT AATTCAGTTT TTAACTGATC AACAAACTGA GAAACAGTAC 
TGGATTATAG CAGCAAGTGG GTATGCAAAA TTGGGGAAAT TGAAAGAGCT GTTGGAATTC 
ATCAATGCAG CTTTAAAATT GGATTATTTT AATGAAAATG ATAAAAAGTC CTTTGAAAGT 
TTCATTATTT GGCTCTTAGT GAAAAATGTT TATTTGGGGA TTGACAAGGA TAACAACTTG 
AATCTTGCTA AAAAAGAAAT ATCAAAATTA AATTTTAAAA TCCAAACTGA CAGTGAAACA 
AGTACACTGA TTAGTACATC AAATTTACTA AGTCTGGCAA TTTTGTACTT GTACGAATCG 
AAAGATGATG ATGCCATAGA CATATTTGAC CGTATTTTAA GAATTGATCC AAACAACTGT 
TTTGCATTAT TAGGGAAAGC TCAATCAGTA TTGAATAAAA CGAAAAACTA TTCGCATGCA 
TTGAAATTGT ATCAGCAAGT TTTGATTTTG AATCCGTTAA TGAAGCCAGA CCCAAGATTA 
GGTATTGGAT TGTGTTTTTG GTTTTTAAAA GACGATAAAA TGGCAATACA GGCATGGGAA 
AGAAGCTTAC AATTAGACCC TACCAATGTG AAACTGAGAA TATTCTTAAA CCTAGCAAAA 
TTTCATACCA CATTTACTAA TTCATTGAGT GATGAAGAAT TTTTGGACAA CTATAAAAAT 
TGTTTGCAAG AGTTAAGCAA GCTCAAGAGT TTGAATGCTA ATGATACTAC AGTGACTTTA 
GCATTGTGTT CATATTTATT CTCTAAAGGT GACTACAACA CAGTGATAAA GATTGTGGAA 
AAAATTGTCA AGGGTATCAC AGGATCTGAC AATCTCAAAA AGTTTAGTAC ATTTTCCAGA 
ATCACCAAAT ATGAATCGAA TGCCTTATCT CAGTGTGCTA CATGGTTAGC AAGAATAGAG 
TTTGCAAGAG GGAATTTTAC ACAATCTTCA AAGTACTTTC AAGAAGCGAT AAAGTTAAAT 
GAAACAAATA TTGTAGCCAA ATTGGGATTG GGTCAGTCGC AATACAATCG TGGCTCAATT 
GAAGAGGCCA GCTTAACTTT TGAGAGTATT TTACGAAGCA ACGTCAAATG CCTTGAAGTT 
AACTATTCTT TAGGTGTCTT GTATTCCAAG CAAAACTCAA GAAGTAAAAA AGAGTTGGCT 
ATCCAGGTGT TAGAAAGGTA CATACGATTA TCAAACAACC GTGGCTTATC CTCGAATGAA 
GAAGAGTTTG TGTTGAACAA AGAACCAGTA GCTTTGAACG CATACTTGAT TTTAAGTCAA 
TTGTACGAGG CAAAGGGAGA TATGACCCAG GCATTAACAT ACTTGAACAA AGCTGTTGAA 
GCAAGGAGAC AAGTCGAAAA AGATGTTCCA TTGGAAGTCT ACAATAACAT CGGTGTTTTC 
CAGTTTACCA AACAAAACTA TGACAGTGCC CTTGAGAACT TTACGACTGC TTTAGGTAAA 
TTGGATGGCC GTGATTTCAA GTCACCAGAT GGTGATACGT TGGTTGATTT ACCACAGGAT 
TTGAGAACAT CTTTAACCTA CAATTTAGCT AGAACTAAAG AAATTTCGAA CCAAAAAGAT 
GCATTGGAGA CATATGAACA ATTACTTACA GAATGCCCTC ATTATTTCTC GGCTAAATTG 
AGAATTTTAT TTTTGAATTG TATAACTGAG GGGATAACCA AAGAGGAGAT TCGTGATGAG 
ATTGAACTGC TTCTAGATCT AAATGCATCT GATTTGGAAG TGAGGTCGTT TTATGGATGG 
TTCATCAAAA ACTTTGGTAA AAAGTTACAC ATGCCTTCAG ATGCCGATAC TAAATTACAG 
AAAGATACCT TAGTTGAGTT TGACCTGCAT GATTGCTACG CATTGATATC TTTAGCTAAC 
ATTTATTGTA TCATGGCCAG AGATACTAAG GGGGCAGACG AGAAAAAGAA GAAATATTAT 
CTCAGAGCCA TTGAGCTTTT CACCAAAGTG TTGTCCTTGG ACTCCAAAAA TGTTTACGCT 
GCACAAGGTT TGGCGATTAC TTATATTGAA AATAAGCAAT TAAACAAAGG TTTGGATATT 
TTGAGAAAAA TTAGAGATTC TTTAAACGAT ATTTCAGTTT ACTTAAATTT AGGTCACGTA 
TTGTGTGATT TAAAGCAATT TGGTAAGGCC ATAGAAAATT ATGAATTGGC TTTGACAAGA 
TACACTGATG GGAAAGATGC CAAGATTTTG TCGTTCTTAG GACGTGTATG GTACTTGAGA 
GGGAACGCTG AACTGAGTTT GCCGTACTTG AAGAAAGCCT TGGGGTATGC ACAAGCTGCA 
TTAGATGCAG CTAGATCTAC ATCAACTGCG GCACTTGCAT TCAACATTTC GTTTGTGCAG 
TTTCAGATTG CTGATTTTAT AACCAAACAG CCAGTTAATG AGAGAAATAT AGAGGATATT 
GAGAGTGCAA TTGAAGGTTT GAATAAAGCC ATAGATATTC TAACACAATT AGCATCGGAT 
GAAGAAAAAC ACCCACCTTA TCCAAGAGAA GAGTTGAGAG GTAGAGCAAA CTTGGGTACT 
AGTACCTTGT TGTCCAGATT AGCCAATGCG TTGGAAGAAA CTAAAGAAAA CAATGCTGAA 
ATAGAAGAAA AGATCCAAAA GGCTAAACAG ATCAGATTGG ATGAAGAACA AGCCAGGTTG 
AAGGAAGAAG AGGAAAGATT GAATAAGTTG AAGGAGAAAG AGTTGGAGAT GAGCAAACAA 
AGAATGCTCT TACAAGAACA AGCACAAAAA TGGGCTGAAG AAAACAGTGC TAGTGTTGGC 
GTCAGCGATA ATGAGGAAGA TGACGATAAA TTATTCGACG AAGAATCTGC TCAAAAAGAA 
AACAAGAGAA AGAAAGGAGG TAGTAGTAAG GGAAAGAAAG GCAAAGGCAG GAAAAAGAAA 
GGAAATATCA TTGATGATAG TGAAGAAGAG CCAGAGAAAA ATATAACTGA TGACTCTGAA 
GACGAAGCAA ACGGAAACTC TAACGGCAAA AGAAAAGCAG CTGATGATGC TGGTGGTAAG 
AAAAAGAAGA AACCACTTTC ATCAGAGTTT ATACAAGATA GTGAAGAAGA GTTGGAAGAC 
GATGACTTGT TTGGTGATAA TGATGATGAT GAGTAG

AA Sequence

MQEPTDISYY IGKESAESLN LLDVPVGGGQ IVSIDLRNEL SDDPSELIQF LTDQQTEKQY 
WIIAASGYAK LGKLKELLEF INAALKLDYF NENDKKSFES FIIWLLVKNV YLGIDKDNNL 
NLAKKEISKL NFKIQTDSET STLISTSNLL SLAILYLYES KDDDAIDIFD RILRIDPNNC 
FALLGKAQSV LNKTKNYSHA LKLYQQVLIL NPLMKPDPRL GIGLCFWFLK DDKMAIQAWE 
RSLQLDPTNV KLRIFLNLAK FHTTFTNSLS DEEFLDNYKN CLQELSKLKS LNANDTTVTL 
ALCSYLFSKG DYNTVIKIVE KIVKGITGSD NLKKFSTFSR ITKYESNALS QCATWLARIE 
FARGNFTQSS KYFQEAIKLN ETNIVAKLGL GQSQYNRGSI EEASLTFESI LRSNVKCLEV 
NYSLGVLYSK QNSRSKKELA IQVLERYIRL SNNRGLSSNE EEFVLNKEPV ALNAYLILSQ 
LYEAKGDMTQ ALTYLNKAVE ARRQVEKDVP LEVYNNIGVF QFTKQNYDSA LENFTTALGK 
LDGRDFKSPD GDTLVDLPQD LRTSLTYNLA RTKEISNQKD ALETYEQLLT ECPHYFSAKL 
RILFLNCITE GITKEEIRDE IELLLDLNAS DLEVRSFYGW FIKNFGKKLH MPSDADTKLQ 
KDTLVEFDLH DCYALISLAN IYCIMARDTK GADEKKKKYY LRAIELFTKV LSLDSKNVYA 
AQGLAITYIE NKQLNKGLDI LRKIRDSLND ISVYLNLGHV LCDLKQFGKA IENYELALTR 
YTDGKDAKIL SFLGRVWYLR GNAELSLPYL KKALGYAQAA LDAARSTSTA ALAFNISFVQ 
FQIADFITKQ PVNERNIEDI ESAIEGLNKA IDILTQLASD EEKHPPYPRE ELRGRANLGT 
STLLSRLANA LEETKENNAE IEEKIQKAKQ IRLDEEQARL KEEEERLNKL KEKELEMSKQ 
RMLLQEQAQK WAEENSASVG VSDNEEDDDK LFDEESAQKE NKRKKGGSSK GKKGKGRKKK 
GNIIDDSEEE PEKNITDDSE DEANGNSNGK RKAADDAGGK KKKKPLSSEF IQDSEEELED 
DDLFGDNDDD E*