Chromosome 7

CaJ7.0291(CaO19.6460)

InitTermntaa
CDS 53497253824732761093

< < <> > >

SPTrEMBL

Acc#GeneLocusDBOrganismProductScoreE-valueIdentitiesPositivesGaps
P46463PEX1PEX1_PICPASPPichia pastoris (Yeast)Peroxisome biosynthesis protein PAS1 (Peroxin-1).8500.0498/1172 (42%)692/1172 (59%)81/1172 (6%)
Q9UVU6PEX1TrEMBLPichia angusta (Yeast) (Hansenula polymorpha)Peroxin-1.7580.0461/1133 (40%)649/1133 (57%)42/1133 (3%)
Q9UV06PEX1TrEMBLYarrowia lipolytica (Candida lipolytica)Peroxisome assembly protein Pex1p.6930.0433/1077 (40%)609/1077 (56%)53/1077 (4%)
Q9HG04PEX1TrEMBLPenicillium chrysogenumPeroxin-1.582e-165411/1232 (33%)609/1232 (49%)146/1232 (11%)
P24004PEX1PEX1_YEASTSPSaccharomyces cerevisiae (Baker's yeast)Peroxisome biosynthesis protein PAS1 (Peroxin-1).533e-150307/676 (45%)408/676 (60%)25/676 (3%)
O43933PEX1PEX1_HUMANSPHomo sapiens (Human)Peroxisome biogenesis factor 1 (Peroxin-1) (Peroxisome biogenesis disorder protein 1).374e-102205/478 (42%)288/478 (60%)13/478 (2%)
Q9FQ60PEX1TrEMBLArabidopsis thaliana (Mouse-ear cress)Peroxisome biogenesis protein PEX1 (Fragment).3283e-88207/542 (38%)296/542 (54%)13/542 (2%)
O74941SPCC553.03TrEMBLSchizosaccharomyces pombe (Fission yeast)Putative peroxisome biosynthesis protein, AAA family ATPases.3282e-88214/578 (37%)304/578 (52%)6/578 (1%)
Q9FNP1
TrEMBLArabidopsis thaliana (Mouse-ear cress)Similarity to ATPase.3278e-88196/474 (41%)273/474 (57%)13/474 (2%)
Q9CU85PEX1TrEMBLMus musculus (Mouse)5430414H02Rik protein (Fragment).3153e-84155/245 (63%)194/245 (79%)1/245 (0%)
Q9VUC7L(3)70DATrEMBLDrosophila melanogaster (Fruit fly)CG6760 protein (LD43687p).2762e-72181/537 (33%)284/537 (52%)23/537 (4%)
Q9Y090L(3)70DATrEMBLDrosophila melanogaster (Fruit fly)L(3)70DA.2762e-72181/537 (33%)284/537 (52%)23/537 (4%)
Q58556MJ1156YB56_METJASPMethanococcus jannaschiiCell division cycle protein 48 homolog MJ1156.2769e-73206/628 (32%)301/628 (47%)48/628 (7%)
O28972AF1297TrEMBLArchaeoglobus fulgidusCell division control protein 48 (CDC48-1).2691e-70167/467 (35%)256/467 (54%)5/467 (1%)
Q9YC86APE1367TrEMBLAeropyrum pernix726AA long hypothetical transitional endoplasmic reticulum ATPase.2683e-70167/475 (35%)255/475 (53%)8/475 (1%)
Q97ZZ9SSO0421TrEMBLSulfolobus solfataricusAAA family ATPase.2661e-69183/582 (31%)292/582 (50%)8/582 (1%)
Q8SSJ5ECU01_1230TrEMBLEncephalitozoon cuniculiProtein of the CDC48/PAS1/SEC28 family of ATPases.2654e-69167/480 (34%)257/480 (53%)8/480 (1%)
Q9Y910APE2474TrEMBLAeropyrum pernix699AA long hypothetical transitional endoplasmic reticulum ATPase.2645e-69170/480 (35%)257/480 (53%)8/480 (1%)
P54609CDC48AC48A_ARATHSPArabidopsis thaliana (Mouse-ear cress)Cell division control protein 48 homolog A (AtCDC48a).2631e-68191/685 (27%)324/685 (47%)58/685 (8%)
Q976H7ST0209TrEMBLSulfolobus tokodaiiPutative SAV protein.2631e-68168/481 (34%)256/481 (53%)9/481 (1%)
Q96372CAFPCC48_CAPANSPCapsicum annuum (Bell pepper)Cell division cycle protein 48 homolog.2622e-68192/688 (27%)323/688 (46%)57/688 (8%)
P90532CDCDTrEMBLDictyostelium discoideum (Slime mold)Cell division cycle protein 48.2622e-68185/638 (28%)306/638 (47%)48/638 (7%)
Q975P4ST0376TrEMBLSulfolobus tokodaiiPutative SAV protein.2622e-68177/581 (30%)291/581 (50%)7/581 (1%)
Q9LZF6CDC48EC48E_ARATHSPArabidopsis thaliana (Mouse-ear cress)Cell division control protein 48 homolog E (AtCDC48e) (Transitional endoplasmic reticulum ATPase E).2622e-68181/629 (28%)305/629 (48%)47/629 (7%)
Q9SCN8CDC48DC48D_ARATHSPArabidopsis thaliana (Mouse-ear cress)Putative cell division control protein 48 homolog D (AtCDC48d) (Transitional endoplasmic reticulum ATPase D).2614e-68182/629 (28%)302/629 (48%)47/629 (7%)
Q877G7SAV2TrEMBLSulfolobus acidocaldariusAAA family ATPase.2613e-68167/474 (35%)249/474 (52%)6/474 (1%)
O60058SPBC56F2.07CTrEMBLSchizosaccharomyces pombe (Fission yeast)AAA family ATPase.2591e-67160/468 (34%)251/468 (53%)1/468 (0%)
P54774CDC48CC48_SOYBNSPGlycine max (Soybean)Cell division cycle protein 48 homolog (Valosin containing protein homolog) (VCP).2583e-67180/633 (28%)302/633 (47%)47/633 (7%)
O44008VCPTrEMBLTrypanosoma bruceiValosin-containing protein homolog.2578e-67182/628 (28%)304/628 (48%)48/628 (7%)
Q980U9SSO0176TrEMBLSulfolobus solfataricusAAA family ATPase.2562e-66184/609 (30%)293/609 (48%)22/609 (3%)

Pfam

Scores for sequence family classification (score includes all domains):

Model      Description                                  Score    E-value  N 

--------   -----------                                  -----    ------- ---

AAA        ATPase family associated with various cell   322.1    6.5e-93   2

NACHT      NACHT domain                                 -46.1          1   1

SPX        SPX domain                                   -68.3        7.7   1

ABC_tran   ABC transporter                              -69.4        8.3   1

Torsin     Torsin                                      -187.8       0.43   1

Yeast_VAR1 Mitochondrial ribosomal protein (VAR1)      -230.0        5.8   1



Parsed for domains:

Model      Domain  seq-f seq-t    hmm-f hmm-t      score  E-value

--------   ------- ----- -----    ----- -----      -----  -------

ABC_tran     1/1     518   672 ..     1   199 []   -69.4      8.3

NACHT        1/1     519   676 ..     1   210 []   -46.1        1

SPX          1/1     559   677 ..     1   327 []   -68.3      7.7

Torsin       1/1     480   711 ..     1   296 []  -187.8     0.43

AAA          1/2     520   713 ..     1   216 []    41.9  1.4e-08

Yeast_VAR1   1/1     526   837 ..     1   413 []  -230.0      5.8

AAA          2/2     798   982 ..     1   216 []   280.2  2.7e-80




CDS Sequence

ATGGATAATC ATAAAGCTAG AATATCTTAT AAGACATTAA AGTCTAATTT GGTTAATTTA 
CCATCTAGTT TAACCAATTT ATTATTCACC GCCAACATTC AAGTTCAAGA TGTGATAATA 
GAAATGGTTA CCACCACTTC AAATGGTAAT AAAAAGAACT CTACCACCAC TAAACACTAT 
GCTGGTTGGT CTGGAATGTC ATCCTCTGAT ATTTCTAATT TAGAAATAGA TCCAGTATTT 
GCTCAATCAT TAAATCTTAT TGATAAAACC CCTATCATAG TCAATTTAAA ACTTGGGAAT 
TATGAATCAA CTAATATCAA TTTAGAACCA TTAACAAGTT CTGATTGGGA ATTAGTAGAA 
TTACATGCTC AATCAATTGA AGATAAATTA TTGAGTCAAA CTCGATGTGT GGCCTTAAAT 
CAAGTATTGG TAGTATACCC TAGTGCTACA ACTAGTGCCA AATTATTAGT AACAGATTTA 
GGTAGTACTG ATCATACGTT TGCCAAAATA TCACCGTATT GTGAAATAGC TATAGCTCCT 
AAAGTAAGAG AAAAGGAACA GAAAAATAAT AAGACTGCAA GTTCTAGTAA GAGTATTGCT 
AGTTCAAAAA GTACGAAAAG AACAACATCA TCATCGCTGG AAGATTATTC TGATTTACCC 
CTGGTTTTGA AAAGAGGAAT ATCATTACCA CATAAGTTAT ATAATGTTAA TGAGGCAGTA 
GCAGGTTACT TTGTATATGT GGATATTGAC GATGAATTAC CTCAAGGGTT CAATAGTGAA 
TATGTTGCTG TATCAGTGAT TCCGGGTCCA AATGATAAAA CATCAACAAA AGCATTACAA 
GCAACCGAAG ATGACAACAA CAACAACAAC AACAGCAACA ATAACAACAA AAATTTGAAA 
GAAAATAAAA GAATTATTGC TAAATTAGTA GACTACAAAT CTGGTCCTTC AGGAAATATT 
GGATTATCCC GTAACTTAGC CATAGCATTG AATATTGAAA ATCAAATCGG GAACATTATT 
AGTTTGAAAC CAGCTATTAA AAATTTACCC AAGAGACCAA CAACGTTCAC TATTCATCCT 
TATATTATAC ATACAAAGAA AAAAGAAATC ACTATTAGCA GTAATAAAAA GGAGAATAAA 
TTGGCACAAC AGTTAACAGA AATCATGTAC CCTGGAATTG CATCAATACC CATTACCAAT 
TTTACGAAGA TCCCCATTAT TGCCAATGTT TTACCGTATG GTGGATTATT GAGATTTAGG 
AAAAATGATG AATATAATGC TTGGATTAAA CCATATAATT TAGATCTGAA GAAACCTATA 
AAATTTGAAA TTGGTGATGA AATTTTACGT CCTAGTTCAT TTATAGAACA AGAGATTGAC 
AAACCGACAA CTAATGATAC AGAAGAACAG GAAGCAATTG GATTAGACAA TACAGTTGAA 
GATATTGTTG ATTCATTTAT TACATCAGAT AATACTGGGA CTTTAGTATA TGGGAATTCT 
GGTAGTGGGA AAACTTTATT ATTGAAATTA GTGGCACAAC AATTAAATCA ACAACATGGT 
TATTTCACAA AATATATATC ATGTGACACC ATAATGAATG AAAATTTCCA AAATTTATCT 
AAAAACCATT TTTTCAAATG GATTCAAACT TGTGCATGGA ATAAACCATC AGTTTTGATT 
TTGGATAATA TTGATAAATT AATGAGTGTA GAAATGGAAA ATATGGATGC CACCAAATCT 
AATCAATTGA CAGAATTTTT CATATCTAAT TTAACGAAAA TTCATCATCA ATTAAATTCC 
AATTTATCAA TATTATTATC AGCTAATTCT AAAGATAATA TTAATAAATT ATTATTAGGA 
TCTCATTTAA TTGAAAATTT CCATCATTTA AATCCACCAG ATAAATCATT ACGATTTGAA 
ATTTTAGATA AATATTTAAC TAATAAATTA GGATTAAAAA TTAAGGTTGA TTTAATGGAT 
TTAGTTAGTG AAACTGAAGG TTATTTACCA AATGATTTGA AAATTTTAAG TGATAGAATA 
TATCATGAAG TTTTGTTCAA CAGCACTGAA ACAGAAACAG AAACAGAAAC TGAAGCCACC 
ACCAATGCCG CAGTGACTAG TGAACATATT GAAAAGGCTT TAGCTGGTTA TACGCCATCT 
AATTTACGTG GAGTTAAATT ACAAAAATCA TCGATTAATT GGTCAGATAT AGGAGGATTA 
AAAGAAGCGA AAAACATTTT ATTAGAAACT TTAGAATGGC CCACTAAATA TGCCCCTATA 
TTTGCTAATT GTCCATTACG TTTAAGATCA GGTATTTTAC TATATGGATA CCCTGGGTGT 
GGTAAAACAT TATTAGCCAG TGCTATAGCT GGTCAATGTG GATTAAATTT TATTTCCATC 
AAGGGGCCAG AAATTTTAAA TAAATATATT GGTGCTTCAG AACAAAGTGT TCGAGAACTT 
TTTGAAAGAG CTCAAGCTGC TAAACCATGT ATTTTATTTT TCGATGAATT TGATTCCATT 
GCTCCTAAAA GAGGTCATGA TTCTACTGGG GTGACTGATC GTGTTGTTAA TCAAATGTTG 
ACACAAATGG ATGGGGCTGA AGGATTAGAT GGAGTTTATG TGTTAGCAGC TACTTCAAGA 
CCAGATTTGA TTGATTCAGC ATTATTAAGA CCTGGTAGAT TAGATAAAAG TGTTATATGT 
GATATGCCTA ATTATGAAGA TCGATTAGAT ATTTTACAAA GTATTACTAC AAAGATGGAT 
TTGAGTGATG ATGTGAATTT ACATGAAATT GCTGAAAAAA CTACTGGATT TAGTGGGGCT 
GATATGCAAG GTTTAGGATA TAATGCTTAT TTAAAAGCCG TTCATGTGAC ATTAGAGGAA 
TTATCTCAAA GGGAACAAGA TGAAGCAAAT AATGAAGATG GTAATAATAA TTTTGAAAAA 
AATTCCATAG AATTCTTTCA AGTGGGTAAT TCAGAAAAGA AATTAAAGAC CAATGAAAAA 
ATCCAATTAT TACATCAAAT TCAACAATTT ATGAATCCTA ATAAAGATAA TTCAGATTCA 
GAAAGAAAAG CAGGTCAAGA TGCAATGAAT AAATCAAAAG TTTTAATTAC TCATGAGAAT 
TTTTTGGAAT CATTAAAAGA AACTAAACCA TCAATTTCCC ATTCAGAGAA AATCAAATTA 
ACGAAAATTT ATAAAGAATT TGTTAATGAT CGAGATGGTA ATATGCCTGA TGGGACCCCA 
AGTAATGAAA TTGGAGGAAG AACTACATTA ATGTAA

AA Sequence

MDNHKARISY KTLKSNLVNL PSSLTNLLFT ANIQVQDVII EMVTTTSNGN KKNSTTTKHY 
AGWSGMSSSD ISNLEIDPVF AQSLNLIDKT PIIVNLKLGN YESTNINLEP LTSSDWELVE 
LHAQSIEDKL LSQTRCVALN QVLVVYPSAT TSAKLLVTDL GSTDHTFAKI SPYCEIAIAP 
KVREKEQKNN KTASSSKSIA SSKSTKRTTS SSLEDYSDLP LVLKRGISLP HKLYNVNEAV 
AGYFVYVDID DELPQGFNSE YVAVSVIPGP NDKTSTKALQ ATEDDNNNNN NSNNNNKNLK 
ENKRIIAKLV DYKSGPSGNI GLSRNLAIAL NIENQIGNII SLKPAIKNLP KRPTTFTIHP 
YIIHTKKKEI TISSNKKENK LAQQLTEIMY PGIASIPITN FTKIPIIANV LPYGGLLRFR 
KNDEYNAWIK PYNLDLKKPI KFEIGDEILR PSSFIEQEID KPTTNDTEEQ EAIGLDNTVE 
DIVDSFITSD NTGTLVYGNS GSGKTLLLKL VAQQLNQQHG YFTKYISCDT IMNENFQNLS 
KNHFFKWIQT CAWNKPSVLI LDNIDKLMSV EMENMDATKS NQLTEFFISN LTKIHHQLNS 
NLSILLSANS KDNINKLLLG SHLIENFHHL NPPDKSLRFE ILDKYLTNKL GLKIKVDLMD 
LVSETEGYLP NDLKILSDRI YHEVLFNSTE TETETETEAT TNAAVTSEHI EKALAGYTPS 
NLRGVKLQKS SINWSDIGGL KEAKNILLET LEWPTKYAPI FANCPLRLRS GILLYGYPGC 
GKTLLASAIA GQCGLNFISI KGPEILNKYI GASEQSVREL FERAQAAKPC ILFFDEFDSI 
APKRGHDSTG VTDRVVNQML TQMDGAEGLD GVYVLAATSR PDLIDSALLR PGRLDKSVIC 
DMPNYEDRLD ILQSITTKMD LSDDVNLHEI AEKTTGFSGA DMQGLGYNAY LKAVHVTLEE 
LSQREQDEAN NEDGNNNFEK NSIEFFQVGN SEKKLKTNEK IQLLHQIQQF MNPNKDNSDS 
ERKAGQDAMN KSKVLITHEN FLESLKETKP SISHSEKIKL TKIYKEFVND RDGNMPDGTP 
SNEIGGRTTL M*