1 ID 100K_RAT STANDARD; PRT; 889 AA.
3 DT 01-NOV-1997 (Rel. 35, Created)
4 DT 01-NOV-1997 (Rel. 35, Last sequence update)
5 DT 15-JUL-1999 (Rel. 38, Last annotation update)
6 DE 100 KD PROTEIN (EC 6.3.2.-).
7 OS Rattus norvegicus (Rat).
8 OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Mammalia;
9 OC Eutheria; Rodentia; Sciurognathi; Muridae; Murinae; Rattus.
12 RC STRAIN=WISTAR; TISSUE=TESTIS;
14 RA MUELLER D., REHBEIN M., BAUMEISTER H., RICHTER D.;
15 RT "Molecular characterization of a novel rat protein structurally
16 RT related to poly(A) binding proteins and the 70K protein of the U1
17 RT small nuclear ribonucleoprotein particle (snRNP).";
18 RL Nucleic Acids Res. 20:1471-1475(1992).
21 RA MUELLER D., REHBEIN M., BAUMEISTER H., RICHTER D.;
22 RL Nucleic Acids Res. 20:2624-2624(1992).
23 CC -!- FUNCTION: E3 UBIQUITIN-PROTEIN LIGASE WHICH ACCEPTS UBIQUITIN FROM
24 CC AN E2 UBIQUITIN-CONJUGATING ENZYME IN THE FORM OF A THIOESTER AND
25 CC THEN DIRECTLY TRANSFERS THE UBIQUITIN TO TARGETED SUBSTRATES (BY
26 CC SIMILARITY). THIS PROTEIN MAY BE INVOLVED IN MATURATION AND/OR
27 CC POST-TRANSCRIPTIONAL REGULATION OF MRNA.
28 CC -!- TISSUE SPECIFICITY: HIGHEST LEVELS FOUND IN TESTIS. ALSO PRESENT
29 CC IN LIVER, KIDNEY, LUNG AND BRAIN.
30 CC -!- DEVELOPMENTAL STAGE: IN EARLY POST-NATAL LIFE, EXPRESSION IN
31 CC THE TESTIS INCREASES TO REACH A MAXIMUM AROUND DAY 28.
32 CC -!- MISCELLANEOUS: A CYSTEINE RESIDUE IS REQUIRED FOR
33 CC UBIQUITIN-THIOLESTER FORMATION.
34 CC -!- SIMILARITY: CONTAINS AN HECT-TYPE E3 UBIQUITIN-PROTEIN LIGASE
36 CC -!- SIMILARITY: A CENTRAL REGION (AA 485-514) IS SIMILAR TO THE
37 CC C-TERMINAL DOMAINS OF MAMMALIAN AND YEAST POLY (A) RNA BINDING
39 CC -!- SIMILARITY: THE C-TERMINAL HALF SHOWS HIGH SIMILARITY TO
40 CC DROSOPHILA HYPERPLASMIC DISC PROTEIN AND SOME, TO HUMAN E6-AP.
41 CC -!- SIMILARITY: CONTAINS MIXED-CHARGE DOMAINS SIMILAR TO RNA-BINDING
43 CC --------------------------------------------------------------------------
44 CC This SWISS-PROT entry is copyright. It is produced through a collaboration
45 CC between the Swiss Institute of Bioinformatics and the EMBL outstation -
46 CC the European Bioinformatics Institute. There are no restrictions on its
47 CC use by non-profit institutions as long as its content is in no way
48 CC modified and this statement is not removed. Usage by and for commercial
49 CC entities requires a license agreement (See http://www.isb-sib.ch/announce/
50 CC or send an email to license@isb-sib.ch).
51 CC --------------------------------------------------------------------------
52 DR EMBL; X64411; CAA45756.1; -.
53 DR PFAM; PF00632; HECT; 1.
54 DR PFAM; PF00658; PABP; 1.
55 KW Ubiquitin conjugation; Ligase.
56 FT DOMAIN 77 88 ASP/GLU-RICH (ACIDIC).
57 FT DOMAIN 127 150 PRO-RICH.
58 FT DOMAIN 420 439 ARG/GLU-RICH (MIXED CHARGE).
59 FT DOMAIN 448 457 ARG/ASP-RICH (MIXED CHARGE).
60 FT DOMAIN 485 514 PABP-LIKE.
61 FT DOMAIN 579 590 ASP/GLU-RICH (ACIDIC).
62 FT DOMAIN 786 889 HECT DOMAIN.
63 FT DOMAIN 827 847 PRO-RICH.
64 FT BINDING 858 858 UBIQUITIN (BY SIMILARITY).
65 SQ SEQUENCE 889 AA; 100368 MW; DD7E6C7A CRC32;
66 MMSARGDFLN YALSLMRSHN DEHSDVLPVL DVCSLKHVAY VFQALIYWIK AMNQQTTLDT
67 PQLERKRTRE LLELGIDNED SEHENDDDTS QSATLNDKDD ESLPAETGQN HPFFRRSDSM
68 TFLGCIPPNP FEVPLAEAIP LADQPHLLQP NARKEDLFGR PSQGLYSSSA GSGKCLVEVT
69 MDRNCLEVLP TKMSYAANLK NVMNMQNRQK KAGEDQSMLA EEADSSKPGP SAHDVAAQLK
70 SSLLAEIGLT ESEGPPLTSF RPQCSFMGMV ISHDMLLGRW RLSLELFGRV FMEDVGAEPG
71 SILTELGGFE VKESKFRREM EKLRNQQSRD LSLEVDRDRD LLIQQTMRQL NNHFGRRCAT
72 TPMAVHRVKV TFKDEPGEGS GVARSFYTAI AQAFLSNEKL PNLDCIQNAN KGTHTSLMQR
73 LRNRGERDRE REREREMRRS SGLRAGSRRD RDRDFRRQLS IDTRPFRPAS EGNPSDDPDP
74 LPAHRQALGE RLYPRVQAMQ PAFASKITGM LLELSPAQLL LLLASEDSLR ARVEEAMELI
75 VAHGRENGAD SILDLGLLDS SEKVQENRKR HGSSRSVVDM DLDDTDDGDD NAPLFYQPGK
76 RGFYTPRPGK NTEARLNCFR NIGRILGLCL LQNELCPITL NRHVIKVLLG RKVNWHDFAF
77 FDPVMYESLR QLILASQSSD ADAVFSAMDL AFAVDLCKEE GGGQVELIPN GVNIPVTPQN
78 VYEYVRKYAE HRMLVVAEQP LHAMRKGLLD VLPKNSLEDL TAEDFRLLVN GCGEVNVQML
79 ISFTSFNDES GENAEKLLQF KRWFWSIVER MSMTERQDLV YFWTSSPSLP ASEEGFQPMP
80 SITIRPPDDQ HLPTANTCIS RLYVPLYSSK QILKQKLLLA IKTKNFGFV
82 ID 104K_THEPA STANDARD; PRT; 924 AA.
84 DT 01-APR-1990 (Rel. 14, Created)
85 DT 01-APR-1990 (Rel. 14, Last sequence update)
86 DT 01-AUG-1992 (Rel. 23, Last annotation update)
87 DE 104 KD MICRONEME-RHOPTRY ANTIGEN.
89 OC Eukaryota; Alveolata; Apicomplexa; Piroplasmida; Theileriidae;
95 RA IAMS K.P., YOUNG J.R., NENE V., DESAI J., WEBSTER P.,
96 RA OLE-MOIYOI O.K., MUSOKE A.J.;
97 RT "Characterisation of the gene encoding a 104-kilodalton microneme-
98 RT rhoptry protein of Theileria parva.";
99 RL Mol. Biochem. Parasitol. 39:47-60(1990).
100 CC -!- SUBCELLULAR LOCATION: IN MICRONEME/RHOPTRY COMPLEXES.
101 CC -!- DEVELOPMENTAL STAGE: SPOROZOITE ANTIGEN.
102 CC --------------------------------------------------------------------------
103 CC This SWISS-PROT entry is copyright. It is produced through a collaboration
104 CC between the Swiss Institute of Bioinformatics and the EMBL outstation -
105 CC the European Bioinformatics Institute. There are no restrictions on its
106 CC use by non-profit institutions as long as its content is in no way
107 CC modified and this statement is not removed. Usage by and for commercial
108 CC entities requires a license agreement (See http://www.isb-sib.ch/announce/
109 CC or send an email to license@isb-sib.ch).
110 CC --------------------------------------------------------------------------
111 DR EMBL; M29954; AAA18217.1; -.
112 DR PIR; A44945; A44945.
113 KW Antigen; Sporozoite; Repeat.
114 FT DOMAIN 1 19 HYDROPHOBIC.
115 FT DOMAIN 905 924 HYDROPHOBIC.
116 SQ SEQUENCE 924 AA; 103625 MW; 4563AAA0 CRC32;
117 MKFLILLFNI LCLFPVLAAD NHGVGPQGAS GVDPITFDIN SNQTGPAFLT AVEMAGVKYL
118 QVQHGSNVNI HRLVEGNVVI WENASTPLYT GAIVTNNDGP YMAYVEVLGD PNLQFFIKSG
119 DAWVTLSEHE YLAKLQEIRQ AVHIESVFSL NMAFQLENNK YEVETHAKNG ANMVTFIPRN
120 GHICKMVYHK NVRIYKATGN DTVTSVVGFF RGLRLLLINV FSIDDNGMMS NRYFQHVDDK
121 YVPISQKNYE TGIVKLKDYK HAYHPVDLDI KDIDYTMFHL ADATYHEPCF KIIPNTGFCI
122 TKLFDGDQVL YESFNPLIHC INEVHIYDRN NGSIICLHLN YSPPSYKAYL VLKDTGWEAT
123 THPLLEEKIE ELQDQRACEL DVNFISDKDL YVAALTNADL NYTMVTPRPH RDVIRVSDGS
124 EVLWYYEGLD NFLVCAWIYV SDGVASLVHL RIKDRIPANN DIYVLKGDLY WTRITKIQFT
125 QEIKRLVKKS KKKLAPITEE DSDKHDEPPE GPGASGLPPK APGDKEGSEG HKGPSKGSDS
126 SKEGKKPGSG KKPGPAREHK PSKIPTLSKK PSGPKDPKHP RDPKEPRKSK SPRTASPTRR
127 PSPKLPQLSK LPKSTSPRSP PPPTRPSSPE RPEGTKIIKT SKPPSPKPPF DPSFKEKFYD
128 DYSKAASRSK ETKTTVVLDE SFESILKETL PETPGTPFTT PRPVPPKRPR TPESPFEPPK
129 DPDSPSTSPS EFFTPPESKR TRFHETPADT PLPDVTAELF KEPDVTAETK SPDEAMKRPR
130 SPSEYEDTSP GDYPSLPMKR HRLERLRLTT TEMETDPGRM AKDASGKPVK LKRSKSFDDL
131 TTVELAPEPK ASRIVVDDEG TEADDEETHP PEERQKTEVR RRRPPKKPSK SPRPSKPKKP
132 KKPDSAYIPS ILAILVVSLI VGIL
134 ID 108_LYCES STANDARD; PRT; 102 AA.
136 DT 15-JUL-1999 (Rel. 38, Created)
137 DT 15-JUL-1999 (Rel. 38, Last sequence update)
138 DT 15-JUL-1999 (Rel. 38, Last annotation update)
139 DE PROTEIN 108 PRECURSOR.
140 OS Lycopersicon esculentum (Tomato).
141 OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
142 OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
143 OC core eudicots; Asteridae; euasterids I; Solanales; Solanaceae;
146 RP SEQUENCE FROM N.A.
147 RC STRAIN=CV. VF36; TISSUE=ANTHER;
148 RX MEDLINE; 94143497.
149 RA CHEN R., SMITH A.G.;
150 RT "Nucleotide sequence of a stamen- and tapetum-specific gene from
151 RT Lycopersicon esculentum.";
152 RL Plant Physiol. 101:1413-1413(1993).
153 CC -!- TISSUE SPECIFICITY: STAMEN- AND TAPETUM-SPECIFIC.
154 CC -!- SIMILARITY: BELONGS TO THE A9 / FIL1 FAMILY.
155 CC --------------------------------------------------------------------------
156 CC This SWISS-PROT entry is copyright. It is produced through a collaboration
157 CC between the Swiss Institute of Bioinformatics and the EMBL outstation -
158 CC the European Bioinformatics Institute. There are no restrictions on its
159 CC use by non-profit institutions as long as its content is in no way
160 CC modified and this statement is not removed. Usage by and for commercial
161 CC entities requires a license agreement (See http://www.isb-sib.ch/announce/
162 CC or send an email to license@isb-sib.ch).
163 CC --------------------------------------------------------------------------
164 DR EMBL; Z14088; CAA78466.1; -.
165 DR MENDEL; 8853; LYCes;1133;1.
167 FT SIGNAL 1 30 POTENTIAL.
168 FT CHAIN 31 102 PROTEIN 108.
169 FT DISULFID 41 77 BY SIMILARITY.
170 FT DISULFID 51 66 BY SIMILARITY.
171 FT DISULFID 67 92 BY SIMILARITY.
172 FT DISULFID 79 99 BY SIMILARITY.
173 SQ SEQUENCE 102 AA; 10576 MW; AFA4875A CRC32;
174 MASVKSSSSS SSSSFISLLL LILLVIVLQS QVIECQPQQS CTASLTGLNV CAPFLVPGSP
175 TASTECCNAV QSINHDCMCN TMRIAAQIPA QCNLPPLSCS AN
177 ID 10KD_VIGUN STANDARD; PRT; 75 AA.
179 DT 01-NOV-1990 (Rel. 16, Created)
180 DT 01-NOV-1990 (Rel. 16, Last sequence update)
181 DT 01-FEB-1995 (Rel. 31, Last annotation update)
182 DE 10 KD PROTEIN PRECURSOR (CLONE PSAS10).
183 OS Vigna unguiculata (Cowpea).
184 OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
185 OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
186 OC core eudicots; Rosidae; eurosids I; Fabales; Fabaceae; Papilionoideae;
189 RP SEQUENCE FROM N.A.
191 RX MEDLINE; 91355865.
192 RA ISHIBASHI N., YAMAUCHI D., MINIAMIKAWA T.;
193 RT "Stored mRNA in cotyledons of Vigna unguiculata seeds: nucleotide
194 RT sequence of cloned cDNA for a stored mRNA and induction of its
195 RT synthesis by precocious germination.";
196 RL Plant Mol. Biol. 15:59-64(1990).
197 CC -!- FUNCTION: THIS PROTEIN IS REQUIRED FOR GERMINATION.
198 CC -!- SIMILARITY: BELONGS TO THE GAMMA-PUROTHIONIN FAMILY.
199 CC --------------------------------------------------------------------------
200 CC This SWISS-PROT entry is copyright. It is produced through a collaboration
201 CC between the Swiss Institute of Bioinformatics and the EMBL outstation -
202 CC the European Bioinformatics Institute. There are no restrictions on its
203 CC use by non-profit institutions as long as its content is in no way
204 CC modified and this statement is not removed. Usage by and for commercial
205 CC entities requires a license agreement (See http://www.isb-sib.ch/announce/
206 CC or send an email to license@isb-sib.ch).
207 CC --------------------------------------------------------------------------
208 DR EMBL; X16877; CAA34760.1; -.
209 DR PIR; S11156; S11156.
210 DR HSSP; P45639; 1CHL.
211 DR PFAM; PF00304; Gamma-thionin; 1.
212 DR PROSITE; PS00940; GAMMA_THIONIN; 1.
213 KW Germination; Signal.
214 FT SIGNAL 1 ? POTENTIAL.
215 FT CHAIN ? 75 10 KD PROTEIN.
216 FT DISULFID 31 75 BY SIMILARITY.
217 FT DISULFID 42 63 BY SIMILARITY.
218 FT DISULFID 48 69 BY SIMILARITY.
219 FT DISULFID 52 71 BY SIMILARITY.
220 SQ SEQUENCE 75 AA; 8523 MW; AFF911AB CRC32;
221 MEKKSIAGLC FLFLVLFVAQ EVVVQSEAKT CENLVDTYRG PCFTTGSCDD HCKNKEHLLS
224 ID 110K_PLAKN STANDARD; PRT; 296 AA.
226 DT 01-JAN-1990 (Rel. 13, Created)
227 DT 01-JAN-1990 (Rel. 13, Last sequence update)
228 DT 01-FEB-1994 (Rel. 28, Last annotation update)
229 DE 110 KD ANTIGEN (PK110) (FRAGMENT).
230 OS Plasmodium knowlesi.
231 OC Eukaryota; Alveolata; Apicomplexa; Haemosporida; Plasmodium.
233 RP SEQUENCE FROM N.A.
234 RX MEDLINE; 88039002.
235 RA PERLER F.B., MOON A.M., QIANG B.Q., MEDA M., DALTON M., CARD C.,
236 RA SCHMIDT-ULLRICH R., WALLACH D., LYNCH J., DONELSON J.E.;
237 RT "Cloning and characterization of an abundant Plasmodium knowlesi
238 RT antigen which cross reacts with Gambian sera.";
239 RL Mol. Biochem. Parasitol. 25:185-193(1987).
240 CC --------------------------------------------------------------------------
241 CC This SWISS-PROT entry is copyright. It is produced through a collaboration
242 CC between the Swiss Institute of Bioinformatics and the EMBL outstation -
243 CC the European Bioinformatics Institute. There are no restrictions on its
244 CC use by non-profit institutions as long as its content is in no way
245 CC modified and this statement is not removed. Usage by and for commercial
246 CC entities requires a license agreement (See http://www.isb-sib.ch/announce/
247 CC or send an email to license@isb-sib.ch).
248 CC --------------------------------------------------------------------------
249 DR EMBL; M19152; AAA29471.1; -.
250 DR PIR; A54527; A54527.
251 KW Malaria; Antigen; Repeat.
253 FT DOMAIN 131 296 13.5 X 12 AA TANDEM REPEATS OF E-E-T-Q-K-
255 SQ SEQUENCE 296 AA; 34077 MW; 666F88DF CRC32;
256 FNSNMLRGSV CEEDVSLMTS IDNMIEEIDF YEKEIYKGSH SGGVIKGMDY DLEDDENDED
257 EMTEQMVEEV ADHITQDMID EVAHHVLDNI THDMAHMEEI VHGLSGDVTQ IKEIVQKVNV
258 AVEKVKHIVE TEETQKTVEP EQIEETQNTV EPEQTEETQK TVEPEQTEET QNTVEPEQIE
259 ETQKTVEPEQ TEEAQKTVEP EQTEETQKTV EPEQTEETQK TVEPEQTEET QKTVEPEQTE
260 ETQKTVEPEQ TEETQKTVEP EQTEETQKTV EPEQTEETQN TVEPEPTQET QNTVEP
262 ID 11S3_HELAN STANDARD; PRT; 493 AA.
264 DT 01-NOV-1990 (Rel. 16, Created)
265 DT 01-NOV-1990 (Rel. 16, Last sequence update)
266 DT 01-FEB-1994 (Rel. 28, Last annotation update)
267 DE 11S GLOBULIN SEED STORAGE PROTEIN G3 PRECURSOR (HELIANTHININ G3).
269 OS Helianthus annuus (Common sunflower).
270 OC Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
271 OC euphyllophytes; Spermatophyta; Magnoliophyta; eudicotyledons;
272 OC core eudicots; Asteridae; euasterids II; Asterales; Asteraceae;
275 RP SEQUENCE FROM N.A.
276 RX MEDLINE; 89232734.
277 RA VONDER HARR R.A., ALLEN R.D., COHEN E.A., NESSLER C.L., THOMAS T.L.;
278 RT "Organization of the sunflower 11S storage protein gene family.";
279 RL Gene 74:433-443(1988).
280 CC -!- FUNCTION: THIS IS A SEED STORAGE PROTEIN.
281 CC -!- SUBUNIT: HEXAMER; EACH SUBUNIT IS COMPOSED OF AN ACIDIC AND A
282 CC BASIC CHAIN DERIVED FROM A SINGLE PRECURSOR AND LINKED BY A
284 CC -!- SIMILARITY: BELONGS TO THE 11S SEED STORAGE PROTEINS (GLOBULINS)
286 CC --------------------------------------------------------------------------
287 CC This SWISS-PROT entry is copyright. It is produced through a collaboration
288 CC between the Swiss Institute of Bioinformatics and the EMBL outstation -
289 CC the European Bioinformatics Institute. There are no restrictions on its
290 CC use by non-profit institutions as long as its content is in no way
291 CC modified and this statement is not removed. Usage by and for commercial
292 CC entities requires a license agreement (See http://www.isb-sib.ch/announce/
293 CC or send an email to license@isb-sib.ch).
294 CC --------------------------------------------------------------------------
295 DR EMBL; M28832; AAA33374.1; -.
296 DR PIR; JA0089; JA0089.
297 DR PFAM; PF00190; Seedstore_11s; 1.
298 DR PROSITE; PS00305; 11S_SEED_STORAGE; 1.
299 KW Seed storage protein; Multigene family; Signal.
301 FT CHAIN 21 305 ACIDIC CHAIN.
302 FT CHAIN 306 493 BASIC CHAIN.
303 FT DISULFID 103 312 INTERCHAIN (ACIDIC-BASIC) (POTENTIAL).
304 FT DOMAIN 23 35 GLN-RICH.
305 FT DOMAIN 111 127 GLN/GLY-RICH.
306 FT DOMAIN 191 297 GLN-RICH.
307 SQ SEQUENCE 493 AA; 55687 MW; E79DEAAE CRC32;
308 MASKATLLLA FTLLFATCIA RHQQRQQQQN QCQLQNIEAL EPIEVIQAEA GVTEIWDAYD
309 QQFQCAWSIL FDTGFNLVAF SCLPTSTPLF WPSSREGVIL PGCRRTYEYS QEQQFSGEGG
310 RRGGGEGTFR TVIRKLENLK EGDVVAIPTG TAHWLHNDGN TELVVVFLDT QNHENQLDEN
311 QRRFFLAGNP QAQAQSQQQQ QRQPRQQSPQ RQRQRQRQGQ GQNAGNIFNG FTPELIAQSF
312 NVDQETAQKL QGQNDQRGHI VNVGQDLQIV RPPQDRRSPR QQQEQATSPR QQQEQQQGRR
313 GGWSNGVEET ICSMKFKVNI DNPSQADFVN PQAGSIANLN SFKFPILEHL RLSVERGELR
314 PNAIQSPHWT INAHNLLYVT EGALRVQIVD NQGNSVFDNE LREGQVVVIP QNFAVIKRAN
315 EQGSRWVSFK TNDNAMIANL AGRVSASAAS PLTLWANRYQ LSREEAQQLK FSQRETVLFA