1 ID 7LES_DROME STANDARD; PRT; 2554 AA.
3 DT 01-JAN-1990 (REL. 13, CREATED)
4 DT 01-JAN-1990 (REL. 13, LAST SEQUENCE UPDATE)
5 DT 01-NOV-1995 (REL. 32, LAST ANNOTATION UPDATE)
6 DE SEVENLESS PROTEIN (EC 2.7.1.112).
8 OS DROSOPHILA MELANOGASTER (FRUIT FLY).
9 OC EUKARYOTA; METAZOA; ARTHROPODA; INSECTA; DIPTERA.
14 RA BASLER K., HAFEN E.;
15 RL CELL 54:299-311(1988).
20 RA BOWTELL D.L.L., SIMON M.A., RUBIN G.M.;
21 RL GENES DEV. 2:620-634(1988).
23 RP IDENTIFICATION OF FN-III REPEATS.
25 RA NORTON P.A., HYNES R.O., RESS D.J.G.;
26 RL CELL 61:15-16(1990).
27 CC -!- FUNCTION: RECEPTOR FOR AN EXTRACELLULAR SIGNAL REQUIRED TO
28 CC INSTRUCT A CELL TO DIFFERENTIATE INTO A R7 PHOTORECEPTOR. THE
29 CC LIGAND FOR SEV IS THE BOSS (BRIDE OF SEVENLESS) PROTEIN ON THE
30 CC SURFACE OF THE NEIGHBORING R8 CELL.
31 CC -!- CATALYTIC ACTIVITY: ATP + A PROTEIN TYROSINE = ADP +
32 CC PROTEIN TYROSINE PHOSPHATE.
33 CC -!- SUBUNIT: MAY FORM A COMPLEX WITH DRK AND SOS.
34 CC -!- SIMILARITY: BELONGS TO THE INSULIN RECEPTOR FAMILY OF TYROSINE-
36 CC -!- SIMILARITY: CONTAINS SEVEN FIBRONECTIN TYPE III-LIKE DOMAINS.
37 CC -!- CAUTION: UNCLEAR WHETHER THE POTENTIAL MEMBRANE SPANNING REGION
38 CC NEAR THE N-TERMINUS IS PRESENT AS A TRANSMEMBRANE DOMAIN IN THE
39 CC NATIVE PROTEIN OR SERVES AS A CLEAVED SIGNAL SEQUENCE.
40 DR EMBL; X13666; G8579; ALT_INIT.
41 DR EMBL; J03158; G158419; -.
42 DR PIR; A28912; TVFF7L.
43 DR FLYBASE; FBGN0003366; SEV.
44 DR PROSITE; PS00107; PROTEIN_KINASE_ATP.
45 DR PROSITE; PS00109; PROTEIN_KINASE_TYR.
46 DR PROSITE; PS00239; RECEPTOR_TYR_KIN_II.
47 DR PROSITE; PS50011; PROTEIN_KINASE_DOM.
48 KW TRANSFERASE; TYROSINE-PROTEIN KINASE; TRANSMEMBRANE; ATP-BINDING;
49 KW PHOSPHORYLATION; RECEPTOR; VISION; REPEAT.
50 FT DOMAIN 1 2123 EXTRACELLULAR (POTENTIAL).
51 FT TRANSMEM 102 122 POTENTIAL.
52 FT TRANSMEM 2124 2147 POTENTIAL.
53 FT DOMAIN 2148 2554 CYTOPLASMIC (POTENTIAL).
54 FT DOMAIN 311 431 FIBRONECTIN TYPE-III.
55 FT DOMAIN 436 528 FIBRONECTIN TYPE-III.
56 FT DOMAIN 822 921 FIBRONECTIN TYPE-III.
57 FT DOMAIN 1298 1392 FIBRONECTIN TYPE-III.
58 FT DOMAIN 1680 1794 FIBRONECTIN TYPE-III.
59 FT DOMAIN 1797 1897 FIBRONECTIN TYPE-III.
60 FT DOMAIN 1898 1988 FIBRONECTIN TYPE-III.
61 FT DOMAIN 2038 2046 POLY-ARG.
62 FT DOMAIN 2209 2485 PROTEIN KINASE.
63 FT NP_BIND 2215 2223 ATP (BY SIMILARITY).
64 FT BINDING 2242 2242 ATP (BY SIMILARITY).
65 FT MUTAGEN 2242 2242 K->M: INACTIVATES THE PROTEIN.
66 FT MOD_RES 2380 2380 PHOSPHORYLATION (AUTO-) (BY SIMILARITY).
67 FT CARBOHYD 30 30 POTENTIAL.
68 FT CARBOHYD 129 129 POTENTIAL.
69 FT CARBOHYD 481 481 POTENTIAL.
70 FT CARBOHYD 505 505 POTENTIAL.
71 FT CARBOHYD 617 617 POTENTIAL.
72 FT CARBOHYD 647 647 POTENTIAL.
73 FT CARBOHYD 966 966 POTENTIAL.
74 FT CARBOHYD 1228 1228 POTENTIAL.
75 FT CARBOHYD 1313 1313 POTENTIAL.
76 FT CARBOHYD 1353 1353 POTENTIAL.
77 FT CARBOHYD 1550 1550 POTENTIAL.
78 FT CARBOHYD 1557 1557 POTENTIAL.
79 FT CARBOHYD 1639 1639 POTENTIAL.
80 FT CARBOHYD 1725 1725 POTENTIAL.
81 FT CARBOHYD 1756 1756 POTENTIAL.
82 FT CARBOHYD 1804 1804 POTENTIAL.
83 FT CARBOHYD 1889 1889 POTENTIAL.
84 FT CARBOHYD 1947 1947 POTENTIAL.
85 FT CARBOHYD 2073 2073 POTENTIAL.
86 FT VARIANT 392 392 M -> V.
87 FT VARIANT 1668 1668 A -> V.
88 FT VARIANT 1703 1703 N -> H.
89 FT VARIANT 1730 1730 R -> K.
90 FT VARIANT 1731 1731 G -> E.
91 FT VARIANT 1741 1741 V -> M.
92 FT VARIANT 2271 2271 R -> C.
93 FT CONFLICT 1823 1823 E -> Q (IN REF. 2).
94 SQ SEQUENCE 2554 AA; 287107 MW; 1143D891 CRC32;
95 MTMFWQQNVD HQSDEQDKQA KGAAPTKRLN ISFNVKIAVN VNTKMTTTHI NQQAPGTSSS
96 SSNSQNASPS KIVVRQQSSS FDLRQQLARL GRQLASGQDG HGGISTILII NLLLLILLSI
97 CCDVCRSHNY TVHQSPEPVS KDQMRLLRPK LDSDVVEKVA IWHKHAAAAP PSIVEGIAIS
98 SRPQSTMAHH PDDRDRDRDP SEEQHGVDER MVLERVTRDC VQRCIVEEDL FLDEFGIQCE
99 KADNGEKCYK TRCTKGCAQW YRALKELESC QEACLSLQFY PYDMPCIGAC EMAQRDYWHL
100 QRLAISHLVE RTQPQLERAP RADGQSTPLT IRWAMHFPEH YLASRPFNIQ YQFVDHHGEE
101 LDLEQEDQDA SGETGSSAWF NLADYDCDEY YMCEILEALI PYTQYRFRFE LPFGENRDEV
102 LYSPATPAYQ TPPEGAPISA PVIEHLMGLD DSHLAVHWHP GRFTNGPIEG YRLRLSSSEG
103 NATSEQLVPA GRGSYIFSQL QAGTNYTLAL SMINKQGEGP VAKGFVQTHS ARNEKPAKDL
104 TESVLLVGRR AVMWQSLEPA GENSMIYQSQ EELADIAWSK REQQLWLLNV HGELRSLKFE
105 SGQMVSPAQQ LKLDLGNISS GRWVPRRLSF DWLHHRLYFA MESPERNQSS FQIISTDLLG
106 ESAQKVGESF DLPVEQLEVD ALNGWIFWRN EESLWRQDLH GRMIHRLLRI RQPGWFLVQP
107 QHFIIHLMLP QEGKFLEISY DGGFKHPLPL PPPSNGAGNG PASSHWQSFA LLGRSLLLPD
108 SGQLILVEQQ GQAASPSASW PLKNLPDCWA VILLVPESQP LTSAGGKPHS LKALLGAQAA
109 KISWKEPERN PYQSADAARS WSYELEVLDV ASQSAFSIRN IRGPIFGLQR LQPDNLYQLR
110 VRAINVDGEP GEWTEPLAAR TWPLGPHRLR WASRQGSVIH TNELGEGLEV QQEQLERLPG
111 PMTMVNESVG YYVTGDGLLH CINLVHSQWG CPISEPLQHV GSVTYDWRGG RVYWTDLARN
112 CVVRMDPWSG SRELLPVFEA NFLALDPRQG HLYYATSSQL SRHGSTPDEA VTYYRVNGLE
113 GSIASFVLDT QQDQLFWLVK GSGALRLYRA PLTAGGDSLQ MIQQIKGVFQ AVPDSLQLLR
114 PLGALLWLER SGRRARLVRL AAPLDVMELP TPDQASPASA LQLLDPQPLP PRDEGVIPMT
115 VLPDSVRLDD GHWDDFHVRW QPSTSGGNHS VSYRLLLEFG QRLQTLDLST PFARLTQLPQ
116 AQLQLKISIT PRTAWRSGDT TRVQLTTPPV APSQPRRLRV FVERLATALQ EANVSAVLRW
117 DAPEQGQEAP MQALEYHISC WVGSELHEEL RLNQSALEAR VEHLQPDQTY HFQVEARVAA
118 TGAAAGAASH ALHVAPEVQA VPRVLYANAE FIGELDLDTR NRRRLVHTAS PVEHLVGIEG
119 EQRLLWVNEH VELLTHVPGS APAKLARMRA EVLALAVDWI QRIVYWAELD ATAPQAAIIY
120 RLDLCNFEGK ILQGERVWST PRGRLLKDLV ALPQAQSLIW LEYEQGSPRN GSLRGRNLTD
121 GSELEWATVQ PLIRLHAGSL EPGSETLNLV DNQGKLCVYD VARQLCTASA LRAQLNLLGE
122 DSIAGQLAQD SGYLYAVKNW SIRAYGRRRQ QLEYTVELEP EEVRLLQAHN YQAYPPKNCL
123 LLPSSGGSLL KATDCEEQRC LLNLPMITAS EDCPLPIPGV RYQLNLTLAR GPGSEEHDHG
124 VEPLGQWLLG AGESLNLTDL LPFTRYRVSG ILSSFYQKKL ALPTLVLAPL ELLTASATPS
125 PPRNFSVRVL SPRELEVSWL PPEQLRSESV YYTLHWQQEL DGENVQDRRE WEAHERRLET
126 AGTHRLTGIK PGSGYSLWVQ AHATPTKSNS SERLHVRSFA ELPELQLLEL GPYSLSLTWA
127 GTPDPLGSLQ LECRSSAEQL RRNVAGNHTK MVVEPLQPRT RYQCRLLLGY AATPGAPLYH
128 GTAEVYETLG DAPSQPGKPQ LEHIAEEVFR VTWTAARGNG APIALYNLEA LQARSDIRRR
129 RRRRRRNSGG SLEQLPWAEE PVVVEDQWLD FCNTTELSCI VKSLHSSRLL LFRVRARSLE
130 HGWGPYSEES ERVAEPFVSP EKRGSLVLAI IAPAAIVSSC VLALVLVRKV QKRRLRAKKL
131 LQQSRPSIWS NLSTLQTQQQ LMAVRNRAFS TTLSDADIAL LPQINWSQLK LLRFLGSGAF
132 GEVYEGQLKT EDSEEPQRVA IKSLRKGASE FAELLQEAQL MSNFKHENIV RLVGICFDTE
133 SISLIMEHME AGDLLSYLRA ARATSTQEPQ PTAGLSLSEL LAMCIDVANG CSYLEDMHFV
134 HRDLACRNCL VTESTGSTDR RRTVKIGDFG LARDIYKSDY YRKEGEGLLP VRWMSPESLV
135 DGLFTTQSDV WAFGVLCWEI LTLGQQPYAA RNNFEVLAHV KEGGRLQQPP MCTEKLYSLL
136 LLCWRTDPWE RPSFRRCYNT LHAISTDLRR TQMASATADT VVSCSRPEFK VRFDGQPLEE
137 HREHNERPED ENLTLREVPL KDKQLYANEG VSRL