MDC proteins and DNAS encoding the same

MDC proteins and DNAS encoding the same
US5552526

The present invention provide a gene present in a commonly deleted region of a chromosome in breast and ovarian cancers and encoding a novel protein, the protein ("mdc protein") encoded by the gene, a method for the diagnosis of cancer by using an antibody combinable to the protein, and others.

A detailed genetic map of human chromosome 17 was constructed to analyze the chromosome in breast and ovarian cancer tissues, and a gene encoding a novel protein was cloned and its structure was determined. As a result of gene analysis using DNA probes derived from the gene, a gene mutation was confirmed in breast cancer tissues. Moreover, a transformant carrying a plasmid containing the gene was grown to obtain the mdc protein. Furthermore, a monoclonal antibody was prepared by using the protein as antigen.

PTO Wrapper PDF
Dossier Espace Google

Patent 5552526
Priority May 14 1993
Filed May 13 1994
Issued Sep 03 1996
Expiry May 13 2014
Inventors Nakamura, …
Assg.orig Cancer Ins… Eisai Co.,… EISAI CO ,…
Assg.curr EISAI R&D …
Entity Large
Referenced by 9
References 0
Maint.: all paid

BACKGROUND OF THE IN…
DISCLOSURE OF THE IN…
BRIEF DESCRIPTION OF…
SUMMARY OF THE INVEN…
DETAILED DESCRIPTION…
EXAMPLES
EXAMPLE 1
EXAMPLE 2
EXAMPLE 3
EXAMPLE 4
EXAMPLE 5
EXAMPLE 6
EXAMPLE 7
EXAMPLE 8
EXAMPLE 9
EXAMPLE 10
EXAMPLE 11
EXAMPLE 12
EXAMPLE 13
EXAMPLE 14

4. An isolated mdc protein which comprises a protein subtantially equivalent to one comprising the protein represented by SEQ ID NO:4.

3. An isolated mdc protein which comprises a protein substantially equivalent to one comprising the protein represented by SEQ ID NO:3.

2. An isolated mdc protein which comprises a protein substantially equivalent to one comprising the protein represented by SEQ ID NO:2.

1. An isolated mdc protein which comprises a protein substantially equivalent to one comprising the protein represented by SEQ ID NO:1.

5. An isolated mdc protein which comprises the protein represented by a member selected from the group consisting of SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4.

6. The mdc protein of claim 5, wherein the protein is represented by a member selected from the group consisting of SEQ ID NO:1, SEQ ID NO:3 and SEQ ID NO:4.

7. The mdc protein of claim 5, wherein the protein is represented by a member selected from the group consisting of SEQ ID NO:1 and SEQ ID NO:4.

8. The mdc protein of claim 5, wherein the protein comprises amino acid numbers 107-769 of SEQ ID NO:4.

9. The mdc protein of claim 5, wherein the protein comprises amino acid numbers 107-594 of SEQ ID NO:4.

10. The mdc protein of claim 5, wherein the protein comprises amino acid numbers 1-495 of SEQ ID NO:3.

BACKGROUND OF THE INVENTION

1. Field of the Invention

The present invention relates to MDC proteins, DNAs encoding the same, and gene analysis methods using the DNAs. The present invention can be utilized in such fields as medical treatment and diagnosis.

2. Description of the Related Art

The opinion that mutations in cellular proteins play an important role in the onset of cancer has been known for long. Recent advancement in genetic engineering enables analysis of gene mutations in tumor cells, and has brought about a marked progress in the field of cancer research.

Up to this time, the analysis and identification of oncogenes have made such progress that the number thereof has amounted to several tens. On the other hand, attention has been focused on tumor suppressor genes for these several years. The tumor suppressor genes which have been discovered thus far include the Rb gene for retinoblastoma (Friend, S. H. et al., Proc. Natl. Acad. Sci. USA, 84, 9095, 1987), the p53 gene (Lane, D. P. et al., Nature, 278, 261, 1979) and the APC gene (Kenneth, W. K. et al., Science, 253, 661, 991) for colorectal tumor, the WT1 gene for Wilms' tumor (Call, K. M. et al., Cell, 60, 509, 1990), and the like. In the case of the p53 gene, some families are known to be inheriting mutations in the gene ["Li-Fraumeni syndrome" (Makin, D. et al., Science, 250, 1233, 1990; Srivastava, S. et al., Nature, 348, 747, 1990)]. Moreover, it is becoming increasingly clear that defects in multiple genes, and not in a single gene, contribute to the progression of the malignant phenotype of cancer, and it is believed that there exist much more unidentified oncogenes and tumor suppressor genes. The discovery and elucidation of them are expected by not only investigators and clinicians, but also common people in all the world.

Breast cancer is classified into hereditary (familial) breast cancer and nonhereditary (sporadic) breast cancer, and hereditary breast cancer is classified into early-onset and late-onset diseases according to the age of onset. It has been revealed by linkage analyses that, at least early-onset familial, breast cancer linked to a very small region on chromosome 17 (Hall, J. M. et al., Science, 250, 1684-1689, 1990). Moreover, it has been shown that hereditary ovarian cancer is also linked to the same region (Narod, S. A. et al., Lancet, 338, 82-83, 1991).

Accordingly, it is believed that a tumor suppressor gene is present in this region and protein deficiency or mutation induced by an allelic deletion or mutation of the gene is one of the causes of breast and ovarian cancers.

It is believed that in the onset of common (sporadic) breast cancer as well, the occurence of an acquired mutation or allelic deletion of the gene in this region results in protein mutation or deficiency and this causes the transformation of a normal cell to a breast cancer (Sato et al., Cancer Res., 51, 5794-5799, 1991). Consequently, isolation of the causative gene present in this region and identification of the protein encoded by the gene are expected as an urgent problem to not only physicians and investigators in all the world, but also common people, particularly women in Europe and America where there are numerous patients with breast cancer.

The present invention provides novel proteins involved in breast and ovarian cancers, DNAs encoding them, and methods for the testing and diagnosis of cancer by using them.

The present inventors disclose a novel gene encoding a 524-amino acid protein which was isolated from chromosomal region 17q21.3 where a tumor suppressor gene(s) for breast and ovarian cancers is thought to be present (Nature genetics, 5, 151-157, 1993; this paper is refered in Nature genetics, 5, No. 2, 101-102, 1993).

DISCLOSURE OF THE INVENTION

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram showing the positions on chromosome 17 to which 342 cosmid clones hybridize. Clone names are designated by clone numbers alone.

FIG. 2 is a diagram showing partial deletions on chromosome 17q in ovarian cancers. Solid circles represent the loss of heterozygosity (LOH) and open circles represent the retention of both alleles. Two commonly deleted regions are designated by sidelines.

FIG. 3 is a diagram showing partial deletions on chromosome 17q in breast cancers. Solid circles represent the loss of heterozygosity (LOH) and open circles represent the retention of both alleles. Two commonly deleted regions are designated by sidelines.

FIGS. 4a, 4b, and 4c is a diagram showing the process starting with markers on chromosome 17q21.3 and leading to the isolation of the gene, as well as the regions where genomic rearrangements occurred in tumor tissues (hatched boxes). Clone names are designated by clone numbers alone.

FIGS. 5a, 5b 6 and 7 are diagrams showing the detection of genomic rearrangements in breast cancers by Southern-blot analysis. Symbols N and T represent DNAs from normal tissue and tumor tissue, respectively.

FIG. 8 is a graph showing a working curve for determining the concentration of the MDC protein by ELISA using a monoclonal antibody and a rabbit polyclonal antibody.

SUMMARY OF THE INVENTION

The present inventors constructed a multitude of cosmid clones having DNA fragments of human chromosome 17 introduced thereinto. Then, each of the multitude of cosmid clones was localized throughout the chromosome by fluorescent in-situ hybridization (FISH; Inazawa et al., Genomics, 10, 1075-1078, 1991). The cosmid clones (cosmid markers), localized on the chromosome, enabled construction of a high-resolution physical map of human chromosome 17. The clone names of the cosmids as probes, i.e., the probe names, their detailed map positions and diagrammatical summary of the mapping are shown in Tables 1-3 and FIG. 1, respectively. In FIG. 1, clone names are designated by clone numbers alone.

TABLE 1

__________________________________________________________________________

Probe Locus

Chromosomal

Probe Locus

Chromosomal

No.

name symbol

localization

No.

name symbol

localization

__________________________________________________________________________

1 cCI17-1 17q21.1

67 cCI17-501 17q21.3

2 cCI17-7 17q22 68 cCI17-502 17p11.2

3 cCI17-11 17p11.2

69 cCI17-504 17q24

4 cCI17-24 17q21.1-q21.2

70 cCI17-505

D17S544

17p12-p11.1

5 cCI17-25 17q12 71 cCI17-506

D17S545

17q21

6 cCI17-28 17q21.3

72 cCI17-507 17q21.3

7 cCI17-32 17q11.2

73 cCI17-508

D17S546

17q25.1-q25.2

8 cCI17-35 17q21.3

74 cCI17-509 17q24

9 cCI17-44 17q23.1

75 cCI17-510 17q23

10 cCI17-50 17q23.1

76 cCI17-511 17q25.1-q25.2

11 cCI17-57 17q21 77 cCI17-513

D17S548

17q11.2

12 cCI17-63 17q21.3

78 cCI17-514 17q25.1

13 cCI17-90 17q12-q21.1

79 cCI17-515 17q22

14 cCI17-95 17q23.1

80 cCI17-516

D17S550

17q25.1

15 cCI17-96 17q21.3

81 cCI17-517 17q21.3

16 cCI17-97 17q21.31

82 cCI17-518 17q25.3

17 cCI17-315

D17S521

17q25.1-q25.2

83 cCI17-519

D17S551

17q25.2-q25.3

18 cCI17-316 17q12-q21.1

84 cCI17-520 17q11.2

19 cCI17-317 17q11.2

85 cCI17-521 17q25.1-q25.2

20 cCI17-321 Centromere

86 cCI17-523 17q22

21 cCI17-403 17q21.2-q21.3

87 cCI17-524 17q21.1-q21.2

22 cCI17-412 17q11.2

88 cCI17-525 17p13.1

23 cCI17-415 17q21.1-q21.2

89 cCI17-526 17q11.2-q12

24 cCI17-422 17q22 90 cCI17-527 17q21.3

25 cCI17-425 17q11.2

91 cCI17-528 17q22

26 cCI17-428 17q23.1

92 cCI17-529

D17S552

17q25.1-q25.2

27 cCI17-451 17q21.1-q21.2

93 cCI17-530 17q23

28 cCI17-452

D17S524

17q25 94 cCI17-532 17p11.2

29 cCI17-453

D17S525

17p13 95 cCI17-533 17q21.3

30 cCI17-454

D17S526

17q23 96 cCI17-535 17q12-q21.1

31 cCI17-456

D17S527

17q23.1-q23.2

97 cCI17-536 17p11.2

32 cCI17-457 17q11.2

98 cCI17-539 17q21.3

33 cCI17-458

D17S528

17q21.1-q21.2

99 cCI17-540 17q25

34 cCI17-460

D17S529

17q11.2

100

cCI17-541 17q21.3

35 cCI17-462 17q23 101

cCI17-542 17q21.3

36 cCI17-463 17q21 102

cCI17-543 17q11.2-q12

37 cCI17-464 17q24.3-q25.1

103

cCI17-544 17q25.1

38 cCI17-465

D17S531

17q25.1-q25.2

104

cCI17-545 17q25.1

39 cCI17-466 17q25.1-q25.2

105

cCI17-546 17q24.3-q25.1

40 cCI17-467 17q25 106

cCI17-547 17q21.3

41 cCI17-468

D17S532

17q11.2

107

cCI17-548 17q23

42 cCI17-469

D17S533

17q25.2-q25.3

108

cCI17-549 17q25.3

43 cCI17-471 17q13.3-p13.2

109

cCI17-550 17q23

44 cCI17-473

D17S534

17q11.2

110

cCI17-551 17q25.1-q25.2

45 cCI17-475

D17S535

17q11.2

111

cCI17-552 17q12

46 cCI17-477 17q21.3

112

cCI17-553 17q23

47 cCI17-479 17q21.3

113

cCI17-554 17q25.1

48 cCI17-480 17q25.1-q25.2

114

cCI17-557 17q25.1-q25.2

49 cCI17-482

D17S536

17q11.2

115

cCI17-559 17q24.3-q25.1

50 cCI17-483 17p13 116

cCI17-560 17q25.1-q25.2

51 cCI17-484

D17S537

17p13.1

117

cCI17-561 17q25

52 cCI17-485 17q12 118

cCI17-562 17q11.2

53 cCI17-486 17q25.1

119

cCI17-563 17q25.1

54 cCI17-487

D17S538

17q25.1

120

cCI17-564 17q25.1

55 cCI17-488

D17S539

17p13.2-p13.1

121

cCI17-565 17q23

56 cCI17-489

D17S540

17q23 122

cCI17-567 17q21.3

57 cCI17-490 17q11.2

123

cCI17-568 17q25

58 cCI17-491 17p13.1

124

cCI17-569 17q12

59 cCI17-492

D17S542

17q11.2

125

cCI17-570 17q11.1

60 cCI17-493 17q25.1-q25.2

126

cCI17-571 17p13

61 cCI17-494 17q22 127

cCI17-572 17q25.1

62 cCI17-495 17q25 128

cCI17-573 17q25.3

63 cCI17-497 17q11.2

129

cCI17-574 17q12-q21.2

64 cCI17-498 17p11.2

130

cCI17-576 17q21.1-q21.2

65 cCI17-499 17q21.1-q21.2

131

cCI17-577 17q25.1

66 cCI17-500 17p.12 132

cCI17-578 17q11.2-q12

__________________________________________________________________________

TABLE 2

__________________________________________________________________________

Probe Locus

Chromosomal

Probe Locus

Chromosomal

No.

name symbol

localization

No.

name symbol

localization

__________________________________________________________________________

133

cCI17-579 17q25.1

198

cCI17-652 17q22

134

cCI17-581 17q11.2

199

cCI17-653 17q22

135

cCI17-582 17q21.3

200

cCI17-654 17p13

136

cCI17-583 17q12-q21.1

201

cCI17-655 17q23

137

cCI17-584 17q21.3

202

cCI17-656 17q25.1

138

cCI17-586 17p13 203

cCI17-657 17p13

139

cCI17-587 17p13 204

cCI17-658 17q21.3

140

cCI17-588 17p13.1

205

cCI17-659 17q25.1

141

cCI17-590 17q12 206

cCI17-660 17q25.1

142

cCI17-591 17q24 207

cCI17-662 17p12

143

cCI17-592 17q21.3

208

cCI17-663 17q25.1

144

cCI17-593 17q25.1-q25.2

209

cCI17-664 17q25.1-q25.2

145

cCI17-594 17q25.2-q25.3

210

cCI17-665 17q23

146

cCI17-595 17q25.1-q25.2

211

cCI17-666 17q23.1

147

cCI17-596 17q11.2

212

cCI17-667 17q24

148

cCI17-597 17q25.3

213

cCI17-668 17q22

149

cCI17-598 17q12-q21.1

214

cCI17-669 17p13

150

cCI17-599 17q12 215

cCI17-670 17q21.3

151

cCI17-600 17q23 216

cCI17-671 17q11.2

152

cCI17-601 17q21.1-q21.2

217

cCI17-672 17q25.1-q25.2

153

cCI17-602 17q11.2-q12

218

cCI17-673 17q12-q21.1

154

cCI17-603 17p11.2

219

cCI17-674 17q21.3

155

cCI17-604 17q23 220

cCI17-675 17q21.3

156

cCI17-605 17q21.1-q21.2

221

cCI17-676 17q23

157

cCI17-606 17p13 222

cCI17-677 17q12-q21.1

158

cCI17-607 17q25 223

cCI17-678 17q23

159

cCI17-608 17p11.2

224

cCI17-679 17q23.1

160

cCI17-609 17q21.3

225

cCI17-680 17p13

161

cCI17-610 17q12-q21.1

226

cCI17-681 17p11.1-p11.2

162

cCI17-611 17q22 227

cCI17-683 17q11.2

163

cCI17-512 17q21.3

228

cCI17-684 17q25.1-q25.2

164

cCI17-613 17q25.1

229

cCI17-685 17p13

165

cCI17-614 17q21.3

230

cCI17-687 17q12

166

cCI17-615 17q21.1

231

cCI17-688 17p11.2

167

cCI17-616 17q25.1

232

cCI17-690 17q11.2-q12

168

cCI17-617 17q21.3

233

cCI17-691 17q25.1

169

cCI17-618 17q23.1

234

cCI17-692 17q23

170

cCI17-619 17q21.3

235

cCI17-693 17p11.2

171

cCI17-621 17q25.1-q25.2

236

cCI17-694 17p11.2

172

cCI17-622 17q12 237

cCI17-695 17p11.2

173

cCI17-623 17q25.1-q25.2

238

cCI17-696 17q23.3

174

cCI17-624 17p13 239

cCI17-697 17q25

175

cCI17-625 17q23 240

cCI17-698 17q11.2

176

cCI17-626 17q23 241

cCI17-699 17q23

177

cCI17-627 17p13 242

cCI17-700 17q23

178

cCI17-628 17q23 243

cCI17-701 17q21.3

179

cCI17-630 17q11.2

244

cCI17-702 17q25.2-q25.3

180

cCI17-631 17p11.2

245

cCI17-703 17p13

181

cCI17-662 17q22 246

cCI17-704 17q23

182

cCI17-633 17q12 247

cCI17-705

D17S554

17p11.2

183

cCI17-634 17q21.3

248

cCI17-706

D17S555

17q12

184

cCI17-636 17p13 249

cCI17-707

D17S556

17q25.1-q25.2

185

cCI17-637 17q12 250

cCI17-708 17p13

186

cCI17-638 17p11.2

251

cCI17-709 17p12

187

cCI17-639 17q12 252

cCI17-710

D17S557

17q25.3

188

cCI17-640 17q11.2

253

cCI17-711 17q32.1

189

cCI17-641 17q25.1

254

cCI17-712

D17S558

17p11.2

190

cCI17-642 17q12-q21.1

255

cCI17-713

D17S559

17p13

191

cCI17-643 17q21.3

256

cCI17-714

D17S560

17q25.3

192

cCI17-644 17q23 257

cCI17-715 17q21.3

193

cCI17-645 17p13 258

cCI17-716

D17S561

17p13

194

cCI17-646 17q13 259

cCI17-717 17p13

195

cCI17-647 17q25.1-q25.2

260

cCI17-719 17q25

196

cCI17-650 17q12 261

cCI17-731 17q23

197

cCI17-651 17q25.1

262

cCI17-722

D17S563

17q25.2-q25.3

__________________________________________________________________________

TABLE 3

__________________________________________________________________________

Probe Locus

Chromosomal

Probe Locus

Chromosomal

No.

name symbol

localization

No.

name symbol

localization

__________________________________________________________________________

263

cCI17-723 17p13 304

cCI17-834 17q11.2-q12

264

cCI17-724

D17S564

17p11.2

305

cCI17-835 17q21.3

265

cCI17-726 17q25 306

cCI17-841 17p12

266

cCI17-727

D17S566

17p13 307

cCI17-1005 17q21.3

267

cCI17-728

D17S567

17p12 308

cCI17-1008 17q21.3

268

cCI17-729

D17S568

17q11.2

309

cCI17-1016 17q23.1

269

cCI17-730 17q21.3

310

cCI17-1018 17q21.2-21.3

270

cCI17-732

D17S570

17p13.2

311

cCI17-1019 17q23.1

271

cCI17-733 17q25.1

312

cCI17-1024 17q12

272

cCI17-735

D17S572

17q25.3

313

cCI17-1029 17q11.2

273

cCI17-736

D17S573

17q21.3

314

cCI17-1030 17q22

274

cCI17-737

D17S557

17q25.2-q25.3

315

cCI17-1031 17q11.2

275

cCI17-739

D17S575

17q25.1

316

cCI17-1032 17q23.1-23.2

276

cCI17-741 17q25.3

317

cCI17-1049 17q21.3

277

cCI17-742 17q25 318

cCI17-1055 17q21.3

278

cCI17-743 17q23.3

319

cCI17-1059 17q21.1-q21.2

279

cCI17-744 17q23 320

cCI17-1063 17q12

280

cCI17-745

D17S577

17p13 321

cCI17-1073 17q11.2

281

cCI17-801 17q11.2-q12

322

cCI17-1079 17q12

282

cCI17-802 17p11.2

323

cCI17-1082 17q22

283

cCI17-808 17q25.1-q25.2

324

cCI17-1094 17q21.1

284

cCI17-809 17q23 325

cCI17-1101 17q12

285

cCI17-810 17p13.2-p13.1

326

cCI17-1103 17q11.2

286

cCI17-812 17q25.1

327

cCI17-1106 17q11.2

287

cCI17-813 17q23 328

cCI17-1702 17q21.1-q21.2

288

cCI17-814 17p11.2

329

cCI17-1705 17q21.2-q21.3

289

cCI17-815 17q24 330

cCI17-1706 17q21.1-q21.2

290

cCI17-816 17q23 331

cCI17-1707 17q21.2-q21.3

291

cCI17-817 17q23 332

cCI17-1709 17q12

292

cCI17-818 17p11.2

333

cCI17-1710 17q21.3

293

cCI17-820 17q12 334

cCI17-1711 17q12

294

cCI17-821 17p13 335

cCI17-1715 17q12

295

cCI17-822 17q11.1

336

cCI17-1717 17q21.3

296

cCI17-823 17q12 337

cCI17-1719 17q11.2

297

cCI17-825 17p11.2

338

cCI17-1720 17q24.3-q25.1

298

cCI17-826 17q11.2

339

cCI17-1722 17q23

299

cCI17-827 17p11.2

340

cCI17-1723 17q21.3

300

cCI17-828 17p11.2

341

cCI17-1724 17q11.2

301

cCI17-831 17q25.1-q25.2

342

cCI17-1725 17q21.1

302

cCI17-832 17q11.2

343

pCMM86 17q23

303

cCI17-833 17q23

__________________________________________________________________________

From among these markers, ones exhibiting restriction fragment length polymorphism (RFLP) in which the lengths of restriction fragments vary with the individual, namely RFLP markers, were selected. The selected marker clones, the restriction enzymes used, and the particular lengths of several fragments detected thereby are shown in Tables 4-6.

TABLE 4

__________________________________________________________________________

Probe Locus Allele size

Chromosomal

No.

name symbol

Enzyme (frequency)

localization

__________________________________________________________________________

2 cCI17-7

D17S860

PvuII 3.0 kb(0.33)

1.8 + 1.2 kb(0.67)

16 cCI17-97

D17S861

PstI 8.2 kb(0.92)

17q21.3

4.7 + 3.5 kb(0.08)

17 cCI17-315

D17S521

TaqI 2.0 kb(0.67)

17q25.1-q25.2

1.8 kb(0.33)

18 cCI17-316

D17S862

MspI 3.1 kb(0.33)

17q12-q21.1

2.7 kb(0.67)

19 cCI17-317

D17S522

TaqI 2.6-3.9 kb 4 alleles VNTR, 60%

17q11.2

heterozygosity also polymorphic with

MspI, PstI, PvuII

29 cCI17-453

D17S525

BglII 5.8-7.5 kb 4 alleles VNTR, 50%

17p13

heterozygosity also polymorphic with

EcoRI, TaqI, PstI, PvuII, MapI

42 cCI17-469

D17S533

MspI 2.0-2.6 kb 5 alleles VNTR, 83%

17q25.2-q25.3

heterozvgosity also polymolphic with

EcoRI, TaqI, PvuII

54 cCI17-487

D17S538

EcoRI 5.8 kb(0.75)

17q25.1

3.3 kb(0.25)

56 cCI17-489

D17S540

MspI 3.3 kb(0.25)

17q23

2.1 kb(0.50)

TaqI 1.5 kb(0.50)

1.35 kb(0.50)

PvuII 1.2 kb(0.50)

0.7 kb(0.50)

58 cCI17-491

D17S863

TaqI 3.6 kb(0.75)

17p13.1

3.3 kb(0.25)

59 cCI17-492

D17S542

BglII 2.1 kb(0.40)

17q11.2

1.4 kb(0.60)

61 cCI17-494

D17S865

EcoRI 10.3 kb(0.92)

7.8 kb(0.008)

70 cCI17-505

D17S544

MspI 3.1 kb(0.58)

17p12-p11.1

3.0 kb(0.42)

TaqI 4.1 kb(0.67)

2.7 + 1.4 kb(0.33)

71 cCI17-506

D17S545

MspI 3.0 kb(0.33)

17q21

2.6 kb(0.67)

73 cCI17-508

D17S546

MspI 4.6 kb(0.50)

17q25.1-q25.2

4.0 kb(0.50)

80 cCI17-516

D17S550

TaqI 4.1 kb(0.25)

17q25.1

2.4 + 1.7 kb(0.75)

PvuII 3.4 kb(0.83)

2.2 kb(0.17)

88 cCI17-525

D17SB66

MspI 2.7 kb(0.42)

2.3 kb(0.58)

118

cCI17-562

D17S5867

TaqI 3.5 kb(0.42)

3.2 kb(0.58)

PvuII 7.1 kb(0.92)

6.6 kb(0.08)

137

cCI17-584

D17S868

MspI 3.8 kb(0.25)

3.6 kb(0.75)

166

cCI17-615

D17S869

PstI 5.2 kb(0.42)

4.7 kb(0.58)

243

cCI17-701

D17S870

TaqI 1.7-2.5 kb 6 alleles VNTR, 67%

17q21.3

heterozygosity also polymorphic with

MspI, PstI, PvuII, RsaI

244

cCI17-702

D17S871

MspI 4.1 kp(0.83)

17q25.2-q25.3

3.4 kb(0.17)

RsaI 5.2 kb(0.83)

4.1 kb(0.17)

BglII 6.6 kb(0.83)

5.6 kb(0.17)

PvuII 2.9 kb(0.83)

2.2 kb(0.17)

__________________________________________________________________________

TABLE 5

__________________________________________________________________________

Probe Locus Allele size

Chromosomal

No.

name symbol

Enzyme (frequency)

localization

__________________________________________________________________________

245

cCI17-703

D17S877

TaqI 2.6-3.8 kb 4 alleles VNTR, 50%

17p13

heterozygosity also polymorphic with

MspI, RsaI, PstI, PvuII

247

cCI17-705

D17S554

PstI 4.3 kb(0.50)

17p11.2

2.3 + 2.0 kb(0.50)

250

cCI17-708

D17S878

Pvu11 2.6-9.0 kb 10 alleles VNTR, 87%

17p13

heterozygosity also polymorphic with

MspI, TaqI, BglII, PstI, EcoRI

252

cCI17-710

D17S557

MspI 2.0-2.6 kb 5 alleles VNTR, 100%

17q25.3

heterozygosity also polymorphic with

RsaI, TaqI, PstI, PvuII, EcoRI

254

cCI17-712

D17S558

MspI 3.1 kb(0.58)

17p11.2

2.9 kb(0.42)

TaqI 6.6 kb(0.67)

4.3 + 2.3 kb(0.33)

PvuII 7.1 kb(0.50)

3.9 + 3.2 kb(0.50)

255

cCI17-713

D17S559

MspI 2.2-2.8 kb 3 alleles VNTR, 50%

17p13

heterozygosity also polymorphic with

PstI

256

cCI17-714

D17S560

RsaI 4.5 kb(0.58)

17q25.3

4.3 kb(0.42)

TaqI 3.8 kb(0.75)

2.8 kb(0.25)

BglII 3.8 kb(0.58)

3.5 kb(0.42)

PvuII 2.6 kb(0.58)

2.4 kb(0.42)

1.5 kb(0.58)

1.4 kb(0.42)

257

cCI17-715

D17S872

Pst1 3.3 kb(0.17)

17q21.3

3.0 kb(0.83)

EcoR1 3.6 kb(0.87)

3.3 kb(0.13)

258

cCI17-716

D17S561

TaqI 2.4kb(0.87)

17p13

1.3 + 1.1 kb(0.13)

261

cCI17-721

D17S864

RsaI 2.9 kb(0.25)

17q22-q23

1.6 kb(0.75)

BglII 4.4 kb(0.83)

3.9 kb(0.17)

262

cCI17-722

D17S563

MspI 4.1 kb(0.83)

17q25.2-q25.3

3.4 kb(0.17)

RsaI 5.2 kb(0.83)

4.1 kb(0.17)

Bg1II 6.6 kb(0.83)

5.6 kb(0.17)

PvuII 2.9 kb(0.83)

2.2 kb(0.17)

EcoRI 13.0 kb(0.75)

12.5 kb(0.25)

263

cCI17-723

D17SB73

MspI 3.0 kb(0.33)

17p13

1.7 kb(0.67)

RsaI 0.8 kb(0.70)

0.5 kb(0.30)

TaqI 3.6 kb(0.33)

1.9 kb(0.67)

PstI 5.8 kb(0.50)

5.3 kb(0.50)

PvuII 4.6 kb(0.58)

4.2 kb(0.42)

266

cCI17-727

D17S566

PvuII 2.6-9.0 kb 10 alleles VNTR, 87%

17p13

heterozygosity also polymorphic with

MspI, TaqI, Bag1II, PstI, EcoRI

268

cCI17-729

D17S568

MspI 4.6 kb(0.58)

17q11.2

2.6 kb(0.42)

__________________________________________________________________________

TABLE 6

__________________________________________________________________________

Probe Locus Allele size

Chromosomal

No.

name symbol

Enzyme (frequency)

localization

__________________________________________________________________________

269

cCI17-730

D17S874

MspI 2.2-3.5 kb 4 alleles VNTR, 83%

17q21.3

heterozygosity also polymorphic with

TaqI, Bg1II, PstI, PvuII

270

cCI17-732

D17S570

RsaI 3.2 kb(0.50)

17p13.2

2.7 kb(0.50)

Bg1II 8.5 kb(0.50)

3.2 kb(0.50)

PstI 2.5 kb(0.58)

1.7 kb(0.42)

PvuII 4.2 kb(0.50)

4.1 kb(0.50)

271

cCI17-733

D17S875

MspI 3.4 kb(0.75)

17q25.1

2.6 kb(0.25)

272

cCI17-735

D17S572

MspI 4.1 kb(0.83)

17q25.3

3.4 kb(0.17)

RsaI 5.2 kb(0.83)

4.1 kb(0.17)

PvuII 2.9 kb(0.83)

2.2 kb(0.17)

273

cCI17-736

D17S573

TaqI 1.7-2.5 kb 7 alleles VNTR, 100%

17q21.3

heterozygosity also polymorphic with

MspI, RsaI, PstI, PvuII

275

cCI17-739

D17S575

MspI 3.3 kb(0.33)

17q25.1

2.4 kb(0.67)

278

cCI17-743

D17S876

TaqI 4.3 kb(0.17)

2.8 kb(0.83)

__________________________________________________________________________

RFLP markers are characterized in that they can be used to distinguish between two alleles inherited from parents by the difference in polymorphism ("informative") [however, they are indistinguishable when both of them have the same polymorphic pattern ("not informative")]. If such a difference in polymorphic pattern between two alleles ("heterozygosity") exists in normal tissues and the loss of heterozygosity (LOH) is detected in tumor tissues, this implies the allelic deletion in the RFLP marker site on a specific chromosome of tumor tissues. It is generally believed that the inactivation of tumor suppressor genes on both alleles, as caused by the deletion of one allele and the mutation in the other, may lead to malignant transformation. Thus, it is assumed that a tumor suppressor gene is present in a region commonly deleted in many cancers.

Using the detailed chromosome map and RFLP markers thus obtained, the present inventors examined about 300 breast cancers and about 100 ovarian cancers for LOH in chromosome 17. As a result, it was revealed that, in informative cases, a region (of 2.4 cM) lying between cosmid markers cCI17-701 and cCI17-730 located in the neighborhood of 17q21 was deleted with high frequency.

FIG. 2 shows partial deletions on chromosome 17q in ovarian cancers. Solid circles represent the loss of heterozygosity (LOH) and open circles represent the retention of both alleles. Two commonly deleted regions are designated by sidelines.

FIG. 3 shows partial deletions on chromosome 17q in breast cancers. Solid circles represent the loss of heterozygosity (LOH) and open circles represent the retention of both alleles. Two commonly deleted regions are designated by sidelines.

One of the commonly deleted region partially overlapped with the region in which the presence of a causative gene was suggested by linkage analyses of families affected with hereditary breast cancer. When 650 cases of sporadic breast cancer were examined for somatic rearrangements by Southern-blot analysis using cosmids located to the overlapping region as probes, it was revealed that a partial region in the DNA of cosmid clone cCI17-904, which had been selected as described above, detected amplification. On closer examination of this alterations, it was found that segments each having about 6-9 kb were connected with each other to form an abnormal repetition consisting of about 4-6 copies. Moreover, a gene encoding a novel protein was isolated by screening cDNA (DNA having a complementary base sequence reverse-transcribed from messenger RNA) libraries by using, as probe, a restriction fragment of this cosmid clone having a sequence which was conserved among other species. When the sequence structure of this gene was determined and the presence or absence of genomic alterations of this gene in breast cancers was examined, a distinct gene mutation was identified. These results have revealed that deficiency or mutation in this protein and the allelic deletion or mutation of the DNA encoding it deeply participate in the onset of breast and ovarian cancers.

FIG. 4 shows the above-described process starting with a group of markers and leading to the isolation of the gene, as well as the regions where genomic rearrangements occurred in tumor tissues. Clone names are designated by clone numbers alone.

The present invention is very important in that it can provide methods and materials for solving difficult problems (such as risk diagnosis, early finding, course watching, determination of a treatment plan, and estimation of prognosis) concerning at least a part of breast and ovarian cancers, for example, by examining the presence or absence of deficiency or mutation in the protein of the present invention or the presence or absence of the allelic deletion or mutation of the gene encoding it, and thereby bring about a marked advance in the technology in this field.

Specifically, the present invention provides (1) an MDC protein which comprises the whole or part of the protein represented by SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:4, or which consists of a protein substantially equivalent to one comprising the whole or part of the protein represented by SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:4, (2) a DNA which comprises the whole or part of the DNA represented by SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 or SEQ ID NO:9, or which consists of a DNA substantially equivalent to one comprising the whole or part of the DNA represented by SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 or SEQ ID NO:9, (3) a plasmid containing the DNA as set forth in the above (2), a transformant carrying the plasmid, i.e., a transformant transformed with the plasmid, and a process for the production of the MDC protein described above, which comprises the steps of culturing the transformant described above and collecting the resulting expression product, (4) an antibody which can bind to the MDC protein described above as an antigen, and (5) a primer, probe or marker which has a DNA sequence comprising a part of the DNA sequence of the DNA as set forth in the above (2), or a DNA sequence complementary to a part of the DNA sequence of the DNA as set forth in the above (2), and a gene analysis method which comprises the step of hybridizing the primer or probe described above to a DNA to be tested.

The term "MDC protein" in this specification means a protein and a peptide (including a oligopeptide and a polypeptide) involved in the definition of the term, "the MDC protein".

Further scope and applicability of the present invention will become apparent from the detailed description given hereinafter. However, it should be understood that the detailed description and specific examples, while indicating preferred embodiments of the invention, are given by way of illustration only, since various changes and modifications within the spirit and scope of the invention will become apparent to those skilled in the art from this detailed description. The present invention will be specifically described hereinbelow.

DETAILED DESCRIPTION OF THE INVENTION

(1) Isolation of cDNA clones

Cosmid clones having a DNA derived from human chromosome 17 introduced thereinto can be produced, for example, by extracting chromosomal DNA from a human-mouse hybrid cell line containing a single human chromosome17 in a mouse genomic background, and incorporating fragments of the chromosomal DNA into a vector such as pWEX15, according to a method reported by Tokino et al. (Tokino et al., Am. J. Hum. Genet., 48, 258-268, 1991). From among them, clones having an insert derived from the human chromosome can be selected by colony hybridization using the whole human DNA as probe.

The map position of each of the cosmid clones can be determined by FISH. Then, they can be used as markers to construct a high resolution physical chromosome map. Moreover, RFLP markers can be selected on the basis of the fragment length pattern in southern blot analysis (Nakamura et al., Am. J. Hum. Genet., 43, 854-859, 1988). If this map and these RFLP markers are utilized to examine DNAs obtained from the tumor tissues of cancer patients for LOH (loss of heterozygosity), the commonly deleted region on the chromosome in the tumor tissues can be localized to a very small region near q21 of chromosome 17.

Southern-blot analysis of the DNAs from tumor tissues by using a cosmid clone, whose hybridizable portion is present in this localized region, as probe makes it possible to select clones having a DNA sequence associated with genomic alterations in the tumor tissues. Moreover, Southern-blot analysis of the chromosomal DNAs of various mammals by using restriction fragments of the cosmid clone as probes makes it possible to select a fragment containing a DNA sequence conserved among other species and involved in fundamental cellular functions. DNA sequences encoding important proteins are often conserved among other species. In fact, many of the hitherto isolated genes for hereditary diseases are conserved among other species (Call, K. M. et al., Cell, 60, 509-520, 1990).

If the DNA fragment thus obtained is used as probe, the cDNA of the gene present in a localized region near q21 of human chromosome 17 can be cloned. The base sequence of this cDNA can be determined by a conventional manner (Maniatis, J. et al., Molecular Cloning 2nd. ed., Cold Spring Harbor Laboratory Press, N.Y. 1989).

In order to confirm that the DNA clones thus obtained are clones of the desired causative gene, their sequences may be used to examine the presence or absence of genomic alterations in cancer patients and the incidence of genomic alterations according to the SSCP method (Orita, M. et al., Genomics, 5, 874-879, 1984; Orita, M. et al., Cell, 60, 509-520, 1990), the RNase protection method (Winter, E., Perucho, M. et al., Proc. Natl. Acad. Sci. USA, 82, 7575-7579, 1985; Myers, R. M. et al., Science, 230, 1242-1246, 1985) and other methods.

(2) Confirmation of the whole structure of the gene

It has been confirmed that the DNA sequences of two cDNAs obtained by the above-described procedure are novel and are those of the DNAs represented by SEQ ID NO:6 and SEQ ID NO:7. The corresponding amino acid sequences have also been identified as those of the proteins represented by SEQ ID NO:2 and SEQ ID NO:3. Moreover, 5'-RACE and RT-PCR have revealed the DNA sequence of the DNA represented by SEQ ID NO:8, and the amino acid sequence of the protein represented by SEQ ID NO:4 has been deduced as one corresponding to the DNA sequence. Furthermore, with regard to genomic DNA, the structure of the DNA represented by SEQ ID NO:9 including introns and exons has been revealed by analyzing the base sequence of the original cosmid clone cCI17-904 and comparing it with the base sequence of the isolated cDNA clone to determine the intron-exon junctions.

By the present inventors, proteins comprising the whole or part of the amino acid sequence of the protein represented by SEQ ID NO:1, which is an amino acid sequence common to all of the above-described proteins, are named MDC proteins and will hereinafter be referred to as MDC proteins.

The term "a part of the protein" means, for example, a polypeptide having or comprising an amino acid sequence consisting of a continuous, at least three amino acids which is described in SEQ ID NO:1. The amino acid sequence consists of preferably at least three to five amino acids, still more preferably at least eight or at least eight to ten amino acids, and most preferably at least eleven to twenty amino acids. It is to be understood that polypeptides each having or comprising an amino acid sequence consisting of a continuous, more than 20 amino acids which is described in SEQ ID NO:1 can also be used.

As used herein, the term "substantially equivalent" means that, in proteins comprising the whole or part of the amino acid sequence of the protein represented by, for example, SEQ ID NO:1, their amino acid sequences are attended with the replacement, deletion and/or insertion of one or more amino acids, but they can produce an equal effect in research and diagnosis using the proteins comprising the whole or part of the amino acid sequence of the protein represented by, for example, SEQ ID NO:1. Such equivalents also fall within the scope of the present invention and also called as MDC proteins.

The DNA sequence common to all DNAs encoding MDC proteins is one of the DNA represented by SEQ ID NO:5.

A DNA in accordance with the present invention can be utilized in gene analysis and diagnosis. That is, a primer or probe comprising a part of the DNA sequence of the DNA according to the present invention, or comprising a DNA sequence complementary to a part of the DNA sequence of the DNA according to the present invention is used in gene analysis and diagnosis.

Part of the DNA sequence consists of at least six bases, preferably at least 8 bases, still more preferably 10-12 bases and particularly preferably about 15-25 bases. That is, the oligonucleotide used as primer or probe comprises at least six bases derived from the DNA sequence of the DNA according to the present invention or derived from the DNA sequence complementary to the DNA sequence of the DNA according to the present invention, and, if necessary, other base(s).

In connection with the DNAs of the present invention, the term "substantially equivalent" has the same meaning as described above for the proteins, except that their base sequences are attended with the replacement, deletion and/or insertion of one or more bases.

The introduction of replacement, deletion and insertion mutations into a particular base sequence can be accomplished according to any of conventional methods including those described in F. M. Ausubel et al., "Current Protocols in Molecular Biology", 1987, Chapter 8.

The MDC protein encoded by the DNA according to the present invention, i.e., the MDC protein according to the present invention, can be utilized by using it as an epitope to prepare an antibody. This antibody can be used in experimental and diagnostic reagents. The term "epitope" means an antigenic determinant of a polypeptide and is generally composed of at least 5 amino acids. It is well known that a polypeptide composed of 6 amino acids binds with an antibody, as disclosed in, for example, Published Japanese Translation of International Patent Application No. 60-500684.

(3) Recombinant expression vectors and transformants generated therewith

A transformant can be obtained by incorporating a DNA encoding human MDC protein, which has been obtained by the above-described procedure, or a fragment thereof into a suitable vector and introducing this vector into suitable host cells. By culturing this transformant with a conventional manner, large amounts of human MDC protein can be obtained from the culture. More specifically, a recombinant expression vector can be produced by linking a DNA encoding a human MDC protein or a fragment thereof on the downstream side of the promoter of a vector suited for its expression according to a well-known method using restriction enzymes and DNA ligase. Usable vectors include, for example, plasmids pRB322 and pUC18 derived from Escherichia coli, plasmid pUB110 derived from Bacillus subtilis, plasmid pRB15 derived from yeast, phage vectors λgt10 and λgt11, and vector SV40 derived from an animal virus. However, no particular limitation is placed on the type of vector used, so long as it can be replicated and amplified in the host. Similarly, no particular limitation is placed on the promoter and terminator, so long as they are compatible with the host used for the expression of the DNA base sequence encoding the human MDC protein. They may be used in any suitable combination depending on the host. The DNA used can be any of DNAs encoding human MDC protein. It is not limited to the base sequences represented by SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 and SEQ ID NO:9, but can be any of DNAs in which a part of the base sequence has undergone replacement, deletion, insertion or a combination thereof, whether intentionally or not. In addition, chemically synthesized DNAs can also be used.

A transformant is generated by introducing the recombinant expression vector thus obtained into a host according to the competent cell method (J. Mol. Biol., 53, 154, 1970), the protoplast method (Proc. Natl. Acad. Sci. USA, 75, 1929, 1978), the calcium phosphate method (Science, 221, 551, 1983), the in vitro packaging method (Proc. Natl. Acad. Sci. USA, 72, 581, 1975) or the virus vector method (Cell, 37, 1053, 1984). The host used can be Escherichia coli, Bacillus subtilis, yeast or animal cells, and the resulting transformant is grown in a suitable medium depending on the host. Usually, the transformant is grown at a temperature of 20° to 45°C and a pH of 5 to 8, optionally with aeration and stirring. Separation and purification of the MDC protein from the culture may be carried out using a suitable combination of well-known separation and purification techniques. These well-known techniques include salting-out, solvent precipitation, dialysis, gel filtration, electrophoresis, ion exchange chromatography, affinity chromatography, reverse-phase high-performance liquid chromatography and the like.

(4) Preparation of antibodies

Antibodies can be prepared in the usual manner by using an antigen of which epitope part comprises an MDC protein. For example, a polyclonal antibody can be prepared by fully immunizing an animal such as mouse, guinea pig and rabbit through a plurality of subcutaneous, intramuscular, intraperitoneal or intravenous injections of the antigen described above, collecting blood from this animal, and separating serum therefrom. Commercially available adjuvants may also be used.

A monoclonal antibody can be prepared, for example, by immunizing a mouse with the antigen described above, fusing its spleen cells with commercially available mouse myeloma cells to produce a hybridoma, and collecting an antibody from the culture supernatant of the hybridoma or the ascites of a mouse inoculated with the hybridoma.

The MDC protein which is used as antigen or is used to prepare an antigen need not necessarily have the whole amino acid structure described in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:4, but may have a partial structure of the amino acid sequence described in SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:4. Alternatively, the MDC protein may be a variant or derivative of the MDC protein represented by SEQ ID NO:1, SEQ ID NO:2, SEQ ID NO:3 or SEQ ID NO:4. The antigen may be an MDC protein as such, or a fusion peptide consisting of an MDC protein (incluing peptide) and another peptide. Preparation of the fusion peptide may be carried out according to either biological techniques or chemical synthesis techniques.

These antibodies enable identification and determination of the MDC protein present in human biological specimens and can hence be used as reagents for the diagnosis of cancer, and the like.

The immunological determination of the MDC protein can be made according to any conventional technique. For example, any of the fluorescent antibody technique, the passive agglutination technique and the enzyme antibody technique may be employed.

(5) Gene analysis of human tumor tissues

The biological specimens which can be used for gene analysis include human normal tissues and various types of human tumor tissues, as well as human blood, human body fluids, human secretions and the like. The extraction and preparation of DNA can be carried out, for example, according to the method of Sato et al. (Sato, T. et al., Cancer Res., 50, 7184, 1990).

The presence or absence of mutations of the gene can be analyzed by using, as probes, a restriction fragment of the DNA encoding human MDC protein as provided by the present invention, or by selecting a properly located base sequence from the DNA, synthesizing an oligonucleotide having the selected base sequence and using the oligonucleotide as primer.

These analyses can also detect other alterations, such as insertion and deletion, of the gene in samples.

The base sequences selected for this purpose can be exon portions, intron portions, or junction portions therebetween. It is a matter of course that artificially modified base sequences may be used. When an artificially modified base sequence is used to prepare primer, the corresponding gene mutation can be detected by the gene analysis.

Analyses can be carried out, for example, by amplifying a partial sequence by PCR using two selected sequences as primers and analyzing the base sequence of the amplification product directly, or by incorporating the amplification product into a plasmid in the same manner as that described above, transforming host cells with this plasmid, culturing the transformed cells, and analyzing the base sequence of the clone thus obtained. Alternatively, the presence or absence of particular mutations of the gene in samples can be directly detected by the use of the ligase chain reaction method (Wu et al., Genomics, 4, 560-569, 1989) and, moreover, the mutant sequence specific PCR method (Ruano and Kidd, Nucleic Acid Research, 17, 8392, 1989; C. R. Newton et al., Nucleic Acid Research, 17, 2503-2517, 1989).

Similarly, using probes containing DNA sequences selected or RNA sequences derived therefrom, point mutations can be detected by the SSCP method or the RNase protection method. Moreover, use of these probes also makes it possible to detect mutations of the gene in samples by Southern hybridization and abnormalities in the expression level of the gene in samples by northern hybridization.

Escherichia coli DH5/pBR1 and Escherichia coli XL1-Blue MRF'Kan/pCR-5P2, each carrying a plasmid containing the DNA encoding this MDC protein, and Escherichia coil 490A/cCI 17-904, carrying a cosmid containing the genomic DNA, were deposited with the National Institute of Bioscience and Human-Technology, Agency of Industrial Science and Technology, Ministry of International Trade and Industry on Apr. 28, 1993, Feb. 8, 1994 and Apr. 28, 1993 under accession numbers FERM BP-4286, FERM BP-4555 and FERM BP-4287, respectively.

The MDC proteins and DNAs encoding the MDC proteins according to the present invention are expected to be useful as reagents for cancer research, testing and diagnostic reagents, and therapeutic agents.

EXAMPLES

The present invention will now be described in more detail with reference to the following Examples which should not be considered to limit the scope of the present invention.

EXAMPLE 1

PAC Isolation of Cosmid Clones Specific for Human Chromosome 17 and Construction of a Chromosome Map

A human-mouse hybrid cell line (GM10331) containing a single human chromosome 17 in a mouse genomic background was selected from among hybrid cells produced by fusing human normal cells with cells of an established mouse cell line and cosmid clones specific for human chromosome 17 were isolated according to the method of Tokino et al. (Tokino et al., Am. J. Hum. Genet., 48, 258-268, 1991). The chromosomal DNA of this hybrid cell line was properly digested with restriction enzyme Sau 3AI and the ends of the fragments thus obtained were treated by partial filling-in with dATP and dGTP. Fragments having a size of 35-42 kb were separated therefrom and inserted in cosmid vector pWEX15 which had previously been digested with restriction enzyme Xho I and similarly treated at its ends by partial filling-in with dCTP and dTTP. From among the resulting cosmid clones, clones containing human DNA fragments were selected by colony hybridization using ³2 P-labeled human chromosomal DNA as probe. Thus, 342 cosmid clones specific for human chromosome 17 were isolated.

With regard to each of these cosmid clones specific for human chromosome 17, the location to which its cosmid DNA hybridize on the chromosome was determined by FISH (Inazawa et al., Genomics, 10, 1075-1078, 1991). Thus, a physical chromosome map for chromosome 17 was constructed (see Tables 1-3 and FIG. 1).

Using DNAs obtained from 6 unrelated individuals, the cosmid clones (cosmid markers), the locations on the chromosome to which their cosmid DNA hybridize had been determined, were examined by a known method (Nakamura et al., Am. J. Hum. Genet., 43, 854-859, 1988) in order to see whether RFLP could be detected or not. The restriction enzyme used was Msp I, Taq I, Bgl II, Pst I, Pvu II, Rsa I or Eco RI. As a result, RFLP was detected in 43 clones (see Tables 4-6). That is, these 43 clones were usable as RFLP markers.

EXAMPLE 2

PAC Detection of Commonly Deleted Regions of the Human Chromosome 17q in Ovarian and Breast Cancers

Tumor tissues were obtained from 94 patients with ovarian cancer and 246 patients with breast cancer who underwent surgery. Corresponding normal tissues or peripheral blood samples were also obtained from the respective patients. DNAs were extracted from these tissues or samples according to a known method (Sato et al., Cancer Res., 50, 7184-7189, 1990). Each DNA was digested with suitable restriction enzymes, and the fragments thus obtained were subjected to 1.0% agarose gel electrophoresis and then Southern transferred to a nylon membrane with 0.1N NaOH/0.1M NaCl (Sato et al., Cancer Res., 50, 7184-7189, 1990).

The membranes thus obtained were examined for LOH (loss of heterozygosity) by Southern hybridization (Sato et al., Cancer Res., 50, 7184-7189, 1990) using, as probes, the RFLP markers obtained by the procedure of Example 1 (see Table 7).

TABLE 7

__________________________________________________________________________

Ovarian Cancer

No. of patients tested

Probe

Chromosomal location

Enzyme

serous

mucinous

clear cell

others

__________________________________________________________________________

CI17-316

q12-21.2 MspI 32 15 12 22

CI17-592

q21.3 EcoRI 14 13 9 15

CI17-701

q21.3 TaqI 24 14 13 19

CI17-730

q21.3 TaqI 29 15 13 19

CI17-507

q21.3 MspI 22 14 11 20

CI17-533

q21.3 TaqI 22 13 11 16

CI17-7

q22 PvuII 21 8 9 15

CI17-489

q23 MspI 26 13 11 21

CMM86

q23 TaqI 28 13 10 17

CI17-516

q25.1 TaqI 29 14 14 21

CI17-710

q25.3 TaqI 18 13 10 12

__________________________________________________________________________

Breast Cancer

Ovarian Cancer No. of

allelic losses/

allelic losses/informative cases(%)

patients

informative cases

Probe

serous

mucinous

clear cell

others

tested

(%)

__________________________________________________________________________

CI17-316

6/13(46.2)

0/1(0.0)

0/3(0.0)

1/9(11.1)

85 11/37(29.7)

CI17-592

2/3(66.7)

0/1(0.0)

2/4(50.0)

62 8/18(44.4)

CI17-701

9/15(60.0)

2/12(16.7)

0/7(0.0)

5/12(41.7)

232 48/138(34.8)

CI17-730

6/12(50.0)

0/4(0.0)

2/4(50.0)

237 36/96(37.5)

CI17-507

6/7(85.7)

1/3(33.3)

2/5(40.0)

74 7/25(28.0)

CI17-533

6/11(54.5)

3/9(33.3)

1/7(14.3)

4/9(44.4)

230 25/93(26.9)

CI17-7

4/5(80.0)

0/1(0.0)

0/3(0.0)

1/3(33.3)

86 14/41(34.1)

CI17-489

5/5(100.0)

0/2(0.0)

0/3(0.0)

3/8(37.5)

75 10/31(32.3)

CMM86

6/17(35.3)

1/8(12.5)

0/6(0.0)

1/20(20.0)

79 12/49(24.5)

CI17-516

6/17(35.3)

1/10(10.0)

0/7(0.0)

6/11(54.5)

84 9/31(29.0)

CI17-710

4/8(50.0)

3/8(37.5)

0.6(0.0)

3/7(42.9)

80 13/45(28.9)

__________________________________________________________________________

A total of 84 among 94 ovarian tumors were informative for at least one locus, and 33 (39.3%) of them showed LOH for at least one locus on chromosome 17q. Among 246 breast tumors examined, 214 were informative for at least one locus, and 88 (41.4%) showed LOH for at least one locus on chromosome 17q.

From the above results, the instances which were informative for two or more loci and exhibited both loss of heterozygosity at a locus and retaining of heterozygosity at other locus on chromosome 17q were summarized.

As a result, two commonly deleted regions were found in 8 ovarian cancers (see FIG. 2). One of them was a region lying between markers CI17-316 (17q12-21.1) and CI17-507 (17q21.3), and the other was a region distal to the marker CI17-516 (17q25.1).

Similarly, two commonly deleted regions were found in 35 breast cancers (see FIG. 3). One of them was a region lying between markers CI17-701 (17q21.3) and CI17-730 (17q21.3), which was also found in the ovarian cancers but was more narrowly localized. The other was a region lying on the terminal side of marker CI17-516 (17q25.1), which was also the region where a deletion was observed in the ovarian cancers.

Of the two commonly deleted regions defined by the above-described deletion mapping, the region flanked by markers CI17-701 and CI17-730 was found to lie close to the 17q21 region showing an intimate correlation with the onset of cancer on the basis of the results of linkage mapping studies on hereditary breast cancer and ovarian cancer (Hall et al., Am. J. Hum. Genet., 50, 1235-1242, 1992). The length of this region (i.e., the genetic distance between the two markers) was estimated to be 2.4 cM by linkage analysis (Lathtop et al., Am. J. Hum. Genet., 37, 482-498, 1985; Donis-Keller et al., Cell, 51, 319-337, 1987).

EXAMPLE 3

PAC Isolation of Cosmid Clones Contained in the Minimal Localized Region

Since it has been shown that the region localized on the basis of the results of linkage mapping is a region lying between markers THRA1 and Mfd188 on 17q21 (Hall et al., Am. J. Hum. Genet., 52, 1235-1242, 1992; Bowcock, A. M. et al., Am. J. Hum. Genet., 52, 718-22, 1993), an attempt was made to determine the relative order of these markers and markers CI17-701 and CI17-730 and thereby combine the mapping information obtained by two different strategies. The relative order of the markers was determined by a two-color FISH method newly developed by the present inventors. This method is a modification of FISH in which a highly extended chromosome preparation obtained by synchronization of the cells is used to enhance the degree of fineness and, moreover, probes labeled with fluorescent materials having different colors are used. This method makes it possible to determine the relative order of markers very close to each other.

As a result, it was found that marker Mfd188 lies between markers CI17-701 and CI17-730 and marker THRA1 lies on the centromeric side of CI17-701 (see FIG. 4, a). That is, the region associated with hereditary breast cancer as localized by linkage mapping and the commonly deleted region in sporadic breast cancers as localized by deletion mapping overlapped each other and the overlapping minimal region was flanked by markers CI17-701 and Mfd188 (see FIG. 4, a). When a physical map of this region was constructed by pulsed-field gel electrophoresis, the length of the overlapping region was greatly narrowed down to about 500 kb.

Furthermore, of the cosmid clones obtained by the procedure of Example 1, 37 clones localized to 17q21.3 and three known markers, THRA1, Mfd188 and PPY, were selected and used for fine mapping of this chromosomal region by two-color FISH. As a result, 15 cosmid clones were located in a region flanked by markers CI17-701 and CI17-730. Of these, two cosmid clones, CI17-527 and CI17-904, were found to lie in the above-described overlapping region (see FIG. 4, a and b).

EXAMPLE 4

PAC Detection of Genomic Alterations in Breast Cancers

Of the overlapping region of about 500 kb, about 150 kb has already been covered by four cosmid clones CI17-701, CI17-527, CI17-904 and Mfd188. Accordingly, an attempt was first made to screen restriction (Sac I, Pvu II or Pst I) fragments of the DNAs from the tumor tissues of 650 sporadic breast cancers by Southern-blot analysis using the DNAs of these cosmid clones or fragments thereof as probes and thereby detect gross structural genomic alterations (so-called genomic rearrangements), such as deletion, duplication, amplification and translocation, having occurred in the tumor cells. As a result, when the DNA of CI17-904 or its 9.5 kb Hind III fragment (see FIG. 4, c) was used as probe, genomic rearrangements were detected in the tumor tissues of two breast cancers (see FIG. 5, a and b). These genomic rearrangements occurred only in the tumor tissues, exhibiting extra bands of different size which were not observed in normal tissues. In addition, the intensities of some bands were increased. That is, a gene amplification occurred in a definite DNA region corresponding to (i.e., hybridizable) this probe. In one case among the above-mentioned two breast cancers, no gene amplification was detected when Southern-blot analysis of the Sac I fragments of DNA from the breast cancer tissue was carried out by using the E-H5.2 or Hind6.1 fragment adjacent to the 9.5 kb Hind fragment (see FIG. 4, c) as probe (see FIG. 6, Case 1). This indicates that the gene amplification in this case occurred within the region corresponding to the 9.5 kb Hind III fragment and was a 4- to 5-fold amplification.

For purpose of closer examination, Southern-blot analyses of the Sac I fragments of DNA from the breast cancer tissue were carried out using each of six Sac I fragments derived from the 9.5 kb Hind III fragment, A, B, C, D, E and F (see FIG. 4, c), as probe. As a result, amplified bands of abnormal size were observed at 2.5 kb with probes A and B, at 3.0 kb with probes B, C and D, at 2.5 kb with probes E and F, and at 0.9 kb with probe F (see FIG. 7).

In the other case, gene amplification was detected when Southern-blot analysis of the Sac I fragments of DNA from the breast cancer tissue was carried out by using the E-H5.2 fragment as probe (see FIG. 6, Case 2). However, no gene amplification was detected when the Hind6.1 fragment was used as probe (see FIG. 6, Case 2). In this case, when the E-H5.2 fragment was used as probe, only an amplification was observed without being attended with any band of abnormal size. This indicates that the gene amplification in this case occurred in a segment extending from within the region corresponding to the 9.5 kb HindIII fragment to the outer (telomeric) side of the region corresponding to the E-H5.2 fragment.

EXAMPLE 5

PAC Isolation of cDNA and Determination of Its Structure

In order to isolate an expressed gene in or near the region where genomic rearrangements were detected in the two breast cancers, DNA fragments containing DNA sequences involved in fundamental cellular functions and conserved among other species were selected from DNA fragments of cosmid clone CI17-904. Specifically, each of the DNA fragments of cosmid clone CI17-904 was used as probe in Southern blot hybridization analyses of DNA fragments from cow, pig, mouse, rat and chicken. As a result, the 3.5 kb Hind III-Ksp I fragment (see FIG. 4, c) of cosmid clone CI17-904 hybridized to DNAs from cow, pig, mouse and rat and showed strong conservation.

Using this 3.5 kb Hind III-Ksp I fragment as probe, human cDNA libraries derived from five different organs (i.e., mammary gland, breast cancer cell line, fetal brain, cerebrum and cerebellum) were screened. Thus, the longest cDNA was cloned from the cerebellar cDNA library. This cDNA hybridized to the 3.5 kb Hind III-Ksp I fragment of cosmid clone CI17-904 and a plurality of adjoining restriction Fragments, and extended over a region of more than 20 kb on the chromosome.

Analysis of the base sequence of this cDNA revealed that it consisted of 2923 base pairs (bp) and was a novel DNA base sequence containing a 5'-untranslated region of 27 bp, a coding region of 1575 bp, a 3'-untranslated region of 1306 bp, and a poly(A) tail of 15 bp (see SEQ ID NO:6). The open reading frame contained in this cDNA sequence encoded a novel protein (MDC protein; see SEQ ID NO:2). An in-frame termination codon was present immediately upstream of the first ATG of the open reading frame. A polyadenylation signal, AATAAA, was observed about 20 bp upstream from the polyadenylation site.

EXAMPLE 6

PAC Determination of the Structure of Genomic DNA

In order to clarify the structure of the genomic DNA corresponding to the cDNA obtained in Example 5, cosmid clone CI17-904 was examined to determine the base sequences of portions containing the base sequence of this cDNA and portions surrounding them. Then, the sequences of both were compared to determine the exon-intron junctions. As a result, the sequence structure of a novel DNA containing 25 exons corresponding to the cDNA obtained in Example 5 was clarified (see SEQ ID NO:9). Thus, it was shown that these 25 exons are of relatively small size and present over an about 20 kb region of the chromosome.

EXAMPLE 7

PAC Detection of Alterations in the Exon Structure of the Gene in Breast Cancers

From the structure of the DNA containing exons/introns as clarified in Example 6, it has become apparent that exons 2, 3 and 4 are present in the sequence region of the probe (the 9.5 kb Hind Ill fragment of cosmid clone CI17-904) with which alterations were detected in the tumor tissues of two breast cancers as described in Example 4. More specifically, exon 2 is present in the sequence region of probe E, and exons 3 and 4 are present in the sequence region of probe F (see FIG. 4, c). Accordingly, it is believed that the gene rearrangements involving the 9.5 kb Hind III fragment region as described in Example 4 disrupt the normal exon structure in the region containing the three exons of the gene. In order to confirm this, the chromosomal DNAs from the tumor tissues of the above-described two breast cancers were examined by Southern-blot analysis using probes having DNA sequences corresponding to exons 2, 3 and 4. Thus, amplified bands of abnormal size were observed similarly to the previously described results obtained with probe E or F (see FIG. 7).

EXAMPLE 8

PAC Tissue Specificity of Gene Expression

mRNAs derived from various human tissues (brain, heart, kidney, liver, lung, pancreas, placenta, skeletal muscle, colon, peripheral blood lymphocyte, ovary, small intestine, spleen, testis and thymus) were examined by northern-blot analysis using the cDNA obtained in Example 5 as probe. As a result, the strongest expression was observed in the brain, and a weak expression in the heart, ovary and testis.

Moreover, amplification by RT-PCR (reverse-transcriptase PCR) was performed to detect a weaker expression. Specifically, using random hexamers as primers, single-stranded cDNAs were synthesized from mRNAs derived from various human tissues under the action of reverse transcriptase. Then, PCR amplification from these templates was performed using primers BC09 and BC012 having sequences derived from the sequences of exons 21 and 23, respectively, which had been revealed in Example 6. As a result, a PCR product having the expected size was observed mainly in tissues of the central nervous system (cerebrum, cerebellum and fetal brain) and in endocrine or reproductive organs (testis, ovary, mammary gland, adrenal gland, thymus and pancreas).

The sequences of the primers used are as follows:

______________________________________

BCO9 5'-GCACCTGCCCCGGCAGT-3'

(coding strand, corresponding to base

numbers 1764-1780 of SEQ ID NO:6)

BCO12 5'-CCAGGACAGCCCCAGCGATG-3'

(antisense strand, corresponding to base

numbers 1976-1957 of SEQ ID NO:6)

______________________________________

EXAMPLE 9

PAC Direct Sequencing of mRNA by RT-PCR

mRNAs derived from human fetal brain and human testis were amplified by RT-PCR using primer GMA701 having a sequence derived from the sequence on exon 19 and primer GMB704 having a sequence derived from the sequence on exon 21. Then, the base sequences of the amplified DNAs were directly determined using primer GMA702 or GMB703. As a result, a sequence, wherein 10 bases (base numbers 1512-1521) were deleted from the cerebellar cDNA sequence of SEQ ID NO:6 obtained in Example 5, was found, which revealed the expression of mRNA corresponding to the DNA sequence of SEQ ID NO:7. Both of the fetal brain and testis mRNAs gave the identical result. The open reading frame contained in the cDNA sequence of SEQ ID NO:7 encodes an MDC protein (see SEQ ID NO:3) composed of 670 amino acids.

This seems to be caused by the alternative RNA splicing at the initiation of exon 20 which starts with base number 6083 instead of base number 6078 on the genomic DNA of SEQ ID NO:9. Such a variation of splicing is also known from, for example, a report by Oda et al. [Biochem. Biophys. Res. Commun., 193, 897-904 (1993)]. As a result, the amino acid sequences encoded by the cDNA of SEQ ID NO:6 and the cDNA of SEQ ID NO:7 differ from each other at and after that site (see SEQ ID NO:2 and SEQ ID NO:3). Specifically, the cDNA of SEQ ID NO:6 produces a termination codon within exon 20, whereas the reading frame is shifted in the cDNA of SEQ ID NO:7 so as to cause the open reading frame to continue to a more downstream position.

The sequences of the primers used in PCR and DNA sequencing are as follows:

______________________________________

GMA701 5'-GGCTGCTGATCGCTTCTGCTAC-3'

(coding strand, corresponding to base

numbers 1413-1434 in SEQ ID NO:6)

GMA702 5'-GAGAAGCTGAATGTGGAGGG-3'

(coding strand, corresponding to base

numbers 1435-1456 in SEQ ID NO:6)

GMB703 5'-GTCAGAGCCGTCCGCCAGC-3'

(antisense strand, corresponding to base

numbers 1675-1657 in SEQ ID NO:6)

GMB704 5'-GCCATCCTCCACATAGCTCAGG-3'

(antisense strand, corresponding to base

numbers 1696-1655 in SEQ ID NO:6)

______________________________________

EXAMPLE 10

PAC Amplification of the 5'-Terminal Sequence by RACE

In order to obtain the full-length cDNA represented by SEQ ID NO:7, PCR amplification of the 5'-cDNA terminus (5'-RACE; Frohman, et al., Proc. Natl. Acad. Sci. USA, 85, 8998-9002, 1988; Belyavski, et al., Nucleic Acid Res., 17, 2919-2932, 1988) was performed. Using specific oligomer SGN012 as primer, together with a commercially available synthesis kit, a single-stranded cDNA was synthesized from 2 μg of poly A(+) RNA derived from human brain (manufactured by Clontech). Then, 5'-RACE was performed using a commercially available kit based on the method of Edwards et al. for linking an anchor oligomer to an end of a single-stranded cDNA (Nucleic Acid Res., 19, 5227-5232, 1991). As a result of PCR using the anchor oligomer of the kit and another specific oligomer SGN011 as primers, an amplification product of about 580 bp was detected by electrophoresis.

This amplification product was extracted from the electrophoretic gel, purified, inserted in the Srf I cleavage site of plasmid vector pCR-Script (manufactured by Stratagene), and cloned. Plasmid DNA was purified from each clone and its base sequence was determined. One of the clones, pCR-SP2, had a cDNA insert of 501 bp beginning with ATG, next to the sequence of the anchor oligomer. The base sequence of the insert extending from base number 315 onward coincided exactly with the base sequence of SEQ ID NO:7 extending from base number 45 (the initiation site of exon 2) onward, excepting one base which will be mentioned below. Furthermore, as far as the reading frame is concerned, that of pCR-5P2 beginning with the first ATG corresponded with the polypeptide encoded by the cDNA of SEQ ID NO:7. The N-terminal region of the polypeptide sequence thus obtained encoded a signal peptide comprising a series of hydrophobic amino acids.

RT-PCR was performed in order to confirm that the above 5'-terminal sequence obtained by 5'-RACE was truly linked, on mRNA, to the sequence of SEQ ID NO:7 extending from base number 45 onward. Using random hexamers as primers, single-stranded cDNAs were synthesized from poly A(+) RNAs derived from human brain, fetal brain, ovary and testis (manufactured by Clontech). Then, the cDNA template were amplified by PCR using an oligomer (SGN013) having the first 20-base sequence of pCR-5P2 as sense primer and SGN011 or SGN012 as antisense primer. As a result, the expected amplification product (about 500 bp for SGN013/SGN011 and about 750 bp for SGN013/SGN012) was detected by electrophoresis with every tissue RNA used.

Thus, it was confirmed that the 5'-terminal sequence of pCR-5P2 obtained by 5'-RACE was linked, on mRNA, to the sequence of SEQ ID NO:7 extending from base number 45 onward, resulting in the construction of a cDNA represented by SEQ ID NO:8. The open reading frame of the cDNA of SEQ ID NO:8 encodes an MDC protein composed of 769 amino acids (see SEQ ID NO:4).

The sequences of the specific oligomers used are as follows:

______________________________________

SGN011 5'-GATGTAAGTCAAGTTCCCATCAGAGA-3'

(antisense strand, corresponding to base

numbers 231-206 in SEQ ID NO:7)

SGN012 5'-AACAGCTGGTGGTCGTTGATCACAA-3'

(antisense strand, corresponding to base

numbers 485-461 in SEQ ID NO:7)

SGN013 5'-ATGAGGCTGCTGCGGCGCTG-3'

(coding strand, corresponding to base

numbers 1-20 in SEQ ID NO:8)

______________________________________

The above-mentioned one base in the SEQ ID NO:8 after the initiation site of exon 2, differing from one in the SEQ ID NO:6 or SEQ ID NO:7, is the forth base from the initiation site of exon 2, i.e., the C at the base number 318 in the SEQ ID NO:8. The corresponding base in the SEQ ID NO:6 or the SEQ ID NO:7 is the A at the base number 48. The base C at the base number 318 in the SEQ ID NO:8 codes His at the amino acid number 106 in the SEQ ID NO:4. The base A at the base number 48 in the SEQ ID NO:6 or the SEQ ID NO:7 codes Gln at the amino acid number 7 in the SEQ ID NO:2 or the SEQ ID NO:3. This fact reflects polymorphism.

An amino acid sequence common to these three variant MDC proteins (SEQ ID NO:2, SEQ ID NO:3 and SEQ ID NO:4) is a sequence composed of 488 amino acids (see SEQ ID NO:1), and a DNA sequence encoding this portion is also a common sequence (see SEQ ID NO:5).

EXAMPLE 11

PAC Homology with Known Proteins

The amino acid sequences of MDC proteins showed homology with a family of snake venom hemorrhagic proteins including HR1B (Takeya et al., J. Biol. Chem., 265, 16068-16073, 1990), prorhodostomin (Au et al., Biochem. Biophys. Res. Commun., 181, 585-593, 1991) and protrigramin (Neeper et al., Nucleic Acid Res., 18, 4255, 1990).

They also showed homology with the guinea pig sperm surface protein PH30 (Blobel et al., Nature, 356, 248-252, 1992) and the rat or monkey epididymis protein EAPI (Perry et al., Biochem. J., 286, 671-675, 1992).

The homology of these proteins with the MDC proteins represented by SEQ ID NO:2 (524 amino acids) and. SEQ ID NO:4 (769 amino acids) is indicated by the following "percent identity/number of amino acids in the tested region". The values for SEQ ID NO:2 are given on the left side and those for SEQ ID NO:4 on the right side.

______________________________________

HR1B 32.5/335 32.2/379

prorhodostomin 29.0/420 29.0/420

protrigramin 27.7/430 28.1/438

PH30b 38.1/147 30.8/302

EAP1 (rat) 36.0/364 33.1/475

EAP1 (monkey) 30.4/503 29.9/599

______________________________________

EXAMPLE 12

PAC Generation of Transformants

A DNA fragment encoding a part of the MDC protein represented by SEQ ID NO:2 was amplified from the DNA (SEQ ID NO:6) encoding the MDC protein (SEQ ID NO:2) by PCR using primers SGN006 and SGN008. The sequences of the primers used are as follows.

______________________________________

SGN006 5'-CACAGATCTGGGGGCATATGCTCCCTG-3'

(coding strand, corresponding to base

numbers 766-783 in SEQ ID NO:6)

SGN008 5'-AACAAGCTT CTACTGATGTCTCCCACC-3'

(antisense strand, corresponding to base

numbers 1602-1585 in SEQ ID NO:6; the

under1ine designating a termination codon.)

______________________________________

For purposes of vector construction, the 5'-terminal of these primers are provided with Bgl II and Hind III cleavage site sequences, respectively.

The PCR amplification product was separated by agarose gel electrophoresis and cleaved with Bgl II and Hind Ill. The resulting DNA fragments encoding a part of the MDC protein was combined with vector pMAL-c2 (manufactured by New England Biolabs) which had previously been cleaved with Bam HI and Hind III to construct plasmid pMAL-MDC(C1).

Similarly, the same DNA fragment was combined with vector pQE-13 (manufactured by Diagen) which had previously been cleaved with Bam HI and Hind III to construct plasmid pH6-MDC(C1).

Furthermore, a DNA sequence downstream from the Bam HI cleavage site (base number 1483 in SEQ ID NO:6) was removed from the MDC protein encoding region of pMAL-MDC(C1) by cleaving pMAL-MDC(C1) with Bam HI and Hind III, and recombining it after the formation of blunt ends. This resulted in the construction of plasmid pMAL-MDC(dC1), which mediates expression of a polypeptide with amino acid sequence common to two variant MDC proteins (SEQ ID NO:2 and SEQ ID NO:3).

Since the fragment incorporated into vector pMAL-c2 is expressed as a fusion protein having a maltose-binding protein (MBP) on the N-terminal side, this fusion protein was purified by affinity chromatography using an amylose column. On the other hand, since the fragment incorporated into vector pQE-13 is expressed as a fusion protein having a peptide (His 6) composed of six histidine residues on the N-terminal side, this fusion protein was purified by affinity chromatography using a metal chelate column.

Several transformants were obtained by transforming E. coli JM109 with each of plasmids pMAL-MDC(C1), pMAL-MDC(dC1) and pH6-MDC(C1) and selecting for ampicillin resistance.

EXAMPLE 13

PAC Expression and Purification of Recombinant MDC Proteins

Each of the transformants obtained in Example 12 was grown and the resulting recombinant MDC fusion protein was extracted and purified from the culture.

Specifically, 100 ml of LB medium (1% polypeptone, 0.5% yeast extract, 1% NaCl) was inoculated with each transformant and incubated overnight at 37°C with shaking. The culture was diluted 10-fold with LB medium previously warmed to 37°C and incubated for additional 30-90 minutes to obtain a culture in the logarithmic growth phase. To 1 liter of the culture was added IPTG (isopropyl-β-D-thiogalactopyranoside) so as to give a final concentration of 1 mM. This culture was incubated for 3-4 hours and then centrifuged to collect the cells therefrom.

In the case of transformant of plasmid pMAL-MDC(C1) or pMAL-MDC(dC1), the cells were suspended in 10 ml of a column buffer (20 mM Tris-HCl, pH 7.4, 200 mM NaCl) and disintegrated by sonication. Since the recombinant MDC fusion protein was present in the insoluble fraction of the disintegrated cell suspension, this was separated by centrifugation and dissolved in a denaturing buffer (8M urea, 20 mM Tris-HCl, pH 8.5, 10 mM dithiothreitol). Then, this solution was dialyzed against the column buffer and centrifuged to collect a supernatant soluble fraction. The dialyzed insoluble fraction was further denatured, dialyzed and centrifuged repeatedly to collect additional supernatant soluble fractions. The combined soluble fraction was applied to an amylose column (manufactured by New England Biolabs), which was washed with the column buffer and eluted with the column buffer containing 10 mM maltose. The eluted fractions were analyzed by absorptiometry at 280 nm and SDS-polyacrylamide electrophoresis (with Coomassie Blue staining), and combined into fractions. As a result, a fraction in which the desired MBP (maltose binding protein) fusion protein (about 68 Kd) was detected as a principal band was obtained for each of the transformants generated with plasmids pMAL-MDC(C1) and pMAL-MDC(dC1). The yield was 46.4 mg and 10.0 mg (when an OD₂80 of 1 was taken as 1 mg/ml), respectively. These fusion proteins will hereinafter be referred to as MBP-MDC(C1) and MBP-MDC(dC1), respectively.

Similarly, in the case of transformant of plasmid pH6-MDC(C1), the cells were suspended in 10 ml of a sonication buffer (10 mM sodium phosphate, pH 8.0, 200 mM NaCl) and disintegrated by sonication. Since the recombinant MDC fusion protein was present in the insoluble fraction of the disintegrated cell suspension, this was separated by centrifugation and dissolved in buffer A (6M guanidine hydrochloride, 100 mM NaH₂ PO₄, 10 mM Tris-HCl, PH 8.0). Then, this solution was centrifuged to collect a supernatant soluble fraction, which was applied to a Ni-NTA column (manufactured by Diagen). This column was washed with buffer A and then buffer B (8M urea, 100 mM NaH₂ PO₄, 10 mM Trim-HCl, pH 8.0), and eluted stepwise with buffer C (8M urea, 100 mM NaH₂ PO₄, 10 mM Tris-HCl, pH 6.3), buffer D (8M urea, 100 mM NaH₂ PO₄, 10 mM Tris-HCl, pH 5.9), buffer E (8M urea, 100 mM NaH₂ PO₄, 10 mM Tris-HCl, pH 4.5) and buffer F (6M guanidine hydrochloride, 200 mM acetic acid). The eluted fractions were analyzed by absorptiometry at 280 nm and SDS-polyacrylamide electrophoresis (with Coomassie Blue staining), and combined into fractions. As a result, a fraction in which the desired His6 fusion protein (about 34 Kd) was detected as a single band was obtained from the effluent resulting from elution with buffer F. The yield was 51.9 mg (when an OD₂80 of 1 was taken as 1 mg/ml). This fusion protein will hereinafter be referred to as His6-MDC(C1).

EXAMPLE 14

PAC Preparation of a Monoclonal Antibody and a Rabbit Polyclonal Antibody

The three recombinant fusion proteins, His6-MDC(C1), MBP-MDC(dC1) and MBP-MDC(C1), obtained in Example 13 were used as an immunizing antigen, an antigen for antibody purification and screening, and a standard antigen for measurement, respectively.

An anti-MDC protein specific monoclonal antibody was prepared by immunizing a mouse with His6-MDC(C1). Specifically, a solution of His6-MDC(C1) (500-1000 μg/ml) in 3M urea/PBS was mixed with complete adjuvant at a ratio of 1:1, and this mixture was injected into the peritoneal cavity of a mouse at a dose of 100 μg per animal. This injection was repeated 4-6 times at intervals of 2 weeks. After completion of the immunization, hybridomas were produced by fusing P3U1 cells with B cells in the presence of PEG1500. Then, hybridomas productive of an anti-MDC protein specific antibody were selected by monitoring the antibody titer in the culture supernatant.

In order to measure the antibody titer, a first reaction was effected by adding 100 μl of the culture supernatant to a polystyrene cup having a solid phase formed from the MBP-MDC(dC1) fusion protein obtained in Example 13 (5 μg/ml). After washing, a second reaction was effected by the addition of anti-mouse IgG HRP (horse-raddish peroxidase). After washing, a color reaction (third reaction) was effected by the addition of an enzyme substrate solution [i.e., a mixed solution of hydrogen peroxide and ABTS [2,2'-azino-bis(3-ethylbenzothiazoline-6-sulfonic acid)]], and the produced color was monitored.

The hybridomas were grown on a 96-well multiplate and screened by means of HAT medium. After about 2 weeks, clones reacting specifically with the antigen were selected by measuring the antibody titer in the culture supernatant. As a result of further cloning, 3 clones (G1-SA2-2C8, G2-2F2-3D11 and G2-2D10-3F5) were established as antibody-producing hybridomas. The class and subclass of the antibody produced by each of the established clones was IgG₁ for G1-5A2-2C8, IgG₂b for G2-2F2-3D11, and IgM for G2-2D10-3F5. 3,000,000 cells of each hybridoma were introduced into the peritoneal cavity of a BALB/c mouse to which 0.5 ml of pristane had been administered intraperitoneally about one week before. After 8-10 days, the ascites was collected. From the ascites collected from each animal, an antibody was purified by affinity chromatography using a protein G column.

Similarly, an anti-MDC protein polyclonal antibody was prepared by immunizing a rabbit with an immunizing antigen comprising His6-MDD(C1) obtained in Example 13.

Specifically, like the mouse, a rabbit was immunized with a mixture prepared by mixing a solution of His6-MDC(C1) (500-1000 μg/ml) in 3M urea/PBS with complete adjuvant at a ratio of 1:1. After completion of the immunization, an antiserum was obtained and its antibody titer was measured using a polystyrene cup having a solid phase formed from the MBP-MDC(dC1) fusion protein obtained in Example 13. The antiserum was diluted 500- to 64,000-fold, 100 μl each of the dilutions were added to wells, and their antibody titers were tested with goat anti-rabbit IgG-HRP. Thus, the antibody titer was detectable up to the 64,000-fold dilution. Since no antibody reacting with MBP-MDC(dC1) was present in the serum before immunization, it could be confirmed that an antibody reacting specifically with the protein was produced. Furthermore, this antiserum was purified by affinity chromatography using a protein G column and a Sepharose column having the MBP-MDC(dC1) fusion protein immobilized therein.

A method for the determination of the MDC protein by ELISA using the purified monoclonal antibody and purified rabbit polyclonal antibody obtained in the above-described manner was established.

Specifically, the purified monoclonal antibody derived from a hybridoma (G2-2F2-3D11) was immobilized on a 96-well plate and blocked with BSA (bovine serum albumin). Test solutions containing purified MBP-MDC(C1) at concentrations of 0.156 to 5.00 μg/ml were prepared, added to wells in an amount of 100 μl per well, and reacted at room temperature for an hour. After the wells were washed, a solution (5 μg/ml) of the purified rabbit polyclonal antibody was added in an amount of 100 μl per well and reacted at room temperature for an hour. After the wells were washed, anti-rabbit IgG-HRP (5 μg/ml) was added in an amount of 100 μl per well and reacted at room temperature for an hour. After completion of the reaction, 2 mM sodium azide was added in an amount of 100 μl per well and the absorbances at 405 nm and 490 nm were measured. It was confirmed that the differential absorbances thus obtained were closely correlated with the concentrations of the test solutions, exhibiting an approximately linear relationship in the range of 0 to 2.5 μg/ml (see FIG. 8). This indicates that ELISA using these monoclonal antibody and rabbit polyclonal antibody can be used as a method for the determination of the MDC protein.

__________________________________________________________________________

SEQUENCE LISTING

(1) GENERAL INFORMATION:

(iii) NUMBER OF SEQUENCES: 20

(2) INFORMATION FOR SEQ ID NO: 1:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 488 amino acids

(B) TYPE: amino acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: protein

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Homo sapiens

(vii) IMMEDIATE SOURCE:

(A) LIBRARY: human fetal brain cDNA library

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1:

LeuLeuSerSerGlnTyrValGluArgHisPheSerArgGluGlyThr

151015

ThrGlnHisSerThrGlyAlaGlyAspHisCysTyrTyrGlnGlyLys

202530

LeuArgGlyAsnProHisSerPheAlaAlaLeuSerThrCysGlnGly

354045

LeuHisGlyValPheSerAspGlyAsnLeuThrTyrIleValGluPro

505560

GlnGluValAlaGlyProTrpGlyAlaProGlnGlyProLeuProHis

65707580

LeuIleTyrArgThrProLeuLeuProAspProLeuGlyCysArgGlu

859095

ProGlyCysLeuPheAlaValProAlaGlnSerAlaProProAsnArg

100105110

ProArgLeuArgArgLysArgGlnValArgArgGlyHisProThrVal

115120125

HisSerGluThrLysTyrValGluLeuIleValIleAsnAspHisGln

130135140

LeuPheGluGlnMetArgGlnSerValValLeuThrSerAsnPheAla

145150155160

LysSerValValAsnLeuAlaAspValIleTyrLysGluGlnLeuAsn

165170175

ThrArgIleValLeuValAlaMetGluThrTrpAlaAspGlyAspLys

180185190

IleGlnValGlnAspAspLeuLeuGluThrLeuAlaArgLeuMetVal

195200205

TyrArgArgGluGlyLeuProGluProSerAsnAlaThrHisLeuPhe

210215220

SerGlyArgThrPheGlnSerThrSerSerGlyAlaAlaTyrValGly

225230235240

GlyIleCysSerLeuSerHisGlyGlyGlyValAsnGluTyrGlyAsn

245250255

MetGlyAlaMetAlaValThrLeuAlaGlnThrLeuGlyGlnAsnLeu

260265270

GlyMetMetTrpAsnLysHisArgSerSerAlaGlyAspCysLysCys

275280285

ProAspIleTrpLeuGlyCysIleMetGluAspThrGlyPheTyrLeu

290295300

ProArgLysPheSerArgCysSerIleAspGluTyrAsnGlnPheLeu

305310315320

GlnGluGlyGlyGlySerCysLeuPheAsnLysProLeuLysLeuLeu

325330335

AspProProGluCysGlyAsnGlyPheValGluAlaGlyGluGluCys

340345350

AspCysGlySerValGlnGluCysSerArgAlaGlyGlyAsnCysCys

355360365

LysLysCysThrLeuThrHisAspAlaMetCysSerAspGlyLeuCys

370375380

CysArgArgCysLysTyrGluProArgGlyValSerCysArgGluAla

385390395400

ValAsnGluCysAspIleAlaGluThrCysThrGlyAspSerSerGln

405410415

CysProProAsnLeuHisLysLeuAspGlyTyrTyrCysAspHisGlu

420425430

GlnGlyArgCysTyrGlyGlyArgCysLysThrArgAspArgGlnCys

435440445

GlnValLeuTrpGlyHisAlaAlaAlaAspArgPheCysTyrGluLys

450455460

LeuAsnValGluGlyThrGluArgGlySerCysGlyArgLysGlySer

465470475480

GlyTrpValGlnCysSerLysGln

485488

(2) INFORMATION FOR SEQ ID NO: 2:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 524 amino acids

(B) TYPE: amino acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: protein

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Homo sapiens

(vii) IMMEDIATE SOURCE:

(A) LIBRARY: human fetal brain cDNA library

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2:

MetCysTrpLeuSerHisGlnLeuLeuSerSerGlnTyrValGluArg

151015

HisPheSerArgGluGlyThrThrGlnHisSerThrGlyAlaGlyAsp

202530

HisCysTyrTyrGlnGlyLysLeuArgGlyAsnProHisSerPheAla

354045

AlaLeuSerThrCysGlnGlyLeuHisGlyValPheSerAspGlyAsn

505560

LeuThrTyrIleValGluProGlnGluValAlaGlyProTrpGlyAla

65707580

ProGlnGlyProLeuProHisLeuIleTyrArgThrProLeuLeuPro

859095

AspProLeuGlyCysArgGluProGlyCysLeuPheAlaValProAla

100105110

GlnSerAlaProProAsnArgProArgLeuArgArgLysArgGlnVal

115120125

ArgArgGlyHisProThrValHisSerGluThrLysTyrValGluLeu

130135140

IleValIleAsnAspHisGlnLeuPheGluGlnMetArgGlnSerVal

145150155160

ValLeuThrSerAsnPheAlaLysSerValValAsnLeuAlaAspVal

165170175

IleTyrLysGluGlnLeuAsnThrArgIleValLeuValAlaMetGlu

180185190

ThrTrpAlaAspGlyAspLysIleGlnValGlnAspAspLeuLeuGlu

195200205

ThrLeuAlaArgLeuMetValTyrArgArgGluGlyLeuProGluPro

210215220

SerAsnAlaThrHisLeuPheSerGlyArgThrPheGlnSerThrSer

225230235240

SerGlyAlaAlaTyrValGlyGlyIleCysSerLeuSerHisGlyGly

245250255

GlyValAsnGluTyrGlyAsnMetGlyAlaMetAlaValThrLeuAla

260265270

GlnThrLeuGlyGlnAsnLeuGlyMetMetTrpAsnLysHisArgSer

275280285

SerAlaGlyAspCysLysCysProAspIleTrpLeuGlyCysIleMet

290295300

GluAspThrGlyPheTyrLeuProArgLysPheSerArgCysSerIle

305310315320

AspGluTyrAsnGlnPheLeuGlnGluGlyGlyGlySerCysLeuPhe

325330335

AsnLysProLeuLysLeuLeuAspProProGluCysGlyAsnGlyPhe

340345350

ValGluAlaGlyGluGluCysAspCysGlySerValGlnGluCysSer

355360365

ArgAlaGlyGlyAsnCysCysLysLysCysThrLeuThrHisAspAla

370375380

MetCysSerAspGlyLeuCysCysArgArgCysLysTyrGluProArg

385390395400

GlyValSerCysArgGluAlaValAsnGluCysAspIleAlaGluThr

405410415

CysThrGlyAspSerSerGlnCysProProAsnLeuHisLysLeuAsp

420425430

GlyTyrTyrCysAspHisGluGlnGlyArgCysTyrGlyGlyArgCys

435440445

LysThrArgAspArgGlnCysGlnValLeuTrpGlyHisAlaAlaAla

450455460

AspArgPheCysTyrGluLysLeuAsnValGluGlyThrGluArgGly

465470475480

SerCysGlyArgLysGlySerGlyTrpValGlnCysSerLysGlnPro

485490495

GlnGlnGlyArgAlaValTrpLeuProProLeuCysGlnHisLeuTrp

500505510

SerSerSerAlaArgGlyProGlyGlyArgHisGln

515520524

(2) INFORMATION FOR SEQ ID NO: 3:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 670 amino acids

(B) TYPE: amino acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: protein

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Homo sapiens

(vii) IMMEDIATE SOURCE:

(A) LIBRARY: human fetal brain cDNA library

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3:

MetCysTrpLeuSerHisGlnLeuLeuSerSerGlnTyrValGluArg

151015

HisPheSerArgGluGlyThrThrGlnHisSerThrGlyAlaGlyAsp

202530

HisCysTyrTyrGlnGlyLysLeuArgGlyAsnProHisSerPheAla

354045

AlaLeuSerThrCysGlnGlyLeuHisGlyValPheSerAspGlyAsn

505560

LeuThrTyrIleValGluProGlnGluValAlaGlyProTrpGlyAla

65707580

ProGlnGlyProLeuProHisLeuIleTyrArgThrProLeuLeuPro

859095

AspProLeuGlyCysArgGluProGlyCysLeuPheAlaValProAla

100105110

GlnSerAlaProProAsnArgProArgLeuArgArgLysArgGlnVal

115120125

ArgArgGlyHisProThrValHisSerGluThrLysTyrValGluLeu

130135140

IleValIleAsnAspHisGlnLeuPheGluGlnMetArgGlnSerVal

145150155160

ValLeuThrSerAsnPheAlaLysSerValValAsnLeuAlaAspVal

165170175

IleTyrLysGluGlnLeuAsnThrArgIleValLeuValAlaMetGlu

180185190

ThrTrpAlaAspGlyAspLysIleGlnValGlnAspAspLeuLeuGlu

195200205

ThrLeuAlaArgLeuMetValTyrArgArgGluGlyLeuProGluPro

210215220

SerAsnAlaThrHisLeuPheSerGlyArgThrPheGlnSerThrSer

225230235240

SerGlyAlaAlaTyrValGlyGlyIleCysSerLeuSerHisGlyGly

245250255

GlyValAsnGluTyrGlyAsnMetGlyAlaMetAlaValThrLeuAla

260265270

GlnThrLeuGlyGlnAsnLeuGlyMetMetTrpAsnLysHisArgSer

275280285

SerAlaGlyAspCysLysCysProAspIleTrpLeuGlyCysIleMet

290295300

GluAspThrGlyPheTyrLeuProArgLysPheSerArgCysSerIle

305310315320

AspGluTyrAsnGlnPheLeuGlnGluGlyGlyGlySerCysLeuPhe

325330335

AsnLysProLeuLysLeuLeuAspProProGluCysGlyAsnGlyPhe

340345350

ValGluAlaGlyGluGluCysAspCysGlySerValGlnGluCysSer

355360365

ArgAlaGlyGlyAsnCysCysLysLysCysThrLeuThrHisAspAla

370375380

MetCysSerAspGlyLeuCysCysArgArgCysLysTyrGluProArg

385390395400

GlyValSerCysArgGluAlaValAsnGluCysAspIleAlaGluThr

405410415

CysThrGlyAspSerSerGlnCysProProAsnLeuHisLysLeuAsp

420425430

GlyTyrTyrCysAspHisGluGlnGlyArgCysTyrGlyGlyArgCys

435440445

LysThrArgAspArgGlnCysGlnValLeuTrpGlyHisAlaAlaAla

450455460

AspArgPheCysTyrGluLysLeuAsnValGluGlyThrGluArgGly

465470475480

SerCysGlyArgLysGlySerGlyTrpValGlnCysSerLysGlnAsp

485490495

ValLeuCysGlyPheLeuLeuCysValAsnIleSerGlyAlaProArg

500505510

LeuGlyAspLeuValGlyAspIleSerSerValThrPheTyrHisGln

515520525

GlyLysGluLeuAspCysArgGlyGlyHisValGlnLeuAlaAspGly

530535540

SerAspLeuSerTyrValGluAspGlyThrAlaCysGlyProAsnMet

545550555560

LeuCysLeuAspHisArgCysLeuProAlaSerAlaPheAsnPheSer

565570575

ThrCysProGlySerGlyGluArgArgIleCysSerHisHisGlyVal

580585590

CysSerAsnGluGlyLysCysIleCysGlnProAspTrpThrGlyLys

595600605

AspCysSerIleHisAsnProLeuProThrSerProProThrGlyGlu

610615620

ThrGluArgTyrLysGlyProSerGlyThrAsnIleIleIleGlySer

625630635640

IleAlaGlyAlaValLeuValAlaAlaIleValLeuGlyGlyThrGly

645650655

TrpGlyPheLysAsnIleArgArgGlyArgSerGlyGlyAla

660665670

(2) INFORMATION FOR SEQ ID NO: 4:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 769 amino acids

(B) TYPE: amino acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: protein

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Homo sapiens

(vii) IMMEDIATE SOURCE:

(A) LIBRARY: human fetal brain cDNA library

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4:

MetArgLeuLeuArgArgTrpAlaPheAlaAlaLeuLeuLeuSerLeu

151015

LeuProThrProGlyLeuGlyThrGlnGlyProAlaGlyAlaLeuArg

202530

TrpGlyGlyLeuProGlnLeuGlyGlyProGlyAlaProGluValThr

354045

GluProSerArgLeuValArgGluSerSerGlyGlyGluValArgLys

505560

GlnGlnLeuAspThrArgValArgGlnGluProProGlyGlyProPro

65707580

ValHisLeuAlaGlnValSerPheValIleProAlaPheAsnSerAsn

859095

PheThrLeuAspLeuGluLeuAsnHisHisLeuLeuSerSerGlnTyr

100105110

ValGluArgHisPheSerArgGluGlyThrThrGlnHisSerThrGly

115120125

AlaGlyAspHisCysTyrTyrGlnGlyLysLeuArgGlyAsnProHis

130135140

SerPheAlaAlaLeuSerThrCysGlnGlyLeuHisGlyValPheSer

145150155160

AspGlyAsnLeuThrTyrIleValGluProGlnGluValAlaGlyPro

165170175

TrpGlyAlaProGlnGlyProLeuProHisLeuIleTyrArgThrPro

180185190

LeuLeuProAspProLeuGlyCysArgGluProGlyCysLeuPheAla

195200205

ValProAlaGlnSerAlaProProAsnArgProArgLeuArgArgLys

210215220

ArgGlnValArgArgGlyHisProThrValHisSerGluThrLysTyr

225230235240

ValGluLeuIleValIleAsnAspHisGlnLeuPheGluGlnMetArg

245250255

GlnSerValValLeuThrSerAsnPheAlaLysSerValValAsnLeu

260265270

AlaAspValIleTyrLysGluGlnLeuAsnThrArgIleValLeuVal

275280285

AlaMetGluThrTrpAlaAspGlyAspLysIleGlnValGlnAspAsp

290295300

LeuLeuGluThrLeuAlaArgLeuMetValTyrArgArgGluGlyLeu

305310315320

ProGluProSerAsnAlaThrHisLeuPheSerGlyArgThrPheGln

325330335

SerThrSerSerGlyAlaAlaTyrValGlyGlyIleCysSerLeuSer

340345350

HisGlyGlyGlyValAsnGluTyrGlyAsnMetGlyAlaMetAlaVal

355360365

ThrLeuAlaGlnThrLeuGlyGlnAsnLeuGlyMetMetTrpAsnLys

370375380

HisArgSerSerAlaGlyAspCysLysCysProAspIleTrpLeuGly

385390395400

CysIleMetGluAspThrGlyPheTyrLeuProArgLysPheSerArg

405410415

CysSerIleAspGluTyrAsnGlnPheLeuGlnGluGlyGlyGlySer

420425430

CysLeuPheAsnLysProLeuLysLeuLeuAspProProGluCysGly

435440445

AsnGlyPheValGluAlaGlyGluGluCysAspCysGlySerValGln

450455460

GluCysSerArgAlaGlyGlyAsnCysCysLysLysCysThrLeuThr

465470475480

HisAspAlaMetCysSerAspGlyLeuCysCysArgArgCysLysTyr

485490495

GluProArgGlyValSerCysArgGluAlaValAsnGluCysAspIle

500505510

AlaGluThrCysThrGlyAspSerSerGlnCysProProAsnLeuHis

515520525

LysLeuAspGlyTyrTyrCysAspHisGluGlnGlyArgCysTyrGly

530535540

GlyArgCysLysThrArgAspArgGlnCysGlnValLeuTrpGlyHis

545550555560

AlaAlaAlaAspArgPheCysTyrGluLysLeuAsnValGluGlyThr

565570575

GluArgGlySerCysGlyArgLysGlySerGlyTrpValGlnCysSer

580585590

LysGlnAspValLeuCysGlyPheLeuLeuCysValAsnIleSerGly

595600605

AlaProArgLeuGlyAspLeuValGlyAspIleSerSerValThrPhe

610615620

TyrHisGlnGlyLysGluLeuAspCysArgGlyGlyHisValGlnLeu

625630635640

AlaAspGlySerAspLeuSerTyrValGluAspGlyThrAlaCysGly

645650655

ProAsnMetLeuCysLeuAspHisArgCysLeuProAlaSerAlaPhe

660665670

AsnPheSerThrCysProGlySerGlyGluArgArgIleCysSerHis

675680685

HisGlyValCysSerAsnGluGlyLysCysIleCysGlnProAspTrp

690695700

ThrGlyLysAspCysSerIleHisAsnProLeuProThrSerProPro

705710715720

ThrGlyGluThrGluArgTyrLysGlyProSerGlyThrAsnIleIle

725730735

IleGlySerIleAlaGlyAlaValLeuValAlaAlaIleValLeuGly

740745750

GlyThrGlyTrpGlyPheLysAsnIleArgArgGlyArgSerGlyGly

755760765

Ala

769

(2) INFORMATION FOR SEQ ID NO: 5:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 1464 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: cDNA to mRNA

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Homo sapiens

(vii) IMMEDIATE SOURCE:

(A) LIBRARY: human fetal brain cDNA library

(ix) FEATURE:

(A) NAME/KEY: CDS

(B) LOCATION: 1..1464

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5:

CTCCTCTCCTCGCAATACGTGGAGCGCCACTTCAGCCGGGAGGGGACA48

LeuLeuSerSerGlnTyrValGluArgHisPheSerArgGluGlyThr

151015

ACCCAGCACAGCACCGGGGCTGGAGACCACTGCTACTACCAGGGGAAG96

ThrGlnHisSerThrGlyAlaGlyAspHisCysTyrTyrGlnGlyLys

202530

CTCCGGGGGAACCCGCACTCCTTCGCCGCCCTCTCCACCTGCCAGGGG144

LeuArgGlyAsnProHisSerPheAlaAlaLeuSerThrCysGlnGly

354045

CTGCATGGGGTCTTCTCTGATGGGAACTTGACTTACATCGTGGAGCCC192

LeuHisGlyValPheSerAspGlyAsnLeuThrTyrIleValGluPro

505560

CAAGAGGTGGCTGGACCTTGGGGAGCCCCTCAGGGACCCCTTCCCCAC240

GlnGluValAlaGlyProTrpGlyAlaProGlnGlyProLeuProHis

65707580

CTCATTTACCGGACCCCTCTCCTCCCAGATCCCCTCGGATGCAGGGAA288

LeuIleTyrArgThrProLeuLeuProAspProLeuGlyCysArgGlu

859095

CCAGGCTGCCTGTTTGCTGTGCCTGCCCAGTCGGCTCCTCCAAACCGG336

ProGlyCysLeuPheAlaValProAlaGlnSerAlaProProAsnArg

100105110

CCGAGGCTGAGAAGGAAAAGGCAGGTCCGCCGGGGCCACCCTACAGTG384

ProArgLeuArgArgLysArgGlnValArgArgGlyHisProThrVal

115120125

CACAGTGAAACCAAGTATGTGGAGCTAATTGTGATCAACGACCACCAG432

HisSerGluThrLysTyrValGluLeuIleValIleAsnAspHisGln

130135140

CTGTTCGAGCAGATGCGACAGTCGGTGGTCCTCACCAGCAACTTTGCC480

LeuPheGluGlnMetArgGlnSerValValLeuThrSerAsnPheAla

145150155160

AAGTCCGTGGTGAACCTGGCCGATGTGATATACAAGGAGCAGCTCAAC528

LysSerValValAsnLeuAlaAspValIleTyrLysGluGlnLeuAsn

165170175

ACTCGCATCGTCCTGGTTGCCATGGAAACATGGGCAGATGGGGACAAG576

ThrArgIleValLeuValAlaMetGluThrTrpAlaAspGlyAspLys

180185190

ATCCAGGTGCAGGATGACCTCCTGGAGACCCTGGCCCGGCTCATGGTC624

IleGlnValGlnAspAspLeuLeuGluThrLeuAlaArgLeuMetVal

195200205

TACCGACGGGAGGGTCTGCCTGAGCCCAGTAATGCCACCCACCTCTTC672

TyrArgArgGluGlyLeuProGluProSerAsnAlaThrHisLeuPhe

210215220

TCGGGCAGGACCTTCCAGAGCACGAGCAGCGGGGCAGCCTACGTGGGG720

SerGlyArgThrPheGlnSerThrSerSerGlyAlaAlaTyrValGly

225230235240

GGCATATGCTCCCTGTCCCATGGCGGGGGTGTGAACGAGTACGGCAAC768

GlyIleCysSerLeuSerHisGlyGlyGlyValAsnGluTyrGlyAsn

245250255

ATGGGGGCGATGGCCGTGACCCTTGCCCAGACGCTGGGACAGAACCTG816

MetGlyAlaMetAlaValThrLeuAlaGlnThrLeuGlyGlnAsnLeu

260265270

GGCATGATGTGGAACAAACACCGGAGCTCGGCAGGGGACTGCAAGTGT864

GlyMetMetTrpAsnLysHisArgSerSerAlaGlyAspCysLysCys

275280285

CCAGACATCTGGCTGGGCTGCATCATGGAGGACACTGGGTTCTACCTG912

ProAspIleTrpLeuGlyCysIleMetGluAspThrGlyPheTyrLeu

290295300

CCCCGCAAGTTCTCTCGCTGCAGCATCGACGAGTACAACCAGTTTCTG960

ProArgLysPheSerArgCysSerIleAspGluTyrAsnGlnPheLeu

305310315320

CAGGAGGGTGGTGGCAGCTGCCTCTTCAACAAGCCCCTCAAGCTCCTG1008

GlnGluGlyGlyGlySerCysLeuPheAsnLysProLeuLysLeuLeu

325330335

GACCCCCCAGAGTGCGGGAACGGCTTCGTGGAGGCAGGGGAGGAGTGC1056

AspProProGluCysGlyAsnGlyPheValGluAlaGlyGluGluCys

340345350

GACTGCGGCTCGGTGCAGGAGTGCAGCCGCGCAGGTGGCAACTGCTGC1104

AspCysGlySerValGlnGluCysSerArgAlaGlyGlyAsnCysCys

355360365

AAGAAATGCACCCTGACTCACGACGCCATGTGCAGCGACGGGCTCTGC1152

LysLysCysThrLeuThrHisAspAlaMetCysSerAspGlyLeuCys

370375380

TGTCGCCGCTGCAAGTACGAACCACGGGGTGTGTCCTGCCGAGAGGCC1200

CysArgArgCysLysTyrGluProArgGlyValSerCysArgGluAla

385390395400

GTGAACGAGTGCGACATCGCGGAGACCTGCACCGGGGACTCTAGCCAG1248

ValAsnGluCysAspIleAlaGluThrCysThrGlyAspSerSerGln

405410415

TGCCCGCCTAACCTGCACAAGCTGGACGGTTACTACTGTGACCATGAG1296

CysProProAsnLeuHisLysLeuAspGlyTyrTyrCysAspHisGlu

420425430

CAGGGCCGCTGCTACGGAGGTCGCTGCAAAACCCGGGACCGGCAGTGC1344

GlnGlyArgCysTyrGlyGlyArgCysLysThrArgAspArgGlnCys

435440445

CAGGTTCTTTGGGGCCATGCGGCTGCTGATCGCTTCTGCTACGAGAAG1392

GlnValLeuTrpGlyHisAlaAlaAlaAspArgPheCysTyrGluLys

450455460

CTGAATGTGGAGGGGACGGAGCGTGGGAGCTGTGGGCGCAAGGGATCC1440

LeuAsnValGluGlyThrGluArgGlySerCysGlyArgLysGlySer

465470475480

GGCTGGGTCCAGTGCAGTAAGCAG1464

GlyTrpValGlnCysSerLysGln

485

(2) INFORMATION FOR SEQ ID NO: 6:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 2923 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: cDNA to mRNA

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Homo sapiens

(vii) IMMEDIATE SOURCE:

(A) LIBRARY: human fetal brain cDNA library

(ix) FEATURE:

(A) NAME/KEY: 5'UTR

(B) LOCATION: 1..27

(ix) FEATURE:

(A) NAME/KEY: 3'UTR

(B) LOCATION: 1600..2923

(ix) FEATURE:

(A) NAME/KEY: CDS

(B) LOCATION: 28..1599

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6:

GCGTTTACTGGCAAACCGCATTTGTAAATGTGCTGGCTGAGCCACCAACTC51

MetCysTrpLeuSerHisGlnLeu

CTCTCCTCGCAATACGTGGAGCGCCACTTCAGCCGGGAGGGGACAACC99

LeuSerSerGlnTyrValGluArgHisPheSerArgGluGlyThrThr

101520

CAGCACAGCACCGGGGCTGGAGACCACTGCTACTACCAGGGGAAGCTC147

GlnHisSerThrGlyAlaGlyAsphisCysTyrTyrGlnGlyLysLeu

25303540

CGGGGGAACCCGCACTCCTTCGCCGCCCTCTCCACCTGCCAGGGGCTG195

ArgGlyAsnProHisSerPheAlaAlaLeuSerThrCysGlnGlyLeu

455055

CATGGGGTCTTCTCTGATGGGAACTTGACTTACATCGTGGAGCCCCAA243

HisGlyValPheSerAspGlyAsnLeuThrTyrIleValGluProGln

606570

GAGGTGGCTGGACCTTGGGGAGCCCCTCAGGGACCCCTTCCCCACCTC291

GluValAlaGlyProTrpGlyAlaProGlnGlyProLeuProHisLeu

758085

ATTTACCGGACCCCTCTCCTCCCAGATCCCCTCGGATGCAGGGAACCA339

IleTyrArgThrProLeuLeuProAspProLeuGlyCysArgGluPro

9095100

GGCTGCCTGTTTGCTGTGCCTGCCCAGTCGGCTCCTCCAAACCGGCCG387

GlyCysLeuPheAlaValProAlaGlnSerAlaProProAsnArgPro

105110115120

AGGCTGAGAAGGAAAAGGCAGGTCCGCCGGGGCCACCCTACAGTGCAC435

ArgLeuArgArgLysArgGlnValArgArgGlyHisProThrValHis

125130135

AGTGAAACCAAGTATGTGGAGCTAATTGTGATCAACGACCACCAGCTG483

SerGluThrLysTyrValGluLeuIleValIleAsnAspHisGlnLeu

140145150

TTCGAGCAGATGCGACAGTCGGTGGTCCTCACCAGCAACTTTGCCAAG531

PheGluGlnMetArgGlnSerValValLeuThrSerAsnPheAlaLys

155160165

TCCGTGGTGAACCTGGCCGATGTGATATACAAGGAGCAGCTCAACACT579

SerValValAsnLeuAlaAspValIleTyrLysGluGlnLeuAsnThr

170175180

CGCATCGTCCTGGTTGCCATGGAAACATGGGCAGATGGGGACAAGATC627

ArgIleValLeuValAlaMetGluThrTrpAlaAspGlyAspLysIle

185190195200

CAGGTGCAGGATGACCTCCTGGAGACCCTGGCCCGGCTCATGGTCTAC675

GlnValGlnAspAspLeuLeuGluThrLeuAlaArgLeuMetValTyr

205210215

CGACGGGAGGGTCTGCCTGAGCCCAGTAATGCCACCCACCTCTTCTCG723

ArgArgGluGlyLeuProGluProSerAsnAlaThrHisLeuPheSer

220225230

GGCAGGACCTTCCAGAGCACGAGCAGCGGGGCAGCCTACGTGGGGGGC771

GlyArgThrPheGlnSerThrSerSerGlyAlaAlaTyrValGlyGly

235240245

ATATGCTCCCTGTCCCATGGCGGGGGTGTGAACGAGTACGGCAACATG819

IleCysSerLeuSerHisGlyGlyGlyValAsnGluTyrGlyAsnMet

250255260

GGGGCGATGGCCGTGACCCTTGCCCAGACGCTGGGACAGAACCTGGGC867

GlyAlaMetAlaValThrLeuAlaGlnThrLeuGlyGlnAsnLeuGly

265270275280

ATGATGTGGAACAAACACCGGAGCTCGGCAGGGGACTGCAAGTGTCCA915

MetMetTrpAsnLysHisArgSerSerAlaGlyAspCysLysCysPro

285290295

GACATCTGGCTGGGCTGCATCATGGAGGACACTGGGTTCTACCTGCCC963

AspIleTrpLeuGlyCysIleMetGluAspThrGlyPheTyrLeuPro

300305310

CGCAAGTTCTCTCGCTGCAGCATCGACGAGTACAACCAGTTTCTGCAG1011

ArgLysPheSerArgCysSerIleAspGluTyrAsnGlnPheLeuGln

315320325

GAGGGTGGTGGCAGCTGCCTCTTCAACAAGCCCCTCAAGCTCCTGGAC1059

GluGlyGlyGlySerCysLeuPheAsnLysProLeuLysLeuLeuAsp

330335340

CCCCCAGAGTGCGGGAACGGCTTCGTGGAGGCAGGGGAGGAGTGCGAC1107

ProProGluCysGlyAsnGlyPheValGluAlaGlyGluGluCysAsp

345350355360

TGCGGCTCGGTGCAGGAGTGCAGCCGCGCAGGTGGCAACTGCTGCAAG1155

CysGlySerValGlnGluCysSerArgAlaGlyGlyAsnCysCysLys

365370375

AAATGCACCCTGACTCACGACGCCATGTGCAGCGACGGGCTCTGCTGT1203

LysCysThrLeuThrHisAspAlaMetCysSerAspGlyLeuCysCys

380385390

CGCCGCTGCAAGTACGAACCACGGGGTGTGTCCTGCCGAGAGGCCGTG1251

ArgArgCysLysTyrGluProArgGlyValSerCysArgGluAlaVal

395400405

AACGAGTGCGACATCGCGGAGACCTGCACCGGGGACTCTAGCCAGTGC1299

AsnGluCysAspIleAlaGluThrCysThrGlyAspSerSerGlnCys

410415420

CCGCCTAACCTGCACAAGCTGGACGGTTACTACTGTGACCATGAGCAG1347

ProProAsnLeuHisLysLeuAspGlyTyrTyrCysAspHisGluGln

425430435440

GGCCGCTGCTACGGAGGTCGCTGCAAAACCCGGGACCGGCAGTGCCAG1395

GlyArgCysTyrGlyGlyArgCysLysThrArgAspArgGlnCysGln

445450455

GTTCTTTGGGGCCATGCGGCTGCTGATCGCTTCTGCTACGAGAAGCTG1443

ValLeuTrpGlyHisAlaAlaAlaAspArgPheCysTyrGluLysLeu

460465470

AATGTGGAGGGGACGGAGCGTGGGAGCTGTGGGCGCAAGGGATCCGGC1491

AsnValGluGlyThrGluArgGlySerCysGlyArgLysGlySerGly

475480485

TGGGTCCAGTGCAGTAAGCAGCCCCAACAGGGACGTGCTGTGTGGCTT1539

TrpValGlnCysSerLysGlnProGlnGlnGlyArgAlaValTrpLeu

490495500

CCTCCTCTGTGTCAACATCTCTGGAGCTCCTCGGCTAGGGGACCTGGT1587

ProProLeuCysGlnHisLeuTrpSerSerSerAlaArgGlyProGly

505510515520

GGGAGACATCAGTAGTGTCACCTTCTACCACCAGGGCAAGGAGCTGGACTGC1639

GlyArgHisGln

524

AGGGGAGGCCACGTGCAGCTGGCGGACGGCTCTGACCTGAGCTATGTGGAGGATGGCACA1699

GCCTGCGGGCCTAACATGTTGTGCCTGGACCATCGCTGCCTGCCAGCTTCTGCCTTCAAC1759

TTCAGCACCTGCCCCGGCAGTGGGGAGCGCCGGATTTGCTCCCACCACGGGGTCTGCAGC1819

AATGAAGGGAAGTGCATCTGTCAGCCAGACTGGACAGGCAAAGACTGCAGTATCCATAAC1879

CCCCTGCCCACGTCCCCACCCACGGGGGAGACGGAGAGATATAAAGGTCCCAGCGGCACC1939

AACATCATCATTGGCTCCATCGCTGGGGCTGTCCTGGTTGCAGCCATCGTCCTGGGCGGC1999

ACGGGCTGGGGATTTAAAAACATTCGCCGAGGAAGGTCCGGAGGGGCCTAAGTGCCACCC2059

TCCTCCCTCCAAGCCTGGCACCCACCGTCTCGGCCCTGAACCACGAGGCTGCCCCCATCC2119

AGCCACGGAGGGAGGCACCATGCAAATGTCTTCCAGGTCCAAACCCTTCAACTCCTGGCT2179

CCGCAGGGGTTTGGGTGGGGGCTGTGGCCCTGCCCTTGGCACCACCAGGGTGGACCAGGC2239

CTGGAGGGCACTTCCTCCACAGTCCCCCACCCACCTCCTGCGGCTCAGCCTTGCACACCC2299

ACTGCCCCGTGTGAATGTAGCTTCCACCTCATGGATTGCCACAGCTCAACTCGGGGGCAC2359

CTGGAGGGATGCCCCCAGGCAGCCACCAGTGGACCTAGCCTGGATGGCCCCTCCTTGCAA2419

CCAGGCAGCTGAGACCAGGGTCTTATCTCTCTGGGACCTAGGGGGACGGGGCTGACATCT2479

ACATTTTTTAAAACTGAATCTTAATCGATGAATGTAAACTCGGGGGTGCTGGGGCCAGGG2539

CAGATGTGGGGATGTTTTGACATTTACAGGAGGCCCCGGAGAAACTGAGGTATGGCCATG2599

CCCTAGACCCTCCCCAAGGATGACCACACCCGAAGTCCTGTCACTGAGCACAGTCAGGGG2659

CTGGGCATCCCAGCTTGCCCCCGCTTAGCCCCGCTGAGCTTGGAGGAAGTATGAGTGCTG2719

ATTCAAACCAAAGCTGCCTGTGCCATGCCCAAGGCCTAGGTTATGGGTACGGCAACCACA2779

TGTCCCAGATCGTCTCCAATTCGAAAACAACCGTCCTGCTGTCCCTGTCAGGACACATGG2839

ATTTTGGCAGGGCGGGGGGGGGTTCTAGAAAATATAGGTTCCTATAATAAAATGGCACCT2899

TCCCCCTTTAAAAAAAAAAAAAAA2923

(2) INFORMATION FOR SEQ ID NO: 7:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 2913 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: cDNA to mRNA

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Homo sapiens

(vii) IMMEDIATE SOURCE:

(A) LIBRARY: human fetal brain cDNA library

(ix) FEATURE:

(A) NAME/KEY: 5'UTR

(B) LOCATION: 1..27

(ix) FEATURE:

(A) NAME/KEY: 3'UTR

(B) LOCATION: 2038..2913

(ix) FEATURE:

(A) NAME/KEY: CDS

(B) LOCATION: 28..2037

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7:

GCGTTTACTGGCAAACCGCATTTGTAAATGTGCTGGCTGAGCCACCAACTC51

MetCysTrpLeuSerHisGlnLeu

CTCTCCTCGCAATACGTGGAGCGCCACTTCAGCCGGGAGGGGACAACC99

LeuSerSerGlnTyrValGluArgHisPheSerArgGluGlyThrThr

101520

CAGCACAGCACCGGGGCTGGAGACCACTGCTACTACCAGGGGAAGCTC147

GlnHisSerThrGlyAlaGlyAspHisCysTyrTyrGlnGlyLysLeu

25303540

CGGGGGAACCCGCACTCCTTCGCCGCCCTCTCCACCTGCCAGGGGCTG195

ArgGlyAsnProHisSerPheAlaAlaLeuSerThrCysGlnGlyLeu

455055

CATGGGGTCTTCTCTGATGGGAACTTGACTTACATCGTGGAGCCCCAA243

HisGlyValPheSerAspGlyAsnLeuThrTyrIleValGluProGln

606570

GAGGTGGCTGGACCTTGGGGAGCCCCTCAGGGACCCCTTCCCCACCTC291

GluValAlaGlyProTrpGlyAlaProGlnGlyProLeuProHisLeu

758085

ATTTACCGGACCCCTCTCCTCCCAGATCCCCTCGGATGCAGGGAACCA339

IleTyrArgThrProLeuLeuProAspProLeuGlyCysArgGluPro

9095100

GGCTGCCTGTTTGCTGTGCCTGCCCAGTCGGCTCCTCCAAACCGGCCG387

GlyCysLeuPheAlaValProAlaGlnSerAlaProProAsnArgPro

105110115120

AGGCTGAGAAGGAAAAGGCAGGTCCGCCGGGGCCACCCTACAGTGCAC435

ArgLeuArgArgLysArgGlnValArgArgGlyHisProThrValHis

125130135

AGTGAAACCAAGTATGTGGAGCTAATTGTGATCAACGACCACCAGCTG483

SerGluThrLysTyrValGluLeuIleValIleAsnAspHisGlnLeu

140145150

TTCGAGCAGATGCGACAGTCGGTGGTCCTCACCAGCAACTTTGCCAAG531

PheGluGlnMetArgGlnSerValValLeuThrSerAsnPheAlaLys

155160165

TCCGTGGTGAACCTGGCCGATGTGATATACAAGGAGCAGCTCAACACT579

SerValValAsnLeuAlaAspValIleTyrLysGluGlnLeuAsnThr

170175180

CGCATCGTCCTGGTTGCCATGGAAACATGGGCAGATGGGGACAAGATC627

ArgIleValLeuValAlaMetGluThrTrpAlaAspGlyAspLysIle

185190195200

CAGGTGCAGGATGACCTCCTGGAGACCCTGGCCCGGCTCATGGTCTAC675

GlnValGlnAspAspLeuLeuGluThrLeuAlaArgLeuMetValTyr

205210215

CGACGGGAGGGTCTGCCTGAGCCCAGTAATGCCACCCACCTCTTCTCG723

ArgArgGluGlyLeuProGluProSerAsnAlaThrHisLeuPheSer

220225230

GGCAGGACCTTCCAGAGCACGAGCAGCGGGGCAGCCTACGTGGGGGGC771

GlyArgThrPheGlnSerThrSerSerGlyAlaAlaTyrValGlyGly

235240245

ATATGCTCCCTGTCCCATGGCGGGGGTGTGAACGAGTACGGCAACATG819

IleCysSerLeuSerHisGlyGlyGlyValAsnGluTyrGlyAsnMet

250255260

GGGGCGATGGCCGTGACCCTTGCCCAGACGCTGGGACAGAACCTGGGC867

GlyAlaMetAlaValThrLeuAlaGlnThrLeuGlyGlnAsnLeuGly

265270275280

ATGATGTGGAACAAACACCGGAGCTCGGCAGGGGACTGCAAGTGTCCA915

MetMetTrpAsnLysHisArgSerSerAlaGlyAspCysLysCysPro

285290295

GACATCTGGCTGGGCTGCATCATGGAGGACACTGGGTTCTACCTGCCC963

AspIleTrpLeuGlyCysIleMetGluAspThrGlyPheTyrLeuPro

300305310

CGCAAGTTCTCTCGCTGCAGCATCGACGAGTACAACCAGTTTCTGCAG1011

ArgLysPheSerArgCysSerIleAspGluTyrAsnGlnPheLeuGln

315320325

GAGGGTGGTGGCAGCTGCCTCTTCAACAAGCCCCTCAAGCTCCTGGAC1059

GluGlyGlyGlySerCysLeuPheAsnLysProLeuLysLeuLeuAsp

330335340

CCCCCAGAGTGCGGGAACGGCTTCGTGGAGGCAGGGGAGGAGTGCGAC1107

ProProGluCysGlyAsnGlyPheValGluAlaGlyGluGluCysAsp

345350355360

TGCGGCTCGGTGCAGGAGTGCAGCCGCGCAGGTGGCAACTGCTGCAAG1155

CysGlySerValGlnGluCysSerArgAlaGlyGlyAsnCysCysLys

365370375

AAATGCACCCTGACTCACGACGCCATGTGCAGCGACGGGCTCTGCTGT1203

LysCysThrLeuThrHisAspAlaMetCysSerAspGlyLeuCysCys

380385390

CGCCGCTGCAAGTACGAACCACGGGGTGTGTCCTGCCGAGAGGCCGTG1251

ArgArgCysLysTyrGluProArgGlyValSerCysArgGluAlaVal

395400405

AACGAGTGCGACATCGCGGAGACCTGCACCGGGGACTCTAGCCAGTGC1299

AsnGluCysAspIleAlaGluThrCysThrGlyAspSerSerGlnCys

410415420

CCGCCTAACCTGCACAAGCTGGACGGTTACTACTGTGACCATGAGCAG1347

ProProAsnLeuHisLysLeuAspGlyTyrTyrCysAspHisGluGln

425430435440

GGCCGCTGCTACGGAGGTCGCTGCAAAACCCGGGACCGGCAGTGCCAG1395

GlyArgCysTyrGlyGlyArgCysLysThrArgAspArgGlnCysGln

445450455

GTTCTTTGGGGCCATGCGGCTGCTGATCGCTTCTGCTACGAGAAGCTG1443

ValLeuTrpGlyHisAlaAlaAlaAspArgPheCysTyrGluLysLeu

460465470

AATGTGGAGGGGACGGAGCGTGGGAGCTGTGGGCGCAAGGGATCCGGC1491

AsnValGluGlyThrGluArgGlySerCysGlyArgLysGlySerGly

475480485

TGGGTCCAGTGCAGTAAGCAGGACGTGCTGTGTGGCTTCCTCCTCTGT1539

TrpValGlnCysSerLysGlnAspValLeuCysGlyPheLeuLeuCys

490495500

GTCAACATCTCTGGAGCTCCTCGGCTAGGGGACCTGGTGGGAGACATC1587

ValAsnIleSerGlyAlaProArgLeuGlyAspLeuValGlyAspIle

505510515520

AGTAGTGTCACCTTCTACCACCAGGGCAAGGAGCTGGACTGCAGGGGA1635

SerSerValThrPheTyrHisGlnGlyLysGluLeuAspCysArgGly

525530535

GGCCACGTGCAGCTGGCGGACGGCTCTGACCTGAGCTATGTGGAGGAT1683

GlyHisValGlnLeuAlaAspGlySerAspLeuSerTyrValGluAsp

540545550

GGCACAGCCTGCGGGCCTAACATGTTGTGCCTGGACCATCGCTGCCTG1731

GlyThrAlaCysGlyProAsnMetLeuCysLeuAspHisArgCysLeu

555560565

CCAGCTTCTGCCTTCAACTTCAGCACCTGCCCCGGCAGTGGGGAGCGC1779

ProAlaSerAlaPheAsnPheSerThrCysProGlySerGlyGluArg

570575580

CGGATTTGCTCCCACCACGGGGTCTGCAGCAATGAAGGGAAGTGCATC1827

ArgIleCysSerHisHisGlyValCysSerAsnGluGlyLysCysIle

585590595600

TGTCAGCCAGACTGGACAGGCAAAGACTGCAGTATCCATAACCCCCTG1875

CysGlnProAspTrpThrGlyLysAspCysSerIleHisAsnProLeu

605610615

CCCACGTCCCCACCCACGGGGGAGACGGAGAGATATAAAGGTCCCAGC1923

ProThrSerProProThrGlyGluThrGluArgTyrLysGlyProSer

620625630

GGCACCAACATCATCATTGGCTCCATCGCTGGGGCTGTCCTGGTTGCA1971

GlyThrAsnIleIleIleGlySerIleAlaGlyAlaValLeuValAla

635640645

GCCTACGTCCTGGGCGGCACGGGCTGGGGATTTAAAAACATTCGCCGA2019

AlaIleValLeuGlyGlyThrGlyTrpGlyPheLysAsnIleArgArg

650655660

GGAAGGTCCGGAGGGGCCTAAGTGCCACCCTCCTCCCTCCAAGCCTGG2067

GlyArgSerGlyGlyAla

665670

CACCCACCGTCTCGGCCCTGAACCACGAGGCTGCCCCCATCCAGCCACGGAGGGAGGCAC2127

CATGCAAATGTCTTCCAGGTCCAAACCCTTCAACTCCTGGCTCCGCAGGGGTTTGGGTGG2187

GGGCTGTGGCCCTGCCCTTGGCACCACCAGGGTGGACCAGGCCTGGAGGGCACTTCCTCC2247

ACAGTCCCCCACCCACCTCCTGCGGCTCAGCCTTGCACACCCACTGCCCCGTGTGAATGT2307

AGCTTCCACCTCATGGATTGCCACAGCTCAACTCGGGGGCACCTGGAGGGATGCCCCCAG2367

GCAGCCACCAGTGGACCTAGCCTGGATGGCCCCTCCTTGCAACCAGGCAGCTGAGACCAG2427

GGTCTTATCTCTCTGGGACCTAGGGGGACGGGGCTGACATCTACATTTTTTAAAACTGAA2487

TCTTAATCGATGAATGTAAACTCGGGGGTGCTGGGGCCAGGGCAGATGTGGGGATGTTTT2547

GACATTTACAGGAGGCCCCGGAGAAACTGAGGTATGGCCATGCCCTAGACCCTCCCCAAG2607

GATGACCACACCCGAAGTCCTGTCACTGAGCACAGTCAGGGGCTGGGCATCCCAGCTTGC2667

CCCCGCTTAGCCCCGCTGAGCTTGGAGGAAGTATGAGTGCTGATTCAAACCAAAGCTGCC2727

TGTGCCATGCCCAAGGCCTAGGTTATGGGTACGGCAACCACATGTCCCAGATCGTCTCCA2787

ATTCGAAAACAACCGTCCTGCTGTCCCTGTCAGGACACATGGATTTTGGCAGGGCGGGGG2847

GGGGTTCTAGAAAATATAGGTTCCTATAATAAAATGGCACCTTCCCCCTTTAAAAAAAAA2907

AAAAAA2913

(2) INFORMATION FOR SEQ ID NO: 8:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 3183 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: cDNA to mRNA

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Homo sapiens

(vii) IMMEDIATE SOURCE:

(A) LIBRARY: human fetal brain cDNA library

(ix) FEATURE:

(A) NAME/KEY: 3'UTR

(B) LOCATION: 2308..3183

(ix) FEATURE:

(A) NAME/KEY: CDS

(B) LOCATION: 1..2307

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8:

ATGAGGCTGCTGCGGCGCTGGGCGTTCGCGGCTCTGCTGCTGTCGCTG48

MetArgLeuLeuArgArgTrpAlaPheAlaAlaLeuLeuLeuSerLeu

151015

CTCCCCACGCCCGGTCTTGGGACCCAAGGTCCTGCTGGAGCTCTGCGA96

LeuProThrProGlyLeuGlyThrGlnGlyProAlaGlyAlaLeuArg

202530

TGGGGGGGCTTACCCCAGCTGGGAGGCCCAGGAGCCCCTGAGGTCACG144

TrpGlyGlyLeuProGlnLeuGlyGlyProGlyAlaProGluValThr

354045

GAACCCAGCCGTCTGGTTAGGGAGAGCTCCGGGGGAGAGGTCCGAAAG192

GluProSerArgLeuValArgGluSerSerGlyGlyGluValArgLys

505560

CAGCAGCTGGACACAAGGGTCCGCCAGGAGCCACCAGGGGGCCCGCCT240

GlnGlnLeuAspThrArgValArgGlnGluProProGlyGlyProPro

65707580

GTCCATCTGGCCCAGGTGAGTTTCGTCATCCCAGCCTTCAACTCAAAC288

ValHisLeuAlaGlnValSerPheValIleProAlaPheAsnSerAsn

859095

TTCACCCTGGACCTGGAGCTGAACCACCACCTCCTCTCCTCGCAATAC336

PheThrLeuAspLeuGluLeuAsnHisHisLeuLeuSerSerGlnTyr

100105110

GTGGAGCGCCACTTCAGCCGGGAGGGGACAACCCAGCACAGCACCGGG384

ValGluArgHisPheSerArgGluGlyThrThrGlnHisSerThrGly

115120125

GCTGGAGACCACTGCTACTACCAGGGGAAGCTCCGGGGGAACCCGCAC432

AlaGlyAspHisCysTyrTyrGlnGlyLysLeuArgGlyAsnProHis

130135140

TCCTTCGCCGCCCTCTCCACCTGCCAGGGGCTGCATGGGGTCTTCTCT480

SerPheAlaAlaLeuSerThrCysGlnGlyLeuHisGlyValPheSer

145150155160

GATGGGAACTTGACTTACATCGTGGAGCCCCAAGAGGTGGCTGGACCT528

AspGlyAsnLeuThrTyrIleValGluProGlnGluValAlaGlyPro

165170175

TGGGGAGCCCCTCAGGGACCCCTTCCCCACCTCATTTACCGGACCCCT576

TrpGlyAlaProGlnGlyProLeuProHisLeuIleTyrArgThrPro

180185190

CTCCTCCCAGATCCCCTCGGATGCAGGGAACCAGGCTGCCTGTTTGCT624

LeuLeuProAspProLeuGlyCysArgGluProGlyCysLeuPheAla

195200205

GTGCCTGCCCAGTCGGCTCCTCCAAACCGGCCGAGGCTGAGAAGGAAA672

ValProAlaGlnSerAlaProProAsnArgProArgLeuArgArgLys

210215220

AGGCAGGTCCGCCGGGGCCACCCTACAGTGCACAGTGAAACCAAGTAT720

ArgGlnValArgArgGlyHisProThrValHisSerGluThrLysTyr

225230235240

GTGGAGCTAATTGTGATCAACGACCACCAGCTGTTCGAGCAGATGCGA768

ValGluLeuIleValIleAsnAspHisGlnLeuPheGluGlnMetArg

245250255

CAGTCGGTGGTCCTCACCAGCAACTTTGCCAAGTCCGTGGTGAACCTG816

GlnSerValValLeuThrSerAsnPheAlaLysSerValValAsnLeu

260265270

GCCGATGTGATATACAAGGAGCAGCTCAACACTCGCATCGTCCTGGTT864

AlaAspValIleTyrLysGluGlnLeuAsnThrArgIleValLeuVal

275280285

GCCATGGAAACATGGGCAGATGGGGACAAGATCCAGGTGCAGGATGAC912

AlaMetGluThrTrpAlaAspGlyAspLysIleGlnValGlnAspAsp

290295300

CTCCTGGAGACCCTGGCCCGGCTCATGGTCTACCGACGGGAGGGTCTG960

LeuLeuGluThrLeuAlaArgLeuMetValTyrArgArgGluGlyLeu

305310315320

CCTGAGCCCAGTAATGCCACCCACCTCTTCTCGGGCAGGACCTTCCAG1008

ProGluProSerAsnAlaThrHisLeuPheSerGlyArgThrPheGln

325330335

AGCACGAGCAGCGGGGCAGCCTACGTGGGGGGCATATGCTCCCTGTCC1056

SerThrSerSerGlyAlaAlaTyrValGlyGlyIleCysSerLeuSer

340345350

CATGGCGGGGGTGTGAACGAGTACGGCAACATGGGGGCGATGGCCGTG1104

HisGlyGlyGlyValAsnGluTyrGlyAsnMetGlyAlaMetAlaVal

355360365

ACCCTTGCCCAGACGCTGGGACAGAACCTGGGCATGATGTGGAACAAA1152

ThrLeuAlaGlnThrLeuGlyGlnAsnLeuGlyMetMetTrpAsnLys

370375380

CACCGGAGCTCGGCAGGGGACTGCAAGTGTCCAGACATCTGGCTGGGC1200

HisArgSerSerAlaGlyAspCysLysCysProAspIleTrpLeuGly

385390395400

TGCATCATGGAGGACACTGGGTTCTACCTGCCCCGCAAGTTCTCTCGC1248

CysIleMetGluAspThrGlyPheTyrLeuProArgLysPheSerArg

405410415

TGCAGCATCGACGAGTACAACCAGTTTCTGCAGGAGGGTGGTGGCAGC1296

CysSerIleAspGluTyrAsnGlnPheLeuGlnGluGlyGlyGlySer

420425430

TGCCTCTTCAACAAGCCCCTCAAGCTCCTGGACCCCCCAGAGTGCGGG1344

CysLeuPheAsnLysProLeuLysLeuLeuAspProProGluCysGly

435440445

AACGGCTTCGTGGAGGCAGGGGAGGAGTGCGACTGCGGCTCGGTGCAG1392

AsnGlyPheValGluAlaGlyGluGluCysAspCysGlySerValGln

450455460

GAGTGCAGCCGCGCAGGTGGCAACTGCTGCAAGAAATGCACCCTGACT1440

GluCysSerArgAlaGlyGlyAsnCysCysLysLysCysThrLeuThr

465470475480

CACGACGCCATGTGCAGCGACGGGCTCTGCTGTCGCCGCTGCAAGTAC1488

HisAspAlaMetCysSerAspGlyLeuCysCysArgArgCysLysTyr

485490495

GAACCACGGGGTGTGTCCTGCCGAGAGGCCGTGAACGAGTGCGACATC1536

GluProArgGlyValSerCysArgGluAlaValAsnGluCysAspIle

500505510

GCGGAGACCTGCACCGGGGACTCTAGCCAGTGCCCGCCTAACCTGCAC1584

AlaGluThrCysThrGlyAspSerSerGlnCysProProAsnLeuHis

515520525

AAGCTGGACGGTTACTACTGTGACCATGAGCAGGGCCGCTGCTACGGA1632

LysLeuAspGlyTyrTyrCysAspHisGluGlnGlyArgCysTyrGly

530535540

GGTCGCTGCAAAACCCGGGACCGGCAGTGCCAGGTTCTTTGGGGCCAT1680

GlyArgCysLysThrArgAspArgGlnCysGlnValLeuTrpGlyHis

545550555560

GCGGCTGCTGATCGCTTCTGCTACGAGAAGCTGAATGTGGAGGGGACG1728

AlaAlaAlaAspArgPheCysTyrGluLysLeuAsnValGluGlyThr

565570575

GAGCGTGGGAGCTGTGGGCGCAAGGGATCCGGCTGGGTCCAGTGCAGT1776

GluArgGlySerCysGlyArgLysGlySerGlyTrpValGlnCysSer

580585590

AAGCAGGACGTGCTGTGTGGCTTCCTCCTCTGTGTCAACATCTCTGGA1824

LysGlnAspValLeuCysGlyPheLeuLeuCysValAsnIleSerGly

595600605

GCTCCTCGGCTAGGGGACCTGGTGGGAGACATCAGTAGTGTCACCTTC1872

AlaProArgLeuGlyAspLeuValGlyAspIleSerSerValThrPhe

610615620

TACCACCAGGGCAAGGAGCTGGACTGCAGGGGAGGCCACGTGCAGCTG1920

TyrHisGlnGlyLysGluLeuAspCysArgGlyGlyHisValGlnLeu

625630635640

GCGGACGGCTCTGACCTGAGCTATGTGGAGGATGGCACAGCCTGCGGG1968

AlaAspGlySerAspLeuSerTyrValGluAspGlyThrAlaCysGly

645650655

CCTAACATGTTGTGCCTGGACCATCGCTGCCTGCCAGCTTCTGCCTTC2016

ProAsnMetLeuCysLeuAspHisArgCysLeuProAlaSerAlaPhe

660665670

AACTTCAGCACCTGCCCCGGCAGTGGGGAGCGCCGGATTTGCTCCCAC2064

AsnPheSerThrCysProGlySerGlyGluArgArgIleCysSerHis

675680685

CACGGGGTCTGCAGCAATGAAGGGAAGTGCATCTGTCAGCCAGACTGG2112

HisGlyValCysSerAsnGluGlyLysCysIleCysGlnProAspTrp

690695700

ACAGGCAAAGACTGCAGTATCCATAACCCCCTGCCCACGTCCCCACCC2160

ThrGlyLysAspCysSerIleHisAsnProLeuProThrSerProPro

705710715720

ACGGGGGAGACGGAGAGATATAAAGGTCCCAGCGGCACCAACATCATC2208

ThrGlyGluThrGluArgTyrLysGlyProSerGlyThrAsnIleIle

725730735

ATTGGCTCCATCGCTGGGGCTGTCCTGGTTGCAGCCATCGTCCTGGGC2256

IleGlySerIleAlaGlyAlaValLeuValAlaAlaIleValLeuGly

740745750

GGCACGGGCTGGGGATTTAAAAACATTCGCCGAGGAAGGTCCGGAGGG2304

GlyThrGlyTrpGlyPheLysAsnIleArgArgGlyArgSerGlyGly

755760765

GCCTAAGTGCCACCCTCCTCCCTCCAAGCCTGGCACCCACCGTCTCGGCCCTG2357

Ala

769

AACCACGAGGCTGCCCCCATCCAGCCACGGAGGGAGGCACCATGCAAATGTCTTCCAGGT2417

CCAAACCCTTCAACTCCTGGCTCCGCAGGGGTTTGGGTGGGGGCTGTGGCCCTGCCCTTG2477

GCACCACCAGGGTGGACCAGGCCTGGAGGGCACTTCCTCCACAGTCCCCCACCCACCTCC2537

TGCGGCTCAGCCTTGCACACCCACTGCCCCGTGTGAATGTAGCTTCCACCTCATGGATTG2597

CCACAGCTCAACTCGGGGGCACCTGGAGGGATGCCCCCAGGCAGCCACCAGTGGACCTAG2657

CCTGGATGGCCCCTCCTTGCAACCAGGCAGCTGAGACCAGGGTCTTATCTCTCTGGGACC2717

TAGGGGGACGGGGCTGACATCTACATTTTTTAAAACTGAATCTTAATCGATGAATGTAAA2777

CTCGGGGGTGCTGGGGCCAGGGCAGATGTGGGGATGTTTTGACATTTACAGGAGGCCCCG2837

GAGAAACTGAGGTATGGCCATGCCCTAGACCCTCCCCAAGGATGACCACACCCGAAGTCC2897

TGTCACTGAGCACAGTCAGGGGCTGGGCATCCCAGCTTGCCCCCGCTTAGCCCCGCTGAG2957

CTTGGAGGAAGTATGAGTGCTGATTCAAACCAAAGCTGCCTGTGCCATGCCCAAGGCCTA3017

GGTTATGGGTACGGCAACCACATGTCCCAGATCGTCTCCAATTCGAAAACAACCGTCCTG3077

CTGTCCCTGTCAGGACACATGGATTTTGGCAGGGCGGGGGGGGGTTCTAGAAAATATAGG3137

TTCCTATAATAAAATGGCACCTTCCCCCTTTAAAAAAAAAAAAAAA3183

(2) INFORMATION FOR SEQ ID NO: 9:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 9278 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(vi) ORIGINAL SOURCE:

(A) ORGANISM: Homo sapiens

(vii) IMMEDIATE SOURCE:

(A) LIBRARY: human DNA cosmid library

(ix) FEATURE:

(A) NAME/KEY: exon 1

(B) LOCATION: 28..44

(ix) FEATURE:

(A) NAME/KEY: exon 2

(B) LOCATION: 308..374

(ix) FEATURE:

(A) NAME/KEY: exon 3

(B) LOCATION: 909..994

(ix) FEATURE:

(A) NAME/KEY: exon 4

(B) LOCATION: 1081..1156

(ix) FEATURE:

(A) NAME/KEY: exon 5

(B) LOCATION: 1591..1657

(ix) FEATURE:

(A) NAME/KEY: exon 6

(B) LOCATION: 1725..1792

(ix) FEATURE:

(A) NAME/KEY: exon 7

(B) LOCATION: 2182..2256

(ix) FEATURE:

(A) NAME/KEY: exon 8

(B) LOCATION: 2339..2410

(ix) FEATURE:

(A) NAME/KEY: exon 9

(B) LOCATION: 2588..2754

(ix) FEATURE:

(A) NAME/KEY: exon 10

(B) LOCATION: 3248..3332

(ix) FEATURE:

(A) NAME/KEY: exon 11

(B) LOCATION: 3445..3535

(ix) FEATURE:

(A) NAME/KEY: exon 12

(B) LOCATION: 3645..3696

(ix) FEATURE:

(A) NAME/KEY: exon 13

(B) LOCATION: 4014..4113

(ix) FEATURE:

(A) NAME/KEY: exon 14

(B) LOCATION: 4196..4267

(ix) FEATURE:

(A) NAME/KEY: exon 15

(B) LOCATION: 4386..4478

(ix) FEATURE:

(A) NAME/KEY: exon 16

(B) LOCATION: 4920..5000

(ix) FEATURE:

(A) NAME/KEY: exon 17

(B) LOCATION: 5347..5397

(ix) FEATURE:

(A) NAME/KEY: exon 18

(B) LOCATION: 5501..5564

(ix) FEATURE:

(A) NAME/KEY: exon 19

(B) LOCATION: 5767..5866

(ix) FEATURE:

(A) NAME/KEY: exon 20

(B) LOCATION: 6073..6202

(ix) FEATURE:

(A) NAME/KEY: exon 21

(B) LOCATION: 6300..6468

(ix) FEATURE:

(A) NAME/KEY: exon 22

(B) LOCATION: 6557..6671

(ix) FEATURE:

(A) NAME/KEY: exon 23

(B) LOCATION: 6756..6846

(ix) FEATURE:

(A) NAME/KEY: exon 24

(B) LOCATION: 7829..7846

(ix) FEATURE:

(A) NAME/KEY: exon 25

(B) LOCATION: 8165..9038

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9:

GCGTTTACTGGCAAACCGCATTTGTAAATGTGCTGGCTGAGCCANNNNNNNNNN54

MetCysTrpLeuSerHis

NNNNCCAGGTGAGTTTCGTCATCCAGCCTTCAACTCAAACTTCACCCTGGACCTGGAGCT114

GAACCAGTGAGNGTGGCCTTGAGCCCAAGAGGAAGGGCAGTGGTGGNNNGGGGGAGACAT174

GGCTAGGGCCTGGCTGCTGGGGGTCTGGGGGTTGGGCCTGGCGAGAGGGGACCTGGGTCC234

TGACCTGAGGCGAGCCTAAAGCCCGACCTCACCTCGCCCGTGACCCCCCTTCCTGCTGCC294

CCCTCTGTCTCAGCCAACTCCTCTCCTCGCAATACGTGGAGCGCCACTTC344

GlnLeuLeuSerSerGlnTyrValGluArgHisPhe

1015

AGCCGGGAGGGGACAACCCAGCACAGCACCGTGAGTGCCACTGCTGGGGA394

SerArgGluGlyThrThrGlnHisSerThr

2025

CCGGGGCCGGGGATGGAAGGGAGGTGCTGTTTCTGTGGTTCTGTGGTCACAGGTGTAGGG454

ACAGGTGGCCACTGGAGATGGGGTCCTGGGCCTGGCCCCTCAGCACCTTCCCTCTCTCCC514

GACCCAGGAGGCTCTGAGGGTGGACAGTGGGCAGCTTAGTGCATAGGGCCCTGAAGTCCC574

CTCACTTGGCCCCAGAGCTCTGACCCCCAGCCAGCCCACGTGGGGCCTACAGGGACACTC634

GTTCCGAGCAGGCTGCCAGGATCCNNNNNNNNNNNNATAGATGACGTGAAGGAGGCCCAG694

AGGTTCCTAACCCCAGAGGGCTAGGAACTTGCCCAGGGTGGCACGGCAAATTAGGAGCAC754

CAGCCATCTAGAAACAGGCTCCAGAGCCCCAGGNATACCCAGGGATNGTGGCCACCTGCA814

CACAGGGCAGCTTCAGTGTCCCCCAAAAAGCCTTGAGGCCCATTGGCTGCCCCCGGCCTC874

ATGCCAGCGTTCTGCTCACTGTTCTGCTCCTTAGGGGGCTGGAGACCACTGCTAC929

GlyAlaGlyAspHisCysTyr

3035

TACCAGGGGAAGCTCCGGGGGAACCCGCACTCCTTCGCCGCCCTCTCC977

TyrGlnGlyLysLeuArgGlyAsnProHisSerPheAlaAlaLeuSer

404550

ACCTGCCAGGGGCTGCAGTGAGTATGGGGAGGGGCCGGGCAGCTGGG1024

ThrCysGlnGlyLeuHis

AGAAGCCTCTGGCCCAGGCCTGGGGACGGAGGGGAGCTGCGCCTCTCTCTCCACAGT1081

GGGGTCTTCTCTGATGGGAACTTGACTTACATCGTGGAGCCCCAAGAG1129

GlyValPheSerAspGlyAsnLeuThrTyrIleValGluProGlnGlu

606570

GTGGCTGGACCTTGGGGAGCCCCTCAGGTAAGCCCCACACAACCCCT1176

ValAlaGlyProTrpGlyAlaProGln

7580

TGCCATCCTCTCTGGTGGCCCTGCCAAGCTTGTCCCAACAGCTGTTGCTGCCACCTCTTC1236

CTCCTCCGGCTCCTCCCTCAGTAACCCCAGCCTCACTGCCCTCTTCAGTGACCCCAGCTC1296

TGGTTCCCTCCCTCCTGTGCCCCAGCTCCCCCTGTGCCCCCAGCTCCAATGTCCCATCTG1356

TCCCATAAGTGACCTCCCATTGGGCTCCAATGTCCTTTGCCCCTGTCTCTCAGGGTGCCC1416

CCAGGTCTTGACCCCGGAATCTGAGCATCTGGGAGATCAGATCCGACATGGGAGCTGTGG1476

CCAGTTCTGGGTCACCCCAGGGTGGGGTGGAGGCGAGGGCTGGATCTGGCCCCCGCCAAG1536

TGGCCTGGAGCAGGCCCAGTTGGCACCCCAAGAACTAATTTCCCCTCATTGCAGGGA1593

Gly

CCCCTTCCCCACCTCATTTACCGGACCCCTCTCCTCCCAGATCCCCTC1641

ProLeuProHisLeuIleTyrArgThrProLeuLeuProAspProLeu

859095

GGATGCAGGGAACCAGGTAAGGGAGGGGAGGGGGGGTGGGGAGGGGCCNGGCTGTG1697

GlyCysArgGluProGly

100

CCCCCCTCACCTGCCCCTCCCCGACAGGCTGCCTGTTTGCTGTGCCTGCCCAG1750

CysLeuPheAlaValProAlaGln

105110

TCGGCTCCTCCAAACCGGCCGAGGCTGAGAAGGAAAAGGCAG1792

SerAlaProProAsnArgProArgLeuArgArgLysArgGln

115120125

GTACGGGGGCCCGCACAGACCTCGGGCTGCAGAGACCTCGGGCTGCAGAGAGACCTCGGC1852

CGTGGCCCAGAGCAGGAGGGCACCCTCATCTATGGCTGGGGCGAAGGAAGGCTCAGATGG1912

ATGTGGCTGGGGGCCAGGGACCGTGTCTGGGAGAAGCCCCCACCCCTTCCCTAATGCTGG1972

CATCTACAGAGGCCCCATCCTGGGCAAACCGAGGCTGCCTGCCCTCATTCCAAAGCTGAG2032

GAAGGACAGGACCCTCTGCCAGTGGGGAGCTGGCACTGTCCCTGGCTGGAGTCCAGACCC2092

CCCCATCCCCACCGAGTCTGTTCCTGGCTTGGCCATGAGATCAGTCAGACATGGAAGGGA2152

CTGATTCCAAGTGCCCACCCACCCCCCAGGTCCGCCGGGGCCACCCTACAGTG2205

ValArgArgGlyHisProThrVal

130135

CACAGTGAAACCAAGTATGTGGAGCTAATTGTGATCAACGACCACCAG2253

HisSerGluThrLysTyrValGluLeuIleValIleAsnAspHisGln

140145150

CTGGTGAGTGCCAGGGCAGGGACAGGGCGTGACACTGGGAGGCCCCTGAGGAG2306

Leu

CCTGGCCCTCCTCCCATTCTTCTCTCTCCCAGTTCGAGCAGATGCGACAGTCG2359

PheGluGlnMetArgGlnSer

155

GTGGTCCTCACCAGCAACTTTGCCAAGTCCGTGGTGAACCTGGCCGAT2407

ValValLeuThrSerAsnPheAlaLysSerValValAsnLeuAlaAsp

160165170175

GTGGTAAGCAGCTCTCCCTCCCTCCCTTCCCTCCTCCTCATGCCCCCCCACCC2460

Val

CACCACACACATTAGGGGGCACTGTCAGCCCCTGGCTCCCACTTCCTGGAGAGAACAGAC2520

AGGCCCTCCTCCAGCCCTGGCCCCAACACCCACTCCCACCCTCCAGCCCCCCTCATCTTC2580

TCCCCAGATATACAAGGAGCAGCTCAACACTCGCATCGTCCTGGTTGCC2629

IleTyrLysGluGlnLeuAsnThrArgIleValLeuValAla

180185190

ATGGAAACATGGGCAGATGGGGACAAGATCCAGGTGCAGGATGACCTC2677

MetGluThrTrpAlaAspGlyAspLysIleGlnValGlnAspAspLeu

195200205

CTGGAGACCCTGGCCCGGCTCATGGTCTACCGACGGGAGGGTCTGCCT2725

LeuGluThrLeuAlaArgLeuMetValTyrArgArgGluGlyLeuPro

210215220

GAGCCCAGTAATGCCACCCACCTCTTCTCGTGAGTCCCCCACCCTGCAC2774

GluProSerAsnAlaThrHisLeuPheSer

225230

CTCCTGCCAGCCTCTGCTAGTTGCTACAGTGCTTGGGATTACTTAACACCTGCCCTGTGC2834

TGGCTGCTCCTCTCAGAGTCTGGGGACTGGGCTCACCTTGCACCTGCCACCTACCCCCAG2894

CCACATGCAACAGCTGGGCATCATCCCCTGAATCTGAGGTTGATGCCCTTGTCTTAGCCC2954

TGGTGGTCCTCTTCTGCCTCTCACCTCCCCTTAGTTCTGTCTTTCCCTTCAACTGTCCCN3014

NNNNNNNNNNNAGAGTGAAACTCTGTCTCAAAAGAAAAANAAAANAAAAGAAGAAAAAAA3074

AGAACCCAAGGAGCGGGGGAAGGGTCTTGCCTGGGGTCACCAAGGCTGATGTAAAGGGCC3134

AGGCTCACCTCCTGAGGAAGGACTCTAGTGTGAGGGGCTCCCCAAGGCCCCACCACCACC3194

CGGGGAGCCACAGGGGAGGGCAGAAGCCATCCTGACAGCGCACTCCCTTCCAGGGGC3251

Gly

AGGACCTTCCAGAGCACGAGCAGCGGGGCAGCCTACGTGGGGGGCATA3299

ArgThrPheGlnSerThrSerSerGlyAlaAlaTyrValGlyGlyIle

235240245

TGCTCCCTGTCCCATGGCGGGGGTGTGAACGAGGTGAGCAGTG3342

CysSerLeuSerHisGlyGlyGlyValAsnGlu

250255260

GGGGGACATGGCTGGGGTGGCGGCTGAGGGAAAGGGGCTTAGGGGCACGACGTGCCTGNT3402

TGGAAGATGTAGACATCTGTGCCCCATCTTCCCCACCCCCAGTACGGCAACATG3456

TyrGlyAsnMet

GGGGCGATGGCCGTGACCCTTGCCCAGACGCTGGGACAGAACCTGGGC3504

GlyAlaMetAlaValThrLeuAlaGlnThrLeuGlyGlnAsnLeuGly

265270275280

ATGATGTGGAACAAACACCGGAGCTCGGCAGGTATCCTCCCCCAGAGGCCC3555

MetMetTrpAsnLysHisArgSerSerAlaGly

285290

CCGTGTGGCCCAGCAGCTCTGGAACGGGAGGGTGACAGTGGGAGGGGTGGTCCTTGGCCT3615

CCCTCATATCCGCCTGGCTCACCCCTCAGGGGACTGCAAGTGTCCAGACATC3667

AspCysLysCysProAspIle

295

TGGCTGGGCTGCATCATGGAGGACACTGGGTGAGTTCTTGGGGACAACC3716

TrpLeuGlyCysIleMetGluAspThrGly

300305

GGGGGAAGGTCTTGGGCGAGGGGAGTCTTAGAGCGAGCATTGTTTGGCAGTCTGGACCAG3776

GGGNNNNNNNNNNNNGAACACACCTTCCCTTCCAGGCCGGCTTGCGAGTCCCAGGTTCAA3836

GCGAGGGATGGGAGCGACAAGGGACAAGGCGGAGGATTCTGGTGCAATCCCGGGGCAGAT3896

CCTCCGCCTCCTCGCGATGGTGACGAAGTCCCCCAGTGTACCCCCTCCCCAGCCTTGAGA3956

GGGGTGAGGGTGGGTTGGAGGGGAGCAGCCAGCAGCACCTCCCCTCGCCCTATCCAGG4014

TTCTACCTGCCCCGCAAGTTCTCTCGCTGCAGCATCGACGAGTACAAC4062

PheTyrLeuProArgLysPheSerArgCysSerIleAspGluTyrAsn

310315320

CAGTTTCTGCAGGAGGGTGGTGGCAGCTGCCTCTTCAACAAGCCCCTC4110

GlnPheLeuGlnGluGlyGlyGlySerCysLeuPheAsnLysProLeu

325330335340

AAGGTACCAGCCCCGCGGCGGGGAGCATGGGAGCGGGCCCTGGGCGGGGTCCG4163

Lys

GGCCAGACTCCCGACCTGTCCTCCCGGTCCAGCTCCTGGACCCCCCAGAGTGC4216

LeuLeuAspProProGluCys

345

GGGAACGGCTTCGTGGAGGCAGGGGAGGAGTGCGACTGCGGCTCGGTG4264

GlyAsnGlyPheValGluAlaGlyGluGluCysAspCysGlySerVal

350355360

CAGGTGAGCGGTGGTGCGGGCGCCAGGTGGGGAACCGGGATGCGGGGGTGGGC4317

Gln

365

ACCAGGGAGCGTCTGAGTGGGAGGATTAGGGCTCGCCCGCCTCCTTCCCCTCCTCCCGCG4377

TCCCTCAGGAGTGCAGCCGCGCAGGTGGCAACTGCTGCAAGAAATGCACC4427

GluCysSerArgAlaGlyGlyAsnCysCysLysLysCysThr

370375

CTGACTCACGACGCCATGTGCAGCGACGGGCTCTGCTGTCGCCGCTGC4475

LeuThrHisAspAlaMetCysSerAspGlyLeuCysCysArgArgCys

380385390395

AAGGTAAGCAGGACCGGCCGGGAGGCGGGGCCAGGACGCAGGAGGAGCGATTG4528

Lys

GAGGCCTTCATATAAGGGGTGGGAGCTAGGGAGGGAAGCGGAGCCTTCGGGGACGAAGGC4588

CTCTGGGGCAGGGCTTGATGCGAAGACAGCGCCAATGGGAGCAAGGGCGGGCTGAAGGAT4648

GTTGAAGGCNNNNNNNNNNNNNNCGGACGGGAAGCTCCCAGAATCAAGGAGGGCGGGAAG4708

GTGGGCGGGCTTGGGGCGGTGCTGAGTGCGCTGGGAGCGAGGTGGGGAGCGTTCAAGAGG4768

TGGTGGGAGCAGGGAAATAAGAACAGGCCTAAACGGGGCCCTGGGGAGCTGGAGGGCCCG4828

GGGATGTGGGGGTCCAGAGAGCGGGGGGCCTGGGGAGGGCAGGGCCGAGGCATCCATCCT4888

GCCTGACTCGAGGAGCGCGTCTCTTCCCTAGTACGAACCACGGGGTGTGTCC4940

TyrGluProArgGlyValSer

400

TGCCGAGAGGCCGTGAACGAGTGCGACATCGCGGAGACCTGCACCGGG4988

CysArgGluAlaValAsnGluCysAspIleAlaGluThrCysThrGly

405410415

GACTCTAGCCAGGTCCGCCCGGCCCCGCCGTCTTGTGGAGCCCTGGGCGAGG5040

AspSerSerGln

420

CAACCCCTACCCTTGTCGATTTGGTTTTCCCGGACGAGTGCTCAGCACTCCCCTCCTCTC5100

CACAGCTGGCATCGACCTTCACTGATCAGACTGTTTTCTTATCTGAGAAAGGGGTTCTTC5160

ATGCTCCTGGCCTTGTTCCTTCAATCATTAAACCAGAATGTATCGTCTGGCTGGTATCCC5220

AGCGCCTGGGCCCGGTGNNNNNNNNNNNTACCCAGATTCCTCCTGGGCAGCCCTCAGCTC5280

CAGTCCTGGGCAGCCCTCAGCCCAGTCCTGGGACTGCTCCGCTCAACCCCACCCCTCTCT5340

CCACAGTGCCCGCCTAACCTGCACAAGCTGGACGGTTACTACTGTGAC5388

CysProProAsnLeuHisLysLeuAspGlyTyrTyrCysAsp

425430435

CATGAGCAGGTATGATGGCTGCCCCCTGAGCCTGGGATTCAGGGCAGTC5437

HisGluGln

440

TCTTATCTCCACTCTGACCACTCAGCATCTCCATCCCTTGCCTCTTAATTCTTGGACTCT5497

CAGGGCCGCTGCTACGGAGGTCGCTGCAAAACCCGGGACCGGCAGTGC5545

GlyArgCysTyrGlyGlyArgCysLysThrArgAspArgGlnCys

445450455

CAGGTTCTTTGGGGCCATGGTGAGTCTGCTAGGGCTGGAGTGGGACTCC5594

GlnValLeuTrpGlyHisAla

460

GGAGGAGCCCAGAGCTGAGAAGCTGGGGAGAGTGGGTTCCAGCTGAACAGGCCCCCAAGT5654

GTGTAGCTCCCCAGGATCTCAGGGAGCCCAGGCAGAGTGTGGGAGATGCAGGCCTGAGGT5714

CTTGGGGTGGGTCCTGGGGCACGTGGGGTCACTTGGCATCCTCTCCCCACAGCGGCT5771

Ala

GCTGATCGCTTCTGCTACGAGAAGCTGAATGTGGAGGGGACGGAGCGT5819

AlaAspArgPheCysTyrGluLysLeuAsnValGluGlyThrGluArg

465470475

GGGAGCTGTGGGCGCAAGGGATCCGGCTGGGTCCAGTGCAGTAAGCA5866

GlySerCysGlyArgLysGlySerGlyTrpValGlnCysSerLysGln

480485490495

GTGAGTACTGAGGCTCCCAGAGGGCCTCTCAGCTCCAGGGCAGGTGTGAGACTTTTCAGA5926

GATGGGGTAGTAGGTTCTCCCAGGAGGAGCCTGTCAGTCCCAATGGGCGGGCACGTGGCA5986

AATGAGGTGGCAGGGTGCAGGGTGAGGGCAGATTAGAGTTCAGTAGTTGAGTCTGAGGTC6046

AAACTTGGGGCTCACTGTCTCTATATGCCCCAACAGGGACGTGCTGTGTGG6097

ProGlnGlnGlyArgAlaValTrp

500

CTTCCTCCTCTGTGTCAACATCTCTGGAGCTCCTCGGCTAGGGGACCT6145

LeuProProLeuCysGlnHisLeuTrpSerSerSerAlaArgGlyPro

505510515

GGTGGGAGACATCAGTAGTGTCACCTTCTACCACCAGGGCAAGGAGCTGGACTGC6200

GlyGlyArgHisGln

520

AGGTGCTGACCAGCACCAAAACTCAGGGAGGGGACCTGGCAGCTGTGCTGGGGGTTAGAA6260

GATCTGGGGGCTGGAGGCTGGGCTGTGTCACTTCCCCAGGGGAGGCCACGTGCAGCTGGC6320

GGACGGCTCTGACCTGAGCTATGTGGAGGATGGCACAGCCTGCGGGCCTAACATGTTGTG6380

CCTGGACCATCGCTGCCTGCCAGCTTCTGCCTTCAACTTCAGCACCTGCCCCGGCAGTGG6440

GGAGCGCCGGATTTGCTCCCACCACGGGGTGACTGCCTGGAGCCCGGGATGGCGGGAGAA6500

GCTTACAAGAGGGGACAGGCCCCTGCTCACCTCTCCTGGCCCTGCCCTGCCTCTAGGTCT6560

GCAGCAATGAAGGGAAGTGCATCTGTCAGCCAGACTGGACAGGCAAAGACTGCAGTATCC6620

ATAACCCCCTGCCCACGTCCCCACCCACGGGGGAGACGGAGAGATATAAAGGTGAGGCTG6680

GAGCTGGCCGAGGGGGGTCTGTCTGTCCCGCTCTCTATGCCTGTCCTTGCCAGCTAAGCC6740

CTGCCATCCTCCCAGGTCCCAGCGGCACCAACATCATCATTGGCTCCATCGCTGGGGCTG6800

TCCTGGTTGCAGCCATCGTCCTGGGCGGCACGGGCTGGGGATTTAAGTAAGAGACACACA6860

CACCCTGTGCCCCCTGGCATCCTTGAGGGGGGATCAGAATCCCTACTGGTGGAGCTGAGG6920

GGGCCCTCCCTGAAAGCCCAACTGAACCAGAGCTCACACGTCATAGGTCCAAGTAGCCTG6980

CAGGGCTTAACATTTAGAAACTAGGAGATTTTAGGCTAGATGAGGTGCTCACGCCTGTAA7040

TCCCAGCACTTTGGGAGGCCAAGGCAGGCGGATCACCTGAGGTCAGGAATTCAAGACCAG7100

TCTGGCCAACATGGTGAAACCCGTCTCTATTAAAAATACAAAAATTAGCCAGCCATGGTG7160

GTGCACACCTGTAATCCCAGCTACTTGCGAGGCTGAGGCAGAGAATTGCTTGAACCCGGG7220

AGGTGGAGGTTGCAGTGAGCTGAGATCGCACCATTGCACTCCAGCCTTGGGTGACAGAGC7280

AAGACTGCGTCAAAAAAAAAAAAAAAAAAAAAAAAAAGGAAAGAAAGAGAGAAAGAAAAG7340

AAAAGAGAAAAGAAATCAGGAGATTTTACACTAGCAATTCGGATTTCCAGCTCTGGAAAC7400

ATGAAAAGGTTGAGCCCCAGCGTGCCTCTAAGCATCCCCAAATAGCCACAGAGTGGAGCT7460

GGGCAGGGGCCACCCAAGCCAGGCATGTGTCCTCCAGTCTCCAGTTCCCACCAGCCTATA7520

CTCCTTTGTGCGTGTCTAAGTTTGGGGTCCTTGTGCCTGGTCTTACCCCCCTTAATGTGC7580

AGAGGGAGGAACCCACGGCCCAAGGTCACATGATTGAGTTAGTAGCAGAGTCAGAGCTGG7640

AACCGGGACGCATTTTTGTGGGTGCCCTGGGTAATTCTCCCTGGCCCTTACATTAGTGTC7700

CAGGCCCCGGGGACCCCGGCCCCGCTCTGGGGCAAGGGGTCGCATGGCAGCCAAAGGCCC7760

CTCCCTGAGAGAAGCAAAAGGTCAGATGTCTCCTTTTCCTCTCCCCTTCCACCATCCTCC7820

CCCTGCAGAAACATTCGCCGAGGAAGGTACGACCCGACCCAGCTGGGGGCAGTGTGATGC7880

CGGCCACGTCATCCCTCCCGCTGTCCTTGTCTCCTCCATCTCATTCGTCACCCGCGTTCT7940

GTTGATGGGGTGCGGGGCCGATCCCACCCTGCGTGCCNNNNNNNNNNNNNATCTGTTTTG8000

TCTTCCATATCACCACTGTCTGACCTCCCGCAGATCCCTTCCCTGGCCAGCCTGTGACTT8060

GCCGCCTGCCTCCAGGGCCCAGAACTGAGCTCCGGGGCCCTGCTGGGGGGCTCTCCCCGA8120

GGCCCCTGCTCACGTCCTCCCCTGATGCCCCCTCTCCGTTCCAGGTCCGGAGGGGCCTAA8180

GTGCCACCCTCCTCCCTCCAAGCCTGGCACCCACCGTCTCGGCCCTGAACCACGAGGCTG8240

CCCCCATCCAGCCACGGAGGGAGGCACCATGCAAATGTCTTCCAGGTCCAAACCCTTCAA8300

CTCCTGGCTCCGCAGGGGTTTGGGTGGGGGCTGTGGCCCTGCCCTTGGCACCACCAGGGT8360

GGACCAGGCCTGGAGGGCACTTCCTCCACAGTCCCCCACCCACCTCCTGCGGCTCAGCCT8420

TGCACACCCACTGCCCCGTGTGAATGTAGCTTCCACCTCATGGATTGCCACAGCTCAACT8480

CGGGGGCACCTGGAGGGATGCCCCCAGGCAGCCACCAGTGGACCTAGCCTGGATGGCCCC8540

TCCTTGCAACCAGGCAGCTGAGACCAGGGTCTTATCTCTCTGGGACCTAGGGGGACGGGG8600

CTGACATCTACATTTTTTAAAACTGAATCTTAATCGATGAATGTAAACTCGGGGGTGCTG8660

GGGCCAGGGCAGATGTGGGGATGTTTTGACATTTACAGGAGGCCCCGGAGAAACTGAGGT8720

ATGGCCATGCCCTAGACCCTCCCCAAGGATGACCACACCCGAAGTCCTGTCACTGAGCAC8780

AGTCAGGGGCTGGGCATCCCAGCTTGCCCCCGCTTAGCCCCGCTGAGCTTGGAGGAAGTA8840

TGAGTGCTGATTCAAACCAAAGCTGCCTGTGCCATGCCCAAGGCCTAGGTTATGGGTACG8900

GCAACCACATGTCCCAGATCGTCTCCAATTCGAAAACAACCGTCCTGCTGTCCCTGTCAG8960

GACACATGGATTTTGGCAGGGCGGGGGGGGGTTCTAGAAAATATAGGTTCCTATAATAAA9020

ATGGCACCTTCCCCCTTTNNNNNNNNNNNNNNNGGGATACCTCTGAATATGGGTATCTGG9080

GGCTGGATATGGGTGGGACATGAGACTTCCTGTGACCAGCCACCCTGGCTCCCAGCTCTC9140

TGTATCCTCCTGCCCCGCCCTGGGGGGTGCCTACCCTGGNAGAACCCAGGGAGGAGTGGA9200

GGCTGCCTCTGCCTGGGCCTCCACACAGCATCCTGACATACGCCACCTGGGGTGGGGGTG9260

GGGAGGCAGGGCCAGGAG9278

(2) INFORMATION FOR SEQ ID NO: 10:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 17 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10:

GCACCTGCCCCGGCAGT17

(2) INFORMATION FOR SEQ ID NO: 11:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 20 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11:

CCAGGACAGCCCCAGCGATG20

(2) INFORMATION FOR SEQ ID NO: 12:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 22 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12:

GGCTGCTGATCGCTTCTGCTAC22

(2) INFORMATION FOR SEQ ID NO: 13:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 20 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13:

GAGAAGCTGAATGTGGAGGG20

(2) INFORMATION FOR SEQ ID NO: 14:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 19 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14:

GTCAGAGCCGTCCGCCAGC19

(2) INFORMATION FOR SEQ ID NO: 15:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 22 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15:

GCCATCCTCCACATAGCTCAGG22

(2) INFORMATION FOR SEQ ID NO: 16:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 26 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16:

GATGTAAGTCAAGTTCCCATCAGAGA26

(2) INFORMATION FOR SEQ ID NO: 17:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 25 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17:

AACAGCTGGTGGTCGTTGATCACAA25

(2) INFORMATION FOR SEQ ID NO: 18:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 20 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18:

ATGAGGCTGCTGCGGCGCTG20

(2) INFORMATION FOR SEQ ID NO: 19:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 27 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19:

CACAGATCTGGGGGCATATGCTCCCTG27

(2) INFORMATION FOR SEQ ID NO: 20:

(i) SEQUENCE CHARACTERISTICS:

(A) LENGTH: 27 base pairs

(B) TYPE: nucleic acid

(D) TOPOLOGY: linear

(ii) MOLECULE TYPE: Genomic DNA

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20:

AACAAGCTTCTACTGATGTCTCCCACC27

__________________________________________________________________________

INVENTORS:

Nakamura, Yusuke, Emi, Mitsuru

THIS PATENT IS REFERENCED BY THESE PATENTS:

Patent	Priority	Assignee	Title
6395889,	Sep 09 1999	Millennium Pharmaceuticals, Inc	Nucleic acid molecules encoding human protease homologs
6683165,	Apr 13 1999	Genome Therapeutics Corporation	Human gene relating to respiratory diseases and obesity
6787644,	Feb 11 1998	Immunex Corp.	Metalloprotease-disintegrin ADAM23 (SVPH3-17) nucleic acids
7196168,	Feb 11 1998	Immunex Corporation	Metalloprotease-disintegrin ADAM23 (SVPH3-17)
7205146,	Jun 14 2000	Oscient Pharmaceuticals Corporation	Nucleotide and amino acid sequences relating to respiratory diseases and obesity
7282556,	May 15 2001	Emory University	Polynucleotides and polypeptides relating to the modulation of SIRPα-CD47
7399832,	Feb 11 1998	Immunex Corporation	Metalloprotease-disintegrin ADAM23 (SVPH3-17)
7501118,	Apr 13 1999	Oscient Pharmaceuticals Corporation	Human gene relating to respiratory diseases and obesity
7858300,	Oct 19 2001	Centre Hospitalier Regional et Universitaire de Tours	Methods and compositions to evaluate antibody treatment response

THIS PATENT REFERENCES THESE PATENTS:

Patent

Priority

Assignee

Title

ASSIGNMENT RECORDS Assignment records on the USPTO

///////

Executed on	Assignor	Assignee	Conveyance	Frame	Reel	Doc
May 13 1994		Cancer Institute	(assignment on the face of the patent)
May 13 1994		Eisai Co., Ltd.	(assignment on the face of the patent)
May 13 1994	NAKAMURA, YUSUKE	Cancer Institute	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	007276	0147	pdf
May 13 1994	EMI, MITSURU	Cancer Institute	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	007276	0147	pdf
May 13 1994	NAKAMURA, YUSUKE	EISAI CO , LTD	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	007276	0147	pdf
May 13 1994	EMI, MITSURU	EISAI CO , LTD	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	007276	0147	pdf
Nov 20 2006	EISAI CO , LTD	EISAI R&D MANAGEMENT CO , LTD	ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS	018972	0624	pdf

MAINTENANCE FEES AND DATES: Maintenance records on the USPTO

Date	Maintenance Fee Events
Aug 29 1997	ASPN: Payor Number Assigned.
Feb 22 2000	M183: Payment of Maintenance Fee, 4th Year, Large Entity.
Feb 04 2004	M1552: Payment of Maintenance Fee, 8th Year, Large Entity.
Feb 08 2008	M1553: Payment of Maintenance Fee, 12th Year, Large Entity.

Date	Maintenance Schedule
Sep 03 1999	4 years fee payment window open
Mar 03 2000	6 months grace period start (w surcharge)
Sep 03 2000	patent expiry (for year 4)
Sep 03 2002	2 years to revive unintentionally abandoned end. (for year 4)
Sep 03 2003	8 years fee payment window open
Mar 03 2004	6 months grace period start (w surcharge)
Sep 03 2004	patent expiry (for year 8)
Sep 03 2006	2 years to revive unintentionally abandoned end. (for year 8)
Sep 03 2007	12 years fee payment window open
Mar 03 2008	6 months grace period start (w surcharge)
Sep 03 2008	patent expiry (for year 12)
Sep 03 2010	2 years to revive unintentionally abandoned end. (for year 12)