A human gene termed apc is disclosed. Methods and kits are provided for assessing mutations of the apc gene in human tissues and body samples. apc mutations are found in familial adenomatous polyposis patients as well as in sporadic colorectal cancer patients. apc is expressed in most normal tissues. These results suggest that apc is a tumor suppressor.
|
2. A preparation of antibodies which specifically binds to a human apc protein which is the product of a mutant allele found in a tumor, wherein the antibodies do not specifically bind to other human proteins, and wherein the human apc protein is a mutant form of the amino acid sequence shown in SEQ ID NOS:2 and SEQ ID NO: 7, and the mutant allele is a mutant form of the nucleotide sequence shown in SEQ ID NO:1 having the sequence of SEQ ID NO: 7 but for a substitution of Arg→Cys at residue 414.
0. 1. A preparation of antibodies which specifically binds to a human apc (adenomatous polyposis coil) protein having an amino acid sequence as shown in SEQ ID NO:1, 2, or 7, and does not specifically bind to other human proteins.
0. 3. The preparation of
0. 4. The preparation of
0. 5. The preparation of
0. 6. The preparation of
0. 7. The preparation of
0. 8. The preparation of
|
, SEQ ID NO:5am are implicated in colorectal tumorigenesis, we searched for similarities between the two predicted proteins. Bourne has previously noted that MCC has the potential to form alpha helical coiled coils (Nature, Vol. 351, p. 188 (1991). Lupas and colleagues have recently developed a program for predicting coiled coil potential from primary sequence data (Science, Vol. 252, p. 1162 (1991) and we have used their program to analyze both MCC and APC. Analysis of MCC indicated a discontinuous pattern of coiled-coil domains separated by putative “hinge” or “sparer” “spacer” regions similar to those seen in laminin and other intermediate filament proteins. Analysis of the APC sequence revealed two regions in the N-terminal domain which had strong coiled coil-forming potential, and these regions corresponded to those that showed local similarities with myosia and IF proteins on database searching. In addition, one other putative coiled coil region was identified in the central region of APC. The potential for both APC and MCC to form coiled coils is interesting in that such structures often mediate homo- and hetero-oligomerization.
Finally, it had previously been noted that MCC shared a short similarity with the region of the m3 muscarinic acetylcholine receptor (mAChR) known to regulate specificity of G-protein coupling. The APC gene also contained a local similarity to the region of the m3 mAChR (SEQ ID NO:9) that overlapped with the MCC similarity (SEQ ID NO:10) (FIG. 4B). Although the similarities to ral2 (SEQ ID NO:8) (
Each of the six genes described above was expressed in normal colon mucosa, as indicated by their representation in colon cDNA libraries. To study expression of the genes in neoplastic colorectal epithelium, we employed reverse transcription-polymerase chain reaction (PCR) assays. Primers based on the sequences of FER, TB1, TB2, MCC, and APC were each used to design primers for PCR performed with cDNA templates. Each of these genes was found to be expressed in normal colon, in each of ten cell lines derived from colorectal cancers, and in tumor cell lines derived from lung and bladder tumors. The ten colorectal cancer cell lines included eight from patients with sporadic CRC and two from patients with FAP.
This example demonstrates a genetic analysis of the role of the FER gene in FAP and sporadic colorectal cancers.
We considered FER as a candidate because of its proximity to the FAP locus as judged by physical and genetic criteria (see Example 1), and its homology to known tyrosine kinases with oncogenic potential. Primers were designed to PCR-amplify the complete coding sequence of FER from the RNA of two colorectal cancer cell lines derived from FAP patients, cDNA was generated from RNA and used as a template for PCR. The primers used were 5′-AGAAGGATCCCTTGTGCAGTGTGGA-3′ (SEQ ID NO:95) and 5′-GACAGGATCCTGAAGCTGAGTTTG-3′ (SEQ ID NO:96). The underlined nucleotides were altered from the true FER sequence to create BamHI sites. The cell lines used were JW and Difi, both derived from colorectal cancers of FAP patients. (C. Paraskeva, B. G. Buckle, D. Sheer, C. B. Wigley, Int. J. Cancer 34, 49 (1984); M. E. Gross et al., Cancer Res. 51, 1452 (1991). The resultant 2554 basepair fragments were cloned and sequenced in their entirety. The PCR products were cloned in the BamHI site of Bluescript SK (Stratagene) and pools of at least 50 clones were sequenced en masse using T7 polymerase, as described in Nigro et al., Nature 342, 705 (1989).
Only a single conservative amino acid change (GTG→CTG, creating a val to leu substitution at codon 439) was observed. The region surrounding this codon was then amplified from the DNA of individuals without FAP and this substitution was found to be a common polymorphism, not specifically associated with FAP. Based on these results, we considered it unlikely (though still possible) the FER gene was responsible for FAP. To amplify the regions surrounding codon 439, the following primers were used: 5′-TCAGAAAGTGCTGAAGAG-3′ (SEQ ID NO:97) and 5′-GGAATAATTAGGTCTCCAA-3′ (SEQ ID NO:98). PCR products were digested with PstI, which yields a 50 bp fragment if codon 439 is leucine, but 26 and 24 bp fragments if it is valine. The primers used for sequencing were chosen from the FER cDNA sequence in Hao et al., supra.
This example demonstrates the genetic analysis of MCC, TB2, SRP and APC in FAP and sporadic colorectal tumors. Each of these genes is linked and encompassed by contig 3 (see FIG. 1).
Several lines of evidence suggested that this contig was of particular interest. First, at least three of the four genes in this contig were within the deleted region identified in two FAP patients. (See Example 5 infra). Second, allelic deletions of chromosome 5q21 in sporadic cancers appeared to be centered in this region. (Ashton-Rickardt et al., Oncogene, in press; and Miki et al., Japn. J. Cancer Res., in press.) Some tumors exhibited loss of proximal RFLP markers (up to and potentially including the 5′ end of MCC), but no loss of markers distal to MCC. Other tumors exhibited loss of markers distal to and perhaps including the 3′ end of MCC, but no loss of sequences proximal to MCC. This suggested either that different ends of MCC were affected by loss in all such cases, or alternatively, that two genes (one proximal to and perhaps including MCC, the other distal to MCC) were separate targets of deletion. Third, clones from each of the six FAP region genes were used as probes on Southern blots containing tumor DNA from patients with Sporadic CRC. Only two examples of somatic changes were observed in over 200 tumors studied: a rearrangement/deletion whose centrometric end was located within the MCC gene (Kinzler et al., supra) and an 800 bp insertion within the APC gene between nucleotides 4424 and 5584. Fourth, point mutations of MCC were observed in two tumors (Kinzler et al.) supra strongly suggesting that MCC was a target of mutation in at least some sporadic colorectal cancers.
Based on these results, we attempted to search for subtle alterations of contig 3 genes in patients with FAP. We chose to examine MCC and APC, rather than TB2 or SRP, because of the somatic mutations in MCC and APC noted above. To facilitate the identification of subtle alterations, the genomic sequences of MCC and APC exons were determined (see Table I, SEQ ID NO:24-38).
TABLE I
APC EXONS
EXON
NUCLEOTIDES1
EXON BOUNDARY SEQUENCE2
822 to 930
catgatgttatctgtatttacctatagtctaaattataccatctataatgtgcttaatttttag/GGTTCA. . .
(SEQ ID NO: 24)
. . .ACCAAG/gtaacagaagattacaaaccctggtcactaatgccatgactactttgctaag
(SEQ ID NO: 25)
931 to 1039
ggatattaaagtcgtaattttgtttctaaactcatttggcccacag/GTGGAA. . .
(SEQ ID NO: 26)
. . .ATCCAA/gtatgttctctatagtgtacatcgtagtgcatg
(SEQ ID NO: 27)
1310 to 1405
catcattgctcttcaaataacaaagcattatggtttatgttgattttatttttcag/TGCCAG. . .
(SEQ ID NO: 28)
. . .AACTAG/gtaagacaaaaatgttttttaatgacatagacaattactggtg
(SEQ ID NO: 29)
1406 to 1545
tagatgattgtctttttcctcttgccctttttaaattag/GGGGAC. . .
(SEQ ID NO: 30)
. . .AACAAG/gtatgtttttataacatgtatttcttaaggatagctcaggtctga
(SEQ ID NO: 31)
1546 to 1623
gcttggcttcaagttgtctttttaatgatcctctattctgtatttaatttacag/GCTACG. . .
(SEQ ID NO: 32)
. . .CAGCAG/gtactatttagaatttcacctgtttttcttttttctctttttctttgaggcagggtctcactctg
(SEQ ID NO: 33)
1624 to 1740
gcaactagtatgattttatgtataaattaatctaaaattgattaatttgacag/GTTATT. . .
(SEQ ID NO: 34)
. . .AAAAAG/gtacctttgaaaacatttagtactataatatgaatttcatgt
(SEQ ID NO: 35)
1741 to 1955
caactctaattagatgacccatattcagaaacttactag/GAATCA. . .
(SEQ ID NO: 36)
. . .CCACAG/gtatatatagagttttatattacttttaaagtacagaattcatactctcaaaaa
(SEQ ID NO: 37)
1956 to 8973
tcttgatttttatttcag/GCAAAT. . .
(SEQ ID NO: 38)
. . .GGTATTTATGCAAAAAAAAATGTTTTTGT
(SEQ ID NO: 1)
1Relative to predicted translation initiation site
2Small case letters represent introns, large case letters represent exons
The entire 3′ end of the cloned APC cDNA (nt 1956-8973) appeared to be encoded in this exon, as indicated by restriction endonuclease mapping and sequencing of the cloned genomic DNA. The ORF ended at nt 8535. The extreme 3′ end of the APC transcript has not yet been identified.
These sequences were used to design primers for PCR analysis of constitutional DNA from FAP patients.
We first amplified eight exons and surrounding introns of the MCC gene in affected individuals from 90 different FAP kindreds. The PCR products were analyzed by a ribonuclease (RNase) protein assay. In brief, the PCR products were hybridized to in vitro transcribed RNA probes representing the normal genomic sequences. The hybrids were digested with RNase A, which can cleave at single base pair mismatches within DNA-RNA hybrids, and the cleavage products were visualized following denaturing gel electrophoresis. Two separate RNase protection analyses were performed for each exon, one with the sense and one with the antisense strand. Under these conditions, approximately 40% of all mismatches are detectable. Although some amino acid variants of MCC were observed in FAP patients, all such variants were found in a small percentage of normal individuals. These variants were thus unlikely to be responsible for the inheritance of FAP.
We next examined three exons of the A PC APC gene. The three exons examined included those containing nt 822-930, 931-1309, and the first 300 nt of the most distal exon (nt 1956-2256). PCR and RNase protection analysis were performed as described in Kinzler et al. supra, using the primers underlined in Table I (SEQ ID NO:24-38). The primers for nt 1956-2256 were 5′-GCAAATCCTAAGAGAGAACAA-3′ (SEQ ID NO:99) and 5′-GATGGCAAGCTTGAGCCAG-3′ (SEQ ID NO:100).
In 90 kindreds, the RNase protection method was used to screen for mutations and in an additional 13 kindreds, the PCR products were cloned and sequenced to search for mutations not detectable by RNase protection. PCR products were cloned into a Bluescript vector modified as described in T. A. Holton and M. W. Graham, Nucleic Acids Res. 19, 1156 (1991). A minimum of 100 clones were pooled and sequenced. Five variants were detected among the 103 kindreds analyzed. Cloning and subsequent DNA sequencing of the PCR product of patient P21 indicated a C to T transition in codon 413 that resulted in a change from arginine to cysteine. This amino acid variant was not observed in any of 200 DNA samples from individuals without FAP. Cloning and sequencing of the PCR product from patients P24 and P34, who demonstrated the same abnormal RNase protection pattern indicated that both had a C to T transition at codon 801 that resulted in a change from arginine (CGA) to a stop codon (TGA). This change was not present in 200 individuals without FAP. As this point mutation resulted in the predicted loss of the recognition site for the enzyme Taq I, appropriate PCR products could be digested with Taq I to detect the mutation. This allowed us to determine that the stop codon co-segragated with disease phenotype in members of the family of P24. The inheritance of this change in affected members of the pedigree provides additional evidence for the importance of the mutation.
Cloning and sequencing of the PCR product from FAP patient P93 indicated a C to G transversion at codon 279, also resulting in a stop codon (change from TCA to TGA). This mutation was not present in 200 individuals without FAP. Finally, one additional mutation resulting in a serine (TCA) to stop codon (TGA) at codon 712 was detected in a single patient with FAP (patient P60).
The five germline mutations identified are summarized in Table IIA, as well as four others discussed in Example 9.
TABLE IIA
Germline mutations of the APC gene in FAP and GS Patients
EXTRA-
COLO-
NIC
AMINO
PATIENT
NUCLEOTIDE
ACID
DISEASE
CODON
CHANGE
CHANGE
AGE
93
279
TCA->TGA
Ser->Stop
39
Mandi-
bular
Osteoma
24
301
CGA->TGA
Arg->Stop
46
None
34
301
CGA->TGA
Arg->Stop
27
Des-
moid
Tumor
21
413
CGC->TGC
Arg->Cys
24
Mandi-
bular
Osteoma
60
712
TCA->TGA
Ser->Stop
37
Mandi-
bular
Osteoma
3746
243
CAGAG->CAG
splice-
junction
3460
301
CGA->TGA
Arg->Stop
3827
456
CTTTCA->CTTCA
frameshift
3712
500
T->G
Tyr->Stop
* The mutated nucleotides are underlined.
In addition to these germline mutations, we identified several somatic mutations of MCC and APC in sporadic CRC's. Seventeen MCC exons were examined in 90 sporadic colorectal cancers by RNase protection analysis. In each case where an abnormal RNase protection pattern was observed, the corresponding PCR products were cloned and sequenced. This led to the identification of six point mutations (two described previously) (Kinzler et al., supra), each of which was not found in the germline of these patients (Table IIB).
TABLE IIB
Somatic Mutations in Sporadic CRC Patients
NUCLEOTIDE
AMINO ACID
PATIENT
CODON
CHANGE
CHANGE
T35
MCC 12
GAG/gtaaga->
(Splice
GAG/gtaaaa
Donor)
T16
MCC 145
ctcag/GGA->
(Splice
atcag/GGA
Acceptor)
T47
MCC 267
CGG->CTG
Arg->Leu
T81
MCC 490
TCG->TTG
Ser->Leu
T35
MCC 506
CGG->CAG
Arg->Gln
T91
MCC 698
GCT->GTT
Ala->Val
T34
APC 288
CCAGT->CCCAGCCAGT
(Insertion)
T27
APC 331
CGA->TGA
Arg->Stop
T135
APC 437
CAA/gtaa->CAA/gcaa
(Splice Donor)
T20I
APC 1338
CAG->TAG
Gln->Stop
For splice site mutations, the codon nearest to the mutation is listed
The underlined nucleotides were mutant; small case letters represent introns, large case letters represent exons
Four of the mutations resulted in amino acid substitutions and two resulted in the alteration of splice site consensus elements. Mutations at analogous splice site positions in other genes have been shown to alter RNA processing in vivo and in vitro.
Three exons of APC were also evaluated in sporadic tumors. Sixty tumors were screened by RNase protection, and an additional 98 tumors were evaluated by sequencing. The exons examined included nt 822-930, 931-1309, and 1406-1545 (Table I). A total of three mutations were identified, each of which proved to be somatic. Tumor T27 contained a somatic mutation of CGA (arginine) to TGA (stop codon) at codon 33. Tumor T135 contained a GT to GC change at a splice donor site. Tumor T34 contained a 5 bp insertion (CAGCC between codons 288 and 289) resulting in a stop at codon 291 due to a frameshift. p We serendipitously discovered one additional somatic mutation in a colorectal cancer. During our attempt to define the sequences and splice patterns of the MCC and APC gene products in colorectal epithelial cells, we cloned cDNA from the colorectal cancer cell line SW480. The amino acid sequence of the MCC gene from SW480 was identical to that previously found in clones from human brain. The sequence of APC in SW480 cells, however, differed significantly, in that a transition at codon 1338 resulted in a change from glutamine (CAG) to a stop codon (TAG). To determine if this mutation was somatic, we recovered DNA from archival paraffin blocks of the original surgical specimen (T201) from which the tumor cell line was derived 28 years ago.
DNA was purified from paraffin sections as described in S. E. Goelz, S. R. Hamilton, and B. Vogelstein. Biochem. Biophys. Res. Comm. 130, 118 (1985). PCR was performed as described in reference 24, using the primers 5′-GTTCCAGCAGTGTCACAG-3′ (SEQ ID NO:101) and 5′-GGGAGATTTCGCTCCTGA-3′ (SEQ ID NO:102). A PCR product containing codon 1338 was amplified from the archival DNA and used to show that the stop codon represented a somatic mutation present in the original primary tumor and in cell lines derived from the primary and metastatic tumor sites, but not from normal tissue of the patient.
The ten point mutations in the MCC and APC genes so far discovered in sporadic CRCs are summarized in Table IIB. Analysis of the number of mutant and wild-type PCR clones obtained from each of these tumors showed that in eight of the ten cases, the wild-type sequence was present in approximately equal proportions to the mutant. This was confirmed by RFLP analysis using flanking markers from chromosome 5q which demonstrated that only two of the ten tumors (T135 and T201) exhibited an allelic deletion on chromosome 5q. These results are consistent with previous observations showing that 20-40% of sporadic colorectal tumors had allelic deletions of chromosome 5q. Moreover, these data suggest that mutations of 5q21 genes are not limited to those colorectal tumors which contain allelic deletions of this chromosome.
This example characterizes small, nested deletions in DNA from two unrelated FAP patients.
DNA from 40 FAP patients was screened with cosmids that has been mapped into a region near the APC locus to identify small deletions or rearrangements. Two of these cosmids, L5.71 =nd L5.79, hybridized with a 1200 kb NotI fragment in DNAs from most of the FAP patients screened.
The DNA of one FAP patient, 3214, showed only a 940 kb NotI fragment instead of the expected 1200 kb fragment. DNA was analyzed from four other members of the patient's immediate family; the 940 kb fragment was present in her affected mother (4711), but not in the other, unaffected family members. The mother also carried a normal 1200 kb Notl fragment that was transmitted to her two unaffected offspring. These observations indicated that the mutant polyposis allele is on the same chromosome as the 940 kb NotI fragment. A simple interpretation is that APC patients 3214 and 4711 each carry a 260 kb deletion within the APC locus.
If a deletion were present, then other enzymes might also be expected to produce fragments with altered mobilities. Hybridization of L5.79 to NruI-digested DNAs from both affected members of the family revealed a novel NruI fragment of 1300 kb, in addition to the normal 1200 kb NruI fragment. Furthermore, Mlul fragments in patients 3214 and 4711 also showed an increase in size consistent with the deletion of an MluI site. The two chromosome 5 homologs of patient 3214 were segregated in somatic cell hybrid lines; HHW1155 (deletion hybrid) carried the abnormal homolog and HHW1159 (normal hybrid) carried the normal homolog.
Because patient 8214 showed bray a 940 kb NotI fragment, she had not inherited the 1200 kb fragment present in the unaffected father's DNA. This observation suggests that he must be heterozygous for, and have transmitted, either a deletion of the L5.79 probe region or a variant NotI fragment too large to resolve on the gel system. As expected, the hybrid cell line HHW1159, which carries the paternal homolog, revealed no resolved. Not fragment when probed with L5.79. However, probing of HHW1159 DNA with L5.79 following digestion with other enzymes did reveal restriction fragments, demonstrating the presence of DNA homologous to the probe. The father is, therefore, interpreted as heterozygous for a polymorphism at the NotI site, with one chromosome 5 having a 1200 kb NotI fragment and the other having a fragment too large to resolve consistently on the gel. The latter was transmitted to patient 3214.
When double digests were used to order restriction sites within the 1200 kb NotI fragment, L5.71 and L5.79 were beth found to lie on a 550 kb NotI-NruI fragment and, therefore, on the same side of an NruI site in the 1200 kb NotI fragment. To obtain genomic representation of sequences present over the entire 1200 kb Notl fragment, we constructed a library of small-fragment inserts enriched for sequences from this fragment. DNA from the somatic cell hybrid HHW141, which contains about 40% of chromosome 5, was digested with NotI and electrophoresed under pulsed-field gel (PFG) conditions; EcoRI fragments from the 1200 kb region of this gel were cloned into a phage vector. Probe Map30 was isolated from this library. In normal individuals probe Map30 hybridizes to the 1200 kb NotI fragment and to a 200 kb NruI fragment. This latter hybridization places Map30 distal, with respect to the locations of L5.71 and L5.79, to the NruI site of the 550 kb NotI-NruI fragment.
Because Map30 hybridized to the abnormal, 1300 kb Nrul fragment of patient 3214, the locus defined by Map30 lies outside the hypothesized deletion. Furthermore, in normal chromosomes Map30 identified a 200 kb NruI fragment and L5.79 identified a 1200 kb NruI fragment; the hypothesized deletion must, therefore, be removing an NruI site, or sites, lying between Map30 and L5.79, and these two probes must flank the hypothesized deletion. A restriction map of the genomic region, showing placement of these probes, is shown in FIG. 5.
A NotI digest of DNA from another FAP patient, 3824 was probed with L5.79. In addition to the 1200 kb normal NotI fragment, a fragment of approximately 1100 kb was observed, consistent with the presence of a 100 kb deletion in one chromosome 5. In this case, however, digestion with NruI and MluI did not reveal abnormal bands, indicating that if a deletion were present, its boundaries must lie distal to the NruI and MluI sites of the fragments identified by L5.79. Consistent with this expectation, hybridization of Map30 to DNA from patient 3824 identified a 760 kb MluI fragment in addition to the expected 860 kb fragment, supporting the interpretation of a 100 kb deletion in this patient. The two chromosome 5 homologs of patient 3824 were segregated in somatic cell hybrid lines; HHW1291 was found to carry only the abnormal homolog and HHW1290 only the normal homolog.
That the 860 kb Mlul fragment identified by Map30 is distinct from the 830 kb MluI fragment identified previously by L5.79 was demonstrated by hybridization of Map30 and L5.79 to a NotI-MluI double digest of DNA from the hybrid cell (HHW1159) containing the nondeleted chromosome 5 homolog of patient 3214. As previously indicated, this hybrid is interpreted as missing one of the NotI sites that define the 1200 kb fragment. A 620 kb NotI-MluI fragment was seen with probe L5.79, and an 860 kb fragment was seen with Map30. Therefore, the 830 kb MluI fragment recognized by probe L5.79 must contain a NotI site in HHW1159 DNA; because the 860 kb MluI fragment remains intact, it does not carry this NotI site and must be distinct from the 830 kb Mlul fragment.
This example demonstrates the isolation of human sequences which span the region deleted in the two unrelated FAP patients characterized in Example 4.
A strong prediction of the hypothesis that patients 8214 and 3824 carry deletions is that some sequences present on normal chromosome 5 homologs would be missing from the hypothesized deletion homologs. Therefore, to develop genomic probes that might confirm the deletions, as well as to identify genes from the region, YAC clones from a contig seeded by cosmid L5.79 were localized from a library containing seven haploid human genome equivalents (Albertsen et al., Proc. Natl. Acad. Sci. U.S.A., Vol. 87, pp. 4256-4260 (1990)) with respect to the hypothesized deletions. Three clones, YACs 57B8, 310D8, and 183H12, were found to overlap the deleted region.
Importantly, one end of YAC 57B8 (clone AT57) was found to lie within the patient 3214 deletion. Inverse polymerase chain reaction (PCR) defined the end sequences of the insert of YAC 57B8. PCR primers based on one of these end sequences repeatedly failed to amplify DNA from the somatic cell hybrid (HHW1155) carrying the deleted homolog of patient 3214, but did amplify a product of the expected size from the somatic cell hybrid (HHW1159) carrying the normal chromosome 5 homolog. This result supported the interpretation that the abnormal restriction fragments found in the DNA of patient 3214 result from a deletion.
Additional support for the hypothesis of deletion in DNA from patient 3214 came from subcloned fragments of YAC 183H12, which spans the region in question. Y11, an EcoRI fragment cloned from YAC 183H12, hybridized to the normal, 1200 kb NotI fragment of patient 4711, but failed to hybridize to the abnormal, 940 kb Notl fragment of 4711 or to DNA from deletion cell line HHW1155. This result confirmed the deletion in patient 3214.
Two additional EcoRl fragments from YAC 183H12, Y10 and Y14, were localized within the patient 3214 deletion by their failure to hybridize to DNA from HHW1155. Probe Y10 hybridizes to a 150 kb NruI fragment in normal chromosome 5 homologs. Because the 3214 deletion creates the 1300 kb NruI fragment seen with the probes L5.79 and Map30 that flank the deletion, these NruI sites and the 150 kb NruI fragment lying between must be deleted in patient 3214. Furthermore, probe Y10 hybridizes to the same 620 kb Notl-MluI fragment seen with probe L5.79 in normal DNA, indicating its location as L5.79-proximal to the deleted MluI site and placing it between the MluI site and the L5.79-proximal NruI site. The MluI site must, therefore, lie between the NruI sites that define the 150 kb NruI fragment (see FIG. 5).
Probe Y11 also hybridized to the 150 kb NruI fragment in the normal chromosome 5 homolog, but failed to hybridize to the 620 kb Notl-MluI fragment, placing it L5.79-distal to the MluI site, but proximal to the second NruI site. Hybridization to the same (860 kb) MluI fragment as Map30 confirmed the localization of probe Y11 L5.79-distal to the MluI site.
Probe Y14 was shown to be L5.79-distal to both deleted NruI sites by virtue of its hybridization to the same 200 kb NruI fragment of the normal chromosome 5 seen with Map30. Therefore, the order of these EcoRI fragments derived from YAC 183H12 and deleted in patient 3214, with respect to L5.79 and Map30, is L5.79-Y10-Y11-Y14-Map30.
The 100 kb deletion of patient 3824 was confirmed by the failure of aberrant restriction fragments in this DNA to hybridize with probe Y11, combined with positive hybridizations to probes Y10 and/or Y14. Y10 and Y14 each hybridized to the 1100 kb NotI fragment of patient 3824 as well as to the normal 1200 kb NotI fragment, but Y11 hybridized to the 1200 kb fragment only. In the Mlul digest, probe Y14 hybridized to the 860 kb and 760 kb fragments of patient 3824 DNA, but probe Y11 hybridized only to the 860 k13 fragment. We conclude that the basis for the alteration in fragment size in DNA from patient 3824 is, indeed, a deletion. Furthermore, because probes Y10 and Y14 are missing from the deleted 3214 chromosome, but present on the deleted 3824 chromosome, and they have been shown to flank probe Y11, the deletion in patient 3824 must be nested within the patient 3214 deletion.
Probes Y10, Y11, Y14 and Map30 each hybridized to YAC 310D8, indicating that this YAC spanned the patient 3824 deletion and at a minimum, most of the 3214 deletion. The YAC characterizations, therefore, confirmed the presence of deletions in the patients and provided physical representation of the deleted region.
This example demonstrates that the MCC coding sequence maps outside of the region deleted in the two FAP patients characterized in Example 4.
An intriguing FAP candidate gene, MCC, recently was ascertained with cosmid L5.71 and was shown to have undergone mutation in colon carcinomas (Kinzler et al., supra). It was therefore of interest to map this gene with respect to the deletions in APC patients. Hybridization of MCC probes with an overlapping series of YAC clones extending in either direction from L5.71 showed that the 3′ end of MCC must be oriented toward the region of the two APC deletions.
Therefore, two 3′ cDNA clones from MCC were mapped with respect, to the deletions: clone 1CI (bp 2378-4181) and clone 7 (bp 2890-3560). Clone 1CI contains sequences from the C-terminal end of the open reading frame, which stops at nucleotide 2708, as well as 3′ untranslated sequence. Clone 7 contains sequence that is entirely 3′ to the open reading frame. Importantly, the entire 3′ untranslated sequence contained in the cDNA clones consists of a single 2.5 kb exon. These two clones were hybridized to DNAs from the YACs spanning the FAP region. Clone 7 fails to hybridize to YAC 310D8, although it does hybridize to YACs 183H12 and 57B8; the same result was obtained with the cDNA 1CI. Furthermore, these probes did show hybridization to DNAs from both hybrid cell lines (HWW1159 and HWW1155) and the lymphoblastoid cell line from patient 3214, confirming their locations outside the deleted region. Additional mapping experiments suggested that the 3′ end of the MCC cDNA clone contig is likely to be located more than 45 kb from the deletion of patient 3214 and, therefore, more than 100 kb from the deletion of patient 3824.
This example identifies three genes within the deleted region of chromosome 5 in the two unrelated FAP patients characterized in Example 4.
Genomic clones were used to screen cDNA libraries in three separate experiments. One screening was done with a phage clone derived from YAC 310D8 known to span the 260 kb deletion of patient 3214. A large-insert phage library was constructed from this YAC; screening with Y11 identified λ205, which mapped within both deletions. When clone λ205 was used to probe a random-, plus oligo(dT)-, primed fetal brain cDNA library (approximately 300,000 phage), six cDNA clones were isolated and each of them mapped entirely within both deletions. Sequence analysis of these six clones formed a single cDNA contig, but did not reveal an extended open reading frame. One of the six cDNAs was used to isolate more cDNA clones, some of which crossed the L5.71-proximal breakpoint of the 3824 deletion, as indicated by hybridization to both chromosome of this patient. These clones also contained an open reading frame, indicating a transcriptional orientation proximal to distal with respect to L5.71. This gene was named DP1 (deleted in polyposis 1). This gene is identical to TB2 described above.
cDNA walks yielded a cDNA contig of 3.0-3.5 kb, and included two clones containing terminal poly(A) sequences. This size corresponds to the 3.5 kb band seen by Northern analysis. Sequencing of the first 3163 bp of the cDNA contig revealed an open reading frame extending from the first base to nucleotide 631, followed by a 2.5 kb 3′ untranslated region. The sequence surrounding the methionine codon at base 77 conforms to the Kozak consensus of an initiation methionine (Kozak, 1984). Failed attempts to walk farther, coupled with the similarity of the lengths of isolated cDNA and mRNA, suggested that the NH2-terminus of the DP1 protein had been reached. Hybridization to a combination of genomic and YAC DNAs cut with various enzymes indicated the genomic coverage of DP1 to be approximately 30 kb.
Two additional probes for the locus, YS-11 and YS-39, which had been ascertained by screening of a cDNA library with an independent YAC probe identified with MCC sequences adjacent to L5.71, were mapped into the deletion region. YS-39 was shown to be a cDNA identical in sequence to DP1. Partial characterization of YS-11 had shown that 200 bp of DNA sequence at one end was identical to sequence coding for the 19 kb protein of the ribosomal signal recognition particle, SRP19 (Lingelbach et al., supra). Hybridization experiments mapped YS-11 within beth deletions. The sequence of this clone, however, was found to be complex. Although 454 bp of the 1032 bp sequence of YS-11 were identical to the GenBank entry for the SRP19 gene, another 578 bp appended 5′ to the SRP19 sequence was found to consist of previously unreported sequence containing no extended open reading frame. This suggested that YS-11 was either a chimeric clone containing two independent inserts or a clone of an incompletely processed or aberrant message. If YS-11 were a conventional chimeric clone, the independent segments would not be expected to map to the same physical region. The segments resulting from anomalous processing of a continuous transcript, however, would map to a single chromosomal region.
Inverse PCR with primers specific to the two ends of YS-11, the SRP19 ,end and the unidentified region, verified that both sequences map within the YAC 310D8; therefore, YS-11 is most likely a clone of an immature or anomalous mRNA species. Subsequently, both ends were shown to lie with the deleted region of patient 3824, and YS-11 was used to screen for additional cDNA clones.
Of the 14 cDNA clones selected from the fetal brain library, one clone, V5, was of particular interest in that it contained an open reading frame throughout, although it included only a short identity to the first 78 5′ bases of the YS-11 sequence. Following the 78 bp of identical sequence, the two cDNA sequences diverged at an AG. Furthermore, divergence from genomic sequence was also seen after these 78 bp, suggesting the presence of a splice junction, and supporting the view that YS-11 represents an irregular message.
Starting with V5, successive 5′ and 3′ walks were performed; the resulting cDNA contig consisted of more than 100 clones, which defined a new transcript, DP2. Clones walking in the 5′ directions crossed the 3824 deletion breakpoint farthest from L5.71; since its 3′ end is closer to this cosmid than its 5′ end, the transcriptional orientation of DP2 is opposite to that of MCC and DP1.
The third screening approach relied on hybridization with a 120 kb MluI fragment from YAC 57B8. This fragment hybridizes with probe Y11 and completely spans the 100 kb deletion in patient 3824, the fragment was purified on two preparative PFGs, labeled, and used to screen a fetal brain cDNA library. A number of cDNA clones previously identified in the development of the DP1 and DP2 contigs were reascertained. However, 19 new cDNA clones mapped into the patient 3824 deletion. Analysis indicated that these 19 formed a new contig, DP3, containing a large open reading frame.
A clone from the 5′ end of this new cDNA contig hybridized to the same EcoRI fragment as the 3′ end of DP2. Subsequently, the DP2 and DP3 contigs were connected by a single 5′ walking step from DP3, to form the single contig DP2.5. The complete nucleotide sequence of DP2.5 is shown in FIG. 9.
The consensus cDNA sequence of DP2.5 suggests that the entire coding sequence of DP2.5 has been obtained and is 8532 bp long. The most 5′ ATG codon occurs two codons from an in-frame stop and conforms to the Kozak initiation consensus (Kozak, Nucl. Acids. Res., Vol. 12, p. 857-872 1984). The 3′ open reading frame breaks down over the final 1.8 kb, giving multiple stops in all frames. A poly(A) sequence was found in one clone approximately 1 kb into the 3′ untranslated region, associated with a polyadenylation signal 33 bp upstream (position 9530). The open reading frame is almost identical to that identified as APC above.
An alternatively spliced exon at nucleotide 934 of the DP2.5 transcript is of potential interest, it was first discovered by noting that two classes of cDNA had been isolated. The more abundant cDNA class contains a 303 bp exon not included in the other. The presence in vivo of the two transcripts was verified by an exon connection experiment. Primers flanking the alternatively spliced exon were used to amplify, by PCR, cDNA prepared from various adult tissues. Two PCR products that differed in size by approximately 300 bases were amplified from all the tissues tested; the larger product was always more abundant than the smaller.
This example demonstrates the primers used to identify subtle mutations in DP1, SRP19, and DP25.
To obtain DNA sequence adjacent to the exons of the genes DP1, DP2.5, and SRP19, sequencing substrate was obtained by inverse PCR amplification of DNAs from two YACs, 310D8 and 183H12, that span the deletions. Ligation at low concentration cyclized the restriction enzyme-digested YAC DNAs. Oligonucleotides with sequencing tails, designed in inverse orientation at intervals along the cDNAs, primed PCR amplification from the cyclized templates. Comparison of these DNA sequences with the cDNA sequences placed exon boundaries at the divergence points. SRP19 and DP1 were each shown to have five exons. DP2.5 consisted of 15 exons. The sequences of the oligonucleotides synthesized to provide PCR amplification primers for the exons of each of these genes are listed in Table III SEQ ID NO:39-94.
TABLE III
Sequences of Primers Used for SSCP Analyses
Exon
Primer 1
Primer 2
DP1
UP-TCCCCGCCTGCCGCTCTC
RP-GCAGCGGCGGCTCCCGTG
UP-GTGAACGGCTCTCATGCTGC
RP-ACGTGCGGGGAGGAATGGA
UP-ATGATATCTTACCAAATGATATAC
RP-TTATTCCTACTTCTTCTATACAG
UP-TACCCATGCTGGCTCTTTTTC
RP-TGGGGCCATCTTGTTCCTGA
UP-ACATTAGGCACAAAGCTTGCAA
RP-ATCAAGCTCCAGTAAGAAGGTA
SRP19
UP-TGCGGCTCCTGGGTTGTTG
RP-GCCCCTTCCTTTCTGAGGAC
UP-TTTTCTCCTGCCTCTTACTGC
RP-ATGACACCCCCCATTCCCTC
UP-CCACTTAAAGCACATATATTTAGT
RP-GTATGGAAAATAGTGAAGAACC
UP-TTCTTAAGTCCTGTTTTTCTTTTG
RP-TTTAGAACCTTTTTTGTGTTGTG
UP-CTCAGATTATACACTAAGCCTAAC
RP-CATGTCTCTTACAGTAGTACCA
DP2.5
UP-AGGTCCAAGGGTAGCCAAGG*
RP-TAAAAATGGATAAACTACAATTAAAAG
UP-AAATACAGAATCATGTCTTGAAGT
RP-ACACCTAAAGATGACAATTTGAG
UP-TAACTTAGATAGCAGTAATTTCCC*
RP-ACAATAAACTGGAGTACACAAGG
UP-ATAGGTCATTGCTTCTTGCTGAT*
RP-TGAATTTTAATGGATTACCTAGGT
UP-CTTTTTTTGCTTTTACTGATTAACG
RP-TGTAATTCATTTTATTCCTAATACCTC
UP-GGTAGCCATAGTATGATTATTTCT
RP-CTACCTATTTTTATACCCACAAAC
UP-AAGAAAGCCTACACCATTTTTGC
RP-GATCATTCTTAGAACCATCTTGC
UP-ACCTATAGTCTAAATTATACCATC
RP-GTCATGGCATTACTGACCAG
UP-AGTCGTAATTTTGTTTCTAAACTC
RP-TGAAGGACTCCGATTTCACCC*
UP-TCATTCACTCACAGCCTGATGAC*
RP-GCTTTGAAACATGCACTACGAT
UP-AAACATCATTGCTCTTCAAATAAC
RP-TACCATGATTTAAAAATCCACCAG
UP-GATGATTGTCTTTTTCCTCTTTGC
RP-CTGAGCTATCTTAAGAAATCACTG
UP-TTTTAAATGATCCTCTATTCTGTAT
RP-ACAGAGTCAGACCCTCCCTCAAAG
UP-TTTCTATTCTTACTGCTAGCATT
RP-ATACACAGGTAAGAAATTAGGA
UP-TAGATGACCCATATTCTCTTC
RP-CAATTAGGTCTTTTTGAGAGTA
3-A
UP-GTTACTGCATACACATTGTGAC
RP-GCTTTTTGTTTCGTAACATGAAG*
-B
UP-AGTACAAGGATGCCAATATTATG*
RP-ACTTCTATCTTTTTCAGAACGAG*
-C
UP-ATTTGAATACTACAGTGTTACCC*
RP-CTTGTATTCTAATTTGGCATAAGG*
-D
UP-CTGCCCATACACATTCAAACAC*
RP-TGTTTGCGTCTTGCCCATCTT*
-E
UP-AGTCTTAAATATTCAGATGAGCAG*
RP-GTTTCTCTTCATTATATTTTATGCTA*
-F
UP-AAGCCTACCAATTATAGTGAACG*
RP-AGCTGATGACAAAGATGATAATC*
-G
UP-AAGAAACAATACAGACTTATTGTG*
RP-ATGAGTGGGGTCTCCTGAAC*
-H
UPATCTCCCTCCAAAAGTGGTGC*
RP-TCCATCTGGAGTACTTTCTGTG*
-I
UP-AGTAAATGCTGCAGTTCAGAGG*
RP-CCGTGGCATATCATCCCCC*
-J
UP-CCCAGACTGCTTCAAAATTACC*
RP-GAGCCTCATCTGTACTTCTGC*
-K
UP-CCCTCCAAATGAGTTAGCTGC*
RP-TTGTGGTATAGGTTTTACTGGTG*
-L
UP-ACCCAACAAAAATCAGTTAGATG*
RP-GTGGCTGGTAACTTTAGCCTC*
-N
UP-ATGATGTTGACCTTTCCAGGG*
RP-ATTGTGTAACTTTTCATCAGTTGC*
-M
UP-AAAGACATACCAGACAGAGGG*
RP-CTTTTTTGGCATTGCGGAGCT*
-O
UP-AAGATGACCTGTTGCAGGAATG*
RP-GAATCAGACCAAGCTTGTCTAGAT*
-P
UP-CAATAGTAAGTAGTTTACATCAAG*
RP-AAACAGGACTTGTACTGTAGGA*
-Q
UP-CAGCCCCTTCAAGCAAACATC*
RP-GAGGACTTATTCCATTTCTACC*
-R
UP-CAGTCTCCTGGCCGAAACTC*
RP-GTTGACTGGCGTACTAATACAG*
-S
UP-TGGTAATGGAGCCAATAAAAAGG*
RP-TGGGACTTTTCGCCATCCAC*
-T
UP-TGTCTCTATCCACACATTCGTC*
RP-ATGTTTTTCATCCTCACTTTTTGC*
-U
UP-GGAGAAGAACTGGAAGTTCATC*
RP-TTGAATCTTTAATGTTTGGATTTGC*
-V
UP-TCTCCCACAGGTAATACTCCC
RP-GCTACAACTGAATGGGGTACG
-W
UP-CAGGACAAAATAATCCTGTCCC
RP-ATTTTCTTACTTTCATTCTTCCTC
All primers are read in the 5′ to 3′ direction, the first primer in each pair lies 5′ of the exon it amplifies; the second primer lies 3′ of the exon it amplifies. Primers that lie within the exons are identified by an asterisk.
UP represents the -21M13 universal primer sequence;
RP represents the M13 reverse primer sequence.
With the exception of exons 1, 3, 4, 9, and 15 of DP2.5 (see below), the primer sequences were located in intron sequences flanking the exons. The 5′ primer of exon 1 is complementary to the cDNA sequence, but extends just into the 5′ Kozak consensus sequence for the initiator methionine, allowing a survey of the translated sequences. The 5′ primer of exon 3 is actually in the 5′ coding sequences of this exon, as three separate intronic primers simply would not amplify. The 5′ primer of exon 4 just overlaps the 5′ end of this exon, and we thus fail to survey the 19 most 5′ bases of this exon. For exon 9, two overlapping primer sets were used, such that each had one end within the exon. For exon 15, the large 3′ exon of DP2.5, overlapping primer pairs were placed along the length of the exon; each pair amplified a product of 250-400 bases.
This example demonstrates the use of single stranded conformation polymorphism (SSCP) analysis as described by Orita et al. Proc. Natl. Acad. Sci. U.S.A., Vol. 86, pp. 2766-70 (1989) and Genomics, Vol. 5, pp. 874-879 (1989) as applied to DP1, SRP19 and DP2.5.
SSCP analysis identifies most single- or multiple-base changes in DNA fragments up to 400 bases in length. Sequence alterations are detected as shifts in electrophoretic mobility of single-stranded DNA on nondenaturing acrylamide gels; the two complementary strands of a DNA segment usually resolve as two SSCP conformers of distinct mobilities. However, if the sample is from an individual heterozygous for a base-pair variant within the amplified segment, often three or more bands are seen. In some cases, even the sample from a homozygous individual will show multiple bands. Base-pair-change variants are identified by differences in pattern among the DNAs of the sample set.
Exons of the candidate genes were amplified by PCR from the DNAs of 61 unrelated FAP patients and a control set of 12 normal individuals. The five exons from DP1 revealed no unique conformers in the FAP patients, although common conformers were observed with exons 2 and 3 in some individuals of both affected and control sets, indicating the presence of DNA sequence polymorphisms. Likewise, none of the five exons of SRP19 revealed unique conformers in DNA from FAP patients in the test panel.
Testing of exons 1 through 14 and primer sets A through N of exon 15, of the DP2.5 gene, however, revealed variant conformers specific to FAP patients in exons 7, 8, 10, 11, and 15. These variants were in the unrelated patients 3746, 3460, 3827, 3712, and 3751, respectively. The PCR-SSCP procedure was repeated for each of these exons in the five affected individuals and in an expanded set of 48 normal controls. The variant bands were reproducible in the FAP patients but were not observed in any of the control DNA samples. Additional variant conformers in exons 11 and 15 of the DP2.5 gene were seen; however, each of these was found in both the affected and control DNA sets. The five sets of conformers unique to the FAP patients were sequenced to determine the nucleotide changes responsible for their altered mobilities. The normal conformers from the host individuals were sequenced also. Bands were cut from the dried acrylamide gels, and the DNA was eluted. PCR amplification of these DNAs provided template for sequencing.
The sequences of the unique conformers from exons 7, 8, 10, and 11 of DP2.5 revealed dramatic mutations in the DP2.5 gene. The sequence of the new mutation creating the exon 7 conformer in patient 3746 was shown to contain a deletion of two adjacent nucleotides, at positions 730 and 731 in the cDNA sequence (
To confirm the 2-base deletion, the PCR product from FAP patient 3746 and a control DNA were electrophoresed on an acrylamide-urea denaturing gel, along with the products of a sequencing reaction. The sample from patient 3746 showed two bands differing in size by 2 nucleotides, with the larger band identical in mobility to the control sample; this result was independent confirmation that patient, 3746 is heterozygous for a 2 bp deletion.
The unique conformer found in exon 8 of patient 3460 was found to carry a C-T transition, at position 904 in the cDNA sequence of DP2.5 (shown in FIG. 7), which replaced the normal sequence of CGA with TGA. This point mutation, when read in frame, results in a stop codon replacing the normal arginine codon. This single-base change had occurred within the context of a CG dimer, a potential hot spot for mutation (Barker et al., 1984).
The conformer unique to FAP patient 3827 in exon 10 was found to contain a deletion of one nucleotide (1367, 1368, or 1369) when compared to the normal sequence found in the other bands on the SSCP gel. This deletion, occurring within a set of three T's, changed the sequence from CTTTCA to CTTCA; this 1 base frameshift creates a downstream stop within 30 bases. The PCR product amplified from this patient's DNA also was electrophoresed on an acrylamide-urea denaturing gel, along with the PCR product from a control DNA and products from a sequencing reaction. The patient's PCR product showed two bands differing by 1 bp in length, with the larger identical in mobility to the PCR product from the normal DNA; this result confirmed the presence of a 1 bp deletion in patient 3827.
Sequence analysis of the variant conformer of exon 11 from patient 3712 revealed the substitution of a T by a G at position changing the normal tyrosine codon to a stop codon.
The pair of conformers observed in exon 15 of the DP2.5 gene for FAP patient 3751 also was sequenced. These conformers were found to carry a nucleotide substitution of C to G at position 5253, the third base of a valine codon. No amino acid change resulted from this substitution, suggesting that this conformer reflects a genetically silent polymorphism.
The observation of distinct inactivating mutations in the DP2.5 gene in four unrelated patients strongly suggested that DP2.5 is the gene involved in FAP. These mutations are summarized in Table IIA.
This example demonstrates that the mutations identified in the DP2.5 (APC) gene segregate with the FAP phenotype.
Patient 3746, described above as carrying an APC allele with a frameshift mutation, is an affected offspring of two normal parents. Colonoscopy revealed no polyps in either parent nor among the patient's three siblings.
DNA samples from both parents, from the patient's wife, and from their three children were examined. SSCP analysis of DNA from both of the patient's parents displayed the normal pattern of conformers for exon 7, as did DNA from the patient's wife and one of his off-spring. The two other children, however, displayed the same new conformers as their affected father. Testing of the patient and his parents with highly polymorphic VNTR (variable number of tandem repeat) markers showed a 99.98% likelihood that they are his biological parents.
These observations confirmed that this novel conformer, known to reflect a 2 bp deletion mutation in the DP2.5 gene, appeared spontaneously with FAP in this pedigree and was transmitted to two of the children of the affected individual.
This example demonstrates polymorphisms in the APC gene which appear to be unrelated to disease (FAP).
Sequencing of variant conformers found among controls as well as individuals with APC has revealed the following polymorphisms in the APC gene; first, in exon 11, at position 1458, a substitution of T to C creating an RsaI restriction site but no amino acid change; and second, in exon 15, at positions 5037 and 5271, substitutions of A to G and G to T, respectively, neither resulting in amino acid substitutions. These nucleotide polymorphisms in the APC gene sequence may be useful for diagnostic purposes.
This example shows the structure of the APC gene.
The structure of the APC gene is schematically shown in
The continuity of the very large (6.5 kb), most 3′ exon in DP2.5 was shown in two ways. First, inverse PCR with primers spanning the entire length of this exon revealed no divergence of the cDNA sequence from the genomic sequence. Second, PCR amplification with converging primers placed at intervals along the exon generated products of the same size whether amplified from the originally isolated cDNA, cDNA from various tissues, or genomic template. Two forms of exon 9 were found in DP2.5: one is the complete exon; and the other, labeled exon 9A, is the result of a splice into the interior of the exon that deletes bases 934 to 1236 in the mRNA and removes 101 amino acids from the predicted protein (see
This example demonstrates the mapping of the FAP deletions with respect to the APC exons.
Somatic cell hybrids carrying the segregated chromosomes 5 from the 100 kb (HHW1291) and 260 kb (HHW1155) deletion patients were used to determine the distribution of the APC genes exons across the deletions. DNAs from these cell lines were used as template, along with genomic DNA from a normal control, for PCR-based amplification of the APC exons.
PCR analysis of the hybrids from the 260 kb deletion of patient 3214 showed that all but one (exon 1) of the APC exons are removed by this deletion. PCR analysis of the somatic cell hybrid HHW1291, carrying the chromosome 5 homolog with the 100 kb deletion from patient 3824, revealed that exons 1 through 9 are present but exons 10 through 15 are missing. This result placed the deletion breakpoint either between exons 9 and 10 or within exon 10.
This example demonstrates the expression of alternately spliced APC messenger in normal tissues and in cancer cell lines.
Tissues that express the APC gene were identified by PCR amplification of cDNA made to mRNA with primers located within adjacent APC exons. In addition, PCR primers that flank the alternatively spliced exon 9 were chosen so that the expression pattern of both splice forms could be assessed. All tissue types tested (brain, lung, aorta, spleen, heart, kidney, liver, stomach, placenta, and colonic mucosa) and cultured cell lines (lymphoblasts, HL60, and choriocarcinoma) expressed both splice forms of the APC gene. We note, however, that expression by lymphocytes normally residing in some tissues, including colon, prevents unequivocal assessment of expression. The large mRNA, containing the complete exon 9 rather than only exon 9A, appears to be the more abundant message.
Northern analysis of poly(A)-selected RNA from lymphoblasts revealed a single band of approximately 10 kb, consistent with the size of the sequenced cDNA.
This example discusses structural features of the APC protein predicted from the sequence.
The cDNA consensus sequence of APC predicts that the longer, more abundant form of the message codes for a 2842 or 2844 amino acid peptide with a mass of 311.8 kd. This predicted APC peptide was compared with the current data bases of protein and DNA sequences using both Intelligenetics and GCG software packages. No genes with a high degree of amino acid sequence similarity were found. Although many short (approximately 20 amino acid) regions of sequence similarity were uncovered, none was sufficiently strong to reveal which, if any, might represent functional homology. Interestingly, multiple similarities to myosins and keratins did appear. The APC gene also was scanned for sequence motifs of known function; although multiple glycosylation, phosphorylation, and myristoylation sites were seen, their significance is uncertain.
Analysis of the APC peptide sequence did identify features important in considering potential protein structure. Hydropathy plots (Kyte and Doolittle, J. Mol. Biol. Vol. 157, pp. 105-132 (1982)) indicate that the APC protein is notably hydrophilic. No hydrophobic domains suggesting a signal peptide or a membrane-spanning domain were found. Analysis of the first 1000 residues indicates that α-helical rods may form (Cohen and Parry, Trends Biochem, Sci. Vol. 77, pp. 245-248 (1986); there is a scarcity of proline residues and, there are a number of regions containing heptad repeats (apolar-X-X-apolar-X-X-X). Interestingly, in exon 9A, the deleted form of exon 9, two heptad repeat regions are reconnected in the proper heptad repeat frame, deleting the intervening peptide region. After the first 1000 residues, the high proline content of the remainder of the peptide suggests a compact rather than a rod-like structure.
The prominent feature of the second 1000 residues is a 20 amino acid repeat that is iterated seven times with semiregular spacing (Table 4). The intervening sequences between the seven repeat regions contained 114, 116, 151, 205, 107, and 58 amino acids, respectively. Finally, residues 2200-24000 contain a 200 amino acid basic domain.
TABLE IV
Seven Different Versions of the 20-Amino Acid Repeat
Consensus:
F
*
V
E
*
T
P
*
C
F
S
R
*
S
S
L
S
S
L
S
(SEQ ID NO: 147)
1262:
Y
C
V
E
D
T
P
I
C
F
S
R
C
S
S
L
S
S
L
S
(SEQ ID NO: 148)
1376:
H
T
V
Q
E
T
P
L
M
F
S
R
C
T
S
V
S
S
L
D
(SEQ ID NO: 149)
1492:
F
A
T
E
S
T
P
D
G
F
S
C
S
S
S
L
S
A
L
S
(SEQ ID NO: 150)
1643:
Y
C
V
E
G
T
P
I
N
F
S
T
A
T
S
L
S
D
L
T
(SEQ ID NO: 151)
1848:
T
P
I
E
G
T
P
Y
C
F
S
R
N
D
S
L
S
S
L
D
(SEQ ID NO: 152)
1953:
F
A
I
E
N
T
P
V
C
P
S
H
N
S
S
L
S
S
L
S
(SEQ ID NO: 153)
2013:
R
H
V
E
D
T
P
V
C
F
S
R
N
S
S
L
S
S
L
S
(SEQ ID NO: 154)
Numbers denote the first amino acid of each repeat. The consensus sequence at the top reflects a majority amino acid at a given position. In the consensus sequence, “*” indicates “Xaa.”
Vogelstein, Bert, Kinzler, Kenneth W., Markham, Alexander Fred, Groden, Joanna, White, Raymond, Anand, Rakesh, Albertsen, Hans, Carlson, Mary, Hedge, Philip, Joslyn, Geoff, Nakumura, Yusuke, Thliveris, Andrew
Patent | Priority | Assignee | Title |
10155980, | Aug 15 2016 | ACCURAGEN HOLDINGS LIMITED | Compositions and methods for detecting rare sequence variants |
10724088, | Aug 15 2016 | ACCURAGEN HOLDINGS LIMITED | Compositions and methods for detecting rare sequence variants |
10752942, | Oct 09 2015 | ACCURAGEN HOLDINGS LIMITED | Methods and compositions for enrichment of amplification products |
10767222, | Dec 11 2013 | ACCURAGEN HOLDINGS LIMITED | Compositions and methods for detecting rare sequence variants |
11203782, | Mar 29 2018 | ACCURAGEN HOLDINGS LIMITED | Compositions and methods comprising asymmetric barcoding |
11286519, | Dec 11 2013 | ACCURAGEN HOLDINGS LIMITED | Methods and compositions for enrichment of amplification products |
11427866, | May 16 2016 | ACCURAGEN HOLDINGS LIMITED | Method of improved sequencing by strand identification |
11578359, | Oct 09 2015 | ACCURAGEN HOLDINGS LIMITED | Methods and compositions for enrichment of amplification products |
11597973, | Dec 11 2013 | ACCURAGEN HOLDINGS LIMITED | Compositions and methods for detecting rare sequence variants |
11643683, | Aug 15 2016 | ACCURAGEN HOLDINGS LIMITED | Compositions and methods for detecting rare sequence variants |
11859246, | Dec 11 2013 | ACCURAGEN HOLDINGS LIMITED | Methods and compositions for enrichment of amplification products |
Patent | Priority | Assignee | Title |
5098823, | Aug 12 1987 | IMPERIAL CANCER RESEARCH TECHNOLOGY, LTD , A CORP OF THE UNITED KINGDOM | Chromosome-specific nucleic acid probe for familial Polyposis coli |
5137806, | Dec 11 1989 | AstraZeneca UK Limited | Methods and compositions for the detection of sequences in selected DNA molecules |
5244801, | Dec 28 1989 | Process for producing monoclonal antibodies which specifically bind to benign colonic adenomatous polyps of human origin and hybridomas and monoclonal antibodies produced by said process | |
WO8901481, |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Nov 21 2000 | Zeneca Limited | Syngenta Limited | CHANGE OF NAME SEE DOCUMENT FOR DETAILS | 022315 | /0519 | |
Oct 24 2001 | The Johns Hopkins University | (assignment on the face of the patent) | / | |||
Oct 24 2001 | Astrazeneca United Kingdom, Ltd. | (assignment on the face of the patent) | / | |||
Oct 24 2001 | Cancer Institute, Japanese, Foundation for Cancer Research | (assignment on the face of the patent) | / | |||
Oct 24 2001 | The University of Utah | (assignment on the face of the patent) | / | |||
Feb 26 2009 | Syngenta Limited | ASTRAZENECA UK LTD | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 022315 | /0560 |
Date | Maintenance Fee Events |
Date | Maintenance Schedule |
Oct 27 2012 | 4 years fee payment window open |
Apr 27 2013 | 6 months grace period start (w surcharge) |
Oct 27 2013 | patent expiry (for year 4) |
Oct 27 2015 | 2 years to revive unintentionally abandoned end. (for year 4) |
Oct 27 2016 | 8 years fee payment window open |
Apr 27 2017 | 6 months grace period start (w surcharge) |
Oct 27 2017 | patent expiry (for year 8) |
Oct 27 2019 | 2 years to revive unintentionally abandoned end. (for year 8) |
Oct 27 2020 | 12 years fee payment window open |
Apr 27 2021 | 6 months grace period start (w surcharge) |
Oct 27 2021 | patent expiry (for year 12) |
Oct 27 2023 | 2 years to revive unintentionally abandoned end. (for year 12) |