Provided herein are adeno-associated virus (aav) compositions that can express a phenylalanine hydroxylase (pah) polypeptide in a cell, thereby restoring the pah gene function. Also provided are methods of use of the aav compositions, and packaging systems for making the aav compositions.

Patent
   11951183
Priority
Feb 01 2018
Filed
Mar 19 2020
Issued
Apr 09 2024
Expiry
Jul 06 2041

TERM.DISCL.
Extension
886 days
Assg.orig
Entity
Small
0
171
currently ok
1. A packaging system for preparation of a replication-defective aav comprising:
(a) a first nucleotide sequence comprising a first transcriptional regulatory element operably linked to a nucleotide sequence encoding one or more aav Rep proteins;
(b) a second nucleotide sequence comprising a second transcriptional regulatory element operably linked to a nucleotide sequence encoding an aav capsid protein; and
(c) a third nucleotide sequence comprising a transfer genome comprising, from 5′ to 3′:
(i) an aav2 5′ inverted terminal repeat (ITR) nucleotide sequence;
(ii) a transcriptional regulatory element comprising, from 5′ to 3′, a human hepatic control region 1 (HCR1) and a human al-antitrypsin (hAAT) promoter;
(iii) a silently altered pah coding sequence having at least 80% sequence identity to the nucleotide sequence of SEQ ID NO: 71, wherein the silently altered pah coding sequence encodes the amino acid sequence set forth in SEQ ID NO: 23; and
(iv) an aav2 3′ ITR nucleotide sequence.
2. A method for producing a replication-defective aav, the method comprising introducing the packaging system of claim 1 into a cell under conditions operative for producing the aav.
3. The packaging system of claim 1, wherein the packaging system comprises a first vector comprising the first nucleotide sequence and the second nucleotide sequence, and a second vector comprising the third nucleotide sequence.
4. The packaging system of claim 3, further comprising a third vector, wherein the third vector is a helper virus vector.
5. The packaging system of claim 4, wherein the helper virus vector is an independent third vector.
6. The packaging system of claim 4, wherein the third vector comprises genes encoding helper virus proteins.
7. The packaging system of claim 4, wherein the helper virus is selected from the group consisting of adenovirus, herpes virus, vaccinia virus, and cytomegalovirus (CMV).
8. The packaging system of claim 7, wherein the helper virus is adenovirus.
9. The packaging system of claim 8, wherein the adenovirus genome comprises one or more adenovirus RNA genes selected from the group consisting of E1, E2, E4 and VA.
10. The packaging system of claim 7, wherein the helper virus is herpes simplex virus (HSV).
11. The packaging system of claim 10, wherein the HSV genome comprises one or more HSV genes selected from the group consisting of UL5/8/52, ICPO, 1CP4, ICP22 and UL30/UL42.
12. The packaging system of claim 4, wherein the first vector and the third vector are contained within a first transfecting plasmid.
13. The packaging system of claim 4, wherein the nucleotide sequences of the second vector and the third vector are contained within a second transfecting plasmid.
14. The packaging system of claim 4, wherein the nucleotide sequences of the first vector and the third vector are cloned into a recombinant helper virus.
15. The packaging system of claim 4, wherein the nucleotide sequences of the second vector and the third vector are cloned into a recombinant helper virus.
16. The packaging system of claim 1, wherein the Rep nucleotide sequence encodes an aav2 Rep protein.
17. The packaging system of claim 16, wherein the aav2 Rep protein is Rep 78/68 or Rep 68/52.
18. The packaging system of claim 16, wherein the aav2 Rep protein comprises an amino acid sequence having at least 70% sequence identity to the amino acid sequence of SEQ ID NO: 22.

This application is a division of U.S. patent application Ser. No. 16/369,879, now U.S. Pat. No. 10,610,606, filed Mar. 29, 2019, which is a continuation of International Patent Application No. PCT/US2019/016351, filed Feb. 1, 2019, which claims priority to U.S. Provisional Patent Application Ser. No. 62/625,150, filed Feb. 1, 2018, the entire disclosures of which are hereby incorporated herein by reference.

Phenylketonuria (PKU) is an autosomal recessive genetic disorder where the majority of cases are caused by mutations in the phenylalanine hydroxylase (PAH) gene. The PAH gene encodes a hepatic enzyme that catalyzes the hydroxylation of L-phenylalanine (Phe) to L-tyrosine (Tyr) upon multimerization. Reduction or loss of PAH activity leads to phenylalanine accumulation and its conversion into phenylpyruvate (also known as phenylketone). This abnormality in phenylalanine metabolism impairs neuronal maturation and the synthesis of myelin, resulting in mental retardation, seizures and other serious medical problems.

Currently, there is no cure for PKU. The standard of care is diet management by minimizing foods that contain high amounts of phenylalanine. Dietary management from birth with a low phenylalanine formula largely prevents the development of the intellectual disability of the disorder. However, even on a low-phenylalanine diet, children still suffer from growth retardation, and adults often have osteoporosis and vitamin deficiencies. Moreover, adherence to life-long dietary treatment is difficult, particularly once children reach school age.

New treatment strategies have recently emerged, including large neutral amino acid (LNAA) supplementation, cofactor tetrahydrobiopterin therapy, enzyme replacement therapy, and genetically modified probiotic therapy. However, these strategies suffer from shortcomings. The LNAA supplementation is suitable only for adults not adhering to a low Phe diet. The cofactor tetrahydrobiopterin can only be used in some mild forms of PKU. Enzyme replacement by administration of a substitute for PAH, e.g., phenylalanine ammonia-lyase (PAL), can lead to immune responses that reduce the efficacy and/or cause side effects. As to genetically modified probiotic therapy, the pathogenicity of PAL-expressing E. coli has been a concern.

Gene therapy provides a unique opportunity to cure PKU. Retroviral vectors, including lentiviral vectors, are capable of integrating nucleic acids into host cell genomes, raising safety concerns due to their non-targeted insertion into the genome. For example, there is a risk of the vector disrupting a tumor suppressor gene or activating an oncogene, thereby causing a malignancy. Indeed, in a clinical trial for treating X-linked severe combined immunodeficiency (SCID) by transducing CD34+ bone marrow precursors with a gammaretroviral vector, four out of ten patients developed leukemia (Hacein-Bey-Abina et al., J Clin Invest. (2008) 118(9):3132-42). Non-integrating vectors, on the other hand, often suffer insufficient expression level or inadequate duration of expression in vivo.

Accordingly, there is a need in the art for improved gene therapy compositions and methods that can efficiently and safely restore PAH gene function in PKU patients.

Provided herein are adeno-associated virus (AAV) compositions that can restore PAH gene function in cells, and methods for using the same to treat diseases associated with reduction of PAH gene function (e.g., PKU). Also provided are packaging systems for making the adeno-associated virus compositions.

Accordingly, in one aspect, the instant disclosure provides a method for expressing a PAH polypeptide in a cell, the method comprising transducing the cell with a replication-defective adeno-associated virus (AAV) comprising:

In certain embodiments, the cell is a hepatocyte, a renal cell, or a cell in the brain, pituitary gland, adrenal gland, pancreas, urinary bladder, gallbladder, colon, small intestine, or breast. In certain embodiments, the cell is in a mammalian subject and the AAV is administered to the subject in an amount effective to transduce the cell in the subject.

In another aspect, the instant disclosure provides a method for treating a subject having a disease or disorder associated with a PAH gene mutation, the method comprising administering to the subject an effective amount of a replication-defective AAV comprising:

In certain embodiments, the disease or disorder is phenylketonuria. In certain embodiments, the subject is a human subject.

In another aspect, the instant disclosure provides a replication-defective adeno-associated virus (AAV) comprising:

The following embodiments apply to each of the foregoing aspects.

In certain embodiments, the PAH coding sequence encodes an amino acid sequence set forth in SEQ ID NO: 23. In certain embodiments, the PAH coding sequence comprises the nucleotide sequence set forth in SEQ ID NO: 24. In certain embodiments, the PAH coding sequence is silently altered. In certain embodiments, the PAH coding sequence comprises the nucleotide sequence set forth in SEQ ID NO: 25.

In certain embodiments, the transcriptional regulatory element is capable of mediating transcription in a hepatocyte, a renal cell, or a cell in the brain, pituitary gland, adrenal gland, pancreas, urinary bladder, gallbladder, colon, small intestine, or breast. In certain embodiments, the transcriptional regulatory element is capable of mediating transcription in a hepatocyte or renal cell. In certain embodiments, the transcriptional regulatory element comprises one of more of the elements selected from the group consisting of a CAG promoter, a human EF-1α promoter, a human hepatic control region 1 (HCR1), a human al-antitrypsin (hAAT) promoter, a hepatic specific regulatory module of the hAAT promoter, an SV40 intron, and a minute virus of mouse (MVM) intron. In certain embodiments, the transcriptional regulatory element comprises a nucleotide sequence at least 90% identical to a sequence selected from the group consisting of SEQ ID NOs: 28-30 and 32-41. In certain embodiments, the transcriptional regulatory element comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28-30 and 32-41. In certain embodiments, the transcriptional regulatory element comprises from 5′ to 3′ the nucleotide sequences set forth in SEQ ID NOs: 29, 30, and 31. In certain embodiments, the transcriptional regulatory element comprises the nucleotide sequences set forth in SEQ ID NO: 32.

In certain embodiments, the transfer genome further comprises an intron operably linked to the PAH coding sequence. In certain embodiments, the intron comprises a nucleotide sequence at least 90% identical to the sequence set forth in SEQ ID NO: 31 or 35. In certain embodiments, the intron comprises the nucleotide sequence set forth in SEQ ID NO: 31 or 35. In certain embodiments, the transfer genome comprises from 5′ to 3′: a non-coding exon, the intron, and the PAH coding sequence.

In certain embodiments, the transfer genome further comprises a polyadenylation sequence 3′ to the PAH coding sequence. In certain embodiments, the polyadenylation sequence is an exogenous polyadenylation sequence. In certain embodiments, the exogenous polyadenylation sequence is an SV40 polyadenylation sequence. In certain embodiments, the SV40 polyadenylation sequence comprises a sequence selected from the group consisting of SEQ ID NOs: 42, 43, and 45.

In certain embodiments, the transfer genome comprises a sequence selected from the group consisting of SEQ ID NOs: 46-50, 61, 64, 67, 74, 76, 78, 80, 82, 84, 86, and 89.

In certain embodiments, the transfer genome further comprises a 5′ inverted terminal repeat (5′ ITR) nucleotide sequence 5′ of the genome, and a 3′ inverted terminal repeat (3′ ITR) nucleotide sequence 3′ of the genome. In certain embodiments, the 5′ ITR nucleotide sequence has at least 95% sequence identity to SEQ ID NO: 18, and the 3′ ITR nucleotide sequence has at least 95% sequence identity to SEQ ID NO: 19. In certain embodiments, the 5′ ITR nucleotide sequence has at least 95% sequence identity to SEQ ID NO: 20, and the 3′ ITR nucleotide sequence has at least 95% sequence identity to SEQ ID NO: 21. In certain embodiments, the 5′ ITR nucleotide sequence has at least 95% sequence identity to SEQ ID NO: 26, and the 3′ ITR nucleotide sequence has at least 95% sequence identity to SEQ ID NO: 27.

In certain embodiments, the transfer genome comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 51-55, 62, 65, 68, 75, 77, 79, 81, 83, 85, 87, and 90. In certain embodiments, the transfer genome consists of a nucleotide sequence selected from the group consisting of SEQ ID NOs: 51-55, 62, 65, 68, 75, 77, 79, 81, 83, 85, 87, and 90. In certain embodiments, the transfer genome consists of the nucleotide sequence set forth in SEQ ID NO: 52.

In certain embodiments, the AAV Clade F capsid protein comprises an amino acid sequence having at least 95% sequence identity with the amino acid sequence of amino acids 203-736 of SEQ ID NO: 2, 3, 4, 6, 7, 10, 11, 12, 13, 15, 16, or 17. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 206 of SEQ ID NO: 2 is C; the amino acid in the capsid protein corresponding to amino acid 296 of SEQ ID NO: 2 is H; the amino acid in the capsid protein corresponding to amino acid 312 of SEQ ID NO: 2 is Q; the amino acid in the capsid protein corresponding to amino acid 346 of SEQ ID NO: 2 is A; the amino acid in the capsid protein corresponding to amino acid 464 of SEQ ID NO: 2 is N; the amino acid in the capsid protein corresponding to amino acid 468 of SEQ ID NO: 2 is S; the amino acid in the capsid protein corresponding to amino acid 501 of SEQ ID NO: 2 is I; the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 590 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G or Y; the amino acid in the capsid protein corresponding to amino acid 681 of SEQ ID NO: 2 is M; the amino acid in the capsid protein corresponding to amino acid 687 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 690 of SEQ ID NO: 2 is K; the amino acid in the capsid protein corresponding to amino acid 706 of SEQ ID NO: 2 is C; or, the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G.

In certain embodiments, (a) the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G, and the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G;

In certain embodiments, the capsid protein comprises the amino acid sequence of amino acids 203-736 of SEQ ID NO: 2, 3, 4, 6, 7, 10, 11, 12, 13, 15, 16, or 17.

In certain embodiments, the AAV Clade F capsid protein comprises an amino acid sequence having at least 95% sequence identity with the amino acid sequence of amino acids 138-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 13, 15, 16, or 17. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 151 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 160 of SEQ ID NO: 2 is D; the amino acid in the capsid protein corresponding to amino acid 206 of SEQ ID NO: 2 is C; the amino acid in the capsid protein corresponding to amino acid 296 of SEQ ID NO: 2 is H; the amino acid in the capsid protein corresponding to amino acid 312 of SEQ ID NO: 2 is Q; the amino acid in the capsid protein corresponding to amino acid 346 of SEQ ID NO: 2 is A; the amino acid in the capsid protein corresponding to amino acid 464 of SEQ ID NO: 2 is N; the amino acid in the capsid protein corresponding to amino acid 468 of SEQ ID NO: 2 is S; the amino acid in the capsid protein corresponding to amino acid 501 of SEQ ID NO: 2 is I; the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 590 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G or Y; the amino acid in the capsid protein corresponding to amino acid 681 of SEQ ID NO: 2 is M; the amino acid in the capsid protein corresponding to amino acid 687 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 690 of SEQ ID NO: 2 is K; the amino acid in the capsid protein corresponding to amino acid 706 of SEQ ID NO: 2 is C; or, the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G.

In certain embodiments, (a) the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G, and the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G;

In certain embodiments, the capsid protein comprises the amino acid sequence of amino acids 138-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 13, 15, 16, or 17.

In certain embodiments, the AAV Clade F capsid protein comprises an amino acid sequence having at least 95% sequence identity with the amino acid sequence of amino acids 1-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 2 of SEQ ID NO: 2 is T; the amino acid in the capsid protein corresponding to amino acid 65 of SEQ ID NO: 2 is I; the amino acid in the capsid protein corresponding to amino acid 68 of SEQ ID NO: 2 is V; the amino acid in the capsid protein corresponding to amino acid 77 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 119 of SEQ ID NO: 2 is L; the amino acid in the capsid protein corresponding to amino acid 151 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 160 of SEQ ID NO: 2 is D; the amino acid in the capsid protein corresponding to amino acid 206 of SEQ ID NO: 2 is C; the amino acid in the capsid protein corresponding to amino acid 296 of SEQ ID NO: 2 is H; the amino acid in the capsid protein corresponding to amino acid 312 of SEQ ID NO: 2 is Q; the amino acid in the capsid protein corresponding to amino acid 346 of SEQ ID NO: 2 is A; the amino acid in the capsid protein corresponding to amino acid 464 of SEQ ID NO: 2 is N; the amino acid in the capsid protein corresponding to amino acid 468 of SEQ ID NO: 2 is S; the amino acid in the capsid protein corresponding to amino acid 501 of SEQ ID NO: 2 is I; the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 590 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G or Y; the amino acid in the capsid protein corresponding to amino acid 681 of SEQ ID NO: 2 is M; the amino acid in the capsid protein corresponding to amino acid 687 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 690 of SEQ ID NO: 2 is K; the amino acid in the capsid protein corresponding to amino acid 706 of SEQ ID NO: 2 is C; or, the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G.

In certain embodiments, (a) the amino acid in the capsid protein corresponding to amino acid 2 of SEQ ID NO: 2 is T, and the amino acid in the capsid protein corresponding to amino acid 312 of SEQ ID NO: 2 is Q;

In certain embodiments, the capsid protein comprises the amino acid sequence of amino acids 1-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17.

In another aspect, the instant disclosure provides a pharmaceutical composition comprising an AAV disclosed herein.

In another aspect, the instant disclosure provides a packaging system for recombinant preparation of an AAV, wherein the packaging system comprises:

In certain embodiments, the packaging system comprises a first vector comprising the Rep nucleotide sequence and the Cap nucleotide sequence, and a second vector comprising the transfer genome. In certain embodiments, the Rep nucleotide sequence encodes an AAV2 Rep protein. In certain embodiments, the AAV2 Rep protein is 78/68 or Rep 68/52. In certain embodiments, the AAV2 Rep protein comprises an amino acid sequence having a minimum percent sequence identity to the AAV2 Rep amino acid sequence of SEQ ID NO: 22, wherein the minimum percent sequence identity is at least 70% across the length of the amino acid sequence encoding the AAV2 Rep protein.

In certain embodiments, the packaging system further comprises a third vector, wherein the third vector is a helper virus vector. In certain embodiments, the helper virus vector is an independent third vector. In certain embodiments, the helper virus vector is integral with the first vector. In certain embodiments, the helper virus vector is integral with the second vector. In certain embodiments, the third vector comprises genes encoding helper virus proteins.

In certain embodiments, the helper virus is selected from the group consisting of adenovirus, herpes virus, vaccinia virus, and cytomegalovirus (CMV). In certain embodiments, the helper virus is adenovirus. In certain embodiments, the adenovirus genome comprises one or more adenovirus RNA genes selected from the group consisting of E1, E2, E4 and VA. In certain embodiments, the helper virus is herpes simplex virus (HSV). In certain embodiments, the HSV genome comprises one or more of HSV genes selected from the group consisting of UL5/8/52, ICPO, ICP4, ICP22 and UL30/UL42.

In certain embodiments, the first vector and the third vector are contained within a first transfecting plasmid. In certain embodiments, the nucleotides of the second vector and the third vector are contained within a second transfecting plasmid. In certain embodiments, the nucleotides of the first vector and the third vector are cloned into a recombinant helper virus. In certain embodiments, the nucleotides of the second vector and the third vector are cloned into a recombinant helper virus.

In another aspect, the instant disclosure provides a method for recombinant preparation of an AAV, the method comprising introducing a packaging system as described herein into a cell under conditions operative for enclosing the transfer genome or the transfer genome in the capsid to form the AAV.

FIGS. 1A, 1B, 1C, 1D, and 1E are vector maps of the pHMI-hPAH-TC-004, pHMI-hPAH-TC-025, pHMI-hPAH-TC-010, pHMI-hPAH-TC-011, and pHMI-hPAH-TC-012 vectors, respectively.

FIG. 2 is an image of Western blot showing the expression of human PAH from the pCOH-WT-PAH (“WT PAH”), pCOH—CO-PAH (“CO PAH pCOH”), and pHMI-CO-PAH (“CO PAH pHMI”) vectors. 5×105 HEK 293 cells were transfected with 1 μg of vector. Lysate of the cells was collected 48 hours after transfection. The expression of human PAH was detected by Western blotting with an anti-PAH antibody (Sigma HPA031642). The amount of GAPDH protein as detected by an anti-GAPDH antibody (Millipore MAB 374) was shown as a loading control.

FIGS. 3A and 3B are graphs showing the Phe level in the serum of two pah−/− mice (“Mouse H1” and “Mouse H5”) each administered with 5×1013 vector genomes of the rAAV-CBA-mPAH vector packaged in an AAVHSC capsid per kg of body weight intravenously via the tail vein. Serum samples were collected in a time course. The Phe levels were measured with a BioAssay Systems ELISA kit EPHE-100 (FIG. 3A) or mass spectrometry (FIG. 3B).

FIG. 4 is a graph and a table showing the numbers of vector genomes per 106 cells detected in major organs. The rAAV-CBA-mPAH vector packaged in an AAVHSC capsid was injected to pah−/− mice intravenously via the tail vein at a dose of 5×1013 vector genomes per kg of body weight. Organs of the mice were collected 4 weeks after the administration. The numbers of vector genomes per 106 cells were measured by the following method: (1) the weight/volume concentration of the vector genome in a sample was measured by Taqman PCR using a standard curve generated with serial dilutions of the vector plasmid; (2) the mass of a single vector genome was calculated based on the sequence of the vector; (3) the number/volume concentration of the vector genome in the sample was calculated; (4) the weight/volume concentration of genomic DNA in the same sample was measured by Taqman PCR of the apolipoprotein B gene using a standard curve generated with serial dilutions of calculated amounts of genomic DNA isolated from mouse tissues; (5) the number/volume concentration of cell genome in the sample was calculated based on copies of ApoB; and (6) the number of vector genomes per 106 cells was calculated by dividing the number/volume concentration of the vector genome by the number/volume concentration of the cell genome and multiplying the result by 106.

FIGS. 5A and 5B are graphs showing the aspartate aminotransferase (AST) and alanine aminotransferase (ALT) levels in the serum of pah−/− mice administered with the rAAV-CBA-mPAH vector. The rAAV-CBA-mPAH vector packaged in an AAVHSC capsid was injected to pah−/− mice intravenously via the tail vein at a dose of 5×1013 vector genomes per kg of body weight. Serum samples were collected 4 weeks after the administration. The levels of AST (FIG. 5A) and ALT (FIG. 5B) were measured by ELISA using the Sigma MAK055 and Sigma MAK052 kits, respectively.

FIGS. 6A-6H are graphs showing the levels of phenylalanine (FIGS. 6A, 6C, 6E, and 6G) or tyrosine (FIGS. 6B, 6D, 6F, and 6H) in the serum of male (FIGS. 6A, 6B, 6E, and 6F) or female (FIGS. 6C, 6D, 6G, and 6H) mice administered with the indicated doses of the pHMI-hPAH-TC-004, pHMI-hPAH-TC-025, pHMI-hPAH-TC-010, pHMI-hPAH-TC-011, or pHMI-hPAH-TC-012 vector.

FIGS. 6I-6J are graphs showing the long-term efficacy on levels of phenylalanine in the serum of male (FIG. 6I) or female (FIG. 6J) mice administered the indicated doses of the pHMI-hPAH-TC-025 vector.

FIGS. 7A-7D are graphs showing the levels of phenylalanine (FIGS. 7A and 7C) or tyrosine (FIGS. 7B and 7D) in the serum of male (FIGS. 7A and 7B) or female (FIGS. 7C and 7D) mice administered with the indicated doses of the pHMI-hPAH-TC-025 vector.

FIG. 8A, 8B, 8C are vector maps of pHMI-hPAH-TC-009, pHMI-hPAH-TC-013 and pHMI-hPAH-TC-017 vectors, respectively.

FIG. 9A-9B depict the quantification of Western blots of human PAH expression, from the indicated AAV vectors, in Huh7 cells (FIG. 9A) and HEK293 cells (FIG. 9B).

FIG. 10A-10C are graphs showing serum phenylalanine levels in mice that have been administered the indicated AAV vectors. FIG. 10A is a graph showing serum Phe levels over time of male Pah−/− PAHemu2 mice. FIG. 10B is a graph showing serum Phe levels over time of female Pah−/− PAHemu2 mice. FIG. 10C is a graph showing the average baseline serum Phe levels of the male and female mice in the study (55 mice per group; **** indicates p<0.05).

FIGS. 11A, 11B, 11C, 11D, 11E, and 11F are vector maps of pHMI-hPAH-TC-018, pHMI-hPAH-TC-019, pHMI-hPAH-TC-020, pHMI-hPAH-TC-021, pHMI-hPAH-TC-022, and pHMI-hPAH-TC-023 vectors, respectively.

FIG. 12 depicts the quantification of Western blots of human PAH expression from HEK293 cells transfected with the indicated AAV vectors under the control of a CBA promoter.

FIG. 13 is a graph showing serum phenylalanine levels over time of male Pah−/− PAHemu2 mice administered the indicated AAV vectors.

The instant disclosure provided adeno-associated virus (AAV) compositions that can restore PAH gene function in a cell. Also provide are packaging systems for making the adeno-associated virus compositions.

As used herein, the term “replication-defective adeno-associated virus” refers to an AAV comprising a genome lacking Rep and Cap genes.

As used herein, the term “PAH gene” refers to the phenylalanine hydroxylase gene. The human PAH gene is identified by Entrez Gene ID 5053. An exemplary nucleotide sequence of a PAH mRNA is provided as SEQ ID NO: 24. An exemplary amino acid sequence of a PAH polypeptide is provided as SEQ ID NO: 23.

As used herein, the term “transfer genome” refers to a recombinant AAV genome comprising a coding sequence operably linked to an exogenous transcriptional regulatory element that mediates expression of the coding sequence when the transfer genome is introduced into a cell. In certain embodiments, the transfer genome does not integrate in the chromosomal DNA of the cell. The skilled artisan will appreciate that the portion of a transfer genome comprising the transcriptional regulatory element operably linked to a PAH coding sequence can be in the sense or antisense orientation relative to direction of transcription of the PAH coding sequence.

As used herein, the term “Clade F capsid protein” refers to an AAV VP1, VP2, or VP3 capsid protein that has at least 90% identity with the VP1, VP2, or VP3 amino acid sequences set forth, respectively, in amino acids 1-736, 138-736, and 203-736 of SEQ ID NO: 1 herein.

As used herein, the identity between two nucleotide sequences or between two amino acid sequences is determined by the number of identical nucleotides or amino acids in alignment divided by the full length of the longer nucleotide or amino acid sequence.

As used herein, the term “a disease or disorder associated with a PAH gene mutation” refers to any disease or disorder caused by, exacerbated by, or genetically linked with mutation of a PAH gene. In certain embodiments, the disease or disorder associated with a PAH gene mutation is phenylketonuria (PKU).

As used herein, the term “coding sequence” refers to the portion of a complementary DNA (cDNA) that encodes a polypeptide, starting at the start codon and ending at the stop codon. A gene may have one or more coding sequences due to alternative splicing, alternative translation initiation, and variation within the population. A coding sequence may either be wild-type or codon-altered. An exemplary wild-type PAH coding sequence is set forth in SEQ ID NO: 24.

As used herein, the term “silently altered” refers to alteration of a coding sequence or a stuffer-inserted coding sequence of a gene (e.g., by nucleotide substitution) without changing the amino acid sequence of the polypeptide encoded by the coding sequence or stuffer-inserted coding sequence. Such silent alteration is advantageous in that it may increase the translation efficiency of a coding sequence.

In the instant disclosure, nucleotide positions in a PAH gene are specified relative to the first nucleotide of the start codon. The first nucleotide of a start codon is position 1; the nucleotides 5′ to the first nucleotide of the start codon have negative numbers; the nucleotides 3′ to the first nucleotide of the start codon have positive numbers. An exemplary nucleotide 1 of the human PAH gene is nucleotide 5,473 of the NCBI Reference Sequence: NG_008690.1, and an exemplary nucleotide 3 of the human PAH gene is nucleotide 5,475 of the NCBI Reference Sequence: NG_008690.1. The nucleotide adjacently 5′ to the start codon is nucleotide −1.

In the instant disclosure, exons and introns in a PAH gene are specified relative to the exon encompassing the first nucleotide of the start codon, which is nucleotide 5473 of the NCBI Reference Sequence: NG_008690.1. The exon encompassing the first nucleotide of the start codon is exon 1. Exons 3′ to exon 1 are from 5′ to 3′: exon 2, exon 3, etc. Introns 3′ to exon 1 are from 5′ to 3′: intron 1, intron 2, etc. Accordingly, the PAH gene comprises from 5′ to 3′: exon 1, intron 1, exon 2, intron 2, exon 3, etc. An exemplary exon 1 of the human PAH gene is nucleotides 5001-5532 of the NCBI Reference Sequence: NG_008690.1. An exemplary intron 1 of the human PAH gene is nucleotides 5533-9704 of the NCBI Reference Sequence: NG_008690.1.

As used herein, the term “transcriptional regulatory element” or “TRE” refers to a cis-acting nucleotide sequence, for example, a DNA sequence, that regulates (e.g., controls, increases, or reduces) transcription of an operably linked nucleotide sequence by an RNA polymerase to form an RNA molecule. A TRE relies on one or more trans-acting molecules, such as transcription factors, to regulate transcription. Thus, one TRE may regulate transcription in different ways when it is in contact with different trans-acting molecules, for example, when it is in different types of cells. A TRE may comprise one or more promoter elements and/or enhancer elements. A skilled artisan would appreciate that the promoter and enhancer elements in a gene may be close in location, and the term “promoter” may refer to a sequence comprising a promoter element and an enhancer element. Thus, the term “promoter” does not exclude an enhancer element in the sequence. The promoter and enhancer elements do not need to be derived from the same gene or species, and the sequence of each promoter or enhancer element may be either identical or substantially identical to the corresponding endogenous sequence in the genome.

As used herein, the term “operably linked” is used to describe the connection between a TRE and a coding sequence to be transcribed. Typically, gene expression is placed under the control of a TRE comprising one or more promoter and/or enhancer elements. The coding sequence is “operably linked” to the TRE if the transcription of the coding sequence is controlled or influenced by the TRE. The promoter and enhancer elements of the TRE may be in any orientation and/or distance from the coding sequence, as long as the desired transcriptional activity is obtained. In certain embodiments, the TRE is upstream from the coding sequence.

As used herein, the term “polyadenylation sequence” refers to a DNA sequence that when transcribed into RNA constitutes a polyadenylation signal sequence. The polyadenylation sequence can be native (e.g., from the PAH gene) or exogenous. The exogenous polyadenylation sequence can be a mammalian or a viral polyadenylation sequence (e.g., an SV40 polyadenylation sequence).

As used herein, “exogenous polyadenylation sequence” refers to a polyadenylation sequence not identical or substantially identical to the endogenous polyadenylation sequence of a PAH gene (e.g., human PAH gene). In certain embodiments, an exogenous polyadenylation sequence is a polyadenylation sequence of a non-PAH gene in the same species (e.g., human). In certain embodiments, an exogenous polyadenylation sequence is a polyadenylation sequence of a different species (e.g., a virus).

As used herein, the term “effective amount” in the context of the administration of an AAV to a subject refers to the amount of the AAV that achieves a desired prophylactic or therapeutic effect.

In one aspect, provided herein are novel replication-defective AAV compositions useful for expressing PAH polypeptide in cells with reduced or otherwise defective PAH gene function. In certain embodiments, the AAV disclosed herein comprise: an AAV capsid comprising an AAV Clade F capsid protein; and a transfer genome comprising a transcriptional regulatory element operably linked to a PAH coding sequence, allowing for extrachromosomal expression of PAH.

Any AAV Clade F capsid protein or derivative thereof can be used in the AAV compositions disclosed herein. For example, in certain embodiments, the AAV Clade F capsid protein comprises an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the amino acid sequence of amino acids 203-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17. In certain embodiments, the AAV Clade F capsid protein comprises an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the amino acid sequence of amino acids 203-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17, wherein: the amino acid in the capsid protein corresponding to amino acid 206 of SEQ ID NO: 2 is C; the amino acid in the capsid protein corresponding to amino acid 296 of SEQ ID NO: 2 is H; the amino acid in the capsid protein corresponding to amino acid 312 of SEQ ID NO: 2 is Q; the amino acid in the capsid protein corresponding to amino acid 346 of SEQ ID NO: 2 is A; the amino acid in the capsid protein corresponding to amino acid 464 of SEQ ID NO: 2 is N; the amino acid in the capsid protein corresponding to amino acid 468 of SEQ ID NO: 2 is S; the amino acid in the capsid protein corresponding to amino acid 501 of SEQ ID NO: 2 is I; the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 590 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G or Y; the amino acid in the capsid protein corresponding to amino acid 681 of SEQ ID NO: 2 is M; the amino acid in the capsid protein corresponding to amino acid 687 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 690 of SEQ ID NO: 2 is K; the amino acid in the capsid protein corresponding to amino acid 706 of SEQ ID NO: 2 is C; or, the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G, and the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 296 of SEQ ID NO: 2 is H, the amino acid in the capsid protein corresponding to amino acid 464 of SEQ ID NO: 2 is N, the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 681 of SEQ ID NO: 2 is M. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 687 of SEQ ID NO: 2 is R. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 346 of SEQ ID NO: 2 is A, and the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 501 of SEQ ID NO: 2 is I, the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 706 of SEQ ID NO: 2 is C. In certain embodiments, the AAV Clade F capsid protein comprises the amino acid sequence of amino acids 203-736 of SEQ ID NO: 2, 3, 4, 6, 7, 10, 11, 12, 13, 15, 16, or 17.

For example, in certain embodiments, the AAV Clade F capsid protein comprises an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the amino acid sequence of amino acids 138-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17. In certain embodiments, the AAV Clade F capsid protein comprises an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the amino acid sequence of amino acids 138-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17, wherein: the amino acid in the capsid protein corresponding to amino acid 151 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 160 of SEQ ID NO: 2 is D; the amino acid in the capsid protein corresponding to amino acid 206 of SEQ ID NO: 2 is C; the amino acid in the capsid protein corresponding to amino acid 296 of SEQ ID NO: 2 is H; the amino acid in the capsid protein corresponding to amino acid 312 of SEQ ID NO: 2 is Q; the amino acid in the capsid protein corresponding to amino acid 346 of SEQ ID NO: 2 is A; the amino acid in the capsid protein corresponding to amino acid 464 of SEQ ID NO: 2 is N; the amino acid in the capsid protein corresponding to amino acid 468 of SEQ ID NO: 2 is S; the amino acid in the capsid protein corresponding to amino acid 501 of SEQ ID NO: 2 is I; the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 590 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G or Y; the amino acid in the capsid protein corresponding to amino acid 681 of SEQ ID NO: 2 is M; the amino acid in the capsid protein corresponding to amino acid 687 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 690 of SEQ ID NO: 2 is K; the amino acid in the capsid protein corresponding to amino acid 706 of SEQ ID NO: 2 is C; or, the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G, and the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 296 of SEQ ID NO: 2 is H, the amino acid in the capsid protein corresponding to amino acid 464 of SEQ ID NO: 2 is N, the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 681 of SEQ ID NO: 2 is M. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 687 of SEQ ID NO: 2 is R. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 346 of SEQ ID NO: 2 is A, and the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 501 of SEQ ID NO: 2 is I, the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 706 of SEQ ID NO: 2 is C. In certain embodiments, the AAV Clade F capsid protein comprises the amino acid sequence of amino acids 138-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 13, 15, 16, or 17.

For example, in certain embodiments, the AAV Clade F capsid protein comprises an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the amino acid sequence of amino acids 1-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17. In certain embodiments, the AAV Clade F capsid protein comprises an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the amino acid sequence of amino acids 1-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17, wherein: the amino acid in the capsid protein corresponding to amino acid 2 of SEQ ID NO: 2 is T; the amino acid in the capsid protein corresponding to amino acid 65 of SEQ ID NO: 2 is I; the amino acid in the capsid protein corresponding to amino acid 68 of SEQ ID NO: 2 is V; the amino acid in the capsid protein corresponding to amino acid 77 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 119 of SEQ ID NO: 2 is L; the amino acid in the capsid protein corresponding to amino acid 151 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 160 of SEQ ID NO: 2 is D; the amino acid in the capsid protein corresponding to amino acid 206 of SEQ ID NO: 2 is C; the amino acid in the capsid protein corresponding to amino acid 296 of SEQ ID NO: 2 is H; the amino acid in the capsid protein corresponding to amino acid 312 of SEQ ID NO: 2 is Q; the amino acid in the capsid protein corresponding to amino acid 346 of SEQ ID NO: 2 is A; the amino acid in the capsid protein corresponding to amino acid 464 of SEQ ID NO: 2 is N; the amino acid in the capsid protein corresponding to amino acid 468 of SEQ ID NO: 2 is S; the amino acid in the capsid protein corresponding to amino acid 501 of SEQ ID NO: 2 is I; the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 590 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G or Y; the amino acid in the capsid protein corresponding to amino acid 681 of SEQ ID NO: 2 is M; the amino acid in the capsid protein corresponding to amino acid 687 of SEQ ID NO: 2 is R; the amino acid in the capsid protein corresponding to amino acid 690 of SEQ ID NO: 2 is K; the amino acid in the capsid protein corresponding to amino acid 706 of SEQ ID NO: 2 is C; or, the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 2 of SEQ ID NO: 2 is T, and the amino acid in the capsid protein corresponding to amino acid 312 of SEQ ID NO: 2 is Q. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 65 of SEQ ID NO: 2 is I, and the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is Y. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 77 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 690 of SEQ ID NO: 2 is K. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 119 of SEQ ID NO: 2 is L, and the amino acid in the capsid protein corresponding to amino acid 468 of SEQ ID NO: 2 is S. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 626 of SEQ ID NO: 2 is G, and the amino acid in the capsid protein corresponding to amino acid 718 of SEQ ID NO: 2 is G. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 296 of SEQ ID NO: 2 is H, the amino acid in the capsid protein corresponding to amino acid 464 of SEQ ID NO: 2 is N, the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 681 of SEQ ID NO: 2 is M. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 687 of SEQ ID NO: 2 is R. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 346 of SEQ ID NO: 2 is A, and the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R. In certain embodiments, the amino acid in the capsid protein corresponding to amino acid 501 of SEQ ID NO: 2 is I, the amino acid in the capsid protein corresponding to amino acid 505 of SEQ ID NO: 2 is R, and the amino acid in the capsid protein corresponding to amino acid 706 of SEQ ID NO: 2 is C. In certain embodiments, the AAV Clade F capsid protein comprises the amino acid sequence of amino acids 1-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17.

In certain embodiments, the AAV capsid comprises two or more of: (a) a Clade F capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 2, 3, 4, 6, 7, 10, 11, 12, 13, 15, 16, or 17; (b) a Clade F capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 13, 15, 16, or 17; and (c) a Clade F capsid protein comprising the amino acid sequence of amino acids 1-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17. In certain embodiments, the AAV capsid comprises: (a) a Clade F capsid protein having an amino acid sequence consisting of amino acids 203-736 of SEQ ID NO: 2, 3, 4, 6, 7, 10, 11, 12, 13, 15, 16, or 17; (b) a Clade F capsid protein having an amino acid sequence consisting of amino acids 138-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 9, 10, 11, 12, 13, 15, 16, or 17; and (c) a Clade F capsid protein having an amino acid sequence consisting of amino acids 1-736 of SEQ ID NO: 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 15, 16, or 17.

In certain embodiments, the AAV capsid comprises one or more of: (a) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 203-736 of SEQ ID NO: 8; (b) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 8′7%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 138-736 of SEQ ID NO: 8; and (c) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 8′7%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 1-736 of SEQ ID NO: 8. In certain embodiments, the AAV capsid comprises one or more of: (a) a Clade F capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 8; (b) a Clade F capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 8; and (c) a Clade F capsid protein comprising the amino acid sequence of amino acids 1-736 of SEQ ID NO: 8. In certain embodiments, the AAV capsid comprises two or more of: (a) a Clade F capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 8; (b) a Clade F capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 8; and (c) a Clade F capsid protein comprising the amino acid sequence of amino acids 1-736 of SEQ ID NO: 8. In certain embodiments, the AAV capsid comprises: (a) a Clade F capsid protein having an amino acid sequence consisting of amino acids 203-736 of SEQ ID NO: 8; (b) a Clade F capsid protein having an amino acid sequence consisting of amino acids 138-736 of SEQ ID NO: 8; and (c) a Clade F capsid protein having an amino acid sequence consisting of amino acids 1-736 of SEQ ID NO: 8.

In certain embodiments, the AAV capsid comprises one or more of: (a) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 203-736 of SEQ ID NO: 11; (b) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 138-736 of SEQ ID NO: 11; and (c) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 1-736 of SEQ ID NO: 11. In certain embodiments, the AAV capsid comprises one or more of: (a) a Clade F capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 11; (b) a Clade F capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 11; and (c) a Clade F capsid protein comprising the amino acid sequence of amino acids 1-736 of SEQ ID NO: 11. In certain embodiments, the AAV capsid comprises two or more of: (a) a Clade F capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 11; (b) a Clade F capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 11; and (c) a Clade F capsid protein comprising the amino acid sequence of amino acids 1-736 of SEQ ID NO: 11. In certain embodiments, the AAV capsid comprises: (a) a Clade F capsid protein having an amino acid sequence consisting of amino acids 203-736 of SEQ ID NO: 11; (b) a Clade F capsid protein having an amino acid sequence consisting of amino acids 138-736 of SEQ ID NO: 11; and (c) a Clade F capsid protein having an amino acid sequence consisting of amino acids 1-736 of SEQ ID NO: 11.

In certain embodiments, the AAV capsid comprises one or more of: (a) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 203-736 of SEQ ID NO: 13; (b) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 138-736 of SEQ ID NO: 13; and (c) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 1-736 of SEQ ID NO: 13. In certain embodiments, the AAV capsid comprises one or more of: (a) a Clade F capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 13; (b) a Clade F capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 13; and (c) a Clade F capsid protein comprising the amino acid sequence of amino acids 1-736 of SEQ ID NO: 13. In certain embodiments, the AAV capsid comprises two or more of: (a) a Clade F capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 13; (b) a Clade F capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 13; and (c) a Clade F capsid protein comprising the amino acid sequence of amino acids 1-736 of SEQ ID NO: 13. In certain embodiments, the AAV capsid comprises: (a) a Clade F capsid protein having an amino acid sequence consisting of amino acids 203-736 of SEQ ID NO: 13; (b) a Clade F capsid protein having an amino acid sequence consisting of amino acids 138-736 of SEQ ID NO: 13; and (c) a Clade F capsid protein having an amino acid sequence consisting of amino acids 1-736 of SEQ ID NO: 13.

In certain embodiments, the AAV capsid comprises one or more of: (a) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 203-736 of SEQ ID NO: 16; (b) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 138-736 of SEQ ID NO: 16; and (c) a Clade F capsid protein comprising an amino acid sequence having at least 80%, 81%, 82%, 83%, 84%, 85%, 86%, 87%, 88%, 89%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity with the sequence of amino acids 1-736 of SEQ ID NO: 16. In certain embodiments, the AAV capsid comprises one or more of: (a) a Clade F capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 16; (b) a Clade F capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 16; and (c) a Clade F capsid protein comprising the amino acid sequence of amino acids 1-736 of SEQ ID NO: 16. In certain embodiments, the AAV capsid comprises two or more of: (a) a Clade F capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 16; (b) a Clade F capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 16; and (c) a Clade F capsid protein comprising the amino acid sequence of amino acids 1-736 of SEQ ID NO: 16. In certain embodiments, the AAV capsid comprises: (a) a Clade F capsid protein having an amino acid sequence consisting of amino acids 203-736 of SEQ ID NO: 16; (b) a Clade F capsid protein having an amino acid sequence consisting of amino acids 138-736 of SEQ ID NO: 16; and (c) a Clade F capsid protein having an amino acid sequence consisting of amino acids 1-736 of SEQ ID NO: 16.

Transfer genomes useful in the AAV compositions disclosed herein generally comprise a transcriptional regulatory element (TRE) operably linked to a PAH coding sequence. In certain embodiments, the transfer genome comprises a 5′ inverted terminal repeat (5′ ITR) nucleotide sequence 5′ of the TRE and PAH coding sequence, and a 3′ inverted terminal repeat (3′ ITR) nucleotide sequence 3′ of the TRE and PAH coding sequence.

In certain embodiments, the PAH coding sequence comprises all or substantially all of a coding sequence of a PAH gene. In certain embodiments, the transfer genome comprises a nucleotide sequence encoding SEQ ID NO: 23 and can optionally further comprise an exogenous polyadenylation sequence 3′ to the PAH coding sequence. In certain embodiments, the nucleotide sequence encoding SEQ ID NO: 23 is wild-type (e.g., having the sequence set forth in SEQ ID NO: 24). In certain embodiments, the nucleotide sequence encoding SEQ ID NO: 23 is codon-altered (e.g., having the sequence set forth in SEQ ID NO: 25). In certain embodiments, the nucleotide sequence encoding SEQ ID NO: 23 is codon-altered (e.g., having the sequence set forth in SEQ ID NO: 69, 70, 71, 72, or 73).

In certain embodiments, the PAH coding sequence encodes a polypeptide comprising all or substantially all of the amino acids sequence of a PAH protein. In certain embodiments, the PAH coding sequence encodes the amino acid sequence of a wild-type PAH protein (e.g., human PAH protein). In certain embodiments, the PAH coding sequence encodes the amino acid sequence of a mutant PAH protein (e.g., human PAH protein), wherein the mutant PAH polypeptide is a functional equivalent of the wild-type PAH polypeptide, i.e., can function as a wild-type PAH polypeptide. In certain embodiments, the functionally equivalent PAH polypeptide further comprises at least one characteristic not found in the wild-type PAH polypeptide, e.g., the ability to stabilize PAH protein (e.g., dimer or tetramer), or the ability to resist protein degradation.

The transfer genome can be used to express PAH in any mammalian cells (e.g., human cells). Thus, the TRE can be active in any mammalian cells (e.g., human cells). In certain embodiments, the TRE is active in a broad range of human cells. Such TREs may comprise constitutive promoter and/or enhancer elements including cytomegalovirus (CMV) promoter/enhancer (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 58), SV40 promoter, chicken beta actin (CBA) promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 59), human elongation factor 1 alpha (EF1α) promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 40), minute virus of mouse (MVM) intron which comprises transcription factor binding sites (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 35), human phosphoglycerate kinase (PGK1) promoter, human ubiquitin C (Ubc) promoter, human beta actin promoter, human neuron-specific enolase (ENO2) promoter, human beta-glucuronidase (GUSB) promoter, a rabbit beta-globin element (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 60), and/or human Methyl-CpG Binding Protein 2 (MeCP2) promoter. Any of these TREs can be combined in any order to drive efficient transcription. For example, a transfer genome may comprise a CMV enhancer, a CBA promoter, and the splice acceptor from exon 3 of the rabbit beta-globin gene, collectively called a CAG promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 28). For example, a transfer genome may comprise a hybrid of CMV enhancer and CBA promoter followed by a splice donor and splice acceptor, collectively called a CASI promoter region (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 63).

Alternatively, the TRE may be a tissue-specific TRE, i.e., it is active in specific tissue(s) and/or organ(s). A tissue-specific TRE comprises one or more tissue-specific promoter and/or enhancer elements, and optionally one or more constitutive promoter and/or enhancer elements. A skilled artisan would appreciate that tissue-specific promoter and/or enhancer elements can be isolated from genes specifically expressed in the tissue by methods well known in the art. In certain embodiments, the TRE is liver-specific (e.g., hepatocyte-specific). Exemplary liver-specific TREs may comprise one or more elements selected from the group consisting of human albumin promoter, human transthyretin (TTR) promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 34), human APOE/C-I hepatic control region (HCR) 1 or 2 (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 29 or 37), human APOH promoter, and human SERPINA1 (hAAT) promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 30 or 38) or a hepatic specific regulatory module thereof (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 33). In certain embodiments, an hAAT promoter region comprises a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% identical to SEQ ID NO: 66. More liver-specific promoter elements are disclosed in WO 2009/130208 and Kramer et al. (Molecular Therapy (2003) 7, 375-385), which are incorporated by reference herein in their entirety.

In certain embodiments, the TRE is kidney-specific (e.g., renal epithelial cell-specific). Exemplary kidney-specific TREs may comprise one or more elements selected from the group consisting of human nephrin promoter, human parathyroid hormone receptor promoter, human uromodulin promoter, and human SLC12A1 promoter. In certain embodiments, the TRE is brain-specific (e.g., neuron-specific, glial cell-specific, astrocyte-specific, oligodendrocyte-specific, microglia-specific and/or central nervous system-specific). Exemplary brain-specific TREs may comprise one or more elements selected from the group consisting of human glial fibrillary acidic protein (GFAP) promoter and human synapsin 1 (SYN1) promoter. More brain-specific promoter elements are disclosed in WO 2016/100575A1, which is incorporated by reference herein in its entirety.

In certain embodiments, the transfer genome comprises two or more TREs, optionally comprising at least one of the TREs disclosed above. A skilled person in the art would appreciate that any of these TREs can be combined in any order, and combinations of a constitutive TRE and a tissue-specific TRE can drive efficient and tissue-specific transcription. For example, in certain embodiments, the transfer genome comprises a human HCR1 (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 29 or 37) and a human EF-1a promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 40), optionally wherein the human HCR1 is 5′ to the human EF-1a promoter. In certain embodiments, the transfer genome comprises a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the sequence set forth in SEQ ID NO: 41. In certain embodiments, the transfer genome comprises a human HCR1 (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 29 or 37) and a human EF-1α promoter (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 40), optionally wherein the human HCR1 is 5′ to the human EF-1a promoter. In certain embodiments, the transfer genome comprises the sequence set forth in SEQ ID NO: 41.

Similarly, combinations of two or more tissue-specific TREs can drive efficient and tissue-specific transcription. For example, in certain embodiments, the transfer genome comprises a human HCR1 (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 29) and a hAAT promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 30), optionally wherein the human HCR1 is 5′ to the hAAT promoter. In certain embodiments, the transfer genome comprises a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the sequence set forth in SEQ ID NO: 32. In certain embodiments, the transfer genome comprises a human HCR1 (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 29) and a hAAT promoter (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 30), optionally wherein the human HCR1 is 5′ to the hAAT promoter. In certain embodiments, the transfer genome comprises the nucleotide sequence set forth in SEQ ID NO: 32.

In certain embodiments, the transfer genome comprises a hepatic specific regulatory module of hAAT promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 33) and a human TTR promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 34), optionally wherein the hepatic specific regulatory module is 5′ to the human TTR promoter. In certain embodiments, the transfer genome comprises a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the sequence set forth in SEQ ID NO: 36. In certain embodiments, the transfer genome comprises a hepatic specific regulatory module of hAAT promoter (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 33) and a human TTR promoter (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 34), optionally wherein the hepatic specific regulatory module is 5′ to the human TTR promoter. In certain embodiments, the transfer genome comprises the nucleotide sequence set forth in SEQ ID NO: 36.

In certain embodiment, the transfer genome comprises a human HCR1 (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 29 or 37) and a hAAT promoter (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 30 or 38), optionally wherein the human HCR1 is 5′ to the hAAT promoter. In certain embodiments, the transfer genome comprises a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to the sequence set forth in SEQ ID NO: 39. In certain embodiment, the transfer genome comprises a human HCR1 (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 29 or 37) and a hAAT promoter (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 30 or 38), optionally wherein the human HCR1 is 5′ to the hAAT promoter. In certain embodiments, the transfer genome comprises the nucleotide sequence set forth in SEQ ID NO: 39.

In certain embodiments, the transfer vector further comprises an intron 5′ to or inserted in the PAH coding sequence. Such introns can increase transgene expression, for example, by reducing transcriptional silencing and enhancing mRNA export from the nucleus to the cytoplasm. In certain embodiments, the transfer genome comprises from 5′ to 3′: a non-coding exon, an intron, and the PAH coding sequence. In certain embodiments, an intron sequence is inserted in the PAH coding sequence, optionally wherein the intron is inserted at an internucleotide bond that links two native exons. In certain embodiments, the intron is inserted at an internucleotide bond that links native exon 1 and exon 2.

The intron can comprise a native intron sequence of the PAH gene, an intron sequence from a different species or a different gene from the same species, and/or a synthetic intron sequence. A skilled worker will appreciate that synthetic intron sequences can be designed to mediate RNA splicing by introducing any consensus splicing motifs known in the art (e.g., in Sibley et al., (2016) Nature Reviews Genetics, 17, 407-21, which is incorporated by reference herein in its entirety). Exemplary intron sequences are provided in Lu et al. (2013) Molecular Therapy 21(5): 954-63, and Lu et al. (2017) Hum. Gene Ther. 28(1): 125-34, which are incorporated by reference herein in their entirety. In certain embodiments, the transfer genome comprises an SV40 intron (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 31) or a minute virus of mouse (MVM) intron (e.g., comprising a nucleotide sequence at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 35). In certain embodiments, the transfer genome comprises an SV40 intron (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 31) or a minute virus of mouse (MVM) intron (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 35).

In certain embodiments, the transfer genome disclosed herein further comprises a transcription terminator (e.g., a polyadenylation sequence). In certain embodiments, the transcription terminator is 3′ to the PAH coding sequence. The transcription terminator may be any sequence that effectively terminates transcription, and a skilled artisan would appreciate that such sequences can be isolated from any genes that are expressed in the cell in which transcription of the PAH coding sequence is desired. In certain embodiments, the transcription terminator comprises a polyadenylation sequence. In certain embodiments, the polyadenylation sequence is identical or substantially identical to the endogenous polyadenylation sequence of the human PAH gene. In certain embodiments, the polyadenylation sequence is an exogenous polyadenylation sequence. In certain embodiments, the polyadenylation sequence is an SV40 polyadenylation sequence (e.g., comprising the nucleotide sequence set forth in SEQ ID NO: 42, 43, or 45, or a nucleotide sequence complementary thereto). In certain embodiments, the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 43.

In certain embodiments, the transfer genome comprises from 5′ to 3′: a TRE, optionally a non-coding exon and an intron, a PAH coding sequence, and a polyadenylation sequence. In certain embodiments, the TRE has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to any one of SEQ ID NOs: 28-30 and 32-41; the intron has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 31 or 35; the PAH coding sequence has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 25; and/or the polyadenylation sequence has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to any one of SEQ ID NOs: 42, 43, and 45. In certain embodiments, the TRE comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28-30 and 32-41; the intron comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 31 and 35; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; and/or the polyadenylation sequence comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 42, 43, and 45. In certain embodiments, the TRE comprises the sequence set forth in SEQ ID NO: 28; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; and/or the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 42. In certain embodiments, the TRE comprises from 5′ to 3′ the sequence set forth in SEQ ID NO: 29, and the sequence set forth in SEQ ID NO: 30 (e.g., the TRE comprises the sequence set forth in SEQ ID NO: 32); the intron comprises the sequence set forth in SEQ ID NO: 31; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; and/or the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 43. In certain embodiments, the TRE comprises from 5′ to 3′ the sequence set forth in SEQ ID NO: 33, and the sequence set forth in SEQ ID NO: 34 (e.g., the TRE comprises the sequence set forth in SEQ ID NO: 36); the intron comprises the sequence set forth in SEQ ID NO: 35; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; and/or the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 45. In certain embodiments, the TRE comprises from 5′ to 3′ the sequence set forth in SEQ ID NO: 37, and the sequence set forth in SEQ ID NO: 38 (e.g., the TRE comprises the sequence set forth in SEQ ID NO: 39); the intron comprises the sequence set forth in SEQ ID NO: 35; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; and/or the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 45. In certain embodiments, the TRE comprises from 5′ to 3′ the sequence set forth in SEQ ID NO: 37, and the sequence set forth in SEQ ID NO: 40 (e.g., the TRE comprises the sequence set forth in SEQ ID NO: 41); and/or the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 45.

In certain embodiments, the transfer genome comprises a sequence at least 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 46, 47, 48, 49, 50, 61, 64, 67, 74, 76, 78, 80, 82, 84, 86, or 89. In certain embodiments, the transfer genome comprises the nucleotide sequence set forth in SEQ ID NO: 46, 47, 48, 49, 50, 61, 64, 67, 74, 76, 78, 80, 82, 84, 86, or 89. In certain embodiments, the transfer genome consists of the nucleotide sequence set forth in SEQ ID NO: 46, 47, 48, 49, 50, 61, 64, 67, 74, 76, 78, 80, 82, 84, 86, or 89. In certain embodiments, the transfer genome comprises the nucleotide sequence set forth in SEQ ID NO: 47. In certain embodiments, the transfer genome consists of the nucleotide sequence set forth in SEQ ID NO: 47.

In certain embodiments, the transfer genomes disclosed herein further comprise a 5′ inverted terminal repeat (5′ ITR) nucleotide sequence 5′ of the TRE, and a 3′ inverted terminal repeat (3′ ITR) nucleotide sequence 3′ of the PAH coding sequence. ITR sequences from any AAV serotype or variant thereof can be used in the transfer genomes disclosed herein. The 5′ and 3′ ITR can be from an AAV of the same serotype or from AAVs of different serotypes. Exemplary ITRs for use in the transfer genomes disclosed herein are set forth in SEQ ID NOs: 18-21, 26, and 27 herein.

In certain embodiments, the 5′ ITR or 3′ ITR is from AAV2. In certain embodiments, both the 5′ ITR and the 3′ ITR are from AAV2. In certain embodiments, the 5′ ITR nucleotide sequence has at least 90% (e.g., at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to SEQ ID NO: 18, or the 3′ ITR nucleotide sequence has at least 90% (e.g., at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to SEQ ID NO: 19. In certain embodiments, the 5′ ITR nucleotide sequence has at least 90% (e.g., at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to SEQ ID NO: 18, and the 3′ ITR nucleotide sequence has at least 90% (e.g., at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to SEQ ID NO: 19. In certain embodiments, the transfer genome comprises a nucleotide sequence set forth in any one of SEQ ID NOs: 46-50, 61, 64, 67, 74, 76, 78, 80, 82, 84, 86, and 89, a 5′ ITR nucleotide sequence having the sequence of SEQ ID NO: 18, and a 3′ ITR nucleotide sequence having the sequence of SEQ ID NO: 19.

In certain embodiments, the 5′ ITR or 3′ ITR are from AAV5. In certain embodiments, both the 5′ ITR and 3′ ITR are from AAV5. In certain embodiments, the 5′ ITR nucleotide sequence has at least 90% (e.g., at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to SEQ ID NO: 20, or the 3′ ITR nucleotide sequence has at least 90% (e.g., at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to SEQ ID NO: 21. In certain embodiments, the 5′ ITR nucleotide sequence has at least 90% (e.g., at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to SEQ ID NO: 20, and the 3′ ITR nucleotide sequence has at least 90% (e.g., at least 95%, at least 96%, at least 97%, at least 98%, at least 99%, or 100%) sequence identity to SEQ ID NO: 21. In certain embodiments, the transfer genome comprises a nucleotide sequence set forth in any one of SEQ ID NOs: 46-50, a 5′ ITR nucleotide sequence having the sequence of SEQ ID NO: 20, and a 3′ ITR nucleotide sequence having the sequence of SEQ ID NO: 21.

In certain embodiments, the 5′ ITR nucleotide sequence and the 3′ ITR nucleotide sequence are substantially complementary to each other (e.g., are complementary to each other except for mismatch at 1, 2, 3, 4, or 5 nucleotide positions in the 5′ or 3′ ITR).

In certain embodiments, the 5′ ITR or the 3′ ITR is modified to reduce or abolish resolution by Rep protein (“non-resolvable ITR”). In certain embodiments, the non-resolvable ITR comprises an insertion, deletion, or substitution in the nucleotide sequence of the terminal resolution site. Such modification allows formation of a self-complementary, double-stranded DNA genome of the AAV after the transfer genome is replicated in an infected cell. Exemplary non-resolvable ITR sequences are known in the art (see e.g., those provided in U.S. Pat. Nos. 7,790,154 and 9,783,824, which are incorporated by reference herein in their entirety). In certain embodiments, the 5′ ITR comprises a nucleotide sequence at least 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 26. In certain embodiments, the 5′ ITR consists of a nucleotide sequence at least 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 26. In certain embodiments, the 5′ ITR consists of the nucleotide sequence set forth in SEQ ID NO: 26. In certain embodiments, the 3′ ITR comprises a nucleotide sequence at least 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 27. In certain embodiments, the 5′ ITR consists of a nucleotide sequence at least 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 27. In certain embodiments, the 3′ ITR consists of the nucleotide sequence set forth in SEQ ID NO: 27. In certain embodiments, the 5′ ITR consists of the nucleotide sequence set forth in SEQ ID NO: 26, and the 3′ ITR consists of the nucleotide sequence set forth in SEQ ID NO: 27. In certain embodiments, the 5′ ITR consists of the nucleotide sequence set forth in SEQ ID NO: 26, and the 3′ ITR consists of the nucleotide sequence set forth in SEQ ID NO: 19.

In certain embodiments, the 3′ ITR is flanked by an additional nucleotide sequence derived from a wild-type AAV2 genomic sequence. In certain embodiments, the 3′ ITR is flanked by an additional 37 bp sequence derived from a wild-type AAV2 sequence that is adjacent to a wild-type AAV2 ITR. See, e.g., Savy et al., Human Gene Therapy Methods (2017) 28(5): 277-289 (which is hereby incorporated by reference herein in its entirety). In certain embodiments, the additional 37 bp sequence is internal to the 3′ ITR. In certain embodiments, the 37 bp sequence consists of the sequence set forth in SEQ ID NO: 56. In certain embodiments, the 3′ ITR comprises a nucleotide sequence at least 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 57. In certain embodiments, the 3′ ITR comprises the nucleotide sequence set forth in SEQ ID NO: 57. In certain embodiments, the nucleotide sequence of the 3′ ITR consists of a nucleotide sequence at least 95%, 96%, 97%, 98%, or 99% identical to SEQ ID NO: 57. In certain embodiments, the nucleotide sequence of the 3′ ITR consists of the nucleotide sequence set forth in SEQ ID NO: 57.

In certain embodiments, the transfer genome comprises from 5′ to 3′: a 5′ ITR; an internal element comprising from 5′ to 3′: a TRE, optionally a non-coding exon and an intron, a PAH coding sequence, and a polyadenylation sequence, as disclosed herein; a non-resolvable ITR; a nucleotide sequence complementary to the internal element; and a 3′ ITR. Such transfer genome can form a self-complementary, double-stranded DNA genome of the AAV after infection and before replication.

In certain embodiments, the transfer genome comprises from 5′ to 3′: a 5′ ITR, a TRE, optionally a non-coding exon and an intron, a PAH coding sequence, a polyadenylation sequence, and a 3′ ITR. In certain embodiments, the 5′ ITR has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID: 18, 20, or 26; the TRE has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to any one of SEQ ID NOs: 28-30 and 32-41; the intron has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 31 or 35; the PAH coding sequence has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID NO: 25; the polyadenylation sequence has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to any one of SEQ ID NOs: 42, 43, and 45; and/or the 3′ ITR has at least 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99% sequence identity to SEQ ID: 19, 21, or 27. In certain embodiments, the 5′ ITR comprises or consists of a nucleotide sequence selected from the group consisting of SEQ ID NOs: 18, 20, and 26; the TRE comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 28-30 and 32-41; the intron comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 31 and 35; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; the polyadenylation sequence comprises a nucleotide sequence selected from the group consisting of SEQ ID NOs: 42, 43, and 45; and/or the 3′ ITR comprises or consists of a nucleotide sequence selected from the group consisting of SEQ ID NOs: 19, 21, and 27. In certain embodiments, the 5′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 18; the TRE comprises the sequence set forth in SEQ ID NO: 28; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 42; and/or the 3′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 19. In certain embodiments, the 5′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 26; the TRE comprises from 5′ to 3′ the sequence set forth in SEQ ID NO: 29, and the sequence set forth in SEQ ID NO: 30 (e.g., the TRE comprises the sequence set forth in SEQ ID NO: 32); the intron comprises the sequence set forth in SEQ ID NO: 31; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 43 and/or the 3′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 27. In certain embodiments, the 5′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 18; the TRE comprises from 5′ to 3′ the sequence set forth in SEQ ID NO: 33, and the sequence set forth in SEQ ID NO: 34 (e.g., the TRE comprises the sequence set forth in SEQ ID NO: 36); the intron comprises the sequence set forth in SEQ ID NO: 35; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 45; and/or the 3′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 19. In certain embodiments, the 5′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 18; the TRE comprises from 5′ to 3′ the sequence set forth in SEQ ID NO: 37, and the sequence set forth in SEQ ID NO: 38 (e.g., the TRE comprises the sequence set forth in SEQ ID NO: 39); the intron comprises the sequence set forth in SEQ ID NO: 35; the PAH coding sequence comprises the sequence set forth in SEQ ID NO: 25; the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 45; and/or the 3′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 19. In certain embodiments, the 5′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 18; the TRE comprises from 5′ to 3′ the sequence set forth in SEQ ID NO: 37, and the sequence set forth in SEQ ID NO: 40 (e.g., the TRE comprises the sequence set forth in SEQ ID NO: 41); the polyadenylation sequence comprises the sequence set forth in SEQ ID NO: 45; and/or the 3′ ITR comprises or consists of the sequence set forth in SEQ ID NO: 19.

In certain embodiments, the transfer genome comprises a sequence at least 80% (e.g., at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%) identical to the sequence set forth in SEQ ID NO: 51, 52, 53, 54, 55, 62, 65, 68, 75, 77, 79, 81, 83, 85, 87, or 90. In certain embodiments, the transfer genome comprises the sequence set forth in SEQ ID NO: 51, 52, 53, 54, 55, 62, 65, 68, 75, 77, 79, 81, 83, 85, 87, or 90. In certain embodiments, the transfer genome comprises the sequence set forth in SEQ ID NO: 52. In certain embodiments, the transfer genome consists of the sequence set forth in SEQ ID NO: 51, 52, 53, 54, 55, 62, 65, 68, 75, 77, 79, 81, 83, 85, 87, or 90. In certain embodiments, the transfer genome consists of the sequence set forth in SEQ ID NO: 52.

In certain embodiments, the replication-defective AAV comprises: (a) an AAV capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 16, and a transfer genome comprising 5′ to 3′ following genetic elements: a 5′ ITR element (e.g., the 5′ ITR of SEQ ID NOs: 26), a human HCR1 (e.g., the HCR1 of SEQ ID NOs: 29), an hAAT promoter (e.g., the hAAT promoter of SEQ ID NOs: 30), an SV40 intron (e.g., the SV40 intron of SEQ ID NOs: 31), a silently altered human PAH coding sequence (e.g., the PAH coding sequence of SEQ ID NOs: 25), an SV40 polyadenylation sequence (e.g., the SV40 polyadenylation sequence of SEQ ID NOs: 43), and a 3′ ITR element (e.g., the 3′ ITR of SEQ ID NOs: 27); (b) an AAV capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 16, and a transfer genome comprising 5′ to 3′ following genetic elements: a 5′ ITR element (e.g., the 5′ ITR of SEQ ID NOs: 26), a human HCR1 (e.g., the HCR1 of SEQ ID NOs: 29), an hAAT promoter (e.g., the hAAT promoter of SEQ ID NOs: 30), an SV40 intron (e.g., the SV40 intron of SEQ ID NOs: 31), a silently altered human PAH coding sequence (e.g., the PAH coding sequence of SEQ ID NOs: 25), an SV40 polyadenylation sequence (e.g., the SV40 polyadenylation sequence of SEQ ID NOs: 43), and a 3′ ITR element (e.g., the 3′ ITR of SEQ ID NOs: 27; and/or (c) an AAV capsid protein comprising the amino acid sequence of SEQ ID NO: 16, and a transfer genome comprising 5′ to 3′ following genetic elements: a 5′ ITR element (e.g., the 5′ ITR of SEQ ID NOs: 26), a human HCR1 (e.g., the HCR1 of SEQ ID NOs: 29), an hAAT promoter (e.g., the hAAT promoter of SEQ ID NOs: 30), an SV40 intron (e.g., the SV40 intron of SEQ ID NOs: 31), a silently altered human PAH coding sequence (e.g., the PAH coding sequence of SEQ ID NOs: 25), an SV40 polyadenylation sequence (e.g., the SV40 polyadenylation sequence of SEQ ID NOs: 43), and a 3′ ITR element (e.g., the 3′ ITR of SEQ ID NOs: 27.

In certain embodiments, the replication-defective AAV comprises: (a) an AAV capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 16, and a transfer genome comprising the nucleotide sequence set forth in any one of SEQ ID NOs: 24, 25, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 61, 62, 64, 65, 67, 68, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 89, or 90; (b) an AAV capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 16, and a transfer genome comprising the nucleotide sequence set forth in any one of SEQ ID NOs: 24, 25, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 61, 62, 64, 65, 67, 68, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 89, or 90; and/or (c) an AAV capsid protein comprising the amino acid sequence of SEQ ID NO: 16, and a transfer genome comprising the nucleotide sequence set forth in any one of SEQ ID NOs: 24, 25, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 61, 62, 64, 65, 67, 68, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 89, or 90.

In another aspect, provided herein is a polynucleotide comprising a nucleic acid sequence that is at least 80% (e.g., at least 85%, 90%, 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, or 99%) identical to the nucleic acid sequence set forth in SEQ ID NO: 88, 91, or 92. In certain embodiments, the polynucleotide comprises the nucleic acid sequence set forth in SEQ ID NO: 88, 91, or 92. In certain embodiments, the polynucleotide comprises the nucleic acid sequence set forth in SEQ ID NO: 92. In certain embodiments, the polynucleotide consists of the nucleic acid sequence set forth in SEQ ID NO: 88, 91, or 92. In certain embodiments, the polynucleotide consists of the nucleic acid sequence set forth in SEQ ID NO: 92.

In another aspect, the instant disclosure provides pharmaceutical compositions comprising an AAV as disclosed herein together with a pharmaceutically acceptable excipient, adjuvant, diluent, vehicle or carrier, or a combination thereof. A “pharmaceutically acceptable carrier” includes any material which, when combined with an active ingredient of a composition, allows the ingredient to retain biological activity and without causing disruptive physiological reactions, such as an unintended immune reaction. Pharmaceutically acceptable carriers include water, phosphate buffered saline, emulsions such as oil/water emulsion, and wetting agents. Compositions comprising such carriers are formulated by well-known conventional methods such as those set forth in Remington's Pharmaceutical Sciences, current Ed., Mack Publishing Co., Easton Pa. 18042, USA; A. Gennaro (2000) “Remington: The Science and Practice of Pharmacy”, 20th edition, Lippincott, Williams, & Wilkins; Pharmaceutical Dosage Forms and Drug Delivery Systems (1999) H. C. Ansel et al, 7th ed., Lippincott, Williams, & Wilkins; and Handbook of Pharmaceutical Excipients (2000) A. H. Kibbe et al, 3rd ed. Amer. Pharmaceutical Assoc.

In another aspect, the instant disclosure provides a polynucleotide comprising a coding sequence encoding a human PAH protein or a fragment thereof, wherein the coding sequence has been codon-altered to have less than 100% (e.g., less than 95%, 90%, 85%, 80%, 75%, 70%, 65%, 60%, 55%, or 50%) identical to a wild-type human PAH gene. In certain embodiments, the polynucleotide comprises the sequence set forth in SEQ ID NO: 25. In certain embodiments, the polynucleotide comprises nucleotides 4 to 1359 of the sequence set forth in SEQ ID NO: 25. The polynucleotide can comprise DNA, RNA, modified DNA, modified RNA, or a combination thereof. In certain embodiments, the polynucleotide is an expression vector.

In another aspect, the instant disclosure provides methods for expressing a PAH polypeptide in a cell. The methods generally comprise transducing the cell with a replication-defective AAV as disclosed herein. Such methods are highly efficient at restoring PAH expression. Accordingly, in certain embodiments, the methods disclosed herein involve transducing the cell with a replication-defective AAV as disclosed herein.

The methods disclosed herein can be applied to any cell harboring a mutation in the PAH gene. The skilled worker will appreciate that cells that are active in Phe metabolism are of particular interest. Accordingly, in certain embodiments, the method is applied to cells in the liver, kidney, brain, pituitary gland, adrenal gland, pancreas, urinary bladder, gallbladder, colon, small intestine, or breast. In certain embodiments, the method is applied to hepatocytes and/or renal cells.

The methods disclosed herein can be performed in vitro for research purposes or can be performed ex vivo or in vivo for therapeutic purposes.

In certain embodiments, the cell to be transduced is in a mammalian subject and the AAV is administered to the subject in an amount effective to transduce the cell in the subject. Accordingly, in certain embodiments, the instant disclosure provides a method for treating a subject having a disease or disorder associated with a PAH gene mutation, the method generally comprising administering to the subject an effective amount of a replication-defective AAV as disclosed herein. The subject can be a human subject, a non-human primate subject (e.g., a cynomolgus), or a rodent subject (e.g., a mouse) with a PAH mutation, or a non-human primate subject (e.g., a cynomolgus) or a rodent subject (e.g., a mouse) containing PAH-mutant human liver cells. Suitable mouse subjects include without limitation, mice into which human liver cells (e.g., human hepatocytes) have been engrafted. Any disease or disorder associated with a PAH gene mutation can be treated using the methods disclosed herein. Suitable diseases or disorders include, without limitation, phenylketonuria.

In certain embodiments, the foregoing methods employ a replication-defective AAV comprising: (a) an AAV capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 16, and a transfer genome comprising 5′ to 3′ following genetic elements: a 5′ ITR element (e.g., the 5′ ITR of SEQ ID NOs: 26), a human HCR1 (e.g., the HCR1 of SEQ ID NOs: 29), an hAAT promoter (e.g., the hAAT promoter of SEQ ID NOs: 30), an SV40 intron (e.g., the SV40 intron of SEQ ID NOs: 31), a silently altered human PAH coding sequence (e.g., the PAH coding sequence of SEQ ID NOs: 25), an SV40 polyadenylation sequence (e.g., the SV40 polyadenylation sequence of SEQ ID NOs: 43), and a 3′ ITR element (e.g., the 3′ ITR of SEQ ID NOs: 27); (b) an AAV capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 16, and a transfer genome comprising 5′ to 3′ following genetic elements: a 5′ ITR element (e.g., the 5′ ITR of SEQ ID NOs: 26), a human HCR1 (e.g., the HCR1 of SEQ ID NOs: 29), an hAAT promoter (e.g., the hAAT promoter of SEQ ID NOs: 30), an SV40 intron (e.g., the SV40 intron of SEQ ID NOs: 31), a silently altered human PAH coding sequence (e.g., the PAH coding sequence of SEQ ID NOs: 25), an SV40 polyadenylation sequence (e.g., the SV40 polyadenylation sequence of SEQ ID NOs: 43), and a 3′ ITR element (e.g., the 3′ ITR of SEQ ID NOs: 27; and/or (c) an AAV capsid protein comprising the amino acid sequence of SEQ ID NO: 16, and a transfer genome comprising 5′ to 3′ following genetic elements: a 5′ ITR element (e.g., the 5′ ITR of SEQ ID NOs: 26), a human HCR1 (e.g., the HCR1 of SEQ ID NOs: 29), an hAAT promoter (e.g., the hAAT promoter of SEQ ID NOs: 30), an SV40 intron (e.g., the SV40 intron of SEQ ID NOs: 31), a silently altered human PAH coding sequence (e.g., the PAH coding sequence of SEQ ID NOs: 25), an SV40 polyadenylation sequence (e.g., the SV40 polyadenylation sequence of SEQ ID NOs: 43), and a 3′ ITR element (e.g., the 3′ ITR of SEQ ID NOs: 27.

In certain embodiments, the foregoing methods employ a replication-defective AAV comprising: (a) an AAV capsid protein comprising the amino acid sequence of amino acids 203-736 of SEQ ID NO: 16, and a transfer genome comprising the nucleotide sequence set forth in any one of SEQ ID NOs: 24, 25, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 61, 62, 64, 65, 67, 68, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 89, or 90; (b) an AAV capsid protein comprising the amino acid sequence of amino acids 138-736 of SEQ ID NO: 16, and a transfer genome comprising the nucleotide sequence set forth in any one of SEQ ID NOs: 24, 25, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 61, 62, 64, 65, 67, 68, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 89, or 90; and/or (c) an AAV capsid protein comprising the amino acid sequence of SEQ ID NO: 16, and a transfer genome comprising the nucleotide sequence set forth in any one of SEQ ID NOs: 24, 25, 46, 47, 48, 49, 50, 51, 52, 53, 54, 55, 61, 62, 64, 65, 67, 68, 74, 75, 76, 77, 78, 79, 80, 81, 82, 83, 84, 85, 86, 87, 89, or 90.

The methods disclosed herein are particularly advantageous in that they are capable of expressing a PAH protein in a cell with high efficiency both in vivo and in vitro. In certain embodiments, the expression level of the PAH protein is at least 10%, 20%, 30%, 40%, 50%, 60%, 70%, 80%, 90%, or 100% of the expression level of the endogenous PAH protein in a cell of the same type that does not have a mutation in the PAH gene. In certain embodiments, the expression level of the PAH protein is at least 1.1, 1.2, 1.3, 1.4, 1.5, 1.6, 1.7, 1.8, 1.9, 2, 3, 4, 5, 6, 7, 8, 9, or 10 fold higher than the expression level of the endogenous PAH protein in a cell of the same type that does not have a mutation in the PAH gene. Any methods of determining the expression level of the PAH protein can be employed including, without limitation, ELISA, Western blotting, immunostaining, and mass spectrometry.

In certain embodiments, transduction of a cell with an AAV composition disclosed herein can be performed as provided herein or by any method of transduction known to one of ordinary skill in the art. In certain embodiments, the cell may be contacted with the AAV at a multiplicity of infection (MOI) of 50,000; 100,000; 150,000; 200,000; 250,000; 300,000; 350,000; 400,000; 450,000; or 500,000, or at any MOI that provides for optimal transduction of the cell.

An AAV composition disclosed herein can be administered to a subject by any appropriate route including, without limitation, intravenous, intraperitoneal, subcutaneous, intramuscular, intranasal, topical or intradermal routes. In certain embodiments, the composition is formulated for administration via intravenous injection or subcutaneous injection.

In another aspect, the instant disclosure provides packaging systems for recombinant preparation of a replication-defective AAV disclosed herein. Such packaging systems generally comprise: a Rep nucleotide sequence encoding one or more AAV Rep proteins; a Cap nucleotide sequence encoding one or more AAV Clade F capsid proteins as disclosed herein; and a transfer genome for expression of the PAH gene as disclosed herein, wherein the packaging system is operative in a cell for enclosing the transfer genome in the capsid to form the AAV.

In certain embodiments, the packaging system comprises a first vector comprising the Rep nucleotide sequence and the Cap nucleotide sequence, and a second vector comprising the transfer genome. As used in the context of a packaging system as described herein, a “vector” refers to a nucleic acid molecule that is a vehicle for introducing nucleic acids into a cell (e.g., a plasmid, a virus, a cosmid, an artificial chromosome, etc.).

Any AAV Rep protein can be employed in the packaging systems disclosed herein. In certain embodiments of the packaging system, the Rep nucleotide sequence encodes an AAV2 Rep protein. Suitable AAV2 Rep proteins include, without limitation, Rep 78/68 or Rep 68/52. In certain embodiments of the packaging system, the nucleotide sequence encoding the AAV2 Rep protein comprises a nucleotide sequence that encodes a protein having a minimum percent sequence identity to the AAV2 Rep amino acid sequence of SEQ ID NO: 22, wherein the minimum percent sequence identity is at least 70% (e.g., at least 75%, at least 80%, at least 85%, at least 90%, at least 95%, at least 98%, at least 99%, or 100%) across the length of the amino acid sequence of the AAV2 Rep protein. In certain embodiments of the packaging system, the AAV2 Rep protein has the amino acid sequence set forth in SEQ ID NO: 22.

In certain embodiments of the packaging system, the packaging system further comprises a third vector, e.g., a helper virus vector. The third vector may be an independent third vector, integral with the first vector, or integral with the second vector. In certain embodiments, the third vector comprises genes encoding helper virus proteins.

In certain embodiments of the packaging system, the helper virus is selected from the group consisting of adenovirus, herpes virus (including herpes simplex virus (HSV)), poxvirus (such as vaccinia virus), cytomegalovirus (CMV), and baculovirus. In certain embodiments of the packaging system, where the helper virus is adenovirus, the adenovirus genome comprises one or more adenovirus RNA genes selected from the group consisting of E1, E2, E4 and VA. In certain embodiments of the packaging system, where the helper virus is HSV, the HSV genome comprises one or more of HSV genes selected from the group consisting of UL5/8/52, ICPO, ICP4, ICP22 and UL30/UL42.

In certain embodiments of the packaging system, the first, second, and/or third vector are contained within one or more transfecting plasmids. In certain embodiments, the first vector and the third vector are contained within a first transfecting plasmid. In certain embodiments the second vector and the third vector are contained within a second transfecting plasmid.

In certain embodiments of the packaging system, the first, second, and/or third vector are contained within one or more recombinant helper viruses. In certain embodiments, the first vector and the third vector are contained within a recombinant helper virus. In certain embodiments, the second vector and the third vector are contained within a recombinant helper virus.

In a further aspect, the disclosure provides a method for recombinant preparation of an AAV as described herein, wherein the method comprises transfecting or transducing a cell with a packaging system as described under conditions operative for enclosing the transfer genome in the capsid to form the AAV as described herein. Exemplary methods for recombinant preparation of an AAV include transient transfection (e.g., with one or more transfection plasmids containing a first, and a second, and optionally a third vector as described herein), viral infection (e.g. with one or more recombinant helper viruses, such as a adenovirus, poxvirus (such as vaccinia virus), herpes virus (including HSV, cytomegalovirus, or baculovirus, containing a first, and a second, and optionally a third vector as described herein), and stable producer cell line transfection or infection (e.g., with a stable producer cell, such as a mammalian or insect cell, containing a Rep nucleotide sequence encoding one or more AAV Rep proteins and/or a Cap nucleotide sequence encoding one or more AAV Clade F capsid proteins as described herein, and with a transfer genome as described herein being delivered in the form of a transfecting plasmid or a recombinant helper virus).

The recombinant AAV vectors disclosed herein mediate highly efficient gene transfer in vitro and in vivo. The following examples demonstrate the efficient restoration of the expression of the PAH gene, which is mutated in certain human diseases, such as phenylketonuria, using an AAV-based vector as disclosed herein. These examples are offered by way of illustration, and not by way of limitation.

This example provides human PAH transfer vectors pHMI-hPAH-TC-004, pHMI-hPAH-TC-025, pHMI-hPAH-TC-010, pHMI-hPAH-TC-011, and pHMI-hPAH-TC-012 for expression of human PAH in a human or mouse cell.

a) pHMI-hPAH-TC-004

PAH transfer vector pHMI-hPAH-TC-004, as shown in FIG. 1A, comprises 5′ to 3′ the following genetic elements: a 5′ ITR element, a CAG promoter, a silently altered human PAH coding sequence, an SV40 polyadenylation sequence, and a 3′ ITR element. The sequences of these elements are set forth in Table 1. This vector is capable of expressing a human PAH protein in a cell (e.g., a human cell or a mouse cell) to which the vector is transduced.

TABLE 1
Genetic elements in human PAH transfer
vector pHMI-hPAH-TC-004
Genetic Element SEQ ID NO
5′ ITR element 18
CAG promoter 28
codon-altered human PAH coding sequence 25
SV40 polyadenylation sequence 42
3′ ITR element 19
Transfer genome (from promoter to polyadenylation 46
sequence)
Transfer genome (from 5′ ITR to 3′ ITR) 51

b) pHMI-hPAH-TC-025

PAH transfer vector pHMI-hPAH-TC-025, as shown in FIG. 1B, comprises 5′ to 3′ the following genetic elements: a truncated 5′ ITR element, a human hepatic control region 1 (HCR1), a human al-antitrypsin (hAAT) promoter, an SV40 intron, a silently altered human PAH coding sequence, an SV40 polyadenylation sequence, and a modified 3′ ITR element. The sequences of these elements are set forth in Table 2. The truncated 5′ ITR allows the vector to form a double-stranded AAV genome after transduction into cells. This vector is capable of expressing a human PAH protein in a human hepatocyte to which the vector is transduced.

TABLE 2
Genetic elements in human PAH transfer
vector pHMI-hPAH-TC-025
Genetic Element SEQ ID NO
truncated 5′ ITR element 26
human HCR1 29
human α1-antitrypsin (hAAT) promoter 30
SV40 intron 31
transcriptional regulatory region comprising the human 32
HCR1 and hAAT promoter
codon-altered human PAH coding sequence 25
SV40 polyadenylation sequence 43
modified 3′ ITR element 27
Transfer genome (from HCR1 to polyadenylation 47
sequence)
Transfer genome (from 5′ ITR to 3′ ITR) 52
Full sequence of transfer vector 92

c) pHMI-hPAH-TC-010

PAH transfer vector pHMI-hPAH-TC-010, as shown in FIG. 1C, comprises 5′ to 3′ the following genetic elements: a 5′ ITR element, a hepatic specific regulatory module of hAAT promoter, a human TTR promoter, a modified minute virus of mouse (MVM) intron, a silently altered human PAH coding sequence, an SV40 polyadenylation sequence, and a 3′ ITR element. The sequences of these elements are set forth in Table 3. This vector is capable of expressing a human PAH protein in a cell (e.g., a human cell or a mouse cell) to which the vector is transduced, particularly at a high level in a hepatocyte.

TABLE 3
Genetic elements in human PAH transfer
vector pHMI-hPAH-TC-010
Genetic Element SEQ ID NO
5′ ITR element 18
hepatic specific regulatory module of hAAT promoter 33
human TTR promoter 34
modified minute virus of mouse (MVM) intron 35
transcriptional regulatory region comprising the hepatic 36
specific regulatory module (HSRM) and human TTR
promoter
codon-altered human PAH coding sequence 25
SV40 polyadenylation sequence 45
3′ ITR element 19
Transfer genome (from HSRM to polyadenylation 48
sequence)
Transfer genome(from 5′ ITR to 3′ ITR) 53

d) pHMI-hPAH-TC-011

PAH transfer vector pHMI-hPAH-TC-011, as shown in FIG. 1D, comprises 5′ to 3′ the following genetic elements: a 5′ ITR element, a human HCR1, a human al-antitrypsin (hAAT) promoter, an modified MVM intron, a silently altered human PAH coding sequence, an SV40 polyadenylation sequence, and a 3′ ITR element. The sequences of these elements are set forth in Table 4. This vector is capable of expressing a human PAH protein in a human hepatocyte to which the vector is transduced.

TABLE 4
Genetic elements in human PAH transfer
vector pHMI-hPAH-TC-011
Genetic Element SEQ ID NO
truncated 5′ ITR element 18
human HCR1 37
human α1-antitrypsin (hAAT) promoter 38
modified minute virus of mouse (MVM) intron 35
transcriptional regulatory region comprising the human 39
HCR1 and hAAT promoter
codon-altered human PAH coding sequence 25
SV40 polyadenylation sequence 45
modified 3′ ITR element 19
Transfer genome (from HCR1 to polyadenylation 49
sequence)
Transfer genome(from 5′ ITR to 3′ ITR) 54

e) pHMI-hPAH-TC-012

PAH transfer vector pHMI-hPAH-TC-012, as shown in FIG. 1E, comprises 5′ to 3′ the following genetic elements: a 5′ ITR element, a human hepatic control region 1 (HCR1), a modified human EF-1a promoter, a silently altered human PAH coding sequence, an SV40 polyadenylation sequence, and a 3′ ITR element. The sequences of these elements are set forth in Table 5. This vector is capable of expressing a human PAH protein in a human hepatocyte to which the vector is transduced.

TABLE 5
Genetic elements in human PAH transfer
vector pHMI-hPAH-TC-012
Genetic Element SEQ ID NO
truncated 5′ ITR element 18
human hepatic control region 1 (HCR1) 37
modified human EF-1α promoter 40
transcriptional regulatory region comprising the human 41
HCR1 and modified human EF-1α promoter
codon-altered human PAH coding sequence 25
SV40 polyadenylation sequence 45
modified 3′ ITR element 19
Transfer genome (from HCR1 to polyadenylation 50
sequence)
Transfer genome (from 5′ ITR to 3′ ITR) 55

The silent alteration significantly improves the expression of the PAH protein, as demonstrated by comparison of expression vectors pCOH-WT-PAH, pCOH—CO-PAH, and pHMI-CO-PAH. The pCOH-WT-PAH vector comprises a CAG promoter operably linked to a wild-type PAH coding sequence set forth in SEQ ID NO: 24. The pCOH—CO-PAH and pHMI-CO-PAH vectors each comprise a CAG promoter operably linked to a codon-altered human PAH coding sequence as set forth in SEQ ID NO: 25. The pCOH—CO-PAH and pHMI-CO-PAH vectors are highly similar. Each vector was transfected in HEK 293 cells which is naturally deficient in PAH. As shown in FIG. 2, VG-GT-CO-PAH (“CO-hPAH”) gave rise to an expression level of human PAH several fold higher than VG-GT-PAH (“WT-hPAH”).

The vectors disclosed herein can be packaged in an AAV clade F capsid, such as an AAVHSC5, AAVHSC7, AAVHSC15 or AAVHSC17 capsid. The packaged viral particles can be administered to a wild-type animal, a PAH deficient animal, or a reconstituted animal having human hepatocytes obtained from a patient with phenylketonuria caused by a PAH mutation. The gene transfer efficiency can be measured by collecting liver samples and quantifying the percentage of PAH-positive cells (e.g., cells that have a unique nucleotide sequence from the vector, cells that express a wild-type PAH protein, or cells with a higher PAH activity than in cells from a control animal not receiving the PAH expression vector). The restoration of phenylalanine metabolism, which indicates the efficacy of the PAH expression vectors, can be assessed by measuring the Phe level in the blood and by observing the coat color of the mouse. Safety of the viral particle administration can be evaluated by measuring the aspartate aminotransferase (AST) and alanine aminotransferase (ALT) levels in serum.

This example provides a mouse PAH transfer vector rAAV-CBA-mPAH that is similar to the human PAH transfer vector pHMI-hPAH-TC-004 described in Example 1 except that a wild-type mouse PAH coding sequence is substituted for the codon-altered human PAH coding sequence. This vector is capable of expressing a mouse PAH protein in a cell (e.g., a human cell or a mouse cell) to which the vector is transduced.

Briefly, Pah−/− (PAHemu2) mice were housed in clear polycarbonate cages with contact bedding in an isolator. Picolab Mouse Diet 5058 was provided to the animals ad libitum. Spring or tap water acidified with 1N HCl to a targeted pH of 2.5-3.0 was provided ad libitum. Vectors packaged in AAVHSC15 capsid were prepared in PBS (with Ca and Mg), supplemented with 35 mM NaCl, 1% sucrose, and 0.05% Pluronic F-68. The formulation was injected intravenously via the tail vein.

Blood samples were collected every week after the administration of the PAH transfer vector (0 week: prior to administration) by facial vein puncture or tail snip. The samples were allowed to clot at room temperature for at least 30 minutes, centrifuged at ambient temperature at minimum 1000×g for 10 minutes and the serum samples were extracted. Serum samples were stored at −70° C. Serum phenylalanine and tyrosine levels were measured by tandem mass spectrometry.

For collection of tissue samples, the animals underwent cardiac perfusion with saline. Liver (caudate lobe), kidney (left), brain, heart, and muscle (quadriceps) tissues were snap frozen in liquid nitrogen and stored at −70° C. The snap frozen tissues were ground into powder in liquid nitrogen in a mortar and pestle and divided in to aliquots to test for PAH expression for vector genome biodistribution by qPCR.

The safety of the rAAV-CBA-mPAH vector was assessed by measuring the aspartate aminotransferase (AST) and alanine aminotransferase (ALT) levels in the treated animals. Serum samples were collected pre-dose and one week after administration of the viral particles. The levels of AST and ALT were measured by the Sigma MAK055 and Sigma MAK052 ELISA kits.

The pah−/− mice manifested phenylketonuria and had lighter coat color than wild-type mice. As shown in FIGS. 3A and 3B, the administration of the rAAV-CBA-mPAH vector lead to significant reduction of Phe levels in the serum within one week, and the Phe levels remained low for four weeks. The coat color also changed from brown to black within one week.

Expression of mPAH was also observed in tissue samples. As shown in FIG. 4, DNA of the rAAV-CBA-mPAH vector was detectable in many organs, wherein the numbers of viral genomes per 106 cells was the highest in liver, heart, and kidney.

With respect to the safety of the AAV administration, the AST and ALT levels remained low after administration (FIGS. 5A and 5B), suggesting that the rAAV-CBA-mPAH vector was not toxic to the liver.

This example demonstrates that the PAH transfer vectors described in Example 1 effectively reversed the phenotype caused by PAH gene deficiency in a mouse model. The mouse model, AAV packaging and formulation, and methods for examining gene transfer efficiency were identical to the ones described in Example 2.

To examine the efficacy of the five PAH transfer vectors in reversing the phenotypes, a single dose of 2.6×1013 vector genomes per kg of body weight for male mice, or a dose of 6×1013 vector genomes per kg of body weight for female mice. The pah−/− mice manifested increased level of phenylalanine (Phe) and reduced level of tyrosine (Tyr) in the serum. As shown in FIGS. 6A-6H, the administration of any one of the five vectors led to significant reduction of Phe levels and increase of Tyr levels within one week. The efficacy lasted for at least 12 weeks in male mice, and at least 6 weeks in female mice. Other than pHMI-hPAH-TC-004, all the vectors maintained complete reduction of serum Phe levels during the time examined.

To examine the long-term efficacy of pHMI-hPAH-TC-025 in reversing the phenotype caused by PAH gene deficiency, a single dose of 2.6×1013 vector genomes per kg of body weight was administered to male mice, or a single dose of 6×1013 vector genomes per kg of body weight was administered to female mice. As shown in FIGS. 6I and 6J, the administration of the pHMI-hPAH-TC-025 vector led to significant reduction of Phe levels within one week. This reduction persisted for at least 48 weeks in male mice, and at least 46 weeks in female mice. Additionally, within two weeks post administration of the AAV, the coat color of the mice administered with pHMI-hPAH-TC-004 changed from brown to black. An increase of PAH mRNA was observed by ddPCR in the liver samples of these mice collected 4 weeks post injection relative to the mice not administered with AAV vectors. An increase of the PAH enzymatic activity was also detected in liver samples by mass spectrometry.

The efficacy of different doses of the pHMI-hPAH-TC-025 vector was further assessed. A single dose of 2.6×1011, 2.6×1012, or 2.6×1013 vector genomes per kg of body weight was administered to male mice and female mice, and the serum levels of Phe and Tyr were measured. As shown in FIGS. 7A-7D, the dose of 2.6×1013 vector genomes per kg of body weight reduced the Phe levels and increased the Tyr levels more significantly than the two lower doses, and maintained complete reduction of serum Phe levels during the time examined in both male and female subjects.

This example provides human PAH transfer vectors pHMI-hPAH-TC-009, pHMI-hPAH-TC-013, and pHMI-hPAH-TC-017 for expression of human PAH in a human or mouse cell. Vector maps are shown in FIGS. 8A, 8B, and 8C, respectively.

a) pHMI-hPAH-TC-009

PAH transfer vector pHMI-hPAH-TC-009, as shown in FIG. 8A, comprises 5′ to 3′ the following genetic elements: a 5′ ITR element, a CMV enhancer, a CBA promoter, a rabbit β-globin element, a human PAH coding sequence, a polyadenylation sequence, and a 3′ ITR element. The sequences of these elements are set forth in Table 6. This vector is capable of expressing a human PAH protein in a cell (e.g., a human cell or a mouse cell) to which the vector is transduced.

TABLE 6
Genetic elements in human PAH transfer
vector pHMI-hPAH-TC-009
Genetic Element SEQ ID NO
5′ ITR element 18
CMV enhancer 58
CBA promoter 59
Rabbit β-globin element 60
codon-altered human PAH coding sequence 25
Polyadenylation sequence 45
3′ ITR element 19
Transfer genome (from CMV to polyadenylation 61
sequence)
Transfer genome (from 5′ ITR to 3′ ITR) 62

b) pHMI-hPAH-TC-013

PAH transfer vector pHMI-hPAH-TC-013, as shown in FIG. 8B, comprises 5′ to 3′ the following genetic elements: a 5′ ITR element, a CASI promoter region (comprising a CMV enhancer, a CASI promoter, and a ubiquitin C enhancer element (hUBC exon)), a human PAH coding sequence, a polyadenylation sequence, and a 3′ ITR element. The sequences of these elements are set forth in Table 7. This vector is capable of expressing a human PAH protein in a cell (e.g., a human cell or a mouse cell) to which the vector is transduced.

TABLE 7
Genetic elements in human PAH transfer
vector pHMI-hPAH-TC-013
Genetic Element SEQ ID NO
5′ ITR element 18
CASI promoter region 63
Human PAH coding sequence 25
Polyadenylation sequence 45
3′ ITR element 19
Transfer genome (from promoter region to 64
polyadenylation sequence)
Transfer genome (from 5′ ITR to 3′ ITR) 65

f) pHMI-hPAH-TC-017

PAH transfer vector pHMI-hPAH-TC-017, as shown in FIG. 8C, comprises 5′ to 3′ the following genetic elements: a 5′ ITR element, an hAAT promoter region (comprising an ABMP enhancer (an enhancer region adjacent to a gene on chromosome 9 that expresses highly in liver, 5′ to the ATG), a TTR enhancer, an hAAT promoter, and an MVM intron), a human PAH coding sequence, a polyadenylation sequence, and a 3′ ITR element. The sequences of these elements are set forth in Table 8. This vector is capable of expressing a human PAH protein in a cell (e.g., a human cell or a mouse cell) to which the vector is transduced.

TABLE 8
Genetic elements in human PAH transfer
vector pHMI-hPAH-TC-017
Genetic Element SEQ ID NO
5′ ITR element 18
hAAT promoter region 66
Human PAH coding sequence 25
Polyadenylation sequence 45
3′ ITR element 19
Transfer genome (from promoter region to 67
polyadenylation sequence)
Transfer genome (from 5′ ITR to 3′ ITR) 68

The vectors described in this example were tested for expression in two different cell lines. 5×105 HEK293 cells (kidney; non-liver) and 5×105 Huh7 cells (liver) were transfected with 1 ug each of the following vectors: pHMI-hPAH-TC-004 (PAH-004); pHMI-hPAH-TC-009 (PAH-009); pHMI-hPAH-TC-010 (PAH-010); pHMI-hPAH-TC-011 (PAH-011); pHMI-hPAH-TC-012 (PAH-012); pHMI-hPAH-TC-013 (PAH-013); pHMI-hPAH-TC-025 (LP1); pHMI-hPAH-TC-017 (PAH-017). Lysate of the cells was collected 48 hours after transfection. The expression of human PAH was detected by Western blotting with an anti-PAH antibody (Sigma HPA031642). The amount of GAPDH protein as detected by an anti-GAPDH antibody (Millipore MAB 374) was used as a loading control. PAH expression levels of all vectors were normalized to pHMI-hPAH-TC-004 expression level; data was collected from multiple independent transfections and plotted in FIG. 9. FIG. 9A shows the normalized PAH expression level of the indicated vectors in Huh7 cells. FIG. 9B shows the normalized PAH expression level of the indicated vectors in HEK293 cells.

FIGS. 10A and 10B are graphs showing the serum phenylalanine levels over time of male and female homozygous Pah−/− PAHemu2 mice respectively. Male and female mice were dosed at 2e13 vg/kg and 6e13 vg/kg respectively with pHMI-hPAH-TC-010 (hPAH-TC-010), pHMI-hPAH-TC-025 (hPAH-TC-025), pHMI-hPAH-TC-004 (hPAH-TC-004), pHMI-hPAH-TC-011 (hPAH-TC-011), or pHMI-hPAH-TC-012 (hPAH-TC-012) vectors packaged in AAVHSC15 capsid. Serum samples were collected weekly then biweekly after the administration. Serum phenylalanine concentrations were assessed by LC-MS/MS. FIG. 10C is a graph showing the average baseline serum phenylalanine level for the male and female homozygous Pah−/− PAHemu2 mice in the study. The data represents a total of 55 mice per group.

As shown in FIG. 10, the administration of certain vectors led to significant reduction of Phe levels within one week of administration, and this reduction persisted for at least 45 weeks. FIG. 10 demonstrates that some of the PAH transfer vectors effectively reversed the phenotype caused by PAH gene deficiency in a mouse model. The mouse model, AAV packaging and formulation, and methods for examining gene transfer efficiency were identical to those described in Example 2 herein. The sizes of the AAV vectors were as follows: pHMI-hPAH-TC-010 (hPAH-TC-010): 2391 bp; pHMI-hPAH-TC-025 (hPAH-TC-025): 2351 bp; pHMI-hPAH-TC-004 (hPAH-TC-004): 3781 bp; pHMI-hPAH-TC-011 (hPAH-TC-011): 3158 bp; and pHMI-hPAH-TC-012 (hPAH-TC-012): 3799 bp.

This example examines the effect of PAH gene CpG content on PAH protein expression, using the PAH transfer vectors pHMI-hPAH-TC-018, pHMI-hPAH-TC-019, pHMI-hPAH-TC-020, pHMI-hPAH-TC-021, pHMI-hPAH-TC-022, and pHMI-hPAH-TC-023. Vector maps are shown in FIGS. 11A, 11B, 11C, 11D, 11E, and 11F, respectively. These PAH transfer vectors comprise the sequences and elements set forth in Table 9.

TABLE 9
Genetic elements in PAH transfer vectors
pHMI-hPAH-TC-XXX Vector
SEQ ID NO
Genetic Element −018 −019 −020 −021 −022 −023
5′ ITR element 26 26 26 26 26 26
HCR1 29 29 29 29 29 29
hAAT promoter 30 30 30 30 30 30
SV40 intron 31 31 31 31 31 31
PAH coding 69 70 71 72 73 24
sequence
Late SV40 45 45 45 45 45 45
polyadenylation
sequence
3′ ITR element 27 27 27 27 27 27
Transfer genome 74 76 78 80 82 84
(from HCR1 to
polyadenylation
sequence)
Transfer genome 75 77 79 81 83 85
(from 5′ ITR to 3′
ITR)

The vectors described in this example were tested for expression in HEK293 cells but under the control of a CBA promoter. 5×105 HEK293 cells were transfected with 1 ug each of the following vectors: pHMI-hPAH-TC-004 (PAH-TC-004); pHMI-hPAH-TC-009 (PAH-TC-009); pHMI-hPAH-TC-018 (PAH-TC-018); pHMI-hPAH-TC-019 (PAH-TC-019); pHMI-hPAH-TC-020 (PAH-TC-020); pHMI-hPAH-TC-021 (PAH-TC-021); pHMI-hPAH-TC-022 (PAH-TC-022); pHMI-hPAH-TC-023 (PAH-TC-023). Lysate of the cells was collected 48 hours after transfection. The expression of human PAH was detected by Western blotting with an anti-PAH antibody (Sigma HPA031642). The amount of GAPDH protein as detected by an anti-GAPDH antibody (Millipore MAB 374) was used as a loading control. PAH expression levels of all vectors were normalized to pHMI-hPAH-TC-004 expression level; data was plotted in FIG. 12.

FIG. 13 is a graph showing the serum phenylalanine levels over time of male homozygous Pah−/− PAHemu2 mice. Male mice have been dosed at 2e13 vg/kg with pHMI-hPAH-TC-018 (hPAH-TC-018); pHMI-hPAH-TC-019 (hPAH-TC-019); pHMI-hPAH-TC-020 (hPAH-TC-020); pHMI-hPAH-TC-021 (hPAH-TC-021); pHMI-hPAH-TC-022 (hPAH-TC-022); pHMI-hPAH-TC-023 (hPAH-TC-023); and pHMI-hPAH-TC-025 (hPAH-TC-025) vectors packaged in AAVHSC15 capsid. Serum samples were collected weekly after the administration. Serum phenylalanine concentration was assessed by LC-MS/MS.

As shown in FIG. 13, the administration of certain vectors led to significant reduction of Phe levels within one week of administration, and this reduction persisted for at least 25 weeks. The mouse model, AAV packaging and formulation, and methods for examining gene transfer efficiency were identical to those previously described in Example 2 herein. The CpG content of the vectors were as follows: pHMI-hPAH-TC-018 (hPAH-TC-018): 2; pHMI-hPAH-TC-019 (hPAH-TC-019): 7; pHMI-hPAH-TC-020 (hPAH-TC-020): 22; pHMI-hPAH-TC-021 (hPAH-TC-021): 10; pHMI-hPAH-TC-022 (hPAH-TC-022): 7; pHMI-hPAH-TC-023 (hPAH-TC-023): 23; and pHMI-hPAH-TC-025 (hPAH-TC-025): 60.

This example provides human PAH transfer vectors pHMI-01004 and pHMI-01008 for expression of human PAH in a human or mouse cell. Vector maps are shown in FIGS. 14A and 14B, respectively. These PAH transfer vectors comprise the sequences and elements set forth in Table 10.

TABLE 10
Genetic elements in PAH transfer
vectors pHMI-01004 and pHMI-01008
pHMI-01004 pHMI-01008
Genetic Element SEQ ID NO SEQ ID NO
5′ ITR element 26 26
HCR1 29 29
hAAT promoter region 30 30
SV40 intron 31 31
Human PAH coding sequence 25 25
Polyadenylation sequence 43 43
3′ ITR element 27 57
Transfer genome (from HCR1 to 86 89
polyadenylation sequence)
Transfer genome (from 5′ ITR to 3′ ITR) 87 90
Full sequence of transfer vector 88 91

The invention is not to be limited in scope by the specific embodiments described herein. Indeed, various modifications of the invention in addition to those described will become apparent to those skilled in the art from the foregoing description and accompanying figures. Such modifications are intended to fall within the scope of the appended claims.

All references (e.g., publications or patents or patent applications) cited herein are incorporated herein by reference in their entirety and for all purposes to the same extent as if each individual reference (e.g., publication or patent or patent application) was specifically and individually indicated to be incorporated by reference in its entirety for all purposes. Other embodiments are within the following claims.

Rubin, Hillard, Wright, Jason Boke, Seymour, Albert Barnes, Ahmed, Seemin Seher, Dollive, Serena Nicole

Patent Priority Assignee Title
Patent Priority Assignee Title
10610606, Feb 01 2018 HOMOLOGY MEDICINES, INC Adeno-associated virus compositions for PAH gene transfer and methods of use thereof
5252479, Nov 08 1991 RESEARCH CORPORATION TECHNOLOGIES, INC Safe vector for gene therapy
5474935, May 23 1990 The United States of America as represented by the Department of Health Adeno-associated virus (AAV)-based eucaryotic vectors
5589377, Oct 30 1990 GENCELL, S A Recombinant adeno-associated virus vectors
5622856, Aug 03 1995 Genzyme Corporation High efficiency helper system for AAV vector production
5650309, May 16 1995 Regents of the University of California, The Viral vectors
5753500, Sep 07 1989 The Trustees of Princeton University Helper-free stocks of recombinant adeno-associated virus vectors
5780447, Jun 14 1996 ST JUDE CHILDREN S RESEARCH HOSPITAL Recombinant adeno-associated viral vectors
5895759, Sep 03 1993 Max-Planck Gesellschaft zur Forderung der Wissenschaften E.V. Vector for gene transfer in liver cells
6025195, Jul 15 1995 Max-Planck-Gesellschaft zur Forderung der Wissenschaften e.V. Berlin Liver-specific adenovirus expression vector
6153436, Jan 10 1997 BOARD OF TRUSTEES OF THE UNIVERSITY OF ARKANSAS, THE Method of gene delivery using wildtype adeno associated viral (AAV) vectors with insertions
6156303, Jun 11 1997 Washington, University of Adeno-associated virus (AAV) isolates and AAV vectors derived therefrom
6180613, Apr 13 1994 Yale University AAV-mediated delivery of DNA to cells of the nervous system
6238914, Sep 23 1994 The General Hospital Corporation Use of a non-mammalian DNA virus to express an exogenous gene in a mammalian cell
6268212, Oct 18 1993 Amgen Inc Tissue specific transgene expression
6329181, Aug 07 2000 THOMAS JEFFERSON UNVIERSITY Helper functions for recombinant vector production
6338962, Sep 11 1996 The General Hospital Corporation Use of a non-mammalian DNA virus to express an exogenous gene in a mammalian cell
6387670, Nov 22 1996 Centelion Method for producing recombinant virus
6610906, Jun 09 1999 Michigan, University of the Regents Nucleotide sequences for gene regulation and methods of use thereof
6759237, Nov 05 1998 The Trustees of the University of Pennsylvania Adeno-associated virus serotype 1 nucleic acid sequences, vectors and host cells containing same
6919209, Dec 04 1998 City of Hope Method of genetically modifying very primitive quiescent human hematopoietic stem cells
6924128, Dec 06 1994 Targeted Genetics Corporation Packaging cell lines for generation of high titers of recombinant AAV vectors
6936243, Dec 02 1996 The University of Washington; Cell Genesys, Inc. Adeno-associated viral vector-mediated delivery of DNA to cells of the liver
6936466, Oct 21 1997 Targeted Genetics Corporation Transcriptionally-activated AAV inverted terminal repeats (ITRs) for use with recombinant AAV vectors
6953690, Mar 20 1998 TRUSTEES OF THE UNIVERSITY OF PENNSYLVANIA, THE Compositions and methods for helper-free production of recombinant adeno-associated viruses
6984517, May 28 1998 DEPARTMENT OF HEALTH AND HUMAN SERVICE, UNITED STATES OF AMERICA, AS REPRESENTED BY THE SECRETARY, OF, THE AAV5 vector and uses thereof
7001764, Jun 27 1995 CG ONCOLOGY, INC Compositions comprising tissue specific adenoviral vectors
7022519, Mar 18 1999 The Trustees of the University of Pennsylvania Compositions and methods for helper-free production of recombinant adeno-associated viruses
7056502, Apr 28 2000 The Trustees of the University of Pennsylvania Recombinant aav vectors with AAV5 capsids and AAV5 vectors pseudotyped in heterologous capsids
7091029, Sep 23 2002 The Johns Hopkins University High titer recombinant AAV production
7094604, Jun 05 2002 UNIVIRSITY OF FLORIDA RESEARCH FOUNDATION, INC Production of pseudotyped recombinant AAV virions
7148341, Oct 25 1996 Deutsches Krebsforschungszentrum Stiftung Des Offentlichen Rechts; Transgene S.A. AAV DNA comprising helper virus sequences
7157571, Sep 30 2004 Industrial Technology Research Institute Hepatoma specific chimeric regulatory sequence
7172893, Nov 10 1998 The University of North Carolina at Chapel Hill Virus vectors and methods of making and administering the same
7179903, May 14 2001 MILTENYI BIOTEC B V & CO KG Liver specific transcriptional enhancer
7198951, Dec 17 2001 The Trustees of the University of Pennsylvania Adeno-associated virus (AAV) serotype 9 sequences, vectors containing same, and uses therefor
7220577, Aug 28 2002 UNIVERSITY OF FLORIDA RESEARCH FOUNDATION, INC Modified AAV
7235393, Apr 29 2002 The Trustees of the University of Pennsylvania Method for direct rescue and amplification of integrated viruses from cellular DNA of tissues
7259151, Jun 19 2003 Genzyme Corporation AAV virions with decreased immunoreactivity and uses therefor
7282199, Dec 17 2001 TRUSTEES OF THE UNIVERSITY OF PENNSYLVANIA, THE Adeno-associated virus (AAV) serotype 8 sequences, vectors containing same, and uses therefor
7351813, Jun 20 2000 The Board of Trustees of the Leland Stanford Junior University; University of Washington Liver-specific gene expression cassettes, and methods of use
7465583, Jun 01 2000 UNIVERSITY OF NORTH CAROLINA AT CHAPEL HILL,THE Duplexed parvovirus vectors
7482156, Oct 15 2003 Cell Genesys, Inc Hepatocellular carcinoma specific promoter and uses thereof
7588772, Mar 30 2006 The Board of Trustees of the Leland Stanford Junior University AAV capsid library and AAV capsid proteins
7749492, Jan 05 2001 NATIONWIDE CHILDREN S HOSPITAL, INC AAV vectors and methods
7790154, Jun 01 2000 The University of North Carolina at Chapel Hill Duplexed parvovirus vectors
7906111, Sep 30 2003 TRUSTEES OF THE UNIVERSITY OF PENNSYLVANIA, THE Adeno-associated virus (AAV) clades, sequences, vectors containing same, and uses therefor
8030065, Sep 22 2004 St. Jude Children's Research Hospital Expression of factor IX in gene therapy vectors
8067156, May 31 2001 ROCKEFELLER UNIVERSITY, THE Method for generating replication defective viral vectors that are helper free
8163543, Oct 20 2005 AMSTERDAM MOLECULAR THERAPEUTICS AMT B V AAV vectors produced in insect cells
8168425, Sep 22 2004 St. Jude Children's Research Hospital Expression of factor IX in gene therapy vectors
8241622, Jul 13 2001 University of Iowa Research Foundation Adeno-associated virus vectors with intravector heterologous terminal palindromic sequences
8283151, Apr 29 2005 THE GOVERNMENT OF THE UNITED STATES OF AMERICA AS REPRESENTED BY THE SECRETARY, DEPARTMENT OF HEALTH AND HUMAN SERVICES, C O NATIONAL INSTITUTES OF HEALTH, OFFICE OF TECHNOLOGY TRANSFER Isolation, cloning and characterization of new adeno-associated virus (AAV) serotypes
8298818, Apr 28 2006 UNIVERSITY OF FLORIDA RESEARCH FOUNDATION, INC Self-complementary adeno-associated virus having a truncated CMV-chicken β-actin promoter
8476418, May 28 2009 DEUTSCHES KREBSFORSCHUNGSZENTRUM; Ruprecht-Karls-Universitat Heidelberg Modified AAV capsid polypeptides
8524446, Nov 13 2001 The Trustees of the University of Pennsylvania Method for detecting adeno-associated virus
8628966, Apr 30 2010 City of Hope CD34-derived recombinant adeno-associated vectors for stem cell transduction and systemic therapeutic gene transfer
8632764, Apr 30 2008 The University of North Carolina at Chapel Hill Directed evolution and in vivo panning of virus vectors
8716461, May 24 2004 Vitalant Human parvovirus
8846387, Apr 24 1997 The University of Washington Targeted gene modification by parvoviral vectors
8846389, Sep 11 1996 The United States of America, as represented by the Secretary, Department of Health and Human Services AAV4 vector and uses thereof
8926958, Apr 06 2004 Cedars-Sinai Medical Center Prevention and treatment of vascular disease with recombinant adeno-associated virus vectors encoding apolipoprotein A-I and apolipoprotein A-I milano
8999678, Apr 07 2005 The Trustees of the University of Pennsylvania Method of increasing the function of an AAV vector
8999948, Jun 10 2010 ESTEVE PHARMACEUTICALS, S A Vectors and sequences for the treatment of diseases
9150882, Jan 31 2006 THE BOARDS OF TRUSTEES OF THE LELAND STANFORD JUNIOR UNIVERSITY; The Board of Trustees of the Leland Stanford Junior University Self-complementary parvoviral vectors, and methods for making and using the same
9169299, Aug 24 2011 The Board of Trustees of the Leland Stanford Junior University AAV capsid proteins for nucleic acid transfer
9169494, Jan 12 2010 The University of North Carolina at Chapel Hill Restrictive inverted terminal repeats for viral vectors
9193956, Apr 22 2011 The Regents of the University of California Adeno-associated virus virions with variant capsid and methods of use thereof
9217155, May 28 2008 University of Massachusetts Isolation of novel AAV'S and uses thereof
9222105, Oct 27 2011 The Regents of the University of California Methods and compositions for modification of the HPRT locus
9402919, Mar 04 2011 Intrexon Corporation Vectors conditionally expressing protein
9408904, Jan 28 2010 THE CHILDREN S HOSPITAL OF PHILADELPHIA Scalable manufacturing platform for viral vector purification and viral vectors so purified for use in gene therapy
9409953, Feb 10 2011 The University of North Carolina at Chapel Hill Viral vectors with modified transduction profiles and methods of making and using the same
9441244, Jun 30 2003 Integrative Gene Therapeutics Mutant adeno-associated virus virions and methods of use thereof
9617548, Apr 22 2008 VIB VZW; Life Sciences Research Partners VZW; Universiteit Gent Liver-specific nucleic acid regulatory elements and methods and use thereof
9764045, Jul 08 2009 UCL Business LTD Optimised coding sequence and promoter
9783824, Jan 31 2006 The Board of Trustees of the Leland Stanford Junior University Self-complementary parvoviral vectors, and methods for making and using the same
9840719, Jul 22 2013 THE CHILDREN S HOSPITAL OF PHILADELPHIA Variant AAV and compositions, methods and uses for gene transfer to cells, organs and tissues
9890396, Sep 24 2014 City of Hope Adeno-associated virus vector variants for high efficiency genome editing and methods thereof
20030129203,
20030130221,
20030198620,
20040086485,
20040142416,
20040235174,
20050112765,
20090191597,
20090215879,
20100297084,
20100316623,
20120046349,
20120093772,
20120244127,
20130023033,
20130189225,
20130310443,
20130323226,
20140037585,
20140050701,
20140107185,
20140271550,
20140336245,
20140359799,
20150023924,
20150024467,
20150065562,
20150079038,
20150110762,
20150111955,
20150184197,
20150238550,
20150315610,
20150352228,
20150374803,
20150376240,
20160000887,
20160017295,
20160032319,
20160123990,
20160175365,
20160229904,
20170119906,
20170211094,
20170326256,
EP126544,
EP1497436,
EP161788,
WO2016049230,
WO1996008560,
WO1998009524,
WO1998021349,
WO1998027207,
WO1998028417,
WO1999003981,
WO1999018227,
WO1999055564,
WO1999064569,
WO2000049160,
WO2001036620,
WO2002066611,
WO2003087383,
WO2003093436,
WO2005111220,
WO2006096815,
WO2007019646,
WO2008021140,
WO2009000552,
WO2009043936,
WO2009130208,
WO2009134681,
WO2010124180,
WO2010129021,
WO2011012724,
WO2011038187,
WO2014064277,
WO2014089212,
WO2014193716,
WO2015061491,
WO2015143177,
WO2015164723,
WO2016097218,
WO2016097219,
WO2016100575,
WO2016146757,
WO2017015154,
WO2017100551,
WO2018046737,
WO2018126112,
WO2018126116,
WO2018129586,
WO2019010091,
//////
Executed onAssignorAssigneeConveyanceFrameReelDoc
Jul 03 2019DOLLIVE, SERENA NICOLEHOMOLOGY MEDICINES, INC ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0530570979 pdf
Jul 08 2019SEYMOUR, ALBERT BARNESHOMOLOGY MEDICINES, INC ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0530570979 pdf
Jul 08 2019WRIGHT, JASON BOKEHOMOLOGY MEDICINES, INC ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0530570979 pdf
Jul 16 2019AHMED, SEEMIN SEHERHOMOLOGY MEDICINES, INC ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0530570979 pdf
Jul 16 2019RUBIN, HILLARDHOMOLOGY MEDICINES, INC ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS 0530570979 pdf
Mar 19 2020HOMOLOGY MEDICINES, INC.(assignment on the face of the patent)
Date Maintenance Fee Events
Mar 19 2020BIG: Entity status set to Undiscounted (note the period is included in the code).
Mar 30 2020SMAL: Entity status set to Small.


Date Maintenance Schedule
Apr 09 20274 years fee payment window open
Oct 09 20276 months grace period start (w surcharge)
Apr 09 2028patent expiry (for year 4)
Apr 09 20302 years to revive unintentionally abandoned end. (for year 4)
Apr 09 20318 years fee payment window open
Oct 09 20316 months grace period start (w surcharge)
Apr 09 2032patent expiry (for year 8)
Apr 09 20342 years to revive unintentionally abandoned end. (for year 8)
Apr 09 203512 years fee payment window open
Oct 09 20356 months grace period start (w surcharge)
Apr 09 2036patent expiry (for year 12)
Apr 09 20382 years to revive unintentionally abandoned end. (for year 12)