The present invention provides mrna sequences comprising at least one coding region, encoding for at least one epitope of a protein, or of a fragment, variant or derivative thereof, of a virus of the genus rotavirus. Particularly preferred is the protein respectively the protein cleavage product VP8* of rotavirus. The mrna sequence may be used as a vaccine or generally as a pharmaceutical composition for prophylaxis or treatment of rotavirus infections.
|
1. A method for stimulating an anti-rotavirus response in a patient comprising administering an effective amount of an immune stimulating composition comprising a lipid nanoparticle (LNP) comprising an mrna molecule comprising at least one region encoding a polypeptide comprising at least 100 amino acids of a VP8* cleavage product of a rotavirus vp4 protein, wherein the mrna comprises a 5′ Cap and a poly(A) tail of 60 to about 250 adenosine at the 3′ end of the mrna,
wherein the composition is administered by intramuscular injection.
2. The method of
3. The method of
4. The method of
5. The method of
6. The method of
7. The method of
8. The method of
9. The method of
10. The method of
11. The method of
12. The method of
13. The method of
14. The method of
15. The method of
16. The method of
17. The method of
19. The method of
20. The method of
|
This application is a national phase application under 35 U.S.C. § 371 of International Application No. PCT/EP2016/077185, filed Nov. 9, 2016, which claims benefit of European Application No. 15003204.3, filed Nov. 9, 2015, the entire contents of each of which are hereby incorporated by reference.
The present invention relates according to a first aspect to mRNA sequences. According to further aspects the invention relates to compositions, vaccines and kits comprising said mRNA sequences. According to a further aspect of the invention the mRNA sequences may be used for treatment and prophylaxis of rotavirus infections.
Rotavirus infections are the globally leading cause of severe diarrhoea in children younger than five years of age and account for 50% of hospitalisations for childhood diarrhoea. Although rotavirus infections are usually an easily managed disease of childhood, worldwide still more than 450,000 children under five years die from rotavirus infection each year. More than 80% of all rotavirus-related deaths occur in resource-poor countries.
Rotaviruses belong to the family of Reoviridae and have been subdivided into eight species, namely five serological species (rotavirus A to E) and two additional tentative species (rotavirus F and G). These species are commonly referred to as “RV groups”. Three species thereof (A, B and C) can infect humans and animals. The other species D, E, F and G have been identified in animals, mostly in birds. Rotavirus A is responsible for more than 90% of all human infections and is most important for human infection and disease. It is transmitted by the faecal-oral route and targets enterocytes in the villi of the small intestine, leading to cell damage and gastroenteritis.
The virus possess a double stranded, segmented RNA genome that encodes six structural and six non-structural proteins and forms non-enveloped particles with three-layered icosaedral capsids. The six structural proteins (VPs—viral proteins) form the virus particle (virion) and are called VP1, VP2, VP3, VP4, VP6 and VP7. The six non-structural proteins (NSPs) are called NSP1, NSP2, NSP3, NSP4, NSP5 and NSP6 and are important for viral mRNA translation, for genome replication, genome encapsidation and capsid assembly. In addition, non-structural proteins are involved in antagonizing the antiviral host response and in subverting important cellular processes to enable successful virus replication.
Within the species rotavirus A, there are different strains (serotypes or genotypes), which are classified by a dual system based on the structural proteins VP7 and VP4. VP7 and VP4 are components of the outermost protein layer (outer capsid), and both carry neutralizing epitopes. VP7 is a glycoprotein (G) that forms the outer layer or surface of the virion. VP7 determines the G-type of the strain. According to Matthijnssens J. et al. (Arch Virol. 2011; 156(8):1397-1413) there are 27 G-serotypes (G1-G27). VP4 is a surface protein that protrudes as a spike. VP4 is essential for virus-cell interaction and determines host range and virulence of the virus. VP4 is protease sensitive (P) and determines the P-type of the virus. There are 35 P-serotypes (P[1]-P[35]). This dual classification system may also be applied to other rotavirus species.
In humans, around 90% of infections are caused by G1, G2, G3 or G4 and also G9 and G12. With respect to the P-types P[4], P[6] and P[8] are the most prevalent, as described by Zeller et al. (Journal of Clinical Microbiology (2012), vol 50, no. 3, pp: 966-976). Importantly, infection with one virus is not able to induce cross-protection against infection by a different serotype.
While the virus is endemic worldwide with almost every child having been infected by the age of five, rotavirus infection is most problematic in the developing world: the majority of deaths occur in Africa and Asia, of which India is the country most heavily affected.
At present, there are two licensed oral vaccines available, which are both based on live-attenuated forms of the virus. RotaTeq® (Merck) is based on a bovine rotavirus strain engineered to express outer layer proteins from human strains. Rotarix® (GlaxoSmithKline) is based on a live-attenuated human rotavirus strain. Both vaccines are given orally and exhibit high efficacy in the developed world. However, the efficacy of oral rotavirus vaccination is significantly reduced in developing countries. This is likely to be caused by several factors. Firstly, the virus-based vaccine can be inactivated by pre-existing antibodies, e.g. transferred to babies by breastfeeding. Secondly, malnutrition can have a negative impact on the efficacy of oral vaccinations. Furthermore, unrelated infections of the gastrointestinal tract which are more prevalent in developing countries compared to developed countries, might be a major factor in reducing vaccine efficacy.
The Chinese patent application CN 102703475 A describes an approach to obtain recombinant bovine rotavirus strains by introducing a recombinant plasmid containing a NSP4 mutant gene resulting in toxicity weakened recombinant bovine rotavirus, which may be used as attenuated vaccine candidate strains.
The U.S. Pat. No. 4,636,385 discloses a method for producing a live rotavirus vaccine against bovine and human rotaviruses by isolation a strain of live bovine rotavirus which does not hemagglutinate primate red blood cells.
The Korean patent application KR 20020061300 A describes the cloning of VP4 and VP7 genes of black goat rotavirus for the purpose of vaccine development.
The Japanese patent application JP 2000139473 A discloses a plant comprising plant cells into which a gene coding for VP6 protein of rotavirus was incorporated. The plant is described to be useful for the production of a vaccine against rotavirus.
Nevertheless, there is a still an urgent need for providing new and improved vaccines, which are particularly important for developing countries. Preferably, the new and improved vaccine should allow parenteral delivery and thus avoid efficacy reduction induced via oral vaccine delivery. Moreover, the new vaccine should allow cost-effective production. Furthermore, especially for the use in developing countries, there is an urgent need for a temperature stabile rotavirus vaccine which is not dependent on cooling (cold chain) for storage and distribution. Furthermore there is an urgent need of a rotavirus vaccine which is effective against different rotavirus serotypes.
This object is solved by the subject matter of the claims. Particularly, the object underlying the present invention is solved according to a first aspect by an mRNA sequence and by a composition as defined in the claims. According to further aspects of the invention the object is solved by a vaccine, a kit and by a method of treatment or prophylaxis of rotavirus infections as defined in the further claims.
For the sake of clarity and readability the following scientific background information and definitions are provided. Any technical features disclosed thereby can be part of each and every embodiment of the invention. Additional definitions and explanations are provided in the context of this disclosure.
Immune system: The immune system may protect organisms from infection. If a pathogen breaks through a physical barrier of an organism and enters this organism, the innate immune system provides an immediate, but non-specific response. If pathogens evade this innate response, vertebrates possess a second layer of protection, the adaptive immune system. Here, the immune system adapts its response during an infection to improve its recognition of the pathogen. This improved response is then retained after the pathogen has been eliminated, in the form of an immunological memory, and allows the adaptive immune system to mount faster and stronger attacks each time this pathogen is encountered. According to this, the immune system comprises the innate and the adaptive immune system. Each of these two parts contains so called humoral and cellular components.
Immune response: An immune response may typically either be a specific reaction of the adaptive immune system to a particular antigen (so called specific or adaptive immune response) or an unspecific reaction of the innate immune system (so called unspecific or innate immune response).
Adaptive immune system: The adaptive immune system is composed of highly specialized, systemic cells and processes that eliminate or prevent pathogenic growth. The adaptive immune response provides the vertebrate immune system with the ability to recognize and remember specific pathogens (to generate immunity), and to mount stronger attacks each time the pathogen is encountered. The system is highly adaptable because of somatic hypermutation (a process of increased frequency of somatic mutations), and V(D)J recombination (an irreversible genetic recombination of antigen receptor gene segments). This mechanism allows a small number of genes to generate a vast number of different antigen receptors, which are then uniquely expressed on each individual lymphocyte. Because the gene rearrangement leads to an irreversible change in the DNA of each cell, all of the progeny (offspring) of that cell will then inherit genes encoding for the same receptor specificity, including the Memory B cells and Memory T cells that are the keys to long-lived specific immunity. Immune network theory is a theory of how the adaptive immune system works, that is based on interactions between the variable regions of the receptors of T cells, B cells and of molecules made by T cells and B cells that have variable regions.
Adaptive immune response: The adaptive immune response is typically understood to be antigen-specific. Antigen specificity allows for the generation of responses that are tailored to specific antigens, pathogens or pathogen-infected cells. The ability to mount these tailored responses is maintained in the body by “memory cells”. Should a pathogen infect the body more than once, these specific memory cells are used to quickly eliminate it. In this context, the first step of an adaptive immune response is the activation of naïve antigen-specific T cells or different immune cells able to induce an antigen-specific immune response by antigen-presenting cells. This occurs in the lymphoid tissues and organs through which naive T cells are constantly passing. Cell types that can serve as antigen-presenting cells are inter alia dendritic cells, macrophages, and B cells. Each of these cells has a distinct function in eliciting immune responses. Dendritic cells take up antigens by phagocytosis and macropinocytosis and are stimulated by contact with e.g. a foreign antigen to migrate to the local lymphoid tissue, where they differentiate into mature dendritic cells. Macrophages ingest particulate antigens such as bacteria and are induced by infectious agents or other appropriate stimuli to express MHC molecules. The unique ability of B cells to bind and internalize soluble protein antigens via their receptors may also be important to induce T cells. Presenting the antigen on MHC molecules leads to activation of T cells which induces their proliferation and differentiation into armed effector T cells. The most important function of effector T cells is the killing of infected cells by CD8+ cytotoxic T cells and the activation of macrophages by Th1 cells which together make up cell-mediated immunity, and the activation of B cells by both Th2 and Th1 cells to produce different classes of antibody, thus driving the humoral immune response. T cells recognize an antigen by their T cell receptors which do not recognize and bind antigen directly, but instead recognize short peptide fragments e.g. of pathogen-derived protein antigens, which are bound to MHC molecules on the surfaces of other cells.
Cellular immunity/cellular immune response: Cellular immunity relates typically to the activation of macrophages, natural killer cells (NK), antigen-specific cytotoxic T-lymphocytes, and the release of various cytokines in response to an antigen. In a more general way, cellular immunity is not related to antibodies but to the activation of cells of the immune system. A cellular immune response is characterized e.g. by activating antigen-specific cytotoxic T-lymphocytes that are able to induce apoptosis in body cells displaying epitopes of an antigen on their surface, such as virus-infected cells, cells with intracellular bacteria, and cancer cells displaying tumor antigens; activating macrophages and natural killer cells, enabling them to destroy pathogens; and stimulating cells to secrete a variety of cytokines that influence the function of other cells involved in adaptive immune responses and innate immune responses.
Humoral immunity/humoral immune response: Humoral immunity refers typically to antibody production and the accessory processes that may accompany it. A humoral immune response may be typically characterized, e.g., by Th2 activation and cytokine production, germinal center formation and isotype switching, affinity maturation and memory cell generation. Humoral immunity also typically may refer to the effector functions of antibodies, which include pathogen and toxin neutralization, classical complement activation, and opsonin promotion of phagocytosis and pathogen elimination.
Innate immune system: The innate immune system, also known as non-specific immune system, comprises the cells and mechanisms that defend the host from infection by other organisms in a non-specific manner. This means that the cells of the innate system recognize and respond to pathogens in a generic way, but unlike the adaptive immune system, it does not confer long-lasting or protective immunity to the host. The innate immune system may be e.g. activated by ligands of pathogen-associated molecular patterns (PAMP) receptors, e.g. Toll-like receptors (TLRs) or other auxiliary substances such as lipopolysaccharides, TNF-alpha, CD40 ligand, or cytokines, monokines, lymphokines, interleukins or chemokines, immunostimulatory nucleic acids, immunostimulatory RNA (isRNA), CpG-DNA, antibacterial agents, or anti-viral agents. Typically a response of the innate immune system includes recruiting immune cells to sites of infection, through the production of chemical factors, including specialized chemical mediators, called cytokines; activation of the complement cascade; identification and removal of foreign substances present in organs, tissues, the blood and lymph, by specialized white blood cells; activation of the adaptive immune system through a process known as antigen presentation; and/or acting as a physical and chemical barrier to infectious agents.
Adjuvant/adjuvant component: An adjuvant or an adjuvant component in the broadest sense is typically a (e.g. pharmacological or immunological) agent or composition that may modify, e.g. enhance, the efficacy of other agents, such as a drug or vaccine. Conventionally the term refers in the context of the invention to a compound or composition that serves as a carrier or auxiliary substance for immunogens and/or other pharmaceutically active compounds. It is to be interpreted in a broad sense and refers to a broad spectrum of substances that are able to increase the immunogenicity of antigens incorporated into or co-administered with an adjuvant in question. In the context of the present invention an adjuvant will preferably enhance the specific immunogenic effect of the active agents of the present invention. Typically, “adjuvant” or “adjuvant component” has the same meaning and can be used mutually. Adjuvants may be divided, e.g., into immuno potentiators, antigenic delivery systems or even combinations thereof. The term “adjuvant” is typically understood not to comprise agents which confer immunity by themselves. An adjuvant assists the immune system unspecifically to enhance the antigen-specific immune response by e.g. promoting presentation of an antigen to the immune system or induction of an unspecific innate immune response. Furthermore, an adjuvant may preferably e.g. modulate the antigen-specific immune response by e.g. shifting the dominating Th2-based antigen specific response to a more Th1-based antigen specific response or vice versa. Accordingly, an adjuvant may favourably modulate cytokine expression/secretion, antigen presentation, type of immune response etc.
Antigen: The term “antigen” refers typically to a substance which may be recognized by the immune system and may be capable of triggering an antigen-specific immune response, e.g. by formation of antibodies or antigen-specific T-cells as part of an adaptive immune response. An antigen may be a protein or peptide. In this context, the first step of an adaptive immune response is the activation of naïve antigen-specific T cells by antigen-presenting cells. This occurs in the lymphoid tissues and organs through which naïve T cells are constantly passing. The three cell types that can serve as antigen-presenting cells are dendritic cells, macrophages, and B cells. Each of these cells has a distinct function in eliciting immune responses. Tissue dendritic cells take up antigens by phagocytosis and macropinocytosis and are stimulated by infection to migrate to the local lymphoid tissue, where they differentiate into mature dendritic cells. Macrophages ingest particulate antigens such as bacteria and are induced by infectious agents to express MHC class II molecules. The unique ability of B cells to bind and internalize soluble protein antigens via their receptors may be important to induce T cells. By presenting the antigen on MHC molecules leads to activation of T cells which induces their proliferation and differentiation into armed effector T cells. The most important function of effector T cells is the killing of infected cells by CD8+ cytotoxic T cells and the activation of macrophages by Th1 cells which together make up cell-mediated immunity, and the activation of B cells by both Th2 and Th1 cells to produce different classes of antibody, thus driving the humoral immune response. T cells recognize an antigen by their T cell receptors which does not recognize and bind antigen directly, but instead recognize short peptide fragments e.g. of pathogens' protein antigens, which are bound to MHC molecules on the surfaces of other cells.
T cells: T cells fall into two major classes that have different effector functions. The two classes are distinguished by the expression of the cell-surface proteins CD4 and CD8. These two types of T cells differ in the class of MHC molecule that they recognize. There are two classes of MHC molecules—MHC class I and MHC class II molecules—which differ in their structure and expression pattern on tissues of the body. CD4+ T cells bind to a MHC class II molecule and CD8+ T cells to a MHC class I molecule. MHC class I and MHC class II molecules have distinct distributions among cells that reflect the different effector functions of the T cells that recognize them. MHC class I molecules present peptides of cytosolic and nuclear origin e.g. from pathogens, commonly viruses, to CD8+ T cells, which differentiate into cytotoxic T cells that are specialized to kill any cell that they specifically recognize. Almost all cells express MHC class I molecules, although the level of constitutive expression varies from one cell type to the next. But not only pathogenic peptides from viruses are presented by MHC class I molecules, also self-antigens like tumor antigens are presented by them. MHC class I molecules bind peptides from proteins degraded in the cytosol and transported in the endoplasmic reticulum. The CD8+ T cells that recognize MHC class I:peptide complexes at the surface of infected cells are specialized to kill any cells displaying foreign peptides and so rid the body of cells infected with viruses and other cytosolic pathogens. The main function of CD4+ T cells (CD4+ helper T cells) that recognize MHC class II molecules is to activate other effector cells of the immune system. Thus MHC class II molecules are normally found on B lymphocytes, dendritic cells, and macrophages, cells that participate in immune responses, but not on other tissue cells. Macrophages, for example, are activated to kill the intravesicular pathogens they harbour, and B cells to secrete immunoglobulins against foreign molecules. MHC class II molecules are prevented from binding to peptides in the endoplasmic reticulum and thus MHC class II molecules bind peptides from proteins which are degraded in endosomes. They can capture peptides from pathogens that have entered the vesicular system of macrophages, or from antigens internalized by immature dendritic cells or the immunoglobulin receptors of B cells. Pathogens that accumulate in large numbers inside macrophage and dendritic cell vesicles tend to stimulate the differentiation of Th1 cells, whereas extracellular antigens tend to stimulate the production of Th2 cells. Th1 cells activate the microbicidal properties of macrophages and induce B cells to make IgG antibodies that are very effective of opsonising extracellular pathogens for ingestion by phagocytic cells, whereas Th2 cells initiate the humoral response by activating naïve B cells to secrete IgM, and induce the production of weakly opsonising antibodies such as IgG1 and IgG3 (mouse) and IgG2 and IgG4 (human) as well as IgA and IgE (mouse and human).
Epitope (also called “antigen determinant”): An “epitope” is a portion of an antigen that is recognized by the immune system (e.g., by an antibody, an immunoglobulin receptor, a B cell receptor or a T cell receptor). An epitope can be linear or conformational. Commonly, an epitope is a polypeptide or polysaccharide in a naturally occurring antigen. T cell epitopes may comprise fragments preferably having a length of about 6 to about 20 or even more amino acids, e.g. fragments as processed and presented by MHC class I molecules, preferably having a length of about 8 to about 10 amino acids, e.g. 8, 9, or 10, (or even 11, or 12 amino acids), or fragments as processed and presented by MHC class II molecules, preferably having a length of about 13 or more amino acids, e.g. 13, 14, 15, 16, 17, 18, 19, 20 or even more amino acids, wherein these fragments may be selected from any part of the amino acid sequence. These fragments are typically recognized by T cells in form of a complex consisting of the peptide fragment and an MHC molecule. B cell epitopes are typically fragments located on the outer surface of (native) protein or peptide antigens. Accordingly, a B-cell epitope is preferably exposed on the surface of the antigen or pathogen, and can be linear or conformational. Normally, a B-cell epitope will include at least about 5 amino acids but can be as small as 3-4 amino acids.
Vaccine: A vaccine is typically understood to be a prophylactic or therapeutic material providing at least one epitope, one antigen or one antigenic function. The epitope, antigen or antigenic function may stimulate the body's adaptive immune system to provide an adaptive immune response.
Antigen-providing mRNA: An antigen-providing mRNA or an epitope-providing mRNA may typically be an mRNA, having at least one open reading frame that can be translated by a cell or an organism provided with that mRNA. The product of this translation is a peptide or protein that may act as an antigen, preferably as an immunogen. The product may also be a fusion protein composed of more than one immunogen, e.g. a fusion protein that consist of two or more epitopes, peptides or proteins, wherein the epitopes, peptides or proteins may be linked by linker sequences.
5′-CAP-Structure: A 5′-CAP is typically a modified nucleotide, particularly a guanine nucleotide, added to the 5′ end of an mRNA molecule. Preferably, the 5′-CAP is added using a 5′-5′-triphosphate linkage (also named m7GpppN). Further examples of 5′-CAP structures include glyceryl, inverted deoxy abasic residue (moiety), 4′,5′ methylene nucleotide, 1-(beta-D-erythrofuranosyl) nucleotide, 4′-thio nucleotide, carbocyclic nucleotide, 1,5-anhydrohexitol nucleotide, L-nucleotides, alpha-nucleotide, modified base nucleotide, threo-pentofuranosyl nucleotide, acyclic 3′,4′-seco nucleotide, acyclic 3,4-dihydroxybutyl nucleotide, acyclic 3,5 dihydroxypentyl nucleotide, 3′-3′-inverted nucleotide moiety, 3′-3′-inverted abasic moiety, 3′-2′-inverted nucleotide moiety, 3′-2′-inverted abasic moiety, 1,4-butanediol phosphate, 3′-phosphoramidate, hexylphosphate, aminohexyl phosphate, 3′-phosphate, 3′phosphorothioate, phosphorodithioate, or bridging or non-bridging methylphosphonate moiety. These modified 5′-CAP structures may be used in the context of the present invention to modify the mRNA sequence. Further modified 5′-CAP structures which may be used in the context of the present invention are CAP1 (methylation of the ribose of the adjacent nucleotide of m7GpppN), CAP2 (methylation of the ribose of the 2nd nucleotide downstream of the m7GpppN), CAP3 (methylation of the ribose of the 3rd nucleotide downstream of the m7GpppN), CAP4 (methylation of the ribose of the 4th nucleotide downstream of the m7GpppN), ARCA (anti-reverse CAP analogue), modified ARCA (e.g. phosphothioate modified ARCA), inosine, N1-methyl-guanosine, 2′-fluoro-guanosine, 7-deaza-guanosine, 8-oxo-guanosine, 2-amino-guanosine, LNA-guanosine, and 2-azido-guanosine.
Nucleic acid: The term nucleic acid means any DNA or RNA molecule and is used synonymous with polynucleotide. Wherever herein reference is made to a nucleic acid or nucleic acid sequence encoding for a particular protein and/or peptide, said nucleic acid or nucleic acid sequence, respectively, preferably also comprises regulatory sequences allowing in a suitable host, e.g. a human being, its expression, i.e. transcription and/or translation of the nucleic acid sequence encoding for the particular protein or peptide.
Peptide: A peptide is a polymer of amino acid monomers. Usually the monomers are linked by peptide bonds. The term “peptide” does not limit the length of the polymer chain of amino acids. In some embodiments of the present invention a peptide may for example contain less than 50 monomer units. Longer peptides are also called polypeptides, typically having 50 to 600 monomeric units, more specifically 50 to 300 monomeric units.
Pharmaceutically effective amount: A pharmaceutically effective amount in the context of the invention is typically understood to be an amount that is sufficient and/or efficient to induce an immune response.
Protein: A protein typically consists of one or more peptides and/or polypeptides folded into 3-dimensional form, facilitating a biological function.
Poly(C) sequence: A poly(C) sequence is typically a long sequence of cytosine nucleotides, typically about 10 to about 200 cytosine nucleotides, preferably about 10 to about 100 cytosine nucleotides, more preferably about 10 to about 70 cytosine nucleotides or even more, preferably about 20 to about 50, or even about 20 to about 30 cytosine nucleotides. A poly(C) sequence may preferably be located 3′ of the coding region comprised by a nucleic acid.
Poly(A) tail: A poly(A) tail also called “3′-poly(A) tail” is typically a long sequence of adenosine nucleotides of up to about 400 adenosine nucleotides, e.g. from about 25 to about 400, preferably from about 50 to about 400, more preferably from about 50 to about 300, even more preferably from about 50 to about 250, most preferably from about 60 to about 250 adenosine nucleotides, added to the 3′ end of a RNA.
Stabilized nucleic acid: A stabilized nucleic acid, typically, exhibits a modification increasing resistance to in vivo degradation (e.g. degradation by an exo- or endo-nuclease) and/or ex vivo degradation (e.g. by the manufacturing process prior to vaccine administration, e.g. in the course of the preparation of the vaccine solution to be administered). Stabilization of RNA can, e.g., be achieved by providing a 5′-CAP-Structure, a poly(A) tail, or any other UTR-modification. It can also be achieved by backbone-modification or modification of the G/C content of the nucleic acid. Various other methods are known in the art and conceivable in the context of the invention.
Carrier/polymeric carrier: A carrier in the context of the invention may typically be a compound that facilitates transport and/or complexation of another compound. Said carrier may form a complex with said other compound. A polymeric carrier is a carrier that is formed of a polymer.
Cationic component: The term “cationic component” typically refers to a charged molecule, which is positively charged (cation) at a pH value of typically about 1 to 9, preferably of a pH value of or below 9 (e.g. 5 to 9), of or below 8 (e.g. 5 to 8), of or below 7 (e.g. 5 to 7), most preferably at physiological pH values, e.g. about 7.3 to 7.4. Accordingly, a cationic peptide, protein or polymer according to the present invention is positively charged under physiological conditions, particularly under physiological salt conditions of the cell in vivo. A cationic peptide or protein preferably contains a larger number of cationic amino acids, e.g. a larger number of Arg, His, Lys or Orn than other amino acid residues (in particular more cationic amino acids than anionic amino acid residues like Asp or Glu) or contains blocks predominantly formed by cationic amino acid residues. The definition “cationic” may also refer to “polycationic” components.
3′-untranslated region (3′-UTR): A 3′-UTR is typically the part of an mRNA which is located between the protein coding region (i.e. the open reading frame) and the poly(A) sequence of the mRNA. A 3′-UTR of the mRNA is not translated into an amino acid sequence. The 3′-UTR sequence is generally encoded by the gene which is transcribed into the respective mRNA during the gene expression process. The genomic sequence is first transcribed into pre-mature mRNA, which comprises optional introns. The pre-mature mRNA is then further processed into mature mRNA in a maturation process. This maturation process comprises the steps of 5′-capping, splicing the pre-mature mRNA to excise optional introns and modifications of the 3′-end, such as polyadenylation of the 3′-end of the pre-mature mRNA and optional endo- or exonuclease cleavages etc. In the context of the present invention, a 3′-UTR corresponds to the sequence of a mature mRNA which is located 3′ to the stop codon of the protein coding region, preferably immediately 3′ to the stop codon of the protein coding region, and which extends to the 5′-side of the poly(A) sequence, preferably to the nucleotide immediately 5′ to the poly(A) sequence. The term “corresponds to” means that the 3′-UTR sequence may be an RNA sequence, such as in the mRNA sequence used for defining the 3′-UTR sequence, or a DNA sequence which corresponds to such RNA sequence. In the context of the present invention, the term “a 3′-UTR of a gene”, such as “a 3′-UTR of an albumin gene”, is the sequence which corresponds to the 3′-UTR of the mature mRNA derived from this gene, i.e. the mRNA obtained by transcription of the gene and maturation of the pre-mature mRNA. The term “3′-UTR of a gene” encompasses the DNA sequence and the RNA sequence of the 3′-UTR.
5′-untranslated region (5′-UTR): A 5′-UTR is typically understood to be a particular section of messenger RNA (mRNA). It is located 5′ of the open reading frame of the mRNA. Typically, the 5′-UTR starts with the transcriptional start site and ends one nucleotide before the start codon of the open reading frame. The 5′-UTR may comprise elements for controlling gene expression, also called regulatory elements. Such regulatory elements may be, for example, ribosomal binding sites or a 5′-Terminal Oligopyrimidine Tract. The 5′-UTR may be posttranscriptionally modified, for example by addition of a 5′-CAP. In the context of the present invention, a 5′UTR corresponds to the sequence of a mature mRNA which is located between the 5′-CAP and the start codon. Preferably, the 5′-UTR corresponds to the sequence which extends from a nucleotide located 3′ to the 5′-CAP, preferably from the nucleotide located immediately 3′ to the 5′-CAP, to a nucleotide located 5′ to the start codon of the protein coding region, preferably to the nucleotide located immediately 5′ to the start codon of the protein coding region. The nucleotide located immediately 3′ to the 5′-CAP of a mature mRNA typically corresponds to the transcriptional start site. The term “corresponds to” means that the 5′-UTR sequence may be an RNA sequence, such as in the mRNA sequence used for defining the 5′-UTR sequence, or a DNA sequence which corresponds to such RNA sequence. In the context of the present invention, the term “a 5′-UTR of a gene”, such as “a 5′-UTR of a TOP gene”, is the sequence which corresponds to the 5′-UTR of the mature mRNA derived from this gene, i.e. the mRNA obtained by transcription of the gene and maturation of the pre-mature mRNA. The term “5′-UTR of a gene” encompasses the DNA sequence and the RNA sequence of the 5′-UTR.
5′ Terminal Oligopyrimidine Tract (TOP): The 5′ terminal oligopyrimidine tract (TOP) is typically a stretch of pyrimidine nucleotides located at the 5′ terminal region of a nucleic acid molecule, such as the 5′ terminal region of certain mRNA molecules or the 5′ terminal region of a functional entity, e.g. the transcribed region, of certain genes. The sequence starts with a cytidine, which usually corresponds to the transcriptional start site, and is followed by a stretch of usually about 3 to 30 pyrimidine nucleotides. For example, the TOP may comprise 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or even more nucleotides. The pyrimidine stretch and thus the 5′ TOP ends one nucleotide 5′ to the first purine nucleotide located downstream of the TOP. mRNA that contains a 5′ terminal oligopyrimidine tract is often referred to as TOP mRNA. Accordingly, genes that provide such messenger RNAs are referred to as TOP genes. TOP sequences have, for example, been found in genes and mRNAs encoding peptide elongation factors and ribosomal proteins.
TOP motif: In the context of the present invention, a TOP motif is a nucleic acid sequence which corresponds to a 5′ TOP as defined above. Thus, a TOP motif in the context of the present invention is preferably a stretch of pyrimidine nucleotides having a length of 3-30 nucleotides. Preferably, the TOP-motif consists of at least 3 pyrimidine nucleotides, preferably at least 4 pyrimidine nucleotides, preferably at least 5 pyrimidine nucleotides, more preferably at least 6 nucleotides, more preferably at least 7 nucleotides, most preferably at least 8 pyrimidine nucleotides, wherein the stretch of pyrimidine nucleotides preferably starts at its 5′ end with a cytosine nucleotide. In TOP genes and TOP mRNAs, the TOP-motif preferably starts at its 5′ end with the transcriptional start site and ends one nucleotide 5′ to the first purine residue in said gene or mRNA. A TOP motif in the sense of the present invention is preferably located at the 5′end of a sequence which represents a 5′-UTR or at the 5′ end of a sequence which codes for a 5′-UTR. Thus, preferably, a stretch of 3 or more pyrimidine nucleotides is called “TOP motif” in the sense of the present invention if this stretch is located at the 5′ end of a respective sequence, such as the m RNA, the 5′-UTR element of the mRNA, or the nucleic acid sequence which is derived from the 5′-UTR of a TOP gene as described herein. In other words, a stretch of 3 or more pyrimidine nucleotides which is not located at the 5′-end of a 5′-UTR or a 5′-UTR element but anywhere within a 5′-UTR or a 5′-UTR element is preferably not referred to as “TOP motif”.
TOP gene: TOP genes are typically characterised by the presence of a 5′ terminal oligopyrimidine tract. Furthermore, most TOP genes are characterized by a growth-associated translational regulation. However, also TOP genes with a tissue specific translational regulation are known. As defined above, the 5′-UTR of a TOP gene corresponds to the sequence of a 5′-UTR of a mature mRNA derived from a TOP gene, which preferably extends from the nucleotide located 3′ to the 5′-CAP to the nucleotide located 5′ to the start codon. A 5′-UTR of a TOP gene typically does not comprise any start codons, preferably no upstream AUGs (uAUGs) or upstream open reading frames (uORFs). Therein, upstream AUGs and upstream open reading frames are typically understood to be AUGs and open reading frames that occur 5′ of the start codon (AUG) of the open reading frame that should be translated. The 5′-UTRs of TOP genes are generally rather short. The lengths of 5′-UTRs of TOP genes may vary between 20 nucleotides up to 500 nucleotides, and are typically less than about 200 nucleotides, preferably less than about 150 nucleotides, more preferably less than about 100 nucleotides. Exemplary 5′-UTRs of TOP genes in the sense of the present invention are the nucleic acid sequences extending from the nucleotide at position 5 to the nucleotide located immediately 5′ to the start codon (e.g. the ATG) in the sequences according to SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the international patent application WO2013/143700 or homologs or variants thereof, whose disclosure is incorporated herewith by reference. In this context a particularly preferred fragment of a 5′UTR of a TOP gene is a 5′-UTR of a TOP gene lacking the 5′ TOP motif. The term “5′UTR of a TOP gene” preferably refers to the 5′-UTR of a naturally occurring TOP gene.
Fragment of a nucleic acid sequence, particularly an mRNA: A fragment of a nucleic acid sequence consists of a continuous stretch of nucleotides corresponding to a continuous stretch of nucleotides in the full-length nucleic acid sequence which is the basis for the nucleic acid sequence of the fragment, which represents at least 20%, preferably at least 30%, more preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, even more preferably at least 70%, even more preferably at least 80%, and most preferably at least 90% of the full-length nucleic acid sequence. Such a fragment, in the sense of the present invention, is preferably a functional fragment of the full-length nucleic acid sequence.
Identity of a sequence: In order to determine the percentage to which two sequences are identical, e.g. nucleic acid sequences or amino acid sequences as defined herein, preferably the amino acid sequences encoded by a nucleic acid sequence as defined herein or the amino acid sequences themselves, the sequences can be aligned in order to be subsequently compared to one another. Therefore, e.g. a position of a first sequence may be compared with the corresponding position of the second sequence. If a position in the first sequence is occupied by the same component (residue) as is the case at a position in the second sequence, the two sequences are identical at this position. If this is not the case, the sequences differ at this position. If insertions occur in the second sequence in comparison to the first sequence, gaps can be inserted into the first sequence to allow a further alignment. If deletions occur in the second sequence in comparison to the first sequence, gaps can be inserted into the second sequence to allow a further alignment. The percentage to which two sequences are identical is then a function of the number of identical positions divided by the total number of positions including those positions which are only occupied in one sequence. The percentage to which two sequences are identical can be determined using a mathematical algorithm. A preferred, but not limiting, example of a mathematical algorithm which can be used is the algorithm of Karlin et al. (1993), PNAS USA, 90:5873-5877 or Altschul et al. (1997), Nucleic Acids Res., 25:3389-3402. Such an algorithm is integrated in the BLAST program. Sequences which are identical to the sequences of the present invention to a certain extent can be identified by this program.
Variant of a nucleic acid sequence, particularly an mRNA: A variant of a nucleic acid sequence refers to a variant of nucleic acid sequences which forms the basis of a nucleic acid sequence. For example, a variant nucleic acid sequence may exhibit one or more nucleotide deletions, insertions, additions and/or substitutions compared to the nucleic acid sequence from which the variant is derived. Preferably, a variant of a nucleic acid sequence is at least 40%, preferably at least 50%, more preferably at least 60%, more preferably at least 70%, even more preferably at least 80%, even more preferably at least 90%, most preferably at least 95% identical to the nucleic acid sequence the variant is derived from. Preferably, the variant is a functional variant. A “variant” of a nucleic acid sequence may have at least 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% nucleotide identity over a stretch of 10, 20, 30, 50, 75 or 100 nucleotide of such nucleic acid sequence.
The mRNA sequence according to the invention comprises at least one coding region, encoding at least one epitope of a protein, or of a fragment, variant or derivative thereof, of a virus of the genus rotavirus. This mRNA sequence may form the basis for an mRNA based vaccine which is a subunit vaccine. Such a subunit vaccine is particularly advantageous in treatment and especially prophylaxis of rotavirus infections. The design as subunit vaccine is especially advantageous since a subunit vaccine is significantly less risky in terms of side effects associated with live vaccines in general. The vaccine based on the inventive mRNA allows parenteral delivery that is not affected by possible efficacy reductions which may occur via the oral route. Generally, protein-based vaccines, as they are known in the prior art, are suboptimal in developing countries due to their high production costs. In contrast, the mRNA-based vaccines according to the present invention allow very cost-effective production. Therefore, in comparison with known vaccines the vaccine based on the inventive mRNA can be produced significantly cheaper, which is very advantageous particularly for use in developing countries. One further advantage of a vaccine based on the inventive mRNA may be its temperature-stable nature in comparison with the life oral rotavirus vaccines available or with other protein or peptide-based vaccine compositions.
Because of these advantages the vaccine based on the present invention may be used especially for widely used prophylaxis or treatment of rotavirus infections particularly in developing countries, where rotavirus infections are a severe problem.
According to the invention the mRNA sequence comprises at least one coding region encoding at least one epitope of a protein or of a fragment, variant or derivative thereof of a virus of the genus rotavirus. In this context the rotavirus species may be selected form the species A, B, C, D, E, F, G or H, wherein it is particularly preferred that the rotavirus is selected from species A or B or C. Species A, B and C are known to infect humans and various animals. In an especially preferred embodiment the selected rotavirus species is rotavirus A (RVA) which is particularly important for human infections.
The rotavirus may be selected from any one of the following G-serotypes and P-serotypes: G1, G2, G3, G4, G5, G6, G7, G8, G9, G10, G11, G12, G13, G14, G15, G16, G17, G18, G19, G20, G21, G22, G23, G24, G25, G26, G27, P[1], P[2], P[3], P[4], P[5], P[6], P[7], P[8], P[9], P[10], P[11], P[12], P[13], P[14], P[15], P[16], P[17], P[18], P[19], P[20], P[21], P[22], P[23], P[24], P[25], P[26], P[27], P[28], P[29], P[30], P[31], P[32], P[33], P[34], or P[35]. In this context the serotypes G1, G2, G3, G4, G9, G12, P[4], P[6] or P[8] are particularly preferred.
Preferably, the inventive mRNA sequence encodes at least one epitope of a protein, or of a fragment, variant or derivative thereof of a virus of the genus rotavirus, which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical with the wild type protein or fragment thereof, respectively the wild type epitope. The percentage of identities may refer to the identity of amino acid sequences of the protein or to the identity of nucleotide sequences of the mRNA.
The term wild type is to be understood according to the skilled person's general understanding in the art and denotes the protein, the fragment thereof or the epitope in the form of its occurrence in nature without any mutation or amino acid amendment by man. In the context of the invention the term wild type protein corresponds to the protein of the respective isolated rotavirus.
In other embodiments the amino acid sequence of the encoded protein, the fragment thereof or the epitope may be mutated. For example it may be particularly preferred that glycosylation sites in the encoded amino acid sequence are mutated which means that encoded amino acids which may be glycosylated after translation in vivo are exchanged to a different amino acid. Preferably, codons encoding asparagine which are predicted to be glycosylated (N glycosylation sites) are mutated to encode glutamine.
Preferably, the protein of a virus of the genus rotavirus is selected from a structural protein of rotavirus, especially from VP1 (preferably according to SEQ ID NO: 1) or VP2 (preferably according to SEQ ID NO: 2) or VP3 (preferably according to SEQ ID NO: 3) or VP4 (preferably according to SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, SEQ ID NO: 9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13 or SEQ ID NO: 14; SEQ ID Nos: 42-44; 46; 48; 5076) or VP6 (preferably according to SEQ ID NO: 15) or VP7 (preferably according to SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO: 20, SEQ ID NO: 21, SEQ ID NO: 22, SEQ ID NO: 23, SEQ ID NO: 24, SEQ ID NO: 25 or SEQ ID NO: 26). Moreover, it is possible, that the protein is selected from a non-structural protein of rotavirus, especially from NSP1 (preferably according to SEQ ID NO: 27) or NSP2 (preferably according to SEQ ID NO: 28) or NSP3 (preferably according to SEQ ID NO: 29) or NSP4 (preferably according to SEQ ID NO: 30, SEQ ID NO: 31, SEQ ID NO: 32, SEQ ID NO: 33, SEQ ID NO:34, SEQ ID NO: 35, SEQ ID NO: 36 or SEQ ID NO: 37) or NSP5 (preferably according to SEQ ID NO: 38) or NSP6 (preferably according to SEQ ID NO: 39).
It is especially preferred that the protein is selected from VP4 or VP7 or a cleavage product of VP4. These proteins are particularly preferred because they are the components of the outermost protein layer of the rotavirus which may be especially relevant for an immune response. In an especially preferred embodiment the protein derives from a cleavage product of VP4, preferably VP5* (preferably according to SEQ ID NO: 40) or VP8* (preferably according to SEQ ID NO: 41, SEQ ID NO: 45, SEQ ID NO: 47 or SEQ ID NO: 49, wherein VP8* is particularly preferred. Although VP5* and VP8* are cleavage products of the protein VP4 they are nevertheless referred to as proteins. Moreover, the proteins VP2, VP6, NSP2 and/or NSP4 may be especially preferred in other embodiments. The amino acid sequences as shown in the sequence listing derive from different rotavirus A strains, in particular from Human rotavirus A Ecu534, Human rotavirus A RVA/Human_wt/VNM/30378/2009/G26P[19], Human rotavirus A RVA/Human-tc/IDN/57M/1980/G4P[10], Human rotavirus A RVA/Human-wt/BEL/BE1058/2008/G2P[4], Human rotavirus A RVA/Human-wt/BEL/BE1128/2009/G1P[8], Human rotavirus A RVA/Vaccine/USA/Rotarix-A41CB052A/1988/G1P1A[8], Human rotavirus A strain CMH222, Human rotavirus A Wa variant VirWa, Human rotavirus HMG035, Lamb rotavirus Lp14, Murine rotavirus EB, Murine rotavirus EHP, Porcine rotavirus A 61/07/Ire, Porcine rotavirus A strain 134/04-15, Porcine rotavirus A34, Porcine rotavirus A46, Rabbit rotavirus 160/01, Rotavirus A 344-04-1, Rotavirus A Ch-02V0002G3, Rotavirus A Ch-06V0661, Rotavirus A FGP51, Rotavirus A Hu/BEL/F01322/2009/G3P[6], Rotavirus A human-wt/ITA/ME848/12/2012/G12P9, Rotavirus A RVA/Cow-wt/ARG/791_BA/1999/G10P[5], Rotavirus A RVA/Cow-wt/ARG/B1191_B_BA/2000/G10P[11], Rotavirus A RVA/Human-TC/USA/Rotarix/2009/G1P[8], Rotavirus A RVA/Human-wt/BEL/B4106/2000/G3P[14], Rotavirus A RVA/Human-wt/Bel/BE00112/2009/G1P[8], Rotavirus A RVA/Human-wt/BGD/Dhaka6/2001/G11P[25], Rotavirus A RVA/Vaccine/USA/RotaTeq-BrB-9/1996/G4P7[5], Rotavirus A RVA/Vaccine/USA/RotaTeq-SC2-9/1992/G2P7[5], Rotavirus A RVA/Vaccine/USA/RotaTeq-W178-8/1992/G3P7[5], Rotavirus A RVA/Vaccine/USA/RotaTeq-W179-4/1992/G6P1A[8], Rotavirus A RVA/Vaccine/USA/RotaTeq-W179-9/1992/G1P7[5], Rotavirus A Tu-03V0001E10, Rotavirus A turkey-tc/GER/03V0002E10/2003/G22P[35], Rotavirus strain TUCH, Bovine rotavirus A AzuK-1, Bovine rotavirus A Dai-10, Bovine rotavirus H1, Bovine Rotavirus Strain Hg18, Bovine rotavirus WC3, Equine rotavirus H2, Equine rotavirus L338. Nevertheless, the proteins may also derive from all other strains with the following Taxonomy ID or taxID: 1699092, 1637496, 1604860, 1560372, 1537146, 1496075, 1481391, 1461314, 1461313, 1461312, 1461311, 1461310, 1461309, 1461308, 1461307, 1461306, 1461305, 1461304, 1461303, 1461302, 1461301, 1461300, 1461299, 1461298, 1461297, 1461296, 1461295, 1461294, 1461293, 1461292, 1461291, 1461290, 1461289, 1461288, 1461287, 1461286, 1461285, 1460005, 1459051, 1459048, 1459047, 1459046, 1459045, 1454907, 1454906, 1454905, 1454904, 1454903, 1454902, 1454901, 1454900, 1454899, 1454898, 1454897, 1454896, 1454895, 1454894, 1454893, 1454892, 1454891, 1454890, 1454889, 1454888, 1454887, 1454886, 1454885, 1454884, 1454883, 1452758, 1452757, 1424309, 1420855, 1417760, 1399970, 1386847, 1349389, 1348384, 1340801, 1337059, 1327767, 1307516, 1307515, 1307514, 1307513, 1307512, 1307511, 1307510, 1307509, 1307508, 1307507, 1307506, 1307505, 1307170, 1307169, 1307168, 1307167, 1307166, 1307165, 1307164, 1307163, 1307162, 1307161, 1304751, 1302703, 1302702, 1302701, 1302700, 1302699, 1302698, 1271535, 1271534, 1265591, 1260938, 1231331, 1211022, 1211021, 1211020, 1211019, 1211018, 1211017, 1211016, 1204231, 1204230, 1193399, 1193398, 1193397, 1193396, 1193395, 1193394, 1193393, 1193392, 1193391, 1193390, 1193389, 1193388, 1193387, 1193386, 1187998, 1187994, 1184798, 1184797, 1184796, 1184795, 1184792, 1184791, 1184790, 1184789, 1184788, 1184787, 1184786, 1184785, 1184784, 1184783, 1184782, 1184781, 1184780, 1184779, 1184778, 1184777, 1184776, 1184775, 1184774, 1184773, 1184772, 1184771, 1184770, 1184769, 1184768, 1182925, 1182528, 1182527, 1182526, 1182525, 1182524, 1182523, 1182522, 1179682, 1179680, 1179678, 1169021, 1169020, 1169019, 1169018, 1169017, 1169016, 1169015, 1169014, 1169013, 1169012, 1169011, 1169010, 1169009, 1169008, 1168279, 1168278, 1168277, 1168276, 1168275, 1168274, 1168273, 1168272, 1168271, 1168270, 1168269, 1168268, 1168267, 1168266, 1168265, 1168264, 1168263, 1168262, 1168261, 1168260, 1168259, 1168258, 1168257, 1168256, 1168255, 1168254, 1168253, 1168252, 1168251, 1168250, 1168249, 1168248, 1168247, 1168246, 1168245, 1159003, 1159002, 1159001, 1159000, 1148771, 1146936, 1146935, 1146934, 1133028, 1133027, 1133026, 1133025, 1133024, 1133023, 1129884, 1129883, 1129882, 1129881, 1129880, 1129879, 1129878, 1129877, 1129876, 1129875, 1129874, 1127364, 1127363, 1127362, 1127361, 1127360, 1127359, 1127358, 1094389, 1094388, 1094387, 1094386, 1092563, 1092562, 1092561, 1077213, 1077212, 1077211, 1077210, 1077209, 1077208, 1074901, 1074900, 1074899, 1074898, 1073965, 1068803, 1056490, 1056489, 1055980, 1052890, 1052889, 1052888, 1049161, 1046565, 1004946, 1004945, 1004944, 1004943, 1004942, 1004941, 1004888, 1004887, 1004886, 1004885, 1004884, 1004883, 1004882, 1004881, 1004880, 1004879, 1004878, 1004877, 1004876, 1004875, 1004874, 1004873, 1004872, 1004871, 1004870, 1004869, 1004868, 1004867, 1004866, 1004865, 1004864, 1004863, 1004862, 1004861, 1004860, 1004859, 1004858, 1004857, 1004856, 1004855, 1004854, 1004853, 1004852, 1004851, 1004850, 1004849, 1004848, 1004847, 1004846, 1004845, 1004844, 1004843, 1004842, 1004841, 1004840, 1004839, 1004838, 1004837, 1004835, 1004834, 1004833, 1004832, 1004831, 1004830, 1004829, 1004828, 1004827, 1004826, 1004825, 1004824, 1004823, 1004822, 1004821, 1004820, 1004819, 1004818, 1004817, 1004816, 1004815, 1004814, 1004813, 1004812, 1004811, 1004810, 1004809, 1004808, 1004807, 1004806, 1004805, 1004804, 1004803, 1004802, 1004801, 1004800, 1004799, 1004798, 1004797, 1004796, 1004795, 1004794, 1004793, 999467, 999466, 994995, 994994, 993362, 984291, 984290, 984289, 984288, 984287, 984286, 984285, 984284, 984283, 981020, 981019, 981018, 981017, 980178, 980177, 980176, 980175, 980174, 980173, 980172, 980094, 980093, 980092, 980091, 980090, 948991, 948990, 948989, 931249, 915344, 910402, 909654, 909653, 909652, 909651, 909650, 909649, 909648, 909647, 909646, 909645, 909644, 909643, 909642, 909641, 909640, 909639, 909638, 909637, 909636, 909635, 909634, 909633, 909632, 909631, 909630, 909629, 905892, 905891, 905890, 905889, 905888, 905887, 905886, 905885, 905884, 905883, 905882, 905881, 905880, 905879, 905878, 905877, 905876, 905875, 905874, 905873, 905872, 905871, 905870, 905869, 905868, 905867, 905866, 905865, 905864, 905863, 905862, 905861, 905860, 905859, 905858, 905857, 905856, 905855, 905854, 905853, 905852, 905851, 905850, 889507, 889506, 889505, 889098, 889097, 889096, 889095, 889091, 889090, 889089, 889087, 889086, 889085, 884200, 882810, 882809, 882808, 882807, 882806, 882805, 882804, 882803, 879654, 879553, 879552, 879551, 879550, 879549, 864624, 864623, 862183, 862182, 862181, 862180, 862179, 862178, 862177, 862176, 862175, 862174, 862173, 862172, 862171, 862170, 862169, 862168, 862167, 861475, 861474, 861473, 861472, 861471, 861470, 861469, 861468, 861467, 861466, 861465, 861464, 861463, 861462, 861461, 861460, 861458, 758915, 758914, 758913, 758912, 758911, 758910, 758909, 758908, 758907, 758906, 758905, 758904, 758903, 758902, 758901, 758900, 758899, 758898, 758897, 758896, 758895, 758894, 758893, 758892, 758891, 758890, 758889, 757426, 757025, 757024, 757023, 757022, 757021, 757020, 757019, 757018, 757017, 757016, 757015, 757014, 757013, 757012, 757011, 757010, 757009, 757008, 757007, 757006, 757005, 757004, 757003, 757002, 757001, 757000, 756999, 756998, 756997, 756996, 756995, 756994, 756993, 756992, 756991, 756990, 756071, 756070, 756069, 756068, 749226, 748551, 748550, 748549, 746014, 743963, 743962, 743959, 743958, 743957, 743956, 743954, 743493, 713948, 713947, 713946, 713945, 713944, 713943, 713942, 713941, 713940, 713939, 713938, 713937, 713936, 713935, 713934, 713933, 713932, 705080, 703409, 697626, 697625, 697624, 697623, 697622, 697621, 697620, 697619, 697618, 697298, 693632, 686194, 686193, 686192, 686191, 686190, 686189, 686188, 686187, 686186, 686185, 686184, 686183, 686182, 686181, 686180, 686179, 686178, 686177, 686176, 686175, 686174, 686173, 686172, 686171, 686170, 686169, 686168, 686167, 686166, 686165, 686164, 686163, 686162, 686161, 686160, 686159, 686158, 686157, 686156, 686155, 686154, 686153, 686152, 686151, 686150, 686149, 686148, 686147, 686146, 686145, 686144, 686143, 686142, 686141, 686140, 686139, 686138, 686137, 686136, 684598, 684597, 684596, 684595, 684594, 679019, 672377, 670792, 670791, 670788, 670787, 670786, 670785, 666970, 666969, 666968, 666967, 666966, 666965, 664658, 664646, 664644, 663259, 663258, 663257, 663256, 663255, 663254, 663253, 663252, 663251, 663250, 663249, 663248, 663247, 663246, 663245, 663244, 662783, 661085, 661084, 661083, 661082, 661081, 661080, 661079, 661078, 661077, 661076, 661075, 661074, 661073, 661072, 661071, 661070, 661069, 661068, 661067, 661066, 661065, 661064, 661063, 661062, 661061, 661060, 661059, 661058, 661057, 661056, 661055, 661054, 661053, 661052, 652664, 652649, 650120, 644773, 641364, 641363, 641362, 641361, 641360, 641359, 641358, 641357, 641356, 641355, 641354, 641353, 641352, 641351, 641350, 641349, 641348, 641347, 641346, 641345, 641344, 641343, 641342, 641341, 641340, 641339, 641338, 641337, 641336, 641335, 641334, 641333, 641332, 641331, 641330, 641329, 641328, 641327, 641326, 641325, 641324, 641323, 641322, 641321, 641320, 641319, 641318, 641317, 641316, 641315, 641314, 641313, 641312, 639281, 639280, 639279, 639278, 639277, 639276, 639275, 639274, 639273, 639272, 639271, 639270, 639269, 639268, 639267, 639266, 639265, 639264, 639263, 639262, 639261, 639260, 639259, 639258, 639257, 639256, 639255, 639254, 639253, 639252, 639251, 639250, 639249, 639248, 639247, 639246, 639245, 639244, 639243, 639242, 638299, 638288, 638287, 638286, 638285, 638284, 638281, 638280, 638279, 638276, 638275, 626776, 626346, 626345, 626344, 587025, 584720, 584719, 578843, 578842, 578841, 578840, 578839, 578838, 578837, 578834, 578832, 578831, 578830, 578829, 578828, 578827, 574986, 574985, 574984, 573023, 573022, 573021, 573020, 573019, 573018, 573017, 573016, 573015, 573014, 573013, 572153, 571650, 571648, 566029, 566028, 561309, 560626, 560613, 557247, 557245, 557243, 557242, 557241, 557232, 556181, 552809, 545676, 531928, 531927, 529839, 529838, 529837, 529836, 529835, 529834, 529833, 529832, 529831, 529830, 529829, 529828, 529827, 526810, 526809, 526808, 526807, 526806, 526805, 526804, 526803, 526802, 526801, 526800, 526799, 526798, 526797, 526796, 526795, 526794, 526793, 526792, 526791, 523819, 523177, 517318, 517317, 517316, 517315, 517314, 517313, 517312, 517311, 517310, 517309, 517308, 517307, 517306, 517305, 517116, 517115, 517114, 517113, 517112, 517111, 517110, 516714, 500141, 500140, 500139, 500138, 500137, 500136, 500135, 500134, 500133, 500132, 500131, 500130, 500129, 500128, 500127, 500126, 500125, 500124, 500123, 500122, 500121, 500120, 500119, 500118, 500117, 500116, 500115, 500114, 500113, 498924, 492519, 492511, 492509, 492508, 492507, 492506, 490299, 490298, 490297, 490296, 490295, 490294, 490293, 490292, 490291, 487340, 487339, 487338, 487337, 487336, 487335, 487334, 487333, 487332, 487331, 487330, 487329, 487328, 487327, 487326, 487325, 487324, 487323, 487322, 487321, 486187, 486186, 481539, 481538, 478084, 471094, 468920, 468919, 468918, 468917, 460769, 458288, 458287, 458286, 458285, 458284, 458283, 458282, 458281, 458280, 458279, 458278, 458277, 458276, 458275, 458274, 458273, 458272, 458271, 458270, 458269, 458267, 450149, 449582, 449126, 444186, 444185, 444184, 429343, 420252, 419438, 418976, 416558, 416557, 408599, 408598, 407261, 407260, 407259, 407258, 407257, 407256, 401627, 401074, 397542, 391897, 380901, 380390, 380389, 375192, 374507, 370535, 370534, 370533, 370532, 370531, 370529, 359942, 356494, 348136, 335103, 335102, 335100, 334591, 308425, 308418, 295016, 293396, 292226, 290547, 290544, 290543, 290542, 290541, 290255, 290254, 285424, 285423, 285422, 285421, 285420, 285419, 285418, 285262, 285002, 273497, 263599, 263598, 263596, 263595, 240592, 238845, 237917, 215680, 204932, 204519, 200686, 200685, 200684, 199305, 199304, 199303, 195702, 195701, 195700, 195481, 195480, 195478, 195477, 195476, 195475, 183407, 183405, 183403, 180231, 173527, 171615, 161240, 161239, 148357, 141270, 141265, 139395, 121794, 111609, 107263, 106517, 106330, 105506, 105505, 105503, 105502, 104626, 104625, 104624, 101363, 101362, 101361, 101360, 101359, 101358, 101357, 101355, 101353, 101352, 101351, 101350, 96046, 94432, 80340, 79955, 79954, 79953, 79952, 79951, 79732, 79694, 79065, 79064, 76592, 76591, 76590, 75918, 73036, 73034, 72140, 72139, 72138, 72137, 72136, 72135, 72134, 72133, 72132, 71031, 70865, 69831, 61856, 57726, 57202, 53982, 53981, 53980, 53979, 53978, 53977, 53976, 53975, 53974, 53965, 53964, 53963, 49892, 48539, 45408, 44572, 42567, 39013, 39012, 39011, 39010, 39009, 37324, 37137, 37136, 36443, 36442, 36441, 36440, 36439, 36437, 36436, 36435, 36434, 36433, 36432, 36427, 35337, 35336, 35334, 35333, 33723, 33722, 31590, 31589, 31588, 31587, 31586, 31583, 31581, 31580, 31579, 31578, 31577, 31574, 31571, 31570, 31569, 31568, 31567, 31566, 31565, 31564, 31563, 28877, 28876, 28875, 28327, 28326, 12705, 12584, 12578, 10978, 10971, 10970, 10969, 10968, 10967, 10963, 10962, 10961, 10960, 10959, 10958, 10957, 10956, 10955, 10954, 10953, 10952, 10951, 10950, 10949, 10948, 10947, 10946, 10944, 10943, 10942, 10941, 10939, 10937, 10935, 10934, 10933, 10931, 10930, 10929, 10928, 10927, 10926, 10925, 10923, 10922, 10921, 10919, 10918, 10917, 10916, 10915, 10913, 10912. 10912.
A Taxonimy ID or taxID is a stable unique identifier for each taxon (for a species, a family, an order, or any other group in the taxonomy database). The taxID is seen in the GenBank records as a “source” feature table entry; for example, /db_xref=“taxon:<9606>” is the taxID for Homo sapiens, and the line is therefore found in all recent human sequence records).
In the context of the invention additionally to the here disclosed amino acid sequences according to SEQ ID NO: 1-827 also amino acid sequences of different rotavirus strains or rotavirus isolates can be used according to the invention and are incorporated herewith. These different rotavirus isolates show preferably an identity of at least 70%, more preferably of at least 80% and most preferably of at least 90% with the amino acid sequences according to SEQ ID NO: 1-827.
The invention relates to mRNA sequences encoding at least one epitope of a protein as mentioned above, but also of a fragment, variant of derivative thereof. The terms “fragment”, “variant” and “derivative” should be understood as follows:
Fragments of proteins: “Fragments” of proteins (or peptides) in the context of the present invention may, typically, comprise a sequence of a protein or peptide as defined herein, which is, with regard to its amino acid sequence (or its encoded nucleic acid molecule), N-terminally and/or C-terminally truncated compared to the amino acid sequence of the original (native) protein (or its encoded nucleic acid molecule). Such truncation may thus occur either on the amino acid level or correspondingly on the nucleic acid level. A sequence identity with respect to such a fragment as defined herein may therefore preferably refer to the entire protein or peptide as defined herein or to the entire (coding) nucleic acid molecule of such a protein or peptide. For example such fragment may have a length of about 6 to about 20 or even more amino acids, e.g. fragments as processed and presented by MHC class I molecules, preferably having a length of about 8 to about 10 amino acids, e.g. 8, 9, or 10, (or even 6, 7, 11, or 12 amino acids), or fragments as processed and presented by MHC class II molecules, preferably having a length of about 13 or more amino acids, e.g. 13, 14, 15, 16, 17, 18, 19, 20 or even more amino acids, wherein these fragments may be selected from any part of the amino acid sequence. These fragments are typically recognized by T-cells in form of a complex consisting of the peptide fragment and an MHC molecule, i.e. the fragments are typically not recognized in their native form. Fragments of proteins or peptides may comprise at least one epitope of those proteins or peptides. Furthermore also domains of a protein, like the extracellular domain, the intracellular domain or the transmembrane domain and shortened or truncated versions of a protein may be understood to comprise a fragment of a protein.
Variants of proteins: “Variants” of proteins (or peptides) as defined in the context of the present invention may be generated, having an amino acid sequence which differs from the original sequence in one or more mutation(s), such as one or more substituted, inserted and/or deleted amino acid(s). Preferably, these fragments and/or variants have the same biological function or specific activity compared to the full-length native protein, e.g. its specific antigenic property. “Variants” of proteins or peptides as defined in the context of the present invention may comprise conservative amino acid substitution(s) compared to their native, i.e. non-mutated physiological, sequence. Those amino acid sequences as well as their encoding nucleotide sequences in particular fall under the term variants as defined herein. Substitutions in which amino acids, which originate from the same class, are exchanged for one another are called conservative substitutions. In particular, these are amino acids having aliphatic side chains, positively or negatively charged side chains, aromatic groups in the side chains or amino acids, the side chains of which can enter into hydrogen bridges, e.g. side chains which have a hydroxyl function. This means that e.g. an amino acid having a polar side chain is replaced by another amino acid having a likewise polar side chain, or, for example, an amino acid characterized by a hydrophobic side chain is substituted by another amino acid having a likewise hydrophobic side chain (e.g. serine (threonine) by threonine (serine) or leucine (isoleucine) by isoleucine (leucine)). Insertions and substitutions are possible, in particular, at those sequence positions which cause no modification to the three-dimensional structure or do not affect the binding region. Modifications to a three-dimensional structure by insertion(s) or deletion(s) can easily be determined e.g. using CD spectra (circular dichroism spectra) (Urry, 1985, Absorption, Circular Dichroism and ORD of Polypeptides, in: Modern Physical Methods in Biochemistry, Neuberger et al. (ed.), Elsevier, Amsterdam). A “variant” of a protein or peptide may have at least 70%, 75%, 80%, 85%, 90%, 95%, 98% or 99% amino acid identity over a stretch of 10, 20, 30, 50, 75 or 100 amino acids of such protein or peptide. Furthermore, variants of proteins or peptides as defined herein, which may be encoded by a nucleic acid molecule, may also comprise those sequences, wherein nucleotides of the encoding nucleic acid sequence are exchanged according to the degeneration of the genetic code, without leading to an alteration of the respective amino acid sequence of the protein or peptide, i.e. the amino acid sequence or at least part thereof may not differ from the original sequence within the above meaning.
Derivatives of a protein: A derivative of a protein (or peptide) is typically understood to be a molecule that is derived from another molecule, such as said peptide or protein. A “derivative” of a peptide or protein also encompasses fusions comprising a peptide or protein used in the present invention. For example, the fusion comprises a label, such as, for example, an epitope, e.g., a FLAG epitope or a V5 epitope. For example, the epitope is a FLAG epitope. Such a tag is useful for, for example, purifying the fusion protein.
In the following the term “rotavirus protein” is used. This term has to be understood to relate to rotavirus proteins as mentioned above but also to fragments, variants or derivatives thereof.
VP8* is a protein (or protein cleavage product) that is generated upon naturally occurring proteolytic cleavage of the viral cell surface protein VP4 to VP5* and VP8*. VP4 is one of two rotavirus proteins present on the outermost layer of the virus and forms trimeric spikes on the viral surface. A variety of studies employing VP8* as a protein vaccine have shown its ability to induce neutralising antibodies against rotaviruses in mice and guinea pigs (Larralde G et al., J Virol. 1991, 65(6):3213-3218; Dunn S J et al, Arch Virol. 1995; 140(11):1969-1978; Gil M T et al, Viral Immunol. 2000; 13(2):187-200; Perez Filgueira D M et al, Arch Virol. 2004, 149(12):2337-2348; Wen et al, Vaccine. 2012, 30(43):6121-6126) as well as in gnotobiotic pigs (Wen et al, Vaccine. 2014 Jul. 31; 32(35):4420-4427). Now, the inventors were able to demonstrate, that an mRNA-based subunit vaccine comprising a coding region encoding for at least one epitope of especially VP8* is particularly effective in inducing antigen-specific immune responses against rotavirus.
The at least one epitope of a protein or of a fragment, variant or derivative thereof of a virus of the genus rotavirus is preferably derived from one of the serotypes of rotavirus A. For example, the protein VP4 or the protein cleavage product VP5* or especially VP8* may preferably be derived from one of the serotypes P[1]-P[35] (preferably a protein sequence according to any of SEQ ID NOs: 40-76 is chosen). Preferably, the protein VP4 or the protein cleavage product VP5* or especially VP8* is selected from one of the serotypes P[4] or P[6] or P[8], most preferably a protein sequence according to any of SEQ ID Nos: 41; 45; 47; or 49 is chosen. The protein VP7 is preferably selected from one of the serotypes G1 or G2 or G3 or G4 or G9 or G12, most preferably a protein sequence according to any of SEQ ID Nos. 1-26 is chosen.
In this context VP8* is particularly preferred, wherein the inventive mRNA sequence encodes the VP8* protein, preferably the wild type VP8* protein, or a fragment, variant or derivative thereof, which preferably derives from the serotypes P[4] (preferably according to SEQ ID NO: 45) or P[6] (preferably according to SEQ ID NO: 47) or P[8] (preferably according to SEQ ID NO: 41 or SEQ ID NO: 49) of rotavirus A as will be outlined in more detail below. The term wild type is to be understood according to the skilled person's general understanding in the art and denotes the protein in the form of its occurrence in nature without any mutation or amino acid amendment by man. The serotypes P[4] and P[6] and P[8] are the most prevalent P-serotypes important for rotavirus infections and are especially effective in the inventive approach.
In the following the term “VP8* coding region” is used. This term is to be understood that the coding region may encode complete (full-length) VP8*, or a fragment, variant or derivative thereof, also considering the possible percentage of identity of the amino acid sequences (or nucleotide sequences) as outlined above.
Preferably, the amino acid sequence of VP8* derived from serotype P[4] according to SEQ ID NO: 45 (derived from Human rotavirus A RVA/Human-wt/BEL/BE1058/2008/G2P[4]; JN849123.1; GI:371455744) may serve as a basis for advantageous designs of the inventive mRNA sequence.
Preferably, the amino acid sequence of VP8* derived from serotype P[6] according to SEQ ID NO: 47 (derived from Rotavirus A Hu/BEL/F01322/2009/G3P[6]; JF460826.1; GI:37531451) may serve as a basis for advantageous designs of the inventive mRNA sequence.
Preferably, the amino acid sequence of VP8* derived from serotype P[8] according to SEQ ID NO: 41 (derived from Human rotavirus A RVA/Human-wt/BEL/BE1128/2009/G1P[8]; JN849135.1; GI:371455756) may serve as a basis for advantageous designs of the inventive mRNA sequence.
Preferably, the amino acid sequence of VP8* derived from serotype P[8] according to SEQ ID NO: 49 (derived from Human rotavirus A Wa variant VirWa; ACR22783.1) may serve as a basis for advantageous designs of the inventive mRNA sequence.
Preferably, the inventive mRNA sequence encodes an amino acid sequence which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to the sequence according to SEQ ID NO: 41 or 45 or 47 or 49.
Preferred protein designs of VP8* for protein-based vaccines represent a shortened form of VP8* that facilitates protein isolation from producer cells (usually bacterial cells), since the full length protein is insoluble in the bacterial expression system. Given that conditions for cytoplasmatic expression in bacteria are not directly transferable to expression and export via the secretory pathway in eukaryotic cells, current mRNA designs feature full length VP8* due to the presence of predicted T-cell epitopes in fragments removed in a shortened form of VP8*. Therefore, according to a preferred embodiment of the inventive mRNA sequence the full length VP8* is encoded. Nevertheless, it may also be preferred to apply a shortened form of VP8* for the inventive mRNA sequence.
In particularly preferred embodiments the inventive mRNA sequence encodes the following shortened forms of VP8*: amino acids (aa) 2-240; 2-230; 11-223; 11-240, 21-240; 4-223; 65-223 or 64-223. Particularly preferred in this context are the amino acid sequences as shown in Table 1.
TABLE 1
amino acid sequences
Amino
acids of
VP8*
Serotyp P[4]
Serotyp P[6]
Serotyp P[8]
2-240
SEQ ID NO. 77
SEQ ID NO. 78
SEQ ID Nos. 79-80
2-230
SEQ ID NO. 173
SEQ ID NO. 174
SEQ ID NOs. 175-176
11-223
SEQ ID NO. 269
SEQ ID NO. 270
SEQ ID NOs. 271-272
11-240
SEQ ID NO. 365
SEQ ID NO. 366
SEQ ID Nos 367-368
21-240
SEQ ID NO. 461
SEQ ID NO. 462
SEQ ID NOs. 463-464
41-223
SEQ ID NO. 557
SEQ ID NO. 558
SEQ ID NOs. 559-560
65-223
SEQ ID NO. 653
SEQ ID NO. 654
SEQ ID Nos. 655-656
64-223
SEQ ID NO. 749
SEQ ID NO. 750
Preferably, the inventive mRNA sequence encodes at least one amino acid sequence which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the amino acid sequences as shown in Table 1 or to a respective fragment thereof.
It is possible to apply the unmodified coding sequences to the inventive mRNA sequences. For example, unmodified nucleotide sequences encoding the VP8* are shown in Table 2 Column A respectively.
Nevertheless, in especially preferred embodiments of the invention optimized and modified nucleotide sequences are used as will be outlined in more detail below.
In preferred embodiments of the invention the G/C content of the coding region of the mRNA sequence is increased compared to the G/C content of the coding region of its particular wild type coding sequence, i.e. the unmodified mRNA. For example, coding regions with an increased G/C content encoding VP8* are shown in Table 2 Column B respectively.
In especially preferred designs the inventive mRNA sequence comprises UTRs e.g. as 5′-UTR a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), as 3′-UTR a 3′-UTR derived from human albumin 3′-UTR (according to SEQ ID NO. 3205), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 2 Column D. Alternatively the inventive mRNA sequence comprises a 3′-UTR derived from alpha globin 3′-UTR (according to SEQ ID NO. 3199), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 2 Column C.
TABLE 2
inventive mRNA sequences encoding VP8*
Amino
Serotyp P[4]
Serotyp P[6]
Serotyp P[8]
acids of
SEQ ID NO(s)
SEQ ID NO(s)
SEQ ID NO(s)
VP8*
A
B
C
D
A
B
C
D
A
B
C
D
1-240
828
878
1626
2386
829
879
1627
2387
830-
880-
1628-
2388-
831
881
1629
2389
1-230
974
1722
2482
975
1723
2483
976-
1724-
2484-
977
1725
2485
11-223
1070
1818
2578
1071
1819
2579
1072-
1820-
2580-
1073
1821
2581
11-240
1166
1914
2674
1167
1915
2675
1168-
1916-
2676-
1169
1917
2677
21-240
1262
2010
2770
1263
2011
2771
1264-
2012-
2772-
1265
2013
2773
41-223
1358
2106
2866
1359
2107
2867
1360-
2108-
2868-
1361
2109
2869
65-223
833
1454
2202
2962
835
1455
2203
2963
836-
1456-
2204-
2964-
837
1457
2205
2965
64-223
832
1550
2298
3058
834
1551
2298
3059
Column A = SEQ ID Nos. regarding wild type CDS (coding sequences)
Column B = SEQ ID Nos. regarding CDS (coding sequences) with an increased G/C content
Column C = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 3′-UTR derived from alpha globin (according to SEQ ID No. 3199), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Column D = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), a 3′-UTR derived from albumin 3′-UTR (according to SEQ ID No. 3205), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Preferably, the inventive mRNA sequence comprises or consists of a nucleotide sequence as shown in Table 2 or which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 85%, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the sequences shown in Table 2.
Using mRNA as a basis poses challenges concerning antigen localisation that differ from a protein-based approach. During the viral life cycle, VP4 is produced in the cytoplasm and associates with the assembling viral particle on the cytoplasmatic side of the endoplasmatic reticulum (ER) membrane. In a process mediated by a viral accessory protein, the complex then buds into the ER, allowing assembly with the ER resident glycoprotein VP7. In polarised cells, the virus then leaves the infected cell via the secretory pathway (reviewed on Trask et al, Nat Rev Microbiol. 2012 Jan. 23; 10(3):165-177) and is activated by cleavage of VP4 into VP5* and VP8* by trypsin-like proteases in the intestinal lumen. Hence, mRNA-based expression of unmodified VP8* in the absence of additional viral proteins may result in translation of the nascent protein in the cytoplasm of the producing cell, rendering the antigen caught inside the cell. In such a scenario, the antigen as a protein would be inaccessible to the immune system unless the producing cell undergoes cell death. Anyway this might be sufficient to induce an immune response.
Nevertheless, in preferred embodiments of the inventive mRNA sequence, the mRNA sequence encoding at least one epitope of a rotavirus protein or a fragment, variant or derivative thereof, particularly VP8* or a fragment, variant or derivative thereof, is artificially modified and amended, thereby enhancing the immune responses.
In an especially preferred embodiment of the inventive mRNA sequence the mRNA sequence encoding at least one epitope of a rotavirus protein or a fragment, variant or derivative thereof, particularly VP8* or a fragment, variant or derivative thereof, is combined with at least one sequence section encoding a helper peptide for enhancement of immunogenicity. In preferred embodiments the helper peptide is located at the 5′-end of the coding region, encoding the at least one epitope of a rotavirus protein or a fragment, variant or derivative thereof. Preferably, the helper peptide is derived from Tetanus toxin, or a fragment, variant or derivative thereof. By addition of an mRNA sequence encoding such a peptide it is possible to increase immune responses. In the context of a protein-based approach it has already been shown by Wen et al. (Vaccine. 2014 Jul. 31; 32(35):4420-4427) that the N-terminal P2 helper peptide derived from tetanus toxin was able to increase immune responses against VP8*. Now, the inventors were able to show that the addition of a sequence encoding a helper peptide may be particularly effective in enhancing the immune response in an mRNA-based vaccine approach.
Preferably, the amino acid sequence of P2 helper peptide of tetanus toxin according to SEQ ID NO: 3147 (GenBank X04436 derived from Kovacs-Nolan et al.; PMID 16978788; P2: aa 830-844) may serve as a basis for advantageous designs of the inventive mRNA sequence.
If the sequence section encoding the rotavirus protein is combined with a sequence section encoding a helper peptide at the 5′-end of the sequence section encoding the rotavirus protein a peptide linker may optionally separate the rotavirus protein from the helper peptide.
Preferably, the sequence section encoding the peptide linker is coding for a flexible or a rigid linker.
Flexible linkers are especially preferred, wherein joined domains allow a certain degree of movement or interaction. They are preferably composed of small, non-polar (e.g. Gly) or polar (e.g. Ser or Thr) amino acids. The small size of these amino acids provides flexibility, and allows for mobility of the connecting functional domains, as described by Chen et al. (Adv Drug Deliv Reb. 2013; 65(10):1357-1369). The incorporation of Ser or Thr can maintain the stability of the linker in aqueous solutions by forming hydrogen bonds with the water molecules, and therefore reduces an interaction between the linker and the protein moieties. Rigid linkers generally maintain the distance between the protein domains and they may be based on helical structures and/or they have a sequence that is rich in proline.
A typical sequence of a flexible linker is composed of repeats of the amino acids Glycine and Serine. For instance, the linker may have the following sequence: GS, GSG, SGG. In some embodiments, the same sequence is repeated multiple times (e.g. two, three, four, five or six times) to create a longer linker. In other embodiments, a single amino acid such as S or G can be used as a linker.
Preferably, the sequence section encoding the peptide linker codes for a flexible linker comprising sequences consisting of at least one stretch of glycine and/or serine residues. The term “stretch” describes a sequence of consecutive identical amino acids. More preferably, the flexible linker comprises more than one stretch of glycine and/or serine residues. Advantageously, the stretch or stretches consist(s) of four and/or five amino acids.
Preferably, the amino acid sequences of the peptide linker according to SEQ ID NO: 3174 (derived from publication: PMID 16227284) or SEQ ID NOs: 3175-3176 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Preferred amino acid sequences comprising P2 helper peptide, optionally a peptide linker and VP8* proteins (full-length and shortened forms) are shown in Table 3:
TABLE 3
amino acid sequences for VP8* additionally comprising the P2
helper peptide and optionally a peptide linker
Amino acids of VP8*
Serotyp P[4]
Serotyp P[6]
Serotyp P[8]
2-240
SEQ ID NO. 81, 797
SEQ ID NO. 82, 798
SEQ ID Nos. 83-84, 799
2-230
SEQ ID NO. 177
SEQ ID NO. 178
SEQ ID NOs. 179-180
11-223
SEQ ID NO. 273
SEQ ID NO. 274
SEQ ID NOs. 275-276
11-240
SEQ ID NO. 369
SEQ ID NO. 370
SEQ ID Nos 371-372
21-240
SEQ ID NO. 465
SEQ ID NO. 466
SEQ ID NOs. 467-468
41-223
SEQ ID NO. 561
SEQ ID NO. 562
SEQ ID NOs. 563-564
65-223
SEQ ID NO. 657
SEQ ID NO. 658
SEQ ID Nos. 559-660,
802-803
64-223
SEQ ID NO. 751, 800
SEQ ID NO. 752, 801
Preferably, the inventive mRNA sequence encodes at least one amino acid sequence which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the amino acid sequences as shown in Table 3 or to a respective fragment thereof.
Additionally to the amino acid sequences in Table 3 also the inventive mRNA sequences encoding the respective amino acid sequences are disclosed. In this context it is possible to apply unmodified nucleotide sequences or mRNA sequences wherein the G/C content of the coding region is increased compared to the G/C content of the coding region of its particular wild type coding sequence, i.e. the unmodified mRNA. Furthermore, mRNA sequences additionally comprising UTR sequences are disclosed in Table 4.
In especially preferred designs the inventive mRNA sequence comprises UTRs e.g. as 5′-UTR a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), as 3′-UTR a 3′-UTR derived from human albumin 3′-UTR (according to SEQ ID NO. 3205), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 4 Column D. Alternatively the inventive mRNA sequence comprises a 3′-UTR derived from alpha globin 3′-UTR (according to SEQ ID NO. 3199), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 4 Column C.
TABLE 4
mRNA sequences encoding VP8* and additionally encoding
the P2 helper peptide and optionally a peptide linker
Amino
Serotyp P[4]
Serotyp P[6]
Serotyp P[8]
acids of
SEQ ID NO(s)
SEQ ID NO(s)
SEQ ID NO(s)
VP8*
A
B
C
D
A
B
C
D
A
B
C
D
1-240
838
882
2346
2390
839
883
2347
2391
840
884-
2348;
2392-
1598
1630
1599
1631
885
1632-
2393
1600
1633
1-230
978
1726
2486
979
1727
2487
980-
1728-
2488-
981
1729
2489
11-223
1074
1822
2582
1075
1823
2583
1076-
1824-
2584-
1077
1825
2585
11-240
1170
1918
2678
1171
1919
2679
1172-
1920-
2680-
1173
1921
2681
21-240
1266
2014
2774
1267
2015
2775
1268-
2016-
2776-
1269
2017
2777
41-223
1362
2110
2870
1363
2111
2871
1364-
2112-
2872-
1365
2113
2873
65-223
1458
2206
2966
1459
2207
2967
843-
1460-
2351-
2968-
844
1461;
2352;
2969
1603-
2208-
1604
2209
64-223
841
1552
2349
3060
842
1553
2350
3061
1601
2300
1602
2301
Column A = SEQ ID Nos. regarding wild type CDS (coding sequences)
Column B = SEQ ID Nos. regarding CDS (coding sequences) with an increased G/C content
Column C = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 3′-UTR derived from alpha globin (according to SEQ ID No. 3199), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Column D = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), a 3′-UTR derived from albumin 3′-UTR (according to SEQ ID No. 3205), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Preferably, the inventive mRNA sequence comprises or consists of a nucleotide sequence as shown in Table 4 or which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 85%, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the sequences shown in Table 4.
In an especially preferred embodiment the inventive mRNA additionally comprises at least one sequence section encoding a signal peptide (see therefore for example the preferred construct “secreted” shown in
In preferred embodiments the signal peptide derives from tissue plasminogen activator (TPA) or albumin, especially human serum albumin (HSA), or CD5 (CD—cluster of differentiation) or HLA-A2 (HLA—human leucocyte antigen) or luciferase, especially Gaussian luciferase, or immunoglobulin (e.g. IgG or IgE heavy chain) or IL-2 (human IL-2) or chymotrypsinogen (human chymotrypsinogen), or a fragment, variant or derivative thereof. By inclusion of a signal peptide that is aimed to allow co-translational transport of VP8* into the ER followed by protein secretion it is possible to increase antigen accessibility to the immune system.
Preferably, the amino acid sequence of the signal peptide of tissue plasminogen activator according to SEQ ID NO: 3148 (derived from NM_000930.3; GI:132626665; 1-21aa/1-63 bp) may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the signal peptide of human serum albumin (HSA) according to SEQ ID NO: 3149 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the signal peptide of CD5 according to SEQ ID NO: 3151 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the signal peptide of HLA-A2 according to SEQ ID NOs: 3152-3153 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the signal peptide of Gaussia luciferase according to SEQ ID NO: 3154 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the signal peptide of immunoglobulin IgG heavy chain according to SEQ ID NO: 3155 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the signal peptide of human IL-2 according to SEQ ID NO: 3156 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the signal peptide of human chymotrypsinogen according to SEQ ID NO: 3157 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the signal peptide of an immunoglobulin (human, heavy chain) according to SEQ ID NO: 3158 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the signal peptide of an immunoglobulin (human, heavy chain) according to SEQ ID NO: 3159 may serve as a basis for advantageous designs of the inventive mRNA sequence.
Preferred amino acid sequences of the combination of a signal peptide and VP8* (full-length and shortened forms) are shown in Table 5:
TABLE 5
amino acid sequences for VP8* additionally comprising a
signal peptide
Amino acids of VP8*
SEQ ID Nos.
2-240
85-104
2-230
181-200
11-223
277-296
11-240
373-392
21-240
469-488
41-223
565-584
65-223
661-680
64-223
753-762
Preferably, the inventive mRNA sequence encodes at least one amino acid sequence which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the amino acid sequences as shown in Table 5 or to a respective fragment thereof.
Additionally to the amino acid sequences in Table 5 also the inventive mRNA sequences encoding the respective amino acid sequences are disclosed. In this context it is possible to apply unmodified nucleotide sequences or mRNA sequences wherein the G/C content of the coding region is increased compared to the G/C content of the coding region of its particular wild type coding sequence, i.e. the unmodified mRNA. Furthermore, mRNA sequences additionally comprising UTR sequences are disclosed in Table 6.
In especially preferred designs the inventive mRNA sequence comprises UTRs e.g. as 5′-UTR a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), as 3′-UTR a 3′-UTR derived from human albumin 3′-UTR (according to SEQ ID NO. 3205), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 6 Column D. Alternatively the inventive mRNA sequence comprises a 3′-UTR derived from alpha globin 3′-UTR (according to SEQ ID NO. 3199), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 6 Column C.
TABLE 6
mRNA sequences encoding VP8* and additionally encoding a
signal peptide
Amino acids
SEQ ID NO(s)
of VP8*
A
B
C
D
2-240
—
886-905
1634-1653
2394-2413
2-230
—
982-1001
1730-1749
2490-2509
11-223
—
1078-1097
1826-1845
2586-2605
11-240
—
1174-1193
1922-1941
2682-2701
21-240
—
1270-1289
2018-2037
2778-2797
41-223
—
1366-1385
2114-2133
2874-2893
65-223
—
1462-1481
2210-2229
2970-2989
64-223
—
1554-1563
2302-2311
3062-3071
Column A = SEQ ID Nos. regarding wild type CDS (coding sequences)
Column B = SEQ ID Nos. regarding CDS (coding sequences) with an increased G/C content
Column C = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 3′-UTR derived from alpha globin (according to SEQ ID No. 3199), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Column D = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), a 3′-UTR derived from albumin 3′-UTR (according to SEQ ID No. 3205), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Preferably, the inventive mRNA sequence comprises or consists of a nucleotide sequence as shown in Table 6 or which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 85%, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the sequences shown in Table 6.
Preferably, the signal peptide is applied in mRNA sequences encoding at least one epitope of a rotavirus protein according to the invention in combination with a helper peptide and optionally in combination with a peptide linker (as disclosed above).
Preferred amino acid sequences of the combination of a signal peptide, a helper peptide and optionally a peptide linker together with VP8* (full-length and shortened forms) are shown in Table 7:
TABLE 7
amino acid sequences for VP8* additionally comprising a
signal peptide, the P2 helper peptide and optionally a
peptide linker
Amino acids of VP8*
SEQ ID Nos.
2-240
105-124
2-230
201-220
11-223
297-316
11-240
393-412
21-240
489-508
41-223
585-604
65-223
681-700
64-223
763-772
Preferably, the inventive mRNA sequence encodes at least one amino acid sequence which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the amino acid sequences as shown in Table 7 or to a respective fragment thereof.
Additionally to the amino acid sequences in Table 7 also the inventive mRNA sequences encoding the respective amino acid sequences are disclosed. In this context it is possible to apply unmodified nucleotide sequences or mRNA sequences wherein the G/C content of the coding region is increased compared to the G/C content of the coding region of its particular wild type coding sequence, i.e. the unmodified mRNA. Furthermore, mRNA sequences additionally comprising UTR sequences are disclosed in Table 8.
In especially preferred designs the inventive mRNA sequence comprises UTRs e.g. as 5′-UTR a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), as 3′-UTR a 3′-UTR derived from human albumin 3′-UTR (according to SEQ ID NO. 3205), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 8 Column D. Alternatively the inventive mRNA sequence comprises a 3′-UTR derived from alpha globin 3′-UTR (according to SEQ ID NO. 3199), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 8 Column C.
TABLE 8
mRNA sequences encoding VP8* and additionally encoding a
signal peptide, the P2 helper peptide and optionally a
peptide linker
SEQ ID NO(s)
Amino acids
A
B
C
D
2-240
—
906-925
1654-1673
2414-2433
2-230
—
1002-1021
1750-1769
2510-2529
11-223
—
1098-1117
1846-1865
2606-2625
11-240
—
1194-1213
1942-1961
2702-2721
21-240
—
1290-1309
2038-2057
2798-2817
41-223
—
1386-1405
2134-2153
2894-2913
65-223
—
1482-1501
2230-2249
2990-3009
64-223
—
1564-1573
2312-2321
3072-3081
Column A = SEQ ID Nos. regarding wild type CDS (coding sequences)
Column B = SEQ ID Nos. regarding CDS (coding sequences) with an increased G/C content
Column C = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 3′-UTR derived from alpha globin (according to SEQ ID No. 3199), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Column D = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), a 3′-UTR derived from albumin 3′-UTR (according to SEQ ID No. 3205), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Preferably, the inventive mRNA sequence comprises or consists of a nucleotide sequence as shown in Table 8 or which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 85%, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the sequences shown in Table 8.
Nevertheless, the signal peptide may also be applied in an mRNA sequence encoding the at least one epitope of a rotavirus protein according to the invention without the helper peptide.
In a particularly preferred embodiment of the inventive mRNA sequence the coding region encoding at least one epitope of a rotavirus protein, or a fragment, variant or derivative thereof, is mutated to delete at least one predicted or potential glycosylation site. As described e.g. by Chauhan et al. (PLOS ONE (2013) Vol. 8 issue 6, e67008) glycosylation is an important post-translational or co-translational modification of proteins. The majority of proteins synthesized in the rough endoplasmatic reticulum (ER) undergo glycosylation. There are mainly two types of glycosylation: a) In N-glycosylation, the addition of sugar chains takes place at the amide nitrogen on the side-chain of the asparagine or arginine. b) In O-glycosylation, the addition of sugar chains takes place on the hydroxyl oxygen on the side-chain of hydroxylysine, hydroxyproline, serine, tyrosine or threonine. Moreover, phospho-glycans linked through the phosphate of a phospho-serine and C-linked glycans, a rare form of glycosylation where a sugar is added to a carbon on a tryptophan side-chain, are known. Since VP8* or some other rotavirus proteins are not glycosylated in the viral life cycle (Prasad B.V.V. et al., 1988,199(2):269-275), entry in the ER might lead to modifications by glycosylation that could lead to epitope shielding and therefore prevent an efficient immune response. Therefore, it is particularly advantageous to delete the potential glycosylation sites of the encoded rotavirus protein. By mutation of the relevant amino acids the glycosylation is prevented. In this context preferably at least one codon coding for an asparagine, arginine, serine, threonine, tyrosine, lysine, proline or tryptophan is mutated in such a way that a different amino acid is encoded thereby deleting at least one predicted or potential glycosylation site. The predicted glycosylation sites may be predicted by using artificial neural networks that examine the sequence for common glycosylation sites, e.g. N-glycosylation sites may be predicted by using the NetNGlyc 1.0 Server.
In an especially preferred embodiment of the inventive mRNA sequence encoding at least one epitope of a rotavirus protein at least one codon coding for asparagine (N) is mutated into a codon coding for glutamine (Q). Thereby, the sequence is modified to encode for Q instead of N at predicted N-glycosylation sites, for example at predicted N-glycosylation sites of the encoded VP8* protein, or a fragment, variant or derivative thereof. In this context the term “mutated VP8* means that at least one (predicted) glycosylation site is mutated.
It is preferred that not all predicted glycosylation sites of the coding region encoding at least one epitope of a rotavirus protein, or a fragment, variant or derivative thereof, are mutated to partially prevent glycosylation of the resulting protein or peptide. This aspect of the invention may apply for e.g. all N-glycosylation sites or for all O-glycosylation sites or for all glycosylation sites irrespective of their biochemical nature.
It is particularly preferred that all predicted glycosylation sites of the coding region encoding at least one epitope of a rotavirus protein, or a fragment, variant or derivative thereof, are mutated to completely prevent glycosylation of the resulting protein or peptide. This aspect of the invention may apply for e.g. all N-glycosylation sites or for all O-glycosylation sites or for all glycosylation sites irrespective of their biochemical nature.
A preferred amino acid sequence for mutated VP8* of P-serotype P[4] is shown in SEQ ID NO. 125, wherein N-glycosylation modifications were done at N67Q; N91Q; N132Q; N148Q; N230Q.
A preferred amino acid sequence for mutated VP8* of P-serotype P[6] is shown in SEQ ID No. 126, wherein N-glycosylation modifications were done at N67Q; N91Q; N132Q; N146Q.
A preferred amino acid sequence for mutated VP8* of P-serotype P[6] is shown in SEQ ID No. 3210, wherein N-glycosylation modifications were done at N67Q; N91Q; N132Q; N146Q and.
A preferred amino acid sequence for mutated VP8* of P-serotype P[8] is shown in SEQ ID NO. 127 and SEQ ID NO. 128, wherein N-glycosylation modifications were done at N67Q; N91Q; N132Q.
Preferred amino acid sequences of mutated VP8* (full-length and shortened forms optional in combination with helper peptide (and peptide linker) and/or signal peptide) comprising N-glycosylation modifications are shown in Table 9:
TABLE 9
amino acid sequences for mutated VP8* (full-length and shortened forms)
Additional
helper
Additional
peptide P2,
Protein
helper peptide
Additional
peptide
sequence
P2 and
helper peptide
Additional
linker and
Amino
of mutated
additional signal
P2 and
signal
signal
acids of
VP8*
peptide
peptide linker
peptide
peptide
VP8*
SEQ ID Nos.
SEQ ID Nos.
SEQ ID Nos.
SEQ ID Nos.
SEQ ID Nos.
2-240
125-128;
804-806
129-132; 3211
133-152;
153-172;
3210
3212-3216
3217-3221
2-230
221-224;
225-228; 3223
229-248;
249-268;
3222
3224-3228
3229-3233
11-223
317-320;
321-324; 3235
325-344;
345-364;
3234
3236-3240
3241-3245
11-240
413-416;
417-420; 3247
421-440;
441-460;
3246
3248-3252
3253-3257
21-240
509-512;
513-516; 3259
517-536;
537-556;
3258
3260-3264
3265-3269
41-223
605-608;
609-612; 3271
613-632;
633-652;
3270
3272-3276
3277-3281
65-223
701-704;
809-810
705-708; 3283
709-728;
729-748;
3282
3284-3288
3289-3293
64-223
773-774;
807-808
775-776; 3295
777-786;
787-796;
3294
3296-3300
3301-3305
Preferably, the inventive mRNA sequence encodes at least one amino acid sequence which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the amino acid sequences as shown in Table 9 or to a respective fragment thereof.
Additionally to the amino acid sequences in Table 9 also the inventive mRNA sequences encoding the respective amino acid sequences are disclosed. In this context it is possible to apply unmodified nucleotide sequences or mRNA sequences wherein the G/C content of the coding region is increased compared to the G/C content of the coding region of its particular wild type coding sequence, i.e. the unmodified mRNA. Furthermore, mRNA sequences additionally comprising UTR sequences are disclosed in Tables 10-13.
In especially preferred designs the inventive mRNA sequence comprises UTRs e.g. as 5′-UTR a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), as 3′-UTR a 3′-UTR derived from human albumin 3′-UTR (according to SEQ ID NO. 3205), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Tables 10-13 Column D. Alternatively the inventive mRNA sequence comprises a 3′-UTR derived from alpha globin 3′-UTR (according to SEQ ID NO. 3199), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Tables 10-13 Column C.
TABLE 10
mRNA sequences encoding mutated VP8* (full-length and
shortened forms)
Amino
acids
SEQ ID NO(s)
of VP8*
A
B
C
D
1-240
926-929; 3306
1674-1675; 3402
2434-2437; 3498
1-230
1022-1025; 3318
1770-1773; 3414
2530-2533; 3510
11-223
1118-1121; 3330
1866-1869; 3426
2626-2629; 3522
11-240
1214-1217; 3342
1962-1965; 3438
2722-2725; 3534
21-240
1310-1313; 3354
2058-2061; 3450
2818-2821; 3546
41-223
1406-1409; 3366
2154-2157; 3462
2914-2917; 3558
65-223
1502-1505; 3378
2250-2253; 3474
3010-3013; 3570
64-223
1574-1575; 3390
2322-2323; 3486
3082-3083; 3582
TABLE 11
mRNA sequences encoding mutated VP8* (full-length and shortened
forms) and the P2 helper peptide and optionally a peptide linker
Amino
acids
SEQ ID NO(s)
of VP8*
A
B
C
D
2-240
930-933; 3307
1678-1681; 3403
2438-2441;
3106-3108; 3499
2-230
1026-1029; 3319
1774-1777; 3415
2534-2537; 3511
11-223
1122-1125; 3331
1870-1873; 3427
2630-2633; 3523
11-240
1218-1221; 3343
1966-1969; 3439
2726-2729; 3535
21-240
1314-1317; 3355
2062-2065; 3451
2822-2825; 3547
41-223
1410-1413; 3367
2158-2161; 3463
2918-2921; 3559
65-223
1506-1509; 3379
2254-2257; 3475
3014-3017;
3111-3112; 3571
64-223
1576-1577; 3391
2324-2325; 3487
3084-3085;
3109-3110; 3583
TABLE 12
mRNA sequences encoding mutated VP8* (full-length and shortened forms)
and a signal peptide
Amino acids
SEQ ID NO(s)
of VP8*
A
B
C
D
2-240
934-953; 3308-3312
1682-1701; 3404-3408
2442-2461; 3500-3504
2-230
1030-1049; 3320-3324
1778-1797; 3416-3420
2538-2557; 3512-3516
11-223
1126-1145; 3332-3336
1874-1893; 3428-3432
2634-2653; 3524-3528
11-240
1222-1241; 3344-3348
1970-1989; 3440-3444
2730-2749; 3536-3540
21-240
1318-1337; 3356-3360
2066-2085; 3452-3456
2826-2845; 3548-3552
41-223
1414-1433; 3368-3372
2162-2181; 3464-3468
2922-2941; 3560-3564
65-223
1510-1529; 3380-3384
2258-2277; 3476-3480
3018-3037; 3572-3576
64-223
1578-1587; 3392-3396
2326-2335; 3488-3492
3086-3095; 3584-3588
TABLE 13
mRNA sequences encoding mutated VP8* (full-length and shortened forms),
the P2 helper peptide, optionally a peptide linker and a signal peptide
Amino acids
SEQ ID NO(s)
of VP8*
A
B
C
D
2-240
845-847
954-973; 1605-1607;
1702-1721; 2353-2355;
2462-2481; 3113-3115;
3313-3317; 3313-3317
3409-3413
3505-3509
2-230
1050-1069; 3325-3329
1798-1817; 3421-3425
2558-2577; 3517-3521
11-223
1146-1165; 3337-3341
1894-1913; 3433-3437
2654-2673; 3529-3533
11-240
1242-1261; 3349-3353
1990-2009; 3445-3449
2750-2769; 3541-3545
21-240
1338-1357; 3361-3365
2086-2105; 3457-3461
2846-2865; 3553-3557
41-223
1434-1453; 3373-3377
2182-2201; 3469-3473
2942-2961; 3565-3569
65-223
850-851
1530-1549; 1610-1611;
2278-2297; 2358-2359;
3038-3057; 3118-3119;
3385-3389
3481-3485
3577-3581
64-223
848-849
1588-1597; 1608-1609;
2336-2345; 2356-2357;
3096-3105; 3116-3117;
3397-3401
3493-3497
3589-3593
Column A = SEQ ID Nos. regarding wild type CDS (coding sequences)
Column B = SEQ ID Nos. regarding CDS (coding sequences) with an increased G/C content
Column C = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 3′-UTR derived from alpha globin (according to SEQ ID No. 3199), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Column D = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), a 3′-UTR derived from albumin 3′-UTR (according to SEQ ID No. 3205), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Preferably, the inventive mRNA sequence comprises or consists of a nucleotide sequence as shown in Tables 10-13 or which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 85%, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the sequences shown in Tables 10-13.
In a particularly preferred embodiment of the inventive mRNA sequence the sequence additionally comprises at least one sequence section encoding a transmembrane domain of a protein, or a fragment, variant or derivative thereof. The addition of a sequence encoding a transmembrane domain of a protein of a fragment, variant of derivative thereof may enhance the expression of the antigen on the surface of the expressing cell. By addition of a transmembrane domain to the translated product according to this aspect of the invention it is possible to further enhance the immune response, wherein the translated rotavirus protein, e.g. VP8*, anchors to the plasma membrane thereby increasing immune responses by antigen clustering.
According to a preferred embodiment of the invention the transmembrane domain is selected from the transmembrane domain of hemagglutinin (HA) of Influenza virus, or Env of HIV-1 (Human Immunodeficiency Virus), or EIAV (equine infectious anemia virus), or MLV (murine leukemia virus), or mouse mammary tumor virus, or G protein of VSV (vesicular stomatitis virus), or Rabies virus, or a fragment, variant or derivative of these proteins. Preferably, the at least one sequence section encoding the transmembrane domain is located 3′ of the coding region, encoding at least one epitope of a rotavirus protein or fragment, variant or derivative thereof. That means that in the translated product the transmembrane domain is fused to the C-term of the rotavirus protein (see therefore for example the preferred construct “Transmembrane” shown in
The transmembrane domain of hemagglutinin of influenza virus is especially preferred. Preferably, the amino acid sequence of the transmembrane domain of Influenza hemagglutinin (HA) according to SEQ ID NO: 3160 (derived from CY148227.1; GI:538117274; HA/Netherlands/602/2009) may serve as a basis for advantageous designs of the inventive mRNA sequence.
Moreover, the amino acid sequence of the transmembrane domains of Env of HIV-1 according to SEQ ID NOs: 3161-3163 (derived from BAF32550.1; GI:114842138) may serve as a basis for advantageous designs of the inventive mRNA sequence. In addition, the amino acid sequence of the transmembrane domains of Env of EIAV according to SEQ ID NOs: 3164-3166 (derived from AAC03762.1; GI:2905989) or the amino acid sequence of the transmembrane domain of Env of MLV according to the SEQ ID NO:3167 (derived from AAA46526.1; GI:332067) may serve as a basis for advantageous designs of the inventive mRNA sequence.
Further preferred amino acid sequences are the transmembrane domains of Env of mouse mammery tumor virus according to SEQ ID NOs: 3168-3169 (derived from BAA03768.1; GI:391744).
Moreover, the amino acid sequence of the transmembrane domains of the G protein of VSV according to SEQ ID NO: 3170 (derived from CAA24525.1; GI:61840) or of the G protein of Rabies according to SEQ ID NO: 3171 (derived from AEV43288.1; GI:359374323) may serve as a basis for advantageous designs of the inventive mRNA sequence.
In an especially preferred embodiment of the inventive mRNA sequence a sequence section encoding the transmembrane domain (preferably the transmembrane of Influenza HA according to SEQ ID NO. 3160) is fused to the 3′-end of the VP8* coding region, wherein preferably also sequence sections encoding a signal peptide, preferably tissue plasminogen activator according to SEQ ID NO. 3148, and/or a helper peptide, preferably P2 helper peptide of tetanus toxin according to SEQ ID NO. 3147, and optionally a peptide linker are added to the 5′-end of the VP8* coding region. Preferably also feature mutations of the N-glycosylation sites are added, as described above. Preferred constructs refer to VP8* and to the shortened forms of VP8*, preferably of the serotypes P[4], P[6] and P[8].
Preferred amino acid sequences are shown in Table 14:
TABLE 14
amino acid sequences for mutated VP8* (full-length and shortened forms)
additionally comprising a transmembrane domain, and a signal peptide,
the P2 helper peptide and optionally a peptide linker
Amino acids of VP8*
SEQ ID Nos.
2-240
811-813
65-223
816-817
64-223
814-815
Preferably, the inventive mRNA sequence comprises a coding region which encodes for the amino acid sequences according to any one of Table 14 or amino acid sequences which show an identity of at least 60%, preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, or most preferably at least 99% to the sequences as shown in Table 14.
Additionally to the amino acid sequences in Table 14 also the inventive mRNA sequences encoding the respective amino acid sequences are disclosed. In this context it is possible to apply unmodified nucleotide sequences or mRNA sequences wherein the G/C content of the coding region is increased compared to the G/C content of the coding region of its particular wild type coding sequence, i.e. the unmodified mRNA. Furthermore, mRNA sequences additionally comprising UTR sequences are disclosed in Table 15.
In especially preferred designs the inventive mRNA sequence comprises UTRs e.g. as 5′-UTR a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), as 3′-UTR a 3′-UTR derived from human albumin 3′-UTR (according to SEQ ID NO. 3205), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 15 Column D. Alternatively the inventive mRNA sequence comprises a 3′-UTR derived from alpha globin 3′-UTR (according to SEQ ID NO. 3199), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 15 Column C.
TABLE 15
mRNA sequences encoding mutated VP8* (full-length and shortened
forms) additionally encoding a transmembrane domain, and a signal
peptide, the P2 helper peptide and optionally a peptide linker
SEQ ID NO(s)
Amino acids
A
B
C
D
2-240
852-854;
2360-2362
3120-3122
1612-1614
65-223
857-858;
2365-2366
3125-3126
1617-1618
64-223
855-856;
2363-2364
3123-3124
1615-1616
Column A = SEQ ID Nos. regarding wild type CDS (coding sequences)
Column B = SEQ ID Nos. regarding CDS (coding sequences) with an increased G/C content
Column C = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 3′-UTR derived from alpha globin (according to SEQ ID No. 3199), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Column D = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), a 3′-UTR derived from albumin 3′-UTR (according to SEQ ID No. 3205), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Preferably, the inventive mRNA sequence comprises or consists of a nucleotide sequence as shown in Table 15 or which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 85%, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the sequences shown in Table 15.
In a particularly preferred embodiment of the inventive mRNA sequence the sequence additionally comprises at least one sequence section encoding a peptide or protein enabling VLP (virus like particle) formation. By formation of virus like particles it is possible to further increase immune responses by antigen clustering. Generally, virus like particles resemble viruses, but they are non-infectious because they do not contain any viral genetic material. The expression of viral structural proteins or peptides in the context of the inventive mRNA results in the self-assembly of virus like particles which enhance the immungenicity of the inventive approach. In general, VLP formation can be facilitated via two distinct mechanisms for the purpose of the invention:
Advantageously, the peptide or protein enabling VLP formation is derived from non-human pathogenic viruses, because potential problems with pre-existing immunity by using proteins derived from the human viruses are avoided. Non-human pathogenic viruses refer to viruses that normally do not infect humans.
In especially preferred embodiments the peptide or protein enabling VLP formation is derived from Hepatitis B virus core antigen (HBcAg) or from Alfalfa mosaic virus coat protein (CP).
Hepatitis B virus core antigen (HBcAg), which is able to assemble into VLPs, is a well-documented carrier of foreign antigens (reviewed in Buonaguro L, Expert Rev Vaccines. 2011; 10(11):1569-1583). Mostly preferred is Woodchuck Hepatitis virus core protein (WHcAg), as the basis for VLP formation. The use of Woodchuck Hepatitis virus core protein has the particular advantage, that potential problems with pre-existing immunity by using proteins derived from the human Hepatitis B (Billaud J N, J Virol. 2005 November; 79(21):13641-13655) are avoided.
Preferably, the peptide or protein enabling VLP formation comprises or consists of the amino acid sequence of Woodchuck Hepatitis virus core protein (WHcAg) according to SEQ ID NO: 3172 (derived from NCBI accession no.: NC_004107.1; 1-149 with C-terminal C added; derived from publication: PMID 16227284) may serve as a basis for advantageous designs of the inventive mRNA sequence.
In a further preferred embodiment the peptide or protein enabling VLP formation comprises or consists of the amino acid sequence of Alfalfa mosaic virus CP (AIMV CP) according to SEQ ID NO: 3173 (derived from NCBI accession no.: NP_041195.1; derived from publication: PMID 24260245) may serve as a basis for advantageous designs of the inventive mRNA sequence.
Preferably, the sequence section encoding the peptide or protein enabling VLP formation derived from WHcAg is located 5′ of the coding region encoding the at least one epitope of the rotavirus protein resulting in an N-terminal fusion to the rotavirus protein. If the construct additionally comprises a sequence section(s) encoding a helper peptide, the sequence section encoding the peptide or protein enabling VLP formation is preferably located 5′ of the helper peptide. If the construct additionally comprises sequence sections encoding a signal peptide, the sequence section encoding the peptide or protein enabling VLP formation is preferably located 3′ of the signal peptide.
The sequence section encoding the peptide or protein enabling VLP formation derived from Alfalfa mosaic virus CP is preferably located 3′ of the coding region for the rotavirus protein resulting in C-terminal fusion of CP to the rotavirus protein, wherein preferably the construct additionally comprises sequence sections encoding a helper peptide and/or a signal peptide at the 5′ end of the coding region for the rotavirus protein.
In particular preferred embodiments in the context of fusion proteins according to the invention the sequence section encoding the peptide or protein enabling VLP formation is separated from the coding region for the rotavirus protein by at least one sequence section encoding a peptide linker. If the sequence section encoding the rotavirus protein is combined with a sequence section encoding a helper peptide at the 5′-end of the sequence section encoding the peptide linker may separate the VLP forming peptide or protein from the helper peptide plus rotavirus protein.
Preferably, the sequence section encoding the peptide linker is coding for a flexible or a rigid linker as disclosed above.
In preferred embodiments the inventive mRNA sequence comprises at least one sequence section encoding a peptide linker according to SEQ ID Nos. 3174-3176 Furthermore, the mRNA sequence according to this aspect of the invention preferably comprises a sequence section coding for an N-terminal signal peptide, as described above. Preferably, the predicted N-glycosylation sites are mutated, as described above.
In an especially preferred embodiment a relevant portion of WHcAg is fused to the N-term of VP8*, separated by a short peptide linker as described above. Furthermore, the construct preferably contains an N-terminal signal peptide. The predicted N-glycosylation sites are removed from VP8* (see therefore the preferred construct “VLP” in
Preferred amino acid sequences are shown in Table 16:
TABLE 16
amino acid sequences for mutated VP8* (full-length and shortened forms)
additionally comprising a signal peptide, the VLP enabling protein
WHcAg, a peptide linker, and the P2 helper peptide
Amino acids of VP8*
SEQ ID Nos.
2-240
818-820
65-223
823-824
64-223
821-822
Preferably, the inventive mRNA sequence comprises a coding region which encodes for the amino acid sequences according to any one of Table 16 or amino acid sequences which show an identity of at least 60%, preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, or most preferably at least 99% to the sequences as shown in Table 16. Additionally to the amino acid sequences in Table 16 also the inventive mRNA sequences encoding the respective amino acid sequences are disclosed. In this context it is possible to apply unmodified nucleotide sequences or mRNA sequences wherein the G/C content of the coding region is increased compared to the G/C content of the coding region of its particular wild type coding sequence, i.e. the unmodified mRNA. Furthermore, mRNA sequences additionally comprising UTR sequences are disclosed in Table 17.
In especially preferred designs the inventive mRNA sequence comprises UTRs e.g. as 5′-UTR a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), as 3′-UTR a 3′-UTR derived from human albumin 3′-UTR (according to SEQ ID NO. 3205), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 17 Column D. Alternatively the inventive mRNA sequence comprises a 3′-UTR derived from alpha globin 3′-UTR (according to SEQ ID NO. 3199), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 17 Column C.
TABLE 17
mRNA sequences encoding mutated VP8* (full-length and shortened
forms) additionally encoding a signal peptide, the VLP enabling
protein WHcAg, a peptide linker, and the P2 helper peptide
SEQ ID NO(s)
Amino acids
A
B
C
D
2-240
859-861
1619-1621
2367-2369
3127-3129
65-223
864-865
1624-1625
2372-2373
3132-3133
64-223
862-863
1622-1623
2370-2371
3130-3131
Column A = SEQ ID Nos. regarding wild type CDS (coding sequences)
Column B = SEQ ID Nos. regarding CDS (coding sequences) with an increased G/C content
Column C = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 3′-UTR derived from alpha globin (according to SEQ ID No. 3199), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Column D = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), a 3′-UTR derived from albumin 3′-UTR (according to SEQ ID No. 3205), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Preferably, the inventive mRNA sequence comprises or consists of a nucleotide sequence as shown in Table 17 or which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 85%, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the sequences shown in Table 17.
In an especially preferred embodiment a relevant portion of the amino acid sequence of Alfalfa mosaic virus CP is fused to the N-term of VP8*, separated by a short peptide linker as described above. Furthermore, the construct preferably contains an N-terminal signal peptide. The predicted N-glycosylation sites may be removed from VP8*. The region of Alfalfa mosaic virus CP may replace the relevant portion of WHcAg in all disclosed sequences or constructs.
Preferably, the inventive mRNA sequence comprises at least two coding regions, each encoding at least one epitope of a rotavirus protein respectively of a protein of a virus of the genus rotavirus, or of a fragment, variant or derivative thereof. Preferably, the mRNA sequence comprises at least three or four coding regions.
In preferred embodiments of this aspect of the invention the rotavirus proteins derive from different serotypes of the rotavirus, wherein preferably the at least one epitope is derived from the same protein. In particular preferred embodiment the at least one epitope is derived from VP8*. It is possible that the inventive mRNA construct comprises two or more coding regions each encoding the same or different epitopes of the same rotavirus protein, e.g. VP8*, of different serotypes, wherein two or more coding regions encoding the same epitope of the same rotavirus protein, e.g. VP8*, of different serotypes is preferred.
In further preferred embodiments of this aspect of the invention the two or more coding regions encodes epitopes of rotavirus proteins derived from the same serotype of rotavirus. It is possible that the mRNA sequence encodes different epitopes of the same rotavirus protein, preferably VP8*, or that the mRNA sequence encodes two or more copies (or repeats) of the same coding region encoding the same epitope or the same rotavirus protein, preferably VP8*, of the same serotype. It is also possible that the mRNA construct encodes epitopes of two or more different rotavirus proteins of the same or different serotypes.
To further illustrate this aspect of the invention the following examples of preferred inventive mRNA sequences comprise two or more repeats, for example four repeats (i.e. in summary four times the same sequence), of the same coding region encoding at least one epitope of a VP8* protein, or a fragment, variant or derivative thereof, of a virus of the genus rotavirus. The VP8* coding regions of the repeats are preferably combined with sequence sections encoding a helper peptide, especially P2 helper peptide of tetanus toxin, and/or a signal peptide for co-translational transport into the ER, especially tissue plasminogen activator, and/or a transmembrane domain, and/or a peptide or protein enabling VLP formation, especially derived from WHcAg, and/or a peptide linker, as disclosed above or fragments, variants or derivatives thereof.
In preferred embodiments the inventive mRNA sequence comprises at least one sequence section encoding a peptide linker according to SEQ ID NOs: 3174-3176
Preferably, the predicted N-glycosylation sites are mutated, as described above.
VP8* is a relatively small antigen. By the use of repeats of the coding region within the inventive mRNA sequence it is possible to increase antigen clustering, thereby increasing immune responses. Generally, the number of repeats is only limited by practical consideration. For example, four repeats of the VP8* coding region is suitable for the inventive mRNA sequence. In a particularly preferred embodiment of the invention the secreted form described above was used as a basis for this, so the repeat design also contains one N-terminal signal peptide (tissue plasminogen activator) followed by four repeats of VP8*, in which the predicted N-glycosylation sites were mutated (see therefore the preferred construct “secreted-repeat” in
In a preferred embodiment, the repeats of the coding region may be separated by a peptide linker as described above. The inventive mRNA sequence therefore may comprise at least one sequence section encoding a peptide linker according to SEQ ID NOs: 3174-3176
Preferred amino acid sequences are shown in Table 18.
TABLE 18
amino acid sequences comprising a signal peptide, 4 repeats of
mutated (mutated glycosylation sites) VP8* and the P2 helper peptide
Amino acids of VP8*
SEQ ID Nos.
2-240
825-827
Preferably, the inventive mRNA sequence comprises a coding region which encodes for the amino acid sequences according to any one of Table 18 or amino acid sequences which show an identity of at least 60%, preferably at least 70%, more preferably at least 80%, more preferably at least 90%, more preferably at least 95%, or most preferably at least 99% to the sequences as shown in Table 18.
Additionally to the amino acid sequences in Table 18 also the inventive mRNA sequences encoding the respective amino acid sequences are disclosed. In this context it is possible to apply unmodified nucleotide sequences or mRNA sequences wherein the G/C content of the coding region is increased compared to the G/C content of the coding region of its particular wild type coding sequence, i.e. the unmodified mRNA. Furthermore, mRNA sequences additionally comprising UTR sequences are disclosed in Table 19.
In especially preferred designs the inventive mRNA sequence comprises UTRs e.g. as 5′-UTR a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), as 3′-UTR a 3′-UTR derived from human albumin 3′-UTR (according to SEQ ID NO. 3205), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 19 Column D. Alternatively the inventive mRNA sequence comprises a 3′-UTR derived from alpha globin 3′-UTR (according to SEQ ID NO. 3199), a histone stem-loop (according to SEQ ID NO. 3207), a poly(A) sequence, and a poly(C) sequence. Respective mRNA sequences are shown in Table 19 Column C.
TABLE 19
mRNA sequences encoding a signal peptide, 4 repeats of mutated
(mutated glycosylation sites) VP8* and the P2 helper peptide
SEQ ID NO(s)
Amino acids
A
B
C
D
2-240
866-868
2374-2376
3134-3136
Column A = SEQ ID Nos. regarding wild type CDS (coding sequences)
Column B = SEQ ID Nos. regarding CDS (coding sequences) with an increased G/C content
Column C = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 3′-UTR derived from alpha globin (according to SEQ ID No. 3199), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Column D = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), a 3′-UTR derived from albumin 3′-UTR (according to SEQ ID No. 3205), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Preferably, the inventive mRNA sequence comprises or consists of a nucleotide sequence as shown in Table 19 or which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 85%, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the sequences shown in Table 19.
The inventive mRNA sequence comprising two or more coding regions each encoding at least one epitope of a rotavirus protein as mentioned in the previous paragraphs may also be designed as an mRNA sequence with a multicistronic coding region, wherein the term “multicistronic” in this context is meant to describe an mRNA sequence that codes for two or more (same or different) translation products. By the use of an mRNA sequence with a multicistronic coding region it is possible to induce immune responses against different rotavirus serotypes and/or against different rotavirus antigens/proteins. Generally, it is known that in the context of rotavirus infections there is no cross-protection against infection between the different serotypes. The inventive mRNA sequence according to this aspect of the invention is multivalent to confer full protection against different rotavirus infections.
Each coding region may comprise independently from each other a helper peptide and/or a signal peptide and/or a transmembrane domain, and/or a peptide or protein enabling VLP formation, and/or a peptide linker. In each coding region at least one or all predicted glycosylation sites may be removed.
Preferably, the inventive mRNA sequence according to this aspect of the invention combines the coding sequence of two or more VP8* proteins derived from different serotypes, e.g. serotype P[4], P[6] and P[8]. Nevertheless, it is also possible to combine the coding regions of VP8* (or other rotavirus proteins) of one or more serotypes with the coding regions of one or more different antigens of rotavirus, wherein also antigens of different serotypes may be applied.
For the design of a multicistronic mRNA construct according to the invention the at least two coding regions (coding sections), each encoding at least one epitope of a rotavirus protein, are separated by intermitting sequences, for example by internal ribosomal entry sites (IRES). Preferred examples are IRES of encephalomyocarditis virus (EMCV) and/or IRES of foot-and-mouth disease virus (FMDV). By separating the coding sections by intermitting sequences it is ensured, that each coding section is translated into a separate peptide or protein which is able to induce an immune response. Another possibility to separate the coding sections of the multicistronic coding region is the use of further intermitting sequence sections encoding for a self-cleaving peptide. Preferred examples of self-cleaving peptides are F2A peptide derived from foot-and-mouth diseases virus or self-cleaving peptides from equine rhinitis A virus or Thosea asigna virus or porcine teschovirus-1.
A preferred example of an intermitting sequence is the nucleotide sequence of EMCV-IRES-4 according to SEQ ID NO: 3181 that may serve as a basis for advantageous designs of the inventive mRNA sequence.
Another preferred example is the nucleotide sequence of FMDV IRES strain C, isolate c-s8c1 (drived from GenBank: AJ133357.1; GI:6318187; 5′ UTR 578-1038; start codon defined from PMID: 8389904) according to SEQ ID NO: 3182 may serve as a basis for advantageous designs of the inventive mRNA sequence, wherein the alternative start codon on position 454-456 was removed by nucleotide changing T454A. Moreover, a point mutation according to PMID: 8389904: T86C was introduced. Moreover, the following sequences encoding self-cleaving peptides may be used, wherein SEQ ID NOs: 3177 and 3179 refer to F2Amod1 and SEQ ID NOs: 3178 and 3180 refer to F2Amod2.
Preferably, the inventive mRNA sequence comprises sequences according to any one of the SEQ ID NOs: 3179-3180.
It is further contemplated that different types of intermitting sequences (e.g. IRES sequences and sequences coding for self-cleaving peptides) can be used in combination in one multicistronic mRNA construct. For example, in a multicistronic construct comprising three coding regions, the intermitting sequence between the first and second coding region may be an IRES and the intermitting sequence between the second and third coding region may encode a self-cleaving peptide.
In particularly preferred embodiments of this aspect of the invention, the basic (P2 VP8*), secreted (P2 VP8* plus signal peptide) and membrane-bound (P2 VP8* plus transmembrane domain of Influenza HA plus signal peptide) forms or constructs of P2 VP8* of the serotypes P[8] and P[6] and P[4] as described above or as illustrated in
In a particularly preferred embodiment a multicistronic mRNA sequence encodes VP8* of serotype P[8] and serotype P[6] and serotype P[4].
Particularly preferred mRNA sequences are shown in Table 20.
TABLE 20
mRNA sequences encoding multicistronically P2 helper peptide together
with VP8* (wild type or mutated; full-length and shortened forms) of
serotype P[8] and serotype P[6] and serotype P[4] separated by an
IRES sequence or by a sequence encoding a self-cleaving peptide and
optionally encoding a transmembrane domain
SEQ ID NO(s)
Amino acids of VP8*
A
B
C
D
2-240
869-871
2377-2379
3137-3139
wild type
2-240
872-874
2380-2382
3140-3142
glycosylation sites
mutated
2-240
875-877
2383-2385
3143-3145
glycosylation sites
mutated and encoding
a transmembrane
domain
Column A = SEQ ID Nos. regarding wild type CDS (coding sequences
Column B = SEQ ID Nos. regarding CDS (coding sequences) with an increased G/C content
Column C = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 3′-UTR derived from alpha globin (according to SEQ ID No. 3199), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Column D = SEQ ID Nos. regarding mRNA sequences comprising a CDS with an increased G/C content and comprising a 5′-UTR derived from 32L TOP UTR (according to SEQ ID NO. 3189), a 3′-UTR derived from albumin 3′-UTR (according to SEQ ID No. 3205), a histon stem-loop sequence (according to SEQ ID No. 3207) and a poly(A) sequence and a poly(C) sequence.
Preferably, the inventive mRNA sequence comprises or consists of a nucleotide sequence which is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any of the sequences as shown in Table 20.
According to an especially preferred embodiment of the invention, the at least one coding region of the mRNA sequence is modified. Preferably the mRNA is stabilized by modifying and increasing the G (guanosine)/C (cytosine) content of the coding region of the mRNA thereof. Therein, the G/C content of the coding region of the mRNA is increased compared to the G/C content of the coding region of its particular wild type coding sequence, i.e. the unmodified mRNA. However, the encoded amino acid sequence of the mRNA is preferably not modified compared to the encoded amino acid sequence of the particular wild type/unmodified mRNA. The term wild type is to be understood according to the skilled person's general understanding in the art and denotes the nucleic acid or the mRNA in the form of its occurrence in nature without any mutation or nucleotide amendment by man.
The modification of the G/C content of the inventive mRNA sequence is based on the fact that RNA sequences having an increased G (guanosine)/C (cytosine) content are more stable than RNA sequences having an increased A (adenosine)/U (uracil) content. The codons of a coding sequence or a whole RNA might therefore be varied compared to the wild type coding sequence or mRNA, such that they include an increased amount of G/C nucleotides while the translated amino acid sequence is retained. In respect to the fact that several codons code for one and the same amino acid (so-called degeneration of the genetic code), the most favourable codons for the stability can be determined (so-called alternative codon usage). Preferably, the G/C content of the coding region of the mRNA according to the invention is increased by at least 7%, more preferably by at least 15%, particularly preferably by at least 20%, compared to the G/C content of the coding region of the wild type RNA. According to a specific embodiment at least 5%, 10%, 20%, 30%, 40%, 50%, 60%, more preferably at least 70%, even more preferably at least 80% and most preferably at least 90%, 95% or even 100% of the substitutable codons in the region coding for a protein or peptide as defined herein or its fragment or variant thereof or the whole sequence of the wild type mRNA sequence or coding sequence are substituted, thereby increasing the G/C content of said sequence. In this context, it is particularly preferable to increase the G/C content of the mRNA to the maximum (i.e. 100% of the substitutable codons), in particular in the coding region, compared to the wild type sequence.
By a further embodiment, the inventive mRNA sequence preferably comprises at least one of the following structural elements: a 5′- and/or 3′-untranslated region element (UTR element), particularly a 5′-UTR element which comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a TOP gene or from a fragment, homolog or a variant thereof, or a 5′- and/or 3′-UTR element which may be derivable from a gene that provides a stable mRNA or from a homolog, fragment or variant thereof; a histone stem-loop structure, preferably a histone stem-loop in its 3′ untranslated region; a 5′-CAP structure; a poly-A tail (poly(A) sequence); or a poly(C) sequence as will be outlined in more detail below.
In a preferred embodiment the mRNA comprises at least one 5′- or 3′-UTR element. In this context an UTR element comprises or consists of a nucleic acid sequence which is derived from the 5′- or 3′-UTR of any naturally occurring gene or which is derived from a fragment, a homolog or a variant of the 5′- or 3′-UTR of a gene. Preferably, the 5′- or 3′-UTR element used according to the present invention is heterologous to the coding region of the inventive mRNA sequence. Even if 5′- or 3′-UTR elements derived from naturally occurring genes are preferred, also synthetically engineered UTR elements may be used in the context of the present invention.
In a particularly preferred embodiment the inventive mRNA sequence further comprises at least on 3′-UTR element which is derived from a 3′-UTR element of a gene, thereby providing a stabilization of the mRNA. Particularly preferred is a 3′-UTR element which comprises or consists of a nucleic acid sequence derived from the 3′-UTR of a chordate gene, preferably a vertebrate gene, more preferably a mammalian gene, most preferably a human gene, or from a variant of the 3′-UTR of a chordate gene, preferably a vertebrate gene, more preferably a mammalian gene, most preferably a human gene.
The term ‘3′-UTR element’ refers to a nucleic acid sequence which comprises or consists of a nucleic acid sequence that is derived from a 3′-UTR or from a variant of a 3′-UTR. A 3′-UTR element in the sense of the present invention may represent the 3′-UTR of an mRNA. Thus, in the sense of the present invention, preferably, a 3′-UTR element may be the 3′-UTR of an mRNA, preferably of an artificial mRNA, or it may be the transcription template for a 3′-UTR of an mRNA. Thus, a 3′-UTR element preferably is a nucleic acid sequence which corresponds to the 3′-UTR of an mRNA, preferably to the 3′-UTR of an artificial mRNA, such as an mRNA obtained by transcription of a genetically engineered vector construct. Preferably, the 3′-UTR element fulfils the function of a 3′-UTR or encodes a sequence which fulfils the function of a 3′-UTR.
Preferably, the mRNA comprises a 3′-UTR element which may be derivable from a gene that relates to an mRNA with an enhanced half-life (that provides a stable mRNA), for example a 3′-UTR element as defined and described below.
In a particularly preferred embodiment, the 3′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 3′-UTR of a gene selected from the group consisting of an albumin gene, an α-globin gene, a β-globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, and a collagen alpha gene, such as a collagen alpha 1(I) gene, or from a variant of a 3′-UTR of a gene selected from the group consisting of an albumin gene, an α-globin gene, a β-globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, and a collagen alpha gene, such as a collagen alpha 1(I) gene according to SEQ ID NO: 1369-1390 of the patent application WO2013/143700 whose disclosure is incorporated herein by reference. In a particularly preferred embodiment, the 3′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 3′-UTR of an albumin gene, preferably a vertebrate albumin gene, more preferably a mammalian albumin gene, most preferably a human albumin gene according SEQ ID NO: 1369 of the patent application WO2013/143700. The mRNA sequence may comprise or consist of a nucleic acid sequence which is derived from the 3′-UTR of the human albumin gene according to GenBank Accession number NM_000477.5, or from a fragment or variant thereof.
In this context it is particularly preferred that the inventive mRNA sequence comprises a 3′-UTR element comprising a corresponding RNA sequence derived from the nucleic acids according to SEQ ID NO: 1369-1390 of the patent application WO2013/143700 or a fragment, homolog or variant thereof.
Most preferably the 3′-UTR element comprises the nucleic acid sequence derived from a fragment of the human albumin gene according to SEQ ID NO: 1376 of the patent application WO2013/143700.
In another particularly preferred embodiment, the 3′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 3′-UTR of an α-globin or β-globin gene, preferably a vertebrate α- or β-globin gene, more preferably a mammalian α- or β-globin gene, most preferably a human α- or β-globin gene according to SEQ ID NO: 1370 of the patent application WO2013/143700 (3′-UTR of Homo sapiens hemoglobin, alpha 1 (HBA1)), or according to SEQ ID NO: 1371 of the patent application WO2013/143700 (3′-UTR of Homo sapiens hemoglobin, alpha 2 (HBA2)), or according to SEQ ID NO: 1372 of the patent application WO2013/143700 (3′-UTR of Homo sapiens hemoglobin, beta (HBB)).
For example, the 3′-UTR element may comprise or consist of the center, α-complex-binding portion of the 3′-UTR of an α-globin gene, corresponding to SEQ ID NO: 1393 of the patent application WO2013/143700.
In this context it is particularly preferred that the 3′-UTR element of the inventive mRNA sequence comprises or consists of a corresponding RNA sequence of the nucleic acid sequence according to the above or a homolog, a fragment or variant thereof.
The term ‘a nucleic acid sequence which is derived from the 3′-UTR of a [ . . . ] gene’ preferably refers to a nucleic acid sequence which is based on the 3′-UTR sequence of a [ . . . ] gene or on a part thereof, such as on the 3′-UTR of an albumin gene, an α-globin gene, a β-globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, or a collagen alpha gene, such as a collagen alpha 1(I) gene, preferably of an albumin gene or on a part thereof. This term includes sequences corresponding to the entire 3′-UTR sequence, i.e. the full length 3′-UTR sequence of a gene, and sequences corresponding to a fragment of the 3′-UTR sequence of a gene, such as an albumin gene, α-globin gene, β-globin gene, tyrosine hydroxylase gene, lipoxygenase gene, or collagen alpha gene, such as a collagen alpha 1(I) gene, preferably of an albumin gene.
The term ‘a nucleic acid sequence which is derived from a variant of the 3′-UTR of a [ . . . ] gene’ preferably refers to a nucleic acid sequence which is based on a variant of the 3′-UTR sequence of a gene, such as on a variant of the 3′-UTR of an albumin gene, an α-globin gene, a β-globin gene, a tyrosine hydroxylase gene, a lipoxygenase gene, or a collagen alpha gene, such as a collagen alpha 1(I) gene, or on a part thereof as described above. This term includes sequences corresponding to the entire sequence of the variant of the 3′-UTR of a gene, i.e. the full length variant 3′-UTR sequence of a gene, and sequences corresponding to a fragment of the variant 3′-UTR sequence of a gene. A fragment in this context preferably consists of a continuous stretch of nucleotides corresponding to a continuous stretch of nucleotides in the full-length variant 3′-UTR, which represents at least 20%, preferably at least 30%, more preferably at least 40%, more preferably at least 50%, even more preferably at least 60%, even more preferably at least 70%, even more preferably at least 80%, and most preferably at least 90% of the full-length variant 3′-UTR. Such a fragment of a variant, in the sense of the present invention, is preferably a functional fragment of a variant as described herein.
In especially preferred embodiments of the inventive mRNA sequence the 3′-UTR element is derived form a nucleic acid sequence according to SEQ ID NO: 3203 (3′-UTR of human albumin gene) or SEQ ID NO: 3199 (3′-UTR of α-globin gene) or from a corresponding RNA sequence, a homolog, a fragment of a variant thereof.
In a particularly preferred embodiment the mRNA sequence comprises at least one 5′-untranslated region element (5′-UTR element) which comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a TOP gene or which is derived from corresponding RNA sequence, a fragment, homolog or variant of the 5′-UTR of a TOP gene, wherein it is particularly preferred that the 5′-UTR element does not comprise a TOP-motif or a 5′-TOP, as defined above.
In some embodiments, the nucleic acid sequence of the 5′-UTR element which is derived from a 5′-UTR of a TOP gene terminates at its 3′-end with a nucleotide located at position 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 upstream of the start codon (e.g. A(U/T)G) of the gene or mRNA it is derived from. Thus, the 5′-UTR element does not comprise any part of the protein coding region. Thus, preferably, the only protein coding part of the inventive mRNA is provided by the VP8* coding region and optionally further sequence sections as described above.
The nucleic acid sequence which is derived from the 5′-UTR of a TOP gene is preferably derived from a eukaryotic TOP gene, preferably a plant or animal TOP gene, more preferably a chordate TOP gene, even more preferably a vertebrate TOP gene, most preferably a mammalian TOP gene, such as a human TOP gene.
For example, the 5′-UTR element is preferably selected from 5′-UTR elements comprising or consisting of a nucleic acid sequence which is derived from a nucleic acid sequence selected from the group consisting of SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, whose disclosure is incorporated herein by reference, from the homologs of SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, from a variant thereof, or preferably from a corresponding RNA sequence. The term “homologs of SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700” refers to sequences of other species than Homo sapiens, which are homologous to the sequences according to SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700.
In a preferred embodiment, the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from a nucleic acid sequence extending from nucleotide position 5 (i.e. the nucleotide that is located at position 5 in the sequence) to the nucleotide position immediately 5′ to the start codon (located at the 3′ end of the sequences), e.g. the nucleotide position immediately 5′ to the ATG sequence, of a nucleic acid sequence selected from SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, from the homologs of SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700 from a variant thereof, or a corresponding RNA sequence. It is particularly preferred that the 5′-UTR element is derived from a nucleic acid sequence extending from the nucleotide position immediately 3′ to the 5′-TOP to the nucleotide position immediately 5′ to the start codon (located at the 3′ end of the sequences), e.g. the nucleotide position immediately 5′ to the ATG sequence, of a nucleic acid sequence selected from SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, from the homologs of SEQ ID NOs: 1-1363, SEQ ID NO: 1395, SEQ ID NO: 1421 and SEQ ID NO: 1422 of the patent application WO2013/143700, from a variant thereof, or a corresponding RNA sequence.
In a particularly preferred embodiment, the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 5′-UTR of a TOP gene encoding a ribosomal protein or from a variant of a 5′-UTR of a TOP gene encoding a ribosomal protein. For example, the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 5′-UTR of a nucleic acid sequence according to any of SEQ ID NOs: 67, 170, 193, 244, 259, 554, 650, 675, 700, 721, 913, 1016, 1063, 1120, 1138, and 1284-1360 of the patent application WO2013/143700, a corresponding RNA sequence, a homolog thereof, or a variant thereof as described herein, preferably lacking the 5′-TOP motif. As described above, the sequence extending from position 5 to the nucleotide immediately 5′ to the ATG (which is located at the 3′end of the sequences) corresponds to the 5′-UTR of said sequences. Preferably, the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 5′-UTR of a TOP gene encoding a ribosomal large protein (RPL) or from a homolog, a fragment or variant of a 5′-UTR of a TOP gene encoding a ribosomal large protein (RPL). For example, the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from a 5′-UTR of a nucleic acid sequence according to any of SEQ ID NOs: 67, 259, 1284-1318, 1344, 1346, 1348-1354, 1357, 1358, 1421 and 1422 of the patent application WO2013/143700, a corresponding RNA sequence, a homolog thereof, or a variant thereof as described herein, preferably lacking the 5′-TOP motif.
In a particularly preferred embodiment, the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a ribosomal protein Large 32 gene, preferably from a vertebrate ribosomal protein Large 32 (L32) gene, more preferably from a mammalian ribosomal protein Large 32 (L32) gene, most preferably from a human ribosomal protein Large 32 (L32) gene, or from a variant of the 5′-UTR of a ribosomal protein Large 32 gene, preferably from a vertebrate ribosomal protein Large 32 (L32) gene, more preferably from a mammalian ribosomal protein Large 32 (L32) gene, most preferably from a human ribosomal protein Large 32 (L32) gene, wherein preferably the 5′-UTR element does not comprise the 5′-TOP of said gene. A preferred sequence for a 5′-UTR element corresponds to SEQ ID NO: 1368 of the patent application WO2013/143700.
In a preferred embodiment of the inventive mRNA construct the 5′-UTR element is derived from a nucleic acid sequence according to SEQ ID NO: 3189 or from a corresponding RNA sequence, a homolog, a fragment or a variant thereof.
Accordingly, in a particularly preferred embodiment, the 5′-UTR element comprises or consists of a nucleic acid sequence which has an identity of at least about 20%, preferably of at least about 40%, preferably of at least about 50%, preferably of at least about 60%, preferably of at least about 70%, more preferably of at least about 80%, more preferably of at least about 90%, even more preferably of at least about 95%, even more preferably of at least about 99% to the nucleic acid sequence as mentioned above, wherein, preferably, the fragment is as described above, i.e. being a continuous stretch of nucleotides representing at least 20% etc. of the full-length 5′-UTR. Preferably, the fragment exhibits a length of at least about 20 nucleotides or more, preferably of at least about 30 nucleotides or more, more preferably of at least about 40 nucleotides or more. Preferably, the fragment is a functional fragment as described herein.
In some embodiments, the inventive mRNA sequence comprises a 5′-UTR element which comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a vertebrate TOP gene, such as a mammalian, e.g. a human TOP gene, selected from RPSA, RPS2, RPS3, RPS3A, RPS4, RPS5, RPS6, RPS7, RPS8, RPS9, RPS10, RPS11, RPS12, RPS13, RPS14, RPS15, RPS15A, RPS16, RPS17, RPS18, RPS19, RPS20, RPS21, RPS23, RPS24, RPS25, RPS26, RPS27, RPS27A, RPS28, RPS29, RPS30, RPL3, RPL4, RPL5, RPL6, RPL7, RPL7A, RPL8, RPL9, RPL10, RPL10A, RPL11, RPL12, RPL13, RPL13A, RPL14, RPL15, RPL17, RPL18, RPL18A, RPL19, RPL21, RPL22, RPL23, RPL23A, RPL24, RPL26, RPL27, RPL27A, RPL28, RPL29, RPL30, RPL31, RPL32, RPL34, RPL35, RPL35A, RPL36, RPL36A, RPL37, RPL37A, RPL38, RPL39, RPL40, RPL41, RPLP0, RPLP1, RPLP2, RPLP3, RPLP0, RPLP1, RPLP2, EEF1A1, EEF1B2, EEF1D, EEF1G, EEF2, EIF3E, EIF3F, EIF3H, EIF2S3, EIF3C, EIF3K, EIF3EIP, EIF4A2, PABPC1, HNRNPA1, TPT1, TUBB1, UBA52, NPM1, ATP5G2, GNB2L1, NM E2, UQCRB, or from a homolog or variant thereof, wherein preferably the 5′-UTR element does not comprise a TOP-motif or the 5′-TOP of said genes, and wherein optionally the 5′-UTR element starts at its 5′-end with a nucleotide located at position 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 downstream of the 5′ terminal oligopyrimidine tract (TOP) and wherein further optionally the 5′-UTR element which is derived from a 5′-UTR of a TOP gene terminates at its 3′-end with a nucleotide located at position 1, 2, 3, 4, 5, 6, 7, 8, 9 or 10 upstream of the start codon (A(U/T)G) of the gene it is derived from.
In further particularly preferred embodiments, the 5′-UTR element comprises or consists of a nucleic acid sequence which is derived from the 5′-UTR of a ribosomal protein Large 32 gene (RPL32), a ribosomal protein Large 35 gene (RPL35), a ribosomal protein Large 21 gene (RPL21), an ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit 1, cardiac muscle (ATP5A1) gene, an hydroxysteroid (17-beta) dehydrogenase 4 gene (HSD17B4), an androgen-induced 1 gene (AIG1), cytochrome c oxidase subunit Vic gene (COX6C), or a N-acylsphingosine amidohydrolase (acid ceramidase) 1 gene (ASAH1) or from a variant thereof, preferably from a vertebrate ribosomal protein Large 32 gene (RPL32), a vertebrate ribosomal protein Large 35 gene (RPL35), a vertebrate ribosomal protein Large 21 gene (RPL21), a vertebrate ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit 1, cardiac muscle (ATP5A1) gene, a vertebrate hydroxysteroid (17-beta) dehydrogenase 4 gene (HSD17B4), a vertebrate androgen-induced 1 gene (AIG1), a vertebrate cytochrome c oxidase subunit Vic gene (COX6C), or a vertebrate N-acylsphingosine amidohydrolase (acid ceramidase) 1 gene (ASAH1) or from a variant thereof, more preferably from a mammalian ribosomal protein Large 32 gene (RPL32), a ribosomal protein Large 35 gene (RPL35), a ribosomal protein Large 21 gene (RPL21), a mammalian ATP synthase, H+ transporting, mitochondrial F1 complex, alpha subunit 1, cardiac muscle (ATP5A1) gene, a mammalian hydroxysteroid (17-beta) dehydrogenase 4 gene (HSD17B4), a mammalian androgen-induced 1 gene (AIG1), a mammalian cyto-chrome c oxidase subunit Vic gene (COX6C), or a mammalian N-acylsphingosine ami-dohydrolase (acid ceramidase) 1 gene (ASAH1) or from a variant thereof, most preferably from a human ribosomal protein Large 32 gene (RPL32), a human ribosomal protein Large 35 gene (RPL35), a human ribosomal protein Large 21 gene (RPL21), a human ATP syn-thase, H+ transporting, mitochondrial F1 complex, alpha subunit 1, cardiac muscle (ATP5A1) gene, a human hydroxysteroid (17-beta) dehydrogenase 4 gene (HSD17B4), a human androgen-induced 1 gene (AIG1), a human cytochrome c oxidase subunit Vic gene (COX6C), or a human N-acylsphingosine amidohydrolase (acid ceramidase) 1 gene (ASAH1) or from a variant thereof, wherein preferably the 5′-UTR element does not comprise the 5′-TOP of said gene.
Accordingly, in a particularly preferred embodiment, the 5′-UTR element comprises or consists of a nucleic acid sequence which has an identity of at least about 40%, preferably of at least about 50%, preferably of at least about 60%, preferably of at least about 70%, more preferably of at least about 80%, more preferably of at least about 90%, even more preferably of at least about 95%, even more preferably of at least about 99% to the nucleic acid sequence according to SEQ ID NO: 1368, or SEQ ID NOs: 1412-1420 of the patent application WO2013/143700, or a corresponding RNA sequence, or wherein the at least one 5′-UTR element comprises or consists of a fragment of a nucleic acid sequence which has an identity of at least about 20%, preferably of at least about 40%, preferably of at least about 50%, preferably of at least about 60%, preferably of at least about 70%, more preferably of at least about 80%, more preferably of at least about 90%, even more preferably of at least about 95%, even more preferably of at least about 99% to the nucleic acid sequence according to SEQ ID NO: 1368, or SEQ ID NOs: 1412-1420 of the patent application WO2013/143700, wherein, preferably, the fragment is as described above, i.e. being a continuous stretch of nucleotides representing at least 20% etc. of the full-length 5′-UTR. Preferably, the fragment exhibits a length of at least about 20 nucleotides or more, preferably of at least about 30 nucleotides or more, more preferably of at least about 40 nucleotides or more. Preferably, the fragment is a functional fragment as described herein.
Accordingly, in a particularly preferred embodiment, the 5′-UTR element comprises or consists of a nucleic acid sequence which has an identity of at least about 20%, preferably of at least about 40%, preferably of at least about 50%, preferably of at least about 60%, preferably of at least about 70%, more preferably of at least about 80%, more preferably of at least about 90%, even more preferably of at least about 95%, even more preferably of at least about 99% to the nucleic acid sequence according SEQ ID NO: 1414 of the patent application WO2013/143700 (5′-UTR of ATP5A1 lacking the 5′ terminal oligopyrimidine tract) or preferably to a corresponding RNA sequence, wherein, preferably, the fragment is as described above, i.e. being a continuous stretch of nucleotides representing at least 20% etc. of the full-length 5′-UTR. Preferably, the fragment exhibits a length of at least about 20 nucleotides or more, preferably of at least about 30 nucleotides or more, more preferably of at least about 40 nucleotides or more. Preferably, the fragment is a functional fragment as described herein.
Preferably, the at least one 5′-UTR element and the at least one 3′-UTR element act synergistically to increase protein production from the inventive mRNA sequence as described above.
In a particularly preferred embodiment, the inventive mRNA sequence comprises a histone stem-loop sequence/structure. Such histone stem-loop sequences are preferably selected from histone stem-loop sequences as disclosed in WO 2012/019780, whose disclosure is incorporated herewith by reference.
A histone stem-loop sequence, suitable to be used within the present invention, is preferably selected from at least one of the following formulae (I) or (II):
formula (I) (stem-loop sequence without stem bordering elements):
##STR00001##
formula (II) (stem-loop sequence with stem bordering elements):
##STR00002##
wherein:
stem1 or
is a consecutive sequence of 1 to 6, preferably of 2 to 6,
stem2
more preferably of 2 to 5, even more preferably of 3 to
bordering
5, most preferably of 4 to 5 or 5 N, wherein each N is
elements N1-6
independently from another selected from a nucleotide
selected from A, U, T, G and C, or a nucleotide analogue
thereof;
stem1
is reverse complementary or partially reverse
[N0-2GN3-5]
complementary with element stem2, and is a
consecutive sequence between of 5 to 7 nucleotides;
wherein N0-2 is a consecutive sequence of 0 to 2, preferably of 0 to 1, more preferably of 1 N, wherein each N is independently from another selected from a nucleotide selected from A, U, T, G and C or a nucleotide analogue thereof;
wherein N3-5 is a consecutive sequence of 3 to 5, preferably of 4 to 5, more preferably of 4 N, wherein each N is independently from another selected from a nucleotide selected from A, U, T, G and C or a nucleotide analogue thereof, and
wherein G is guanosine or an analogue thereof, and may be optionally replaced by a cytidine or an analogue thereof, provided that its complementary nucleotide cytidine in stem2 is replaced by guanosine;
loop sequence
is located between elements steml and stem2, and is a
[N0-4(U/T)N0-4]
consecutive sequence of 3 to 5 nucleotides, more
preferably of 4 nucleotides;
wherein each N0-4 is independent from another a consecutive sequence of 0 to 4, preferably of 1 to 3, more preferably of 1 to 2 N, wherein each N is independently from another selected from a nucleotide selected from A, U, T, G and C or a nucleotide analogue thereof; and
wherein U/T represents uridine, or optionally thymidine;
stem2 [N3-5CN0-2]
is reverse complementary or partially reverse
complementary with element stem1, and is a
consecutive sequence between of 5 to 7 nucleotides;
wherein N3-5 is a consecutive sequence of 3 to 5, preferably of 4 to 5, more preferably of 4 N, wherein each N is independently from another selected from a nucleotide selected from A, U, T, G and C or a nucleotide analogue thereof;
wherein N0-2 is a consecutive sequence of 0 to 2, preferably of 0 to 1, more preferably of 1 N, wherein each N is independently from another selected from a nucleotide selected from A, U, T, G or C or a nucleotide analogue thereof; and
wherein C is cytidine or an analogue thereof, and may be optionally replaced by a guanosine or an analogue thereof provided that its complementary nucleoside guanosine in stem1 is replaced by cytidine;
wherein stem1 and stem2 are capable of base pairing with each other forming a reverse complementary sequence, wherein base pairing may occur between stem1 and stem2, e.g. by Watson-Crick base pairing of nucleotides A and U/T or G and C or by non-Watson-Crick base pairing e.g. wobble base pairing, reverse Watson-Crick base pairing, Hoogsteen base pairing, reverse Hoogsteen base pairing or are capable of base pairing with each other forming a partially reverse complementary sequence, wherein an incomplete base pairing may occur between stem1 and stem2, on the basis that one or more bases in one stem do not have a complementary base in the reverse complementary sequence of the other stem.
According to a further preferred embodiment of the first inventive aspect, the inventive mRNA sequence may comprise at least one histone stem-loop sequence according to at least one of the following specific formulae (Ia) or (IIa):
formula (Ia) (stem-loop sequence without stem bordering elements):
##STR00003##
formula (IIa) (stem-loop sequence with stem bordering elements):
##STR00004##
wherein N, C, G, T and U are as defined above.
According to a further more particularly preferred embodiment of the first aspect, the inventive mRNA sequence may comprise at least one histone stem-loop sequence according to at least one of the following specific formulae (Ib) or (IIb):
formula (Ib) (stem-loop sequence without stem bordering elements):
##STR00005##
formula (IIb) (stem-loop sequence with stem bordering elements):
##STR00006##
wherein N, C, G, T and U are as defined above.
In a preferred embodiment of the inventive mRNA sequence the histone stem-loop sequence is derived from a nucleic acid sequence according to SEQ ID NO: 3206 or from a corresponding RNA sequence (according to SEQ ID NO: 3207), a homolog, a fragment or a variant thereof.
In a particular preferred embodiment, the inventive mRNA sequence comprises, additionally to the rotavirus protein coding region and possibly further sequence sections, as described above, a poly(A) sequence, also called poly-A tail, preferably at the 3′ terminus of the mRNA. When present, such a poly(A) sequence comprises a sequence of about 25 to about 400 adenosine nucleotides, preferably a sequence of about 50 to about 400 adenosine nucleotides, more preferably a sequence of about 50 to about 300 adenosine nucleotides, even more preferably a sequence of about 50 to about 250 adenosine nucleotides, most preferably a sequence of about 60 to about 250 adenosine nucleotides. In this context the term “about” refers to a deviation of ±10% of the value(s) it is attached to. Mostly preferred is a poly-A tail of 64 adenosine nucleotide. The poly(A) sequence is preferably located 3′ of the coding region comprised in the mRNA according to the invention.
According to a further preferred embodiment, the inventive mRNA sequence can be modified by a sequence of at least 10 cytosines, preferably at least 20 cytosines, more preferably at least 30 cytosines (so-called “poly(C) sequence”). Particularly, the mRNA may contain a poly(C) sequence of typically about 10 to 200 cytosine nucleotides, preferably about 10 to 100 cytosine nucleotides, more preferably about 10 to 70 cytosine nucleotides or even more preferably about 20 to 50 or even 20 to 30 cytosine nucleotides. Mostly preferred is a poly(C) sequence of 30 cytosine nucleotides. The poly(C) sequence is preferably located 3′ of the coding region, more preferably 3′ of an optional poly(A) sequence comprised in the mRNA according to the present invention.
For further improvement of the resistance to e.g. in vivo degradation (e.g. by an exo- or endonuclease), the inventive mRNA sequence is provided as a stabilized nucleic acid, e.g. in the form of a modified nucleic acid. In this context the G/C content is preferably increased as outlined above. According to further preferred embodiments of the invention the mRNA is further stabilized, preferably by backbone modifications, sugar modifications and/or base modifications. All of these modifications may be introduced into the mRNA without impairing the mRNA's function to be translated in the host cell.
A backbone modification in the context of the present invention is preferably a modification in which phosphates of the backbone of the nucleotides contained in the mRNA are chemically modified, e.g. anionic internucleoside linkage, N3′→P5′ modifications, replacement of non-bridging oxygen atoms by boranes, neutral internucleoside linkage, amide linkage of the nucleosides, methylene(methylimino) linkages, formacetal and thioformacetal linkages, introduction of sulfonyl groups, or the like.
A sugar modification in the context of the present invention is preferably a chemical modification of the sugar of the nucleotides of the mRNA, e.g. methylation of the ribose residue or the like.
Further details about the chemical modification of the RNA, especially the mRNA, will be apparent from the following, wherein the term “RNA modification” as used herein may refer to chemical modifications comprising sugar modifications, backbone modifications as well as base modifications or lipid modifications. In this context, a modified RNA molecule as defined herein may contain nucleotide analogues/modifications, e.g. backbone modifications, sugar modifications or base modifications. A backbone modification in connection with the present invention is a modification, in which phosphates of the backbone of the nucleotides contained in an RNA molecule as defined herein are chemically modified. A sugar modification in connection with the present invention is a chemical modification of the sugar of the nucleotides of the RNA molecule as defined herein. Furthermore, a base modification in connection with the present invention is a chemical modification of the base moiety of the nucleotides of the RNA molecule. In this context, nucleotide analogues or modifications are preferably selected from nucleotide analogues, which are applicable for transcription and/or translation.
The modified nucleosides and nucleotides, which may be incorporated into a modified RNA molecule as described herein, can be modified in the sugar moiety. For example, the 2′ hydroxyl group (OH) can be modified or replaced with a number of different “oxy” or “deoxy” substituents. Examples of “oxy”-2′ hydroxyl group modifications include, but are not limited to, alkoxy or aryloxy (—OR, e.g., R═H, alkyl, cycloalkyl, aryl, aralkyl, heteroaryl or sugar); polyethyleneglycols (PEG), —O(CH2CH2O)nCH2CH2OR; “locked” nucleic acids (LNA) in which the 2′ hydroxyl is connected, e.g., by a methylene bridge, to the 4′ carbon of the same ribose sugar; and amino groups (—O-amino, wherein the amino group, e.g., NRR, can be alkylamino, dialkylamino, heterocyclyl, arylamino, diarylamino, heteroarylamino, or diheteroaryl amino, ethylene diamine, polyamino) or aminoalkoxy.
“Deoxy” modifications include hydrogen, amino (e.g. NH2; alkylamino, dialkylamino, heterocyclyl, arylamino, diaryl amino, heteroaryl amino, diheteroaryl amino, or amino acid); or the amino group can be attached to the sugar through a linker, wherein the linker comprises one or more of the atoms C, N, and O.
The sugar group can also contain one or more carbons that possess the opposite stereochemical configuration than that of the corresponding carbon in ribose. Thus, a modified RNA molecule can include nucleotides containing, for instance, arabinose as the sugar.
The phosphate backbone may further be modified in the modified nucleosides and nucleotides, which may be incorporated into a modified RNA molecule as described herein. The phosphate groups of the backbone can be modified by replacing one or more of the oxygen atoms with a different substituent. Further, the modified nucleosides and nucleotides can include the full replacement of an unmodified phosphate moiety with a modified phosphate as described herein. Examples of modified phosphate groups include, but are not limited to, phosphorothioate, phosphoroselenates, borano phosphates, borano phosphate esters, hydrogen phosphonates, phosphoroamidates, alkyl or aryl phosphonates and phosphotriesters. Phosphorodithioates have both non-linking oxygens replaced by sulfur. The phosphate linker can also be modified by the replacement of a linking oxygen with nitrogen (bridged phosphoroamidates), sulfur (bridged phosphorothioates) and carbon (bridged methylene-phosphonates).
The modified nucleosides and nucleotides, which may be incorporated into a modified RNA molecule as described herein, can further be modified in the nucleobase moiety. Examples of nucleobases found in RNA include, but are not limited to, adenine, guanine, cytosine and uracil. For example, the nucleosides and nucleotides described herein can be chemically modified on the major groove face. In some embodiments, the major groove chemical modifications can include an amino group, a thiol group, an alkyl group, or a halo group.
In particularly preferred embodiments of the present invention, the nucleotide analogues/modifications are selected from base modifications, which are preferably selected from 2-amino-6-chloropurineriboside-5′-triphosphate, 2-Aminopurine-riboside-5′-triphosphate; 2-aminoadenosine-5′-triphosphate, 2′-Amino-2′-deoxycytidine-triphosphate, 2-thiocytidine-5′-triphosphate, 2-thiouridine-5′-triphosphate, 2′-Fluorothymidine-5′-triphosphate, 2′-O-Methyl inosine-5′-triphosphate 4-thiouridine-5′-triphosphate, 5-aminoallylcytidine-5′-triphosphate, 5-aminoallyluridine-5′-triphosphate, 5-bromocytidine-5′-triphosphate, 5-bromouridine-5′-triphosphate, 5-Bromo-2′-deoxycytidine-5′-triphosphate, 5-Bromo-2′-deoxyuridine-5′-triphosphate, 5-iodocytidine-5′-triphosphate, 5-Iodo-2′-deoxycytidine-5′-triphosphate, 5-iodouridine-5′-triphosphate, 5-Iodo-2′-deoxyuridine-5′-triphosphate, 5-methylcytidine-5′-triphosphate, 5-methyluridine-5′-triphosphate, 5-Propynyl-2′-deoxycytidine-5′-triphosphate, 5-Propynyl-2′-deoxyuridine-5′-triphosphate, 6-azacytidine-5′-triphosphate, 6-azauridine-5′-triphosphate, 6-chloropurineriboside-5′-triphosphate, 7-deazaadenosine-5′-triphosphate, 7-deazaguanosine-5′-triphosphate, 8-azaadenosine-5′-triphosphate, 8-azidoadenosine-5′-triphosphate, benzimidazole-riboside-5′-triphosphate, N1-methyladenosine-5′-triphosphate, N1-methylguanosine-5′-triphosphate, N6-methyladenosine-5′-triphosphate, 06-methylguanosine-5′-triphosphate, pseudouridine-5′-triphosphate, or puromycin-5′-triphosphate, xanthosine-5′-triphosphate. Particular preference is given to nucleotides for base modifications selected from the group of base-modified nucleotides consisting of 5-methylcytidine-5′-triphosphate, 7-deazaguanosine-5′-triphosphate, 5-bromocytidine-5′-triphosphate, and pseudouridine-5′-triphosphate.
In some embodiments, modified nucleosides include pyridin-4-one ribonucleoside, 5-aza-uridine, 2-thio-5-aza-uridine, 2-thiouridine, 4-thio-pseudouridine, 2-thio-pseudouridine, 5-hydroxyuridine, 3-methyluridine, 5-carboxymethyl-uridine, 1-carboxymethyl-pseudouridine, 5-propynyl-uridine, 1-propynyl-pseudouridine, 5-taurinomethyluridine, 1-taurinomethyl-pseudouridine, 5-taurinomethyl-2-thio-uridine, 1-taurinomethyl-4-thio-uridine, 5-methyl-uridine, 1-methyl-pseudouridine, 4-thio-1-methyl-pseudouridine, 2-thio-1-methyl-pseudouridine, 1-methyl-1-deaza-pseudouridine, 2-thio-1-methyl-1-deaza-pseudouridine, dihydrouridine, dihydropseudouridine, 2-thio-dihydrouridine, 2-thio-dihydropseudouridine, 2-methoxyuridine, 2-methoxy-4-thio-uridine, 4-methoxy-pseudouridine, and 4-methoxy-2-thio-pseudouridine.
In some embodiments, modified nucleosides include 5-aza-cytidine, pseudoisocytidine, 3-methyl-cytidine, N4-acetylcytidine, 5-formylcytidine, N4-methylcytidine, 5-hydroxymethylcytidine, 1-methyl-pseudoisocytidine, pyrrolo-cytidine, pyrrolo-pseudoisocytidine, 2-thio-cytidine, 2-thio-5-methyl-cytidine, 4-thio-pseudoisocytidine, 4-thio-1-methyl-pseudoisocytidine, 4-thio-1-methyl-1-deaza-pseudoisocytidine, 1-methyl-1-deaza-pseudoisocytidine, zebularine, 5-aza-zebularine, 5-methyl-zebularine, 5-aza-2-thio-zebularine, 2-thio-zebularine, 2-methoxy-cytidine, 2-methoxy-5-methyl-cytidine, 4-methoxy-pseudoisocytidine, and 4-methoxy-1-methyl-pseudoisocytidine.
In other embodiments, modified nucleosides include 2-aminopurine, 2, 6-diaminopurine, 7-deaza-adenine, 7-deaza-8-aza-adenine, 7-deaza-2-aminopurine, 7-deaza-8-aza-2-aminopurine, 7-deaza-2,6-diaminopurine, 7-deaza-8-aza-2,6-diaminopurine, 1-methyladenosine, N6-methyladenosine, N6-isopentenyladenosine, N6-(cis-hydroxyisopentenyl)adenosine, 2-methylthio-N6-(cis-hydroxyisopentenyl) adenosine, N6-glycinylcarbamoyladenosine, N6-threonylcarbamoyladenosine, 2-methylthio-N6-threonyl carbamoyladenosine, N6,N6-dimethyladenosine, 7-methyladenine, 2-methylthio-adenine, and 2-methoxy-adenine.
In other embodiments, modified nucleosides include inosine, 1-methyl-inosine, wyosine, wybutosine, 7-deaza-guanosine, 7-deaza-8-aza-guanosine, 6-thio-guanosine, 6-thio-7-deaza-guanosine, 6-thio-7-deaza-8-aza-guanosine, 7-methyl-guanosine, 6-thio-7-methyl-guanosine, 7-methylinosine, 6-methoxy-guanosine, 1-methylguanosine, N2-methylguanosine, N2,N2-dimethylguanosine, 8-oxo-guanosine, 7-methyl-8-oxo-guanosine, 1-methyl-6-thio-guanosine, N2-methyl-6-thio-guanosine, and N2,N2-dimethyl-6-thio-guanosine.
In some embodiments, the nucleotide can be modified on the major groove face and can include replacing hydrogen on C-5 of uracil with a methyl group or a halo group. In specific embodiments, a modified nucleoside is 5′-O-(1-Thiophosphate)-Adenosine, 5′-O-(1-Thiophosphate)-Cytidine, 5′-O-(1-Thiophosphate)-Guanosine, 5′-O-(1-Thiophosphate)-Uridine or 5′-O-(1-Thiophosphate)-Pseudouridine.
In further specific embodiments, a modified RNA may comprise nucleoside modifications selected from 6-aza-cytidine, 2-thio-cytidine, α-thio-cytidine, Pseudo-iso-cytidine, 5-aminoallyl-uridine, 5-iodo-uridine, N1-methyl-pseudouridine, 5,6-dihydrouridine, α-thio-uridine, 4-thio-uridine, 6-aza-uridine, 5-hydroxy-uridine, deoxy-thymidine, 5-methyl-uridine, Pyrrolo-cytidine, inosine, α-thio-guanosine, 6-methyl-guanosine, 5-methyl-cytdine, 8-oxo-guanosine, 7-deaza-guanosine, N1-methyl-adenosine, 2-amino-6-Chloro-purine, N6-methyl-2-amino-purine, Pseudo-iso-cytidine, 6-Chloro-purine, N6-methyl-adenosine, α-thio-adenosine, 8-azido-adenosine, 7-deaza-adenosine.
Further nucleotide analogues are such as those disclosed in WO2013/052523, WO2014093924 WO2015051173, WO2015/051169 and WO2015/089511.
According to a further embodiment, a modified RNA molecule as defined herein can contain a lipid modification. Such a lipid-modified RNA molecule typically comprises an RNA molecule as defined herein. Such a lipid-modified RNA molecule as defined herein typically further comprises at least one linker covalently linked with that RNA molecule, and at least one lipid covalently linked with the respective linker. Alternatively, the lipid-modified RNA molecule comprises at least one RNA molecule as defined herein and at least one (bifunctional) lipid covalently linked (without a linker) with that RNA molecule. According to a third alternative, the lipid-modified RNA molecule comprises an RNA molecule as defined herein, at least one linker covalently linked with that RNA molecule, and at least one lipid covalently linked with the respective linker, and also at least one (bifunctional) lipid covalently linked (without a linker) with that RNA molecule. In this context, it is particularly preferred that the lipid modification is present at the terminal ends of a linear RNA sequence.
According to another preferred embodiment of the invention, a modified RNA molecule as defined herein, can be modified by the addition of a so-called “5′-CAP” structure, namely by modification of the 5′-end of a RNA molecule.
A 5′-cap is an entity, typically a modified nucleotide entity, which generally “caps” the 5′-end of a mature mRNA. A 5′-cap may typically be formed by a modified nucleotide, particularly by a derivative of a guanine nucleotide. Preferably, the 5′-cap is linked to the 5′-terminus via a 5′-5′-triphosphate linkage. A 5′-cap may be methylated, e.g. m7GpppN, wherein N is the terminal 5′ nucleotide of the nucleic acid carrying the 5′-cap, typically the 5′-end of an RNA. m7GpppN is the 5′-CAP structure which naturally occurs in mRNA transcribed by polymerase II and is therefore not considered as modification comprised in a modified RNA in this context. Accordingly, a modified RNA of the present invention may comprise a m7GpppN as 5′-CAP, but additionally the modified RNA comprises at least one further modification as defined herein.
Further examples of 5′-cap structures include glyceryl, inverted deoxy abasic residue (moiety), 4′,5′ methylene nucleotide, 1-(beta-D-erythrofuranosyl) nucleotide, 4′-thio nucleotide, carbocyclic nucleotide, 1,5-anhydrohexitol nucleotide, L-nucleotides, alpha-nucleotide, modified base nucleotide, threo-pentofuranosyl nucleotide, acyclic 3′,4′-seco nucleotide, acyclic 3,4-dihydroxybutyl nucleotide, acyclic 3,5 dihydroxypentyl nucleotide, 3′-3′-inverted nucleotide moiety, 3′-3′-inverted abasic moiety, 3′-2′-inverted nucleotide moiety, 3′-2′-inverted abasic moiety, 1,4-butanediol phosphate, 3′-phosphoramidate, hexylphosphate, aminohexyl phosphate, 3′-phosphate, 3′phosphorothioate, phosphorodithioate, or bridging or non-bridging methylphosphonate moiety. These modified 5′-CAP structures are regarded as at least one modification in this context.
Particularly preferred modified 5′-cap structures are CAP1 (methylation of the ribose of the adjacent nucleotide of m7G), CAP2 (methylation of the ribose of the 2nd nucleotide downstream of the m7G), CAP3 (methylation of the ribose of the 3rd nucleotide downstream of the m7G), CAP4 (methylation of the ribose of the 4th nucleotide downstream of the m7G), ARCA (anti-reverse CAP analogue, modified ARCA (e.g. phosphothioate modified ARCA), inosine, N1-methyl-guanosine, 2′-fluoro-guanosine, 7-deaza-guanosine, 8-oxo-guanosine, 2-amino-guanosine, LNA-guanosine, and 2-azido-guanosine. According to a further preferred embodiment of the invention, the inventive mRNA sequence is optimized for translation, preferably optimized for translation by replacing codons for less frequent tRNAs of a given amino acid by codons for more frequently occurring tRNAs of the respective amino acid. This is based on the finding that the translation efficiency is also determined by a different frequency in the occurrence of tRNAs in cells. Thus, if so-called “less frequent codons” are present in the inventive mRNA to an increased extent, the corresponding modified RNA is translated to a significantly poorer degree than in the case where codons coding for more frequent tRNAs are present. Preferably, the coding region of the mRNA is modified compared to the corresponding region of the wild type RNA or coding sequence such that at least one codon of the wild type sequence which codes for a tRNA which is relatively rare or less frequent in the cell is exchanged for a codon which codes for a tRNA which is more or most frequent in the cell and carries the same amino acid as the relatively rare or less frequent tRNA. By this modification, the sequences of the mRNA can be modified such that codons for which more frequently occurring tRNAs are available are inserted. In other words, according to the invention, by this modification all codons of the wild type sequence which code for a tRNA which is relatively rare in the cell can in each case be exchanged for a codon which codes for a respective tRNA which is relatively frequent in the cell and which, in each case, carries the same amino acid as the relatively rare tRNA. Furthermore, it is particularly preferable to link the sequential G/C content which is increased, in particular maximized, in the mRNA with the “frequent” codons without modifying the amino acid sequence of the protein encoded by the coding region of the mRNA or of the coding region. This preferred embodiment allows provision of a particularly efficiently translated and stabilized (modified) mRNA.
Substitutions, additions or eliminations of bases are preferably carried out using a DNA matrix for preparation of the nucleic acid molecule by techniques of the well known site directed mutagenesis or with an oligonucleotide ligation. In such a process, for preparation of the inventive mRNA sequence as defined herein a corresponding DNA molecule may be transcribed in vitro. This DNA matrix preferably comprises a suitable promoter, e.g. a T7 or SP6 promoter, for in vitro transcription, which is followed by the desired nucleotide sequence for the at least one RNA to be prepared and a termination signal for in vitro transcription. The DNA molecule, which forms the matrix of the inventive mRNA sequence of interest, may be prepared by fermentative proliferation and subsequent isolation as part of a plasmid which can be replicated in bacteria. Plasmids which may be mentioned as suitable for the present invention are e.g. the plasmids pT7 Ts (GenBank accession number AB255037.1; Lai et al., Development 1995; 121:2349-2360), pGEM® series, e.g. pGEM®-1 (GenBank accession number X65300.1; from Promega) and pSP64 (GenBank accession number X65327.1); cf. also Mezei and Storts, Purification of PCR Products, in: Griffin and Griffin (ed.), PCR Technology: Current Innovation, CRC Press, Boca Raton, Fla., 2001.
The mRNA may be prepared using any method known in the art, including synthetic methods such as e.g. solid phase synthesis, as well as in vitro methods, such as in vitro transcription reactions.
In summary, in the context of the present invention, an mRNA is typically an RNA, which is composed of several structural elements, e.g. an optional 5′-CAP structure, an optional 5′-UTR region, an upstream positioned ribosomal binding site followed by a coding region, an optional 3′-UTR region, which may be followed by a poly-A tail (and/or a poly-C-tail). An mRNA may occur as a mono-, di-, or even multicistronic RNA, i.e. a RNA which carries the VP8* coding region or repeats of the VP8* coding region, and optionally further sequence sections as described above. Such coding sequences in di-, or even multicistronic mRNA may be separated by at least one IRES sequence as described herein.
In particularly preferred embodiments of the invention the inventive mRNA sequence has the following structure, wherein the mRNA sequence comprises, preferably in 5′- to 3′-direction:
In a further particularly preferred embodiment of the invention the inventive mRNA sequence has the following structure, wherein the mRNA sequence comprises, preferably in 5′- to 3′-direction:
With respect to the at least one coding region encoding at least one epitope of a rotavirus protein it is referred to the above description. Preferably, the at least one coding region encodes respectively the VP8* protein of one or more rotavirus strains/serotypes(or fragments or variants thereof) and preferably further sequence sections as described above, especially at least one sequence section encoding a helper peptide and/or at least one sequence section encoding a signal peptide for co-translational transport and/or at least one sequence section encoding a factor for antigen clustering/VLP formation and/or at least one sequence section encoding a transmembrane domain and/or a sequence section encoding a linker peptide.
In especially preferred embodiments the at least one coding region encoding the at least one epitope of a rotavirus protein encodes VP8*, wherein preferably the mRNA sequence comprises at least one sequence according to any one of SEQ ID NOs: 828-3146 or 3306-3593 or a sequence that is at least 60% identical, more preferably at least 70% identical, more preferably at least 80% identical, more preferably at least 90% identical, more preferably at least 95% identical, or most preferably at least 99% identical to any one of the sequences according to SEQ ID NOs: 828-3146 or 3306-3593.
According to a main aspect of the invention the mRNA sequence as described above is prepared for use as a vaccine, wherein the vaccine is especially advantageous for use in prophylaxis and/or treatment of rotavirus infections.
Furthermore, the invention relates to a composition that comprises at least one mRNA (one or more mRNAs) comprising at least one of the mRNA sequences as defined above and optionally a pharmaceutically acceptable carrier. In this context a pharmaceutically acceptable carrier or vehicle is an agent which typically may be used within a pharmaceutical composition for facilitating administering of the components of the pharmaceutical composition to an individual. A pharmaceutically acceptable carrier or vehicle typically includes a liquid or non-liquid material, which is mixed with the component(s) of the inventive composition. If the components of the inventive composition are provided in liquid form, the carrier will typically be pyrogen-free water, isotonic saline or buffered aqueous solutions, e.g phosphate, citrate etc. buffered solutions. Ringer or Ringer-Lactate solution is particularly preferred as a liquid basis. It may be preferred that at least one of the components of the inventive composition is prepared for sustained and/or delayed release.
In a preferred embodiment of the composition according to the invention, the at least one mRNA according to the invention is complexed with one or more cationic or polycationic compounds, preferably with cationic or polycationic polymers, cationic or polycationic peptides or proteins, e.g. protamine, cationic or polycationic polysaccharides and/or cationic or polycationic lipids.
According to a preferred embodiment, the at least one mRNA of the composition according to the present invention may be complexed with lipids to form one or more liposomes, lipoplexes, or lipid nanoparticles. Therefore, in one embodiment, the inventive composition comprises liposomes, lipoplexes, and/or lipid nanoparticles comprising the at least one mRNA.
Lipid-based formulations have been increasingly recognized as one of the most promising delivery systems for RNA due to their biocompatibility and their ease of large-scale production. Cationic lipids have been widely studied as synthetic materials for delivery of RNA. After mixing together, nucleic acids are condensed by cationic lipids to form lipid/nucleic acid complexes known as lipoplexes. These lipid complexes are able to protect genetic material from the action of nucleases and deliver it into cells by interacting with the negatively charged cell membrane. Lipoplexes can be prepared by directly mixing positively charged lipids at physiological pH with negatively charged nucleic acids.
Conventional liposomes consist of a lipid bilayer that can be composed of cationic, anionic, or neutral (phospho)lipids and cholesterol, which encloses an aqueous core. Both the lipid bilayer and the aqueous space can incorporate hydrophobic or hydrophilic compounds, respectively. Liposome characteristics and behaviour in vivo can be modified by addition of a hydrophilic polymer coating, e.g. polyethylene glycol (PEG), to the liposome surface to confer steric stabilization. Furthermore, liposomes can be used for specific targeting by attaching ligands (e.g., antibodies, peptides, and carbohydrates) to its surface or to the terminal end of the attached PEG chains (Front Pharmacol. 2015 Dec. 1; 6:286).
Liposomes are colloidal lipid-based and surfactant-based delivery systems composed of a phospholipid bilayer surrounding an aqueous compartment. They may present as spherical vesicles and can range in size from 20 nm to a few microns. Cationic lipid-based liposomes are able to complex with negatively charged nucleic acids via electrostatic interactions, resulting in complexes that offer biocompatibility, low toxicity, and the possibility of the large-scale production required for in vivo clinical applications. Liposomes can fuse with the plasma membrane for uptake; once inside the cell, the liposomes are processed via the endocytic pathway and the genetic material is then released from the endosome/carrier into the cytoplasm. Liposomes have long been perceived as drug delivery vehicles because of their superior biocompatibility, given that liposomes are basically analogs of biological membranes, and can be prepared from both natural and synthetic phospholipids (Int J Nanomedicine. 2014; 9: 1833-1843).
Cationic liposomes have been traditionally the most commonly used non-viral delivery systems for oligonucleotides, including plasmid DNA, antisense oligos, and siRNA/small hairpin RNA-shRNA). Cationic lipids, such as DOTAP, (1,2-dioleoyl-3-trimethylammonium-propane) and DOTMA (N-[1-(2,3-dioleoyloxy)propyl]-N,N,N-trimethyl-ammonium methyl sulfate) can form complexes or lipoplexes with negatively charged nucleic acids to form nanoparticles by electrostatic interaction, providing high in vitro transfection efficiency. Furthermore, neutral lipid-based nanoliposomes for RNA delivery as e.g. neutral 1,2-dioleoyl-sn-glycero-3-phosphatidylcholine (DOPC)-based nanoliposomes were developed. (Adv Drug Deliv Rev. 2014 February; 66: 110-116.).
Therefore, in one embodiment the at least one mRNA of the composition according to the present invention is complexed with cationic lipids and/or neutral lipids and thereby forms liposomes, lipid nanoparticles, lipoplexes or neutral lipid-based nanoliposomes.
In a preferred embodiment, the composition according to the invention comprises the mRNA comprising at least one mRNA sequence according to the invention that is formulated together with a cationic or polycationic compound and/or with a polymeric carrier. Accordingly, in a further embodiment of the invention, it is preferred that the mRNA as defined herein or any other nucleic acid comprised in the inventive (pharmaceutical) composition or vaccine is associated with or complexed with a cationic or polycationic compound or a polymeric carrier, optionally in a weight ratio selected from a range of about 6:1 (w/w) to about 0.25:1 (w/w), more preferably from about 5:1 (w/w) to about 0.5:1 (w/w), even more preferably of about 4:1 (w/w) to about 1:1 (w/w) or of about 3:1 (w/w) to about 1:1 (w/w), and most preferably a ratio of about 3:1 (w/w) to about 2:1 (w/w) of mRNA or nucleic acid to cationic or polycationic compound and/or with a polymeric carrier; or optionally in a nitrogen/phosphate (N/P) ratio of mRNA or nucleic acid to cationic or polycationic compound and/or polymeric carrier in the range of about 0.1-10, preferably in a range of about 0.3-4 or 0.3-1, and most preferably in a range of about 0.5-1 or 0.7-1, and even most preferably in a range of about 0.3-0.9 or 0.5-0.9. More preferably, the N/P ratio of the at least one mRNA to the one or more polycations is in the range of about 0.1 to 10, including a range of about 0.3 to 4, of about 0.5 to 2, of about 0.7 to 2 and of about 0.7 to 1.5.
Therein, the mRNA as defined herein or any other nucleic acid comprised in the (pharmaceutical) composition or vaccine according to the invention can also be associated with a vehicle, transfection or complexation agent for increasing the transfection efficiency and/or the immunostimulatory properties of the mRNA according to the invention or of optionally comprised further included nucleic acids.
In a particularly preferred embodiment of the composition the at least one mRNA is at least partially associated with or complexed (formulated) with a cationic or polycationic compound and/or a polymeric carrier. Cationic compounds being particularly preferred agents in this context include protamine, nucleoline, spermine or spermidine, or other cationic peptides or proteins, such as poly-L-lysine (PLL), poly-arginine, basic polypeptides, cell penetrating peptides (CPPs), including HIV-binding peptides, HIV-1 Tat (HIV), Tat-derived peptides, Penetratin, VP22 derived or analog peptides, HSV VP22 (Herpes simplex), MAP, KALA or protein transduction domains (PTDs), PpT620, proline-rich peptides, arginine-rich peptides, lysine-rich peptides, MPG-peptide(s), Pep-1, L-oligomers, Calcitonin peptide(s), Antennapedia-derived peptides (particularly from Drosophila antennapedia), pAntp, plsI, FGF, Lactoferrin, Transportan, Buforin-2, Bac715-24, SynB, SynB(1), pVEC, hCT-derived peptides, SAP, or histones. Protamine is particularly preferred. Nevertheless, it is also possible that the mRNA of the inventive composition is naked or the composition comprises a mixture of naked and complexed mRNA.
Additionally, preferred cationic or polycationic proteins or peptides may be selected from the following proteins or peptides having the following total formula (III):
(Arg)l;(Lys)m;(His)n;(Orn)o;(Xaa)x, (formula (III))
wherein l+m+n+o+x=8-15, and l, m, n or o independently of each other may be any number selected from 0, 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14 or 15, provided that the overall content of Arg, Lys, His and Orn represents at least 50% of all amino acids of the oligopeptide; and Xaa may be any amino acid selected from native (=naturally occurring) or non-native amino acids except of Arg, Lys, His or Orn; and x may be any number selected from 0, 1, 2, 3 or 4, provided, that the overall content of Xaa does not exceed 50% of all amino acids of the oligopeptide. Particularly preferred cationic peptides in this context are e.g. Arg7, Arg8, Arg9, H3R9, R9H3, H3R9H3, YSSR9SSY, (RKH)4, Y(RKH)2R, etc. In this context the disclosure of WO 2009/030481 is incorporated herewith by reference.
Further preferred cationic or polycationic compounds, which can be used as transfection or complexation agent may include cationic polysaccharides, for example chitosan, polybrene, cationic polymers, e.g. polyethyleneimine (PEI), cationic lipids, e.g. DOTMA: [1-(2,3-sioleyloxy)propyl)]-N,N,N-trimethylammonium chloride, DMRIE, di-C14-amidine, DOTIM, SAINT, DC-Chol, BGTC, CTAP, DOPC, DODAP, DOPE: Dioleyl phosphatidylethanol-amine, DOSPA, DODAB, DOIC, DMEPC, DOGS: Dioctadecylamidoglicylspermin, DIMRI: Dimyristo-oxypropyl dimethyl hydroxyethyl ammonium bromide, DOTAP: dioleoyloxy-3-(trimethylammonio)propane, DC-6-14: O,O-ditetradecanoyl-N-(α-trimethylammonioacetyl)diethanolamine chloride, CLIP1: rac-[(2,3-dioctadecyloxypropyl)(2-hydroxyethyl)]-dimethylammonium chloride, CLIP6: rac-[2(2,3-dihexadecyloxypropyl-oxymethyloxy)ethyl]trimethylammonium, CLIP9: rac-[2(2,3-dihexadecyloxypropyl-oxysuccinyloxy)ethyl]-trimethylammonium, oligofectamine, or cationic or polycationic polymers, e.g. modified polyaminoacids, such as β-aminoacid-polymers or reversed polyamides, etc., modified polyethylenes, such as PVP (poly(N-ethyl-4-vinylpyridinium bromide)), etc., modified acrylates, such as pDMAEMA (poly(dimethylaminoethyl methylacrylate)), etc., modified amidoamines such as pAMAM (poly(amidoamine)), etc., modified polybetaaminoester (PBAE), such as diamine end modified 1,4 butanediol diacrylate-co-5-amino-1-pentanol polymers, etc., dendrimers, such as polypropylamine dendrimers or pAMAM based dendrimers, etc., polyimine(s), such as PEI: poly(ethyleneimine), poly(propyleneimine), etc., polyallylamine, sugar backbone based polymers, such as cyclodextrin based polymers, dextran based polymers, chitosan, etc., silan backbone based polymers, such as PMOXA-PDMS copolymers, etc., blockpolymers consisting of a combination of one or more cationic blocks (e.g. selected from a cationic polymer as mentioned above) and of one or more hydrophilic or hydrophobic blocks (e.g. polyethyleneglycole); etc.
According to a preferred embodiment, the composition of the present invention comprises the mRNA as defined herein and a polymeric carrier. A polymeric carrier used according to the invention might be a polymeric carrier formed by disulfide-crosslinked cationic components. The disulfide-crosslinked cationic components may be the same or different from each other. The polymeric carrier can also contain further components. It is also particularly preferred that the polymeric carrier used according to the present invention comprises mixtures of cationic peptides, proteins or polymers and optionally further components as defined herein, which are crosslinked by disulfide bonds as described herein. In this context, the disclosure of WO 2012/013326 is incorporated herewith by reference.
In this context, the cationic components, which form basis for the polymeric carrier by disulfide-crosslinkage, are typically selected from any suitable cationic or polycationic peptide, protein or polymer suitable for this purpose, particular any cationic or polycationic peptide, protein or polymer capable of complexing the mRNA as defined herein or a further nucleic acid comprised in the composition, and thereby preferably condensing the mRNA or the nucleic acid. The cationic or polycationic peptide, protein or polymer, is preferably a linear molecule, however, branched cationic or polycationic peptides, proteins or polymers may also be used.
Every disulfide-crosslinking cationic or polycationic protein, peptide or polymer of the polymeric carrier, which may be used to complex the mRNA according to the invention or any further nucleic acid comprised in the (pharmaceutical) composition or vaccine of the present invention contains at least one -SH moiety, most preferably at least one cysteine residue or any further chemical group exhibiting an -SH moiety, capable of forming a disulfide linkage upon condensation with at least one further cationic or polycationic protein, peptide or polymer as cationic component of the polymeric carrier as mentioned herein.
As defined above, the polymeric carrier, which may be used to complex the mRNA of the present invention or any further nucleic acid comprised in the (pharmaceutical) composition or vaccine according to the invention may be formed by disulfide-crosslinked cationic (or polycationic) components. Preferably, such cationic or polycationic peptides or proteins or polymers of the polymeric carrier, which comprise or are additionally modified to comprise at least one -SH moiety, are selected from, proteins, peptides and polymers as defined herein for complexation agent.
In a further particular embodiment, the polymeric carrier which may be used to complex the RNA as defined herein or any further nucleic acid comprised in the (pharmaceutical) composition or vaccine according to the invention may be selected from a polymeric carrier molecule according to generic formula (IV):
L-P1-S-[S-P2-S]n-S-P3-L formula (IV)
In this context, the disclosure of WO 2011/026641 is incorporated herewith by reference. Each of hydrophilic polymers P1 and P3 typically exhibits at least one -SH-moiety, wherein the at least one -SH-moiety is capable to form a disulfide linkage upon reaction with component P2 or with component (AA) or (AA)x, if used as linker between P1 and P2 or P3 and P2 as defined below and optionally with a further component, e.g. L and/or (AA) or (AA)x, e.g. if two or more -SH-moieties are contained. The following subformulae “P1-S-S-P2” and “P2-S-S-P3” within generic formula (IV) above (the brackets are omitted for better readability), wherein any of S, P1 and P3 are as defined herein, typically represent a situation, wherein one-SH-moiety of hydrophilic polymers P1 and P3 was condensed with one -SH-moiety of component P2 of generic formula (IV) above, wherein both sulphurs of these -SH-moieties form a disulfide bond -S-S- as defined herein in formula (IV). These -SH-moieties are typically provided by each of the hydrophilic polymers P1 and P3, e.g. via an internal cysteine or any further (modified) amino acid or compound which carries a -SH moiety. Accordingly, the subformulae “P1-S-S-P2” and “P2-S-S-P3” may also be written as “P1-Cys-Cys-P2” and “P2-Cys-Cys-P3”, if the -SH— moiety is provided by a cysteine, wherein the term Cys-Cys represents two cysteines coupled via a disulfide bond, not via a peptide bond. In this case, the term “-S-S-” in these formulae may also be written as “-S-Cys”, as “-Cys-S” or as “-Cys-Cys-”. In this context, the term “-Cys-Cys-” does not represent a peptide bond but a linkage of two cysteines via their -SH-moieties to form a disulfide bond. Accordingly, the term “-Cys-Cys-”also may be understood generally as “-(Cys-S)-(S-Cys)-”, wherein in this specific case S indicates the sulphur of the -SH-moiety of cysteine. Likewise, the terms “-S-Cys” and “-Cys-S” indicate a disulfide bond between a -SH containing moiety and a cysteine, which may also be written as “-S-(S-Cys)” and “-(Cys-S)-S”. Alternatively, the hydrophilic polymers P1 and P3 may be modified with a -SH moiety, preferably via a chemical reaction with a compound carrying a -SH moiety, such that each of the hydrophilic polymers P1 and P3 carries at least one such -SH moiety. Such a compound carrying a -SH moiety may be e.g. an (additional) cysteine or any further (modified) amino acid, which carries a -SH moiety. Such a compound may also be any non-amino compound or moiety, which contains or allows to introduce a -SH moiety into hydrophilic polymers P1 and P3 as defined herein. Such non-amino compounds may be attached to the hydrophilic polymers P1 and P3 of formula (IV) of the polymeric carrier according to the present invention via chemical reactions or binding of compounds, e.g. by binding of a 3-thio propionic acid or thioimolane, by amide formation (e.g. carboxylic acids, sulphonic acids, amines, etc), by Michael addition (e.g maleinimide moieties, α,β-unsatured carbonyls, etc), by click chemistry (e.g. azides or alkines), by alkene/alkine methatesis (e.g. alkenes or alkines), imine or hydrozone formation (aldehydes or ketons, hydrazins, hydroxylamins, amines), complexation reactions (avidin, biotin, protein G) or components which allow Sn-type substitution reactions (e.g halogenalkans, thiols, alcohols, amines, hydrazines, hydrazides, sulphonic acid esters, oxyphosphonium salts) or other chemical moieties which can be utilized in the attachment of further components. A particularly preferred PEG derivate in this context is alpha-Methoxy-omega-mercapto poly(ethylene glycol). In each case, the SH-moiety, e.g. of a cysteine or of any further (modified) amino acid or compound, may be present at the terminal ends or internally at any position of hydrophilic polymers P1 and P3. As defined herein, each of hydrophilic polymers P1 and P3 typically exhibits at least one -SH-moiety preferably at one terminal end, but may also contain two or even more -SH-moieties, which may be used to additionally attach further components as defined herein, preferably further functional peptides or proteins e.g. a ligand, an amino acid component (AA) or (AA)x, antibodies, cell penetrating peptides or enhancer peptides (e.g. TAT, KALA), etc.
In this context it is particularly preferred to select the peptide sequence according to SEQ ID No. 3594 as P2.
Preferably, the inventive composition comprises at least one mRNA as defined herein, which is complexed with one or more polycations, and at least one free mRNA, wherein the at least one complexed mRNA is preferably identical to the at least one free mRNA. In this context, it is particularly preferred that the composition of the present invention comprises the mRNA according to the invention that is complexed at least partially with a cationic or polycationic compound and/or a polymeric carrier, preferably cationic proteins or peptides. In this context, the disclosure of WO 2010/037539 and WO 2012/113513 is incorporated herewith by reference. Partially means that only a part of the mRNA as defined herein is complexed in the composition according to the invention with a cationic compound and that the rest of the mRNA as defined herein is (comprised in the inventive (pharmaceutical) composition or vaccine) in uncomplexed form (“free”). Preferably, the molar ratio of the complexed mRNA to the free mRNA is selected from a molar ratio of about 0.001:1 to about 1:0.001, including a ratio of about 1:1. More preferably the ratio of complexed mRNA to free mRNA (in the (pharmaceutical) composition or vaccine of the present invention) is selected from a range of about 5:1 (w/w) to about 1:10 (w/w), more preferably from a range of about 4:1 (w/w) to about 1:8 (w/w), even more preferably from a range of about 3:1 (w/w) to about 1:5 (w/w) or 1:3 (w/w), and most preferably the ratio of complexed mRNA to free mRNA in the inventive pharmaceutical composition or vaccine is selected from a ratio of about 1:1 (w/w).
The complexed mRNA in the (pharmaceutical) composition or vaccine according to the present invention, is preferably prepared according to a first step by complexing the mRNA according to the invention with a cationic or polycationic compound and/or with a polymeric carrier, preferably as defined herein, in a specific ratio to form a stable complex. In this context, it is highly preferable, that no free cationic or polycationic compound or polymeric carrier or only a negligibly small amount thereof remains in the component of the complexed mRNA after complexing the mRNA. Accordingly, the ratio of the mRNA and the cationic or polycationic compound and/or the polymeric carrier in the component of the complexed RNA is typically selected in a range so that the mRNA is entirely complexed and no free cationic or polycationic compound or polymeric carrier or only a negligibly small amount thereof remains in the composition.
In other embodiments, the composition according to the invention comprising the mRNA as defined herein may be administered naked without being associated with any further vehicle, transfection or complexation agent.
It has to be understood and recognized, that according to the present invention, the inventive composition may comprise at least one naked mRNA as defined herein and/or at least one formulated/complexed mRNA as defined herein, wherein every formulation and/or complexation as disclosed above may be used.
In an especially preferred embodiment of the inventive composition the mRNA sequence encoding a rotavirus protein is combined with at least one VLP (virus like particle) forming protein or peptide or a fragment, variant or derivative thereof. Additionally or alternatively the mRNA sequence encoding a rotavirus protein is combined with a nucleic acid molecule, preferably an mRNA sequence, encoding a VLP forming protein or peptide or a fragment, variant or derivative thereof. In this context it is preferred that the mRNA sequence encoding at least one epitope of a rotavirus protein (e.g. VP8*) is fused to a transmembrane domain of a protein, or a fragment, variant or derivative thereof (e.g. transmembrane domain of Influenza HA) resulting in a membrane-bound form of the rotavirus protein. More preferably the mRNA sequence encoding the rotavirus protein additionally comprises a sequence section encoding a signal peptide as described above. A preferred construct may comprise in 5′-3′ direction sequence sections coding for a signal peptide, VP8* and a transmembrane domain. This embodiment of the inventive composition, namely a combination of an mRNA construct encoding the rotavirus protein and at least one mRNA construct encoding a VLP forming protein or peptide, leads to co-expression of the membrane-bound rotavirus protein and the at least one VLP forming protein or peptide allowing formation of virus like particles thereby further increasing immune responses by antigen clustering.
The co-expression may be implemented in one single multicistronic mRNA construct or in at least two separated mRNA constructs.
In an especially preferred embodiment of this aspect of the invention the VLP forming protein or peptide is a viral matrix protein, or a fragment or derivative thereof, derived from an enveloped virus. The co-expression of such VLP forming proteins or peptides with the membrane-bound rotavirus protein advantageously results in the formation of VLPs with a lipid envelope. In preferred embodiments the matrix protein is a gag protein derived from an enveloped virus selected from HIV-1, EIAV, and MLV. In further preferred embodiments, the matrix protein is the matrix protein of vesiclular stomatitis virus (VSV), Rabies virus or the VP40 protein derived from an Ebola virus. Preferably the VLP forming protein is selected from any of the sequences according to SEQ ID Nos. 3172-3173.
Moreover, the invention relates to a pharmaceutical composition that comprises the composition as defined above, preferably in combination with a pharmaceutically acceptable carrier and/or vehicle.
The inventive pharmaceutical composition may be administered in various ways. Generally, the inventive pharmaceutical composition may be administered orally, parenterally, by inhalation spray, topically, rectally, nasally, buccally, vaginally or via an implanted reservoir. The term parenteral as used herein includes subcutaneous, intravenous, intramuscular, intraarticular, intranodal, intrasynovial, intrasternal, intrathecal, intrahepatic, intralesional, intracranial, transdermal, intradermal, intrapulmonal, intraperitoneal, intracardial, intraarterial, and sublingual injection or infusion techniques.
Preferably, the inventive pharmaceutical composition may be administered by parenteral injection, more preferably by subcutaneous, intravenous, intramuscular, intraarticular, intranodal, intrasynovial, intrasternal, intrathecal, intrahepatic, intralesional, intracranial, transdermal, intradermal, intrapulmonal, intraperitoneal, intracardial, intraarterial, and sublingual injection or via infusion techniques. Particularly preferred is intradermal and intramuscular injection. In one particularly preferred embodiment, the pharmaceutical composition is administered intramuscularly.
Methods for intramuscular administration are known in the art. Typically, a liquid is injected into a skeletal muscle (such as M. gluteus, M. deltoideus or M. vastus lateralis) using, for example, a syringe or a needle-free injection system, such as a jet injection system. Jet injection refers to a needle-free injection method, wherein a fluid comprising the inventive composition and, optionally, further suitable excipients is forced through an orifice, thus generating an ultra-fine liquid stream of high pressure that is capable of penetrating mammalian skin. In principle, the liquid stream forms a hole in the skin, through which the liquid stream is pushed into the target tissue.
Sterile injectable forms of the inventive pharmaceutical composition may be aqueous or oleaginous suspension. These suspensions may be formulated according to techniques known in the art using suitable dispersing or wetting agents and suspending agents. The sterile injectable preparation may also be a sterile injectable solution or suspension in a non-toxic parenterally-acceptable diluent or solvent, for example as a solution in 1,3-butanediol. Among the acceptable vehicles and solvents that may be employed are water, Ringer's solution and isotonic sodium chloride solution. In addition, sterile, fixed oils are conventionally employed as a solvent or suspending medium. For this purpose, any bland fixed oil may be employed including synthetic mono- or di-glycerides. Fatty acids, such as oleic acid and its glyceride derivatives are useful in the preparation of injectables, as are natural pharmaceutically-acceptable oils, such as olive oil or castor oil, especially in their polyoxyethylated versions. These oil solutions or suspensions may also contain a long-chain alcohol diluent or dispersant, such as carboxymethyl cellulose or similar dispersing agents that are commonly used in the formulation of pharmaceutically acceptable dosage forms including emulsions and suspensions. Other commonly used surfactants, such as Tweens, Spans and other emulsifying agents or bioavailability enhancers which are commonly used in the manufacture of pharmaceutically acceptable solid, liquid, or other dosage forms may also be used for the purposes of formulation of the inventive pharmaceutical composition.
Moreover, the inventive pharmaceutical composition as defined herein may also be administered orally in any orally acceptable dosage form including, but not limited to, capsules, tablets, aqueous suspensions or solutions. In the case of tablets for oral use, carriers commonly used include lactose and corn starch. Lubricating agents, such as magnesium stearate, are also typically added. For oral administration in a capsule form, useful diluents include lactose and dried cornstarch. When aqueous suspensions are required for oral use, the active ingredients are combined with emulsifying and suspending agents. If desired, certain sweetening, flavoring or coloring agents may also be added.
In a particularly preferred embodiment the inventive pharmaceutical composition is administered topically. For topical applications, the inventive pharmaceutical composition may be formulated in a suitable ointment, containing the component(s) of the inventive composition suspended or dissolved in one or more carriers. Carriers for topical administration include, but are not limited to, mineral oil, liquid petrolatum, white petrolatum, propylene glycol, polyoxyethylene, polyoxypropylene compound, emulsifying wax and water. Alternatively, the inventive pharmaceutical composition can be formulated in a suitable lotion or cream. In the context of the present invention, suitable carriers include, but are not limited to, mineral oil, sorbitan monostearate, polysorbate 60, cetyl esters wax, cetearyl alcohol, 2-octyldodecanol, benzyl alcohol and water. In especially preferred embodiments of the inventive composition enhancers for topical administration may be added to the composition, particularly substances which enhance skin permeability.
Moreover, transdermal administration may be preferred. In an especially preferred embodiment the pharmaceutical composition is prepared in the form of a transdermal patch, e.g. a microneedle patch (solid, hollow or dissolving), and the composition optionally comprises enhancers for transdermal delivery. Dermal or transdermal patches are particularly advantageous for slow release of the component(s) of the inventive composition.
In an especially preferred embodiment of the invention the inventive composition or the inventive pharmaceutical composition is provided or used as a vaccine. Therefore, the invention also relates to a vaccine comprising or consisting of a composition or pharmaceutical composition as defined above. Typically, such a vaccine is as defined above for pharmaceutical compositions. In the specific context of the inventive vaccine, the choice of a pharmaceutically acceptable carrier is determined in principle by the manner in which the inventive vaccine is administered. Routes for local administration in general include, for example, topical administration routes but also intradermal, transdermal, subcutaneous, or intramuscular injections or intralesional, intracranial, intrapulmonal, intracardial, and sublingual injection. Inventive vaccines are therefore preferably formulated in liquid (or sometimes in solid) form. Preferably, the inventive vaccine may be administered by conventional needle injection or needle-free jet injection. In a preferred embodiment the inventive vaccine may be administered by topical or transdermal routes.
According to another embodiment, the (pharmaceutical) composition or vaccine according to the invention may comprise an adjuvant, which is preferably added in order to enhance the immunostimulatory properties of the composition. In this context, an adjuvant may be understood as any compound, which is suitable to support administration and delivery of the composition according to the invention. Furthermore, such an adjuvant may, without being bound thereto, initiate or increase an immune response of the innate immune system, i.e. a non-specific immune response. In other words, when administered, the composition according to the invention typically initiates an adaptive immune response due to an antigen as defined herein or a fragment or variant thereof, which is encoded by the at least one coding region of the inventive mRNA contained in the composition of the present invention. Additionally, the composition according to the invention may generate an (supportive) innate immune response due to addition of an adjuvant as defined herein to the composition according to the invention.
Such an adjuvant may be selected from any adjuvant known to a skilled person and suitable for the present case, i.e. supporting the induction of an immune response in a mammal. Preferably, the adjuvant may be selected from the group consisting of, without being limited thereto, TDM, MDP, muramyl dipeptide, pluronics, alum solution, aluminium hydroxide, ADJUMER™ (polyphosphazene); aluminium phosphate gel; glucans from algae; algammulin; aluminium hydroxide gel (alum); highly protein-adsorbing aluminium hydroxide gel; low viscosity aluminium hydroxide gel; AF or SPT (emulsion of squalane (5%), Tween 80 (0.2%), Pluronic L121 (1.25%), phosphate-buffered saline, pH 7.4); AVRIDINE™ (propanediamine); BAY R1005™ ((N-(2-deoxy-2-L-leucylamino-β-D-glucopyranosyl)-N-octadecyl-dodecanoyl-amide hydroacetate); CALCITRIOL™ (1-alpha,25-dihydroxy-vitamin D3); calcium phosphate gel; CAP™ (calcium phosphate nanoparticles); cholera holotoxin, cholera-toxin-A1-protein-A-D-fragment fusion protein, sub-unit B of the cholera toxin; CRL 1005 (block copolymer P1205); cytokine-containing liposomes; DDA (dimethyldioctadecylammonium bromide); DHEA (dehydroepiandrosterone); DMPC (dimyristoylphosphatidylcholine); DMPG (dimyristoylphosphatidylglycerol); DOC/alum complex (deoxycholic acid sodium salt); Freund's complete adjuvant; Freund's incomplete adjuvant; gamma inulin; Gerbu adjuvant (mixture of: i) N-acetylglucosaminyl-(P1-4)-N-acetylmuramyl-L-alanyl-D-glutamine (GMDP), ii) dimethyldioctadecylammonium chloride (DDA), iii) zinc-L-proline salt complex (ZnPro-8); GM-CSF); GMDP (N-acetylglucosaminyl-(b1-4)-N-acetylmuramyl-L-alanyl-D-isoglutamine); imiquimod (1-(2-methypropyl)-1H-imidazo[4,5-c]quinoline-4-amine); ImmTher™ (N-acetylglucosaminyl-N-acetylmuramyl-L-Ala-D-isoGlu-L-Ala-glycerol dipalmitate); DRVs (immunoliposomes prepared from dehydration-rehydration vesicles); interferon-gamma; interleukin-1beta; interleukin-2; interleukin-7; interleukin-12; ISCOMS™; ISCOPREP 70.0.3™; liposomes; LOXORIBINE™ (7-allyl-8-oxoguanosine); LT oral adjuvant (E. coli labile enterotoxin-protoxin); microspheres and microparticles of any composition; MF59™; (squalene-water emulsion); MONTANIDE ISA 51™ (purified incomplete Freund's adjuvant); MONTANIDE ISA 720™ (metabolisable oil adjuvant); MPL™ (3-Q-desacyl-4′-monophosphoryl lipid A); MTP-PE and MTP-PE liposomes ((N-acetyl-L-alanyl-D-isoglutaminyl-L-alanine-2-(1,2-dipalmitoyl-sn-glycero-3-(hydroxyphosphoryloxy))-ethylamide, monosodium salt); MURAMETIDE™ (Nac-Mur-L-Ala-D-Gln-OCH3); MURAPALMITINE™ and D-MURAPALMITINE™ (Nac-Mur-L-Thr-D-isoGln-sn-glyceroldipalmitoyl); NAGO (neuraminidase-galactose oxidase); nanospheres or nanoparticles of any composition; NISVs (non-ionic surfactant vesicles); PLEURAN™ (@3-glucan); PLGA, PGA and PLA (homo- and co-polymers of lactic acid and glycolic acid; microspheres/nanospheres); PLURONIC L121™; PMMA (polymethyl methacrylate); PODDS™ (proteinoid microspheres); polyethylene carbamate derivatives; poly-rA: poly-rU (polyadenylic acid-polyuridylic acid complex); polysorbate 80 (Tween 80); protein cochleates (Avanti Polar Lipids, Inc., Alabaster, Ala.); STIMULON™ (QS-21); Quil-A (Quil-A saponin); S-28463 (4-amino-otec-dimethyl-2-ethoxymethyl-1H-imidazo[4,5 c]quinoline-1-ethanol); SAF-1™ (“Syntex adjuvant formulation”); Sendai proteoliposomes and Sendai-containing lipid matrices; Span-85 (sorbitan trioleate); Specol (emulsion of Marcol 52, Span 85 and Tween 85); squalene or Robane© (2,6,10,15,19,23-hexamethyltetracosan and 2,6,10,15,19,23-hexamethyl-2,6,10,14,18,22-tetracosahexane); stearyltyrosine (octadecyltyrosine hydrochloride); Theramid® (N-acetylglucosaminyl-N-acetylmuramyl-L-Ala-D-isoGlu-L-Ala-dipalmitoxypropylamide); Theronyl-MDP (Termurtide™ or [thr 1]-MDP; N-acetylmuramyl-L-threonyl-D-isoglutamine); Ty particles (Ty-VLPs or virus-like particles); Walter-Reed liposomes (liposomes containing lipid A adsorbed on aluminium hydroxide), and lipopeptides, including Pam3Cys, in particular aluminium salts, such as Adju-phos, Alhydrogel, Rehydragel; emulsions, including CFA, SAF, IFA, MF59, Provax, TiterMax, Montanide, Vaxfectin; copolymers, including Optivax (CRL1005), L121, Poloaxmer4010), etc.; liposomes, including Stealth, cochleates, including BIORAL; plant derived adjuvants, including QS21, Quil A, Iscomatrix, ISCOM; adjuvants suitable for costimulation including Tomatine, biopolymers, including PLG, PMM, Inulin; microbe derived adjuvants, including Romurtide, DETOX, MPL, CWS, Mannose, CpG nucleic acid sequences, CpG7909, ligands of human TLR 1-10, ligands of murine TLR 1-13, ISS-1018, IC31, Imidazoquinolines, Ampligen, Ribi529, IMOxine, IRIVs, VLPs, cholera toxin, heat-labile toxin, Pam3Cys, Flagellin, GPI anchor, LNFPIII/Lewis X, antimicrobial peptides, UC-1V150, RSV fusion protein, cdiGMP; and adjuvants suitable as antagonists including CGRP neuropeptide.
Particularly preferred, an adjuvant may be selected from adjuvants, which support induction of a Th1-immune response or maturation of naïve T-cells, such as GM-CSF, IL-12, IFN7, any immunostimulatory nucleic acid as defined herein, preferably an immunostimulatory RNA, CpG DNA, etc.
In a further preferred embodiment it is also possible that the inventive composition contains besides the antigen-providing mRNA further components which are selected from the group comprising: further antigens (e.g. in the form of a peptide or protein) or further antigen-encoding nucleic acids; a further immunotherapeutic agent; one or more auxiliary substances; or any further compound, which is known to be immunostimulating due to its binding affinity (as ligands) to human Toll-like receptors; and/or an adjuvant nucleic acid, preferably an immunostimulatory RNA (isRNA).
The composition of the present invention can additionally contain one or more auxiliary substances in order to increase its immunogenicity or immunostimulatory capacity, if desired. A synergistic action of the mRNA as defined herein and of an auxiliary substance, which may be optionally contained in the inventive composition, is preferably achieved thereby. Depending on the various types of auxiliary substances, various mechanisms can come into consideration in this respect. For example, compounds that permit the maturation of dendritic cells (DCs), for example lipopolysaccharides, TNF-alpha or CD40 ligand, form a first class of suitable auxiliary substances. In general, it is possible to use as auxiliary substance any agent that influences the immune system in the manner of a “danger signal” (LPS, GP96, etc.) or cytokines, such as GM-CFS, which allow an immune response to be enhanced and/or influenced in a targeted manner. Particularly preferred auxiliary substances are cytokines, such as monokines, lymphokines, interleukins or chemokines, that further promote the innate immune response, such as IL-1, IL-2, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL-12, IL-13, IL-14, IL-15, IL-16, IL-17, IL-18, IL-19, IL- 20, IL-21, IL-22, IL-23, IL-24, IL-25, IL-26, IL-27, IL-28, IL-29, IL-30, IL-31, IL-32, IL-33, IFN-alpha, IFN-beta, IFN-gamma, GM-CSF, G-CSF, M-CSF, LT-beta or TNF-alpha, growth factors, such as hGH. A synergistic action of the mRNA contained in the inventive composition and of an auxiliary substance, which may be optionally be co-formulated (or separately formulated) with the inventive vaccine or composition as described above, is preferably achieved thereby. Preferably, such immunogenicity increasing agents or compounds are provided separately (not co-formulated with the inventive vaccine or composition) and administered individually.
Suitable adjuvants may also be selected from cationic or polycationic compounds wherein the adjuvant is preferably prepared upon complexing the mRNA of the composition according to the invention with the cationic or polycationic compound as disclosed above.
The inventive vaccine or composition can also additionally contain any further compound, which is known to be immune-stimulating due to its binding affinity (as ligands) to human Toll-like receptors TLR1, TLR2, TLR3, TLR4, TLR5, TLR6, TLR7, TLR8, TLR9, TLR10, or due to its binding affinity (as ligands) to murine Toll-like receptors TLR1, TLR2, TLR3, TLR4, TLR5, TLR6, TLR7, TLR8, TLR9, TLR10, TLR11, TLR12 or TLR13.
Another class of compounds, which may be added to an inventive vaccine or composition in this context, may be CpG nucleic acids, in particular CpG-RNA or CpG-DNA. A CpG-RNA or CpG-DNA can be a single-stranded CpG-DNA (ss CpG-DNA), a double-stranded CpG-DNA (dsDNA), a single-stranded CpG-RNA (ss CpG-RNA) or a double-stranded CpG-RNA (ds CpG-RNA). The CpG nucleic acid is preferably in the form of CpG-RNA, more preferably in the form of single-stranded CpG-RNA (ss CpG-RNA). The CpG nucleic acid preferably contains at least one or more (mitogenic) cytosine/guanine dinucleotide sequence(s) (CpG motif(s)). According to a first preferred alternative, at least one CpG motif contained in these sequences, that is to say the C (cytosine) and the G (guanine) of the CpG motif, is unmethylated. All further cytosines or guanines optionally contained in these sequences can be either methylated or unmethylated. According to a further preferred alternative, however, the C (cytosine) and the G (guanine) of the CpG motif can also be present in methylated form.
Suitable adjuvants may furthermore be selected from nucleic acids having the formula (Va): GiXmGn, wherein: G is guanosine (guanine), uridine (uracil) or an analogue of guanosine (guanine) or uridine (uracil); X is guanosine (guanine), uridine (uracil), adenosine (adenine), thymidine (thymine), cytidine (cytosine) or an analogue of the above-mentioned nucleotides (nucleosides); l is an integer from 1 to 40, wherein when I=1 G is guanosine (guanine) or an analogue thereof, when l>1 at least 50% of the nucleotides are guanosine (guanine) or an analogue thereof; m is an integer and is at least 3; wherein when m=3 X is uridine (uracil) or an analogue thereof, when m>3 at least 3 successive uridines (uracils) or analogues of uridine (uracil) occur; n is an integer from 1 to 40, wherein when n=1 G is guanosine (guanine) or an analogue thereof, when n>1 at least 50% of the nucleotides (nucleosides) are guanosine (guanine) or an analogue thereof, or formula (Vb): (NuGiXmGnNv)a, wherein: G is guanosine (guanine), uridine (uracil) or an analogue of guanosine (guanine) or uridine (uracil), preferably guanosine (guanine) or an analogue thereof; X is guanosine (guanine), uridine (uracil), adenosine (adenine), thymidine (thymine), cytidine (cytosine), or an analogue of these nucleotides (nucleosides), preferably uridine (uracil) or an analogue thereof; N is a nucleic acid sequence having a length of about 4 to 50, preferably of about 4 to 40, more preferably of about 4 to 30 or 4 to 20 nucleic acids, each N independently being selected from guanosine (guanine), uridine (uracil), adenosine (adenine), thymidine (thymine), cytidine (cytosine) or an analogue of these nucleotides (nucleosides); a is an integer from 1 to 20, preferably from 1 to 15, most preferably from 1 to 10; l is an integer from 1 to 40, wherein when l=1, G is guanosine (guanine) or an analogue thereof, when l>1, at least 50% of these nucleotides (nucleosides) are guanosine (guanine) or an analogue thereof; m is an integer and is at least 3; wherein when m=3, X is uridine (uracil) or an analogue thereof, and when m>3, at least 3 successive uridines (uracils) or analogues of uridine (uracil) occur; n is an integer from 1 to 40, wherein when n=1, G is guanosine (guanine) or an analogue thereof, when n>1, at least 50% of these nucleotides (nucleosides) are guanosine (guanine) or an analogue thereof; u,v may be independently from each other an integer from 0 to 50, preferably wherein when u=0, v≥1, or when v=0, u≥1; wherein the nucleic acid molecule of formula (Vb) has a length of at least 50 nucleotides, preferably of at least 100 nucleotides, more preferably of at least 150 nucleotides, even more preferably of at least 200 nucleotides and most preferably of at least 250 nucleotides.
Other suitable adjuvants may furthermore be selected from nucleic acids having the formula (VI): CiXmCn, wherein: C is cytidine (cytosine), uridine (uracil) or an analogue of cytidine (cytosine) or uridine (uracil); X is guanosine (guanine), uridine (uracil), adenosine (adenine), thymidine (thymine), cytidine (cytosine) or an analogue of the above-mentioned nucleotides (nucleosides); l is an integer from 1 to 40, wherein when I=1 C is cytidine (cytosine) or an analogue thereof, when l>1 at least 50% of the nucleotides are cytidine (cytosine) or an analogue thereof; m is an integer and is at least 3; wherein when m=3 X is uridine (uracil) or an analogue thereof, when m>3 at least 3 successive uridines (uracils) or analogues of uridine (uracil) occur; n is an integer from 1 to 40, wherein when n=1 C is cytidine (cytosine) or an analogue thereof, when n>1 at least 50% of the nucleotides (nucleosides) are cytidine (cytosine) or an analogue thereof.
In this context the disclosure of WO002008014979 and WO2009095226 is also incorporated herein by reference.
Further additives which may be included in the inventive vaccine or composition are emulsifiers, such as, for example, Tween; wetting agents, such as, for example, sodium lauryl sulfate; colouring agents; taste-imparting agents, pharmaceutical carriers; tablet-forming agents; stabilizers; antioxidants; preservatives.
In a further aspect, the present invention provides a vaccine, which is based on the mRNA sequence according to the invention comprising at least one coding region as defined herein. The vaccine according to the invention is preferably a (pharmaceutical) composition as defined herein. Accordingly, the vaccine according to the invention is based on the same components as the (pharmaceutical) composition described herein. Insofar, it may be referred to the description of the (pharmaceutical) composition as provided herein. Preferably, the vaccine according to the invention comprises at least one mRNA comprising at least one mRNA sequence as defined herein and a pharmaceutically acceptable carrier. In embodiments, where the vaccine comprises more than one mRNA sequence (such as a plurality of RNA sequences according to the invention, wherein each preferably encodes a distinct antigenic peptide or protein), the vaccine may be provided in physically separate form and may be administered by separate administration steps. The vaccine according to the invention may correspond to the (pharmaceutical) composition as described herein, especially where the mRNA sequences are provided by one single composition. However, the inventive vaccine may also be provided physically separated. For instance, in embodiments, wherein the vaccine comprises more than one mRNA sequences/species, these RNA species may be provided such that, for example, two, three, four, five or six separate compositions, which may contain at least one mRNA species/sequence each (e.g. three distinct mRNA species/sequences), each encoding distinct antigenic peptides or proteins, are provided, which may or may not be combined. Also, the inventive vaccine may be a combination of at least two distinct compositions, each composition comprising at least one mRNA encoding at least one of the antigenic peptides or proteins defined herein. Alternatively, the vaccine may be provided as a combination of at least one mRNA, preferably at least two, three, four, five, six or more mRNAs, each encoding one of the antigenic peptides or proteins defined herein. The vaccine may be combined to provide one single composition prior to its use or it may be used such that more than one administration is required to administer the distinct mRNA sequences/species encoding any of the antigenic peptides or proteins as defined herein. If the vaccine contains at least one mRNA sequence, typically at least two mRNA sequences, encoding the antigen combinations defined herein, it may e.g. be administered by one single administration (combining all mRNA species/sequences), by at least two separate administrations. Accordingly; any combination of mono-, bi- or multicistronic mRNAs encoding the at least one antigenic peptide or protein or any combination of antigens as defined herein (and optionally further antigens), provided as separate entities (containing one mRNA species) or as combined entity (containing more than one mRNA species), is understood as a vaccine according to the present invention. According to a particularly preferred embodiment of the inventive vaccine, the at least one antigen, preferably a combination as defined herein of at least two, three, four, five, six or more antigens encoded by the inventive composition as a whole, is provided as an individual (monocistronic) mRNA, which is administered separately.
As with the (pharmaceutical) composition according to the present invention, the entities of the vaccine may be provided in liquid and or in dry (e.g. lyophilized) form. They may contain further components, in particular further components allowing for its pharmaceutical use. The vaccine or the (pharmaceutical) composition may, e.g., additionally contain a pharmaceutically acceptable carrier and/or further auxiliary substances and additives and/or adjuvants.
The vaccine or (pharmaceutical) composition typically comprises a safe and effective amount of the mRNA according to the invention as defined herein, encoding an antigenic peptide or protein as defined herein or a fragment or variant thereof or a combination of antigens, preferably as defined herein. As used herein, “safe and effective amount” means an amount of the mRNA that is sufficient to significantly induce a positive modification of cancer or a disease or disorder related to cancer. At the same time, however, a “safe and effective amount” is small enough to avoid serious side-effects, that is to say to permit a sensible relationship between advantage and risk. The determination of these limits typically lies within the scope of sensible medical judgment. In relation to the vaccine or (pharmaceutical) composition of the present invention, the expression “safe and effective amount” preferably means an amount of the mRNA (and thus of the encoded antigen) that is suitable for stimulating the adaptive immune system in such a manner that no excessive or damaging immune reactions are achieved but, preferably, also no such immune reactions below a measurable level. Such a “safe and effective amount” of the mRNA of the (pharmaceutical) composition or vaccine as defined herein may furthermore be selected in dependence of the type of mRNA, e.g. monocistronic, bi- or even multicistronic mRNA, since a bi- or even multicistronic mRNA may lead to a significantly higher expression of the encoded antigen(s) than the use of an equal amount of a monocistronic mRNA. A “safe and effective amount” of the mRNA of the (pharmaceutical) composition or vaccine as defined above will furthermore vary in connection with the particular condition to be treated and also with the age and physical condition of the patient to be treated, the severity of the condition, the duration of the treatment, the nature of the accompanying therapy, of the particular pharmaceutically acceptable carrier used, and similar factors, within the knowledge and experience of the accompanying doctor. The vaccine or composition according to the invention can be used according to the invention for human and also for veterinary medical purposes, as a pharmaceutical composition or as a vaccine.
In a preferred embodiment, the mRNA of the (pharmaceutical) composition, vaccine or kit of parts according to the invention is provided in lyophilized form. Preferably, the lyophilized mRNA is reconstituted in a suitable buffer, advantageously based on an aqueous carrier, priorto administration, e.g. Ringer-Lactate solution, which is preferred, Ringer solution, a phosphate buffer solution. In a preferred embodiment, the (pharmaceutical) composition, the vaccine or the kit of parts according to the invention contains at least one, two, three, four, five, six or more mRNAs, preferably mRNAs which are provided separately in lyophilized form (optionally together with at least one further additive) and which are preferably reconstituted separately in a suitable buffer (such as Ringer-Lactate solution) prior to their use so as to allow individual administration of each of the (monocistronic) mRNAs.
The vaccine or (pharmaceutical) composition according to the invention may typically contain a pharmaceutically acceptable carrier. The expression “pharmaceutically acceptable carrier” as used herein preferably includes the liquid or non-liquid basis of the inventive vaccine. If the inventive vaccine is provided in liquid form, the carrier will be water, typically pyrogen-free water; isotonic saline or buffered (aqueous) solutions, e.g phosphate, citrate etc. buffered solutions. Particularly for injection of the inventive vaccine, water or preferably a buffer, more preferably an aqueous buffer, may be used, containing a sodium salt, preferably at least 50 mM of a sodium salt, a calcium salt, preferably at least 0.01 mM of a calcium salt, and optionally a potassium salt, preferably at least 3 mM of a potassium salt. According to a preferred embodiment, the sodium, calcium and, optionally, potassium salts may occur in the form of their halogenides, e.g. chlorides, iodides, or bromides, in the form of their hydroxides, carbonates, hydrogen carbonates, or sulfates, etc. Without being limited thereto, examples of sodium salts include e.g. NaCl, NaI, NaBr, Na2CO3, NaHCO3, Na2SO4, examples of the optional potassium salts include e.g. KCl, KI, KBr, K2CO3, KHCO3, K2SO4, and examples of calcium salts include e.g. CaCl2, Cal2, CaBr2, CaCO3, CaSO4, Ca(OH)2. Furthermore, organic anions of the aforementioned cations may be contained in the buffer. According to a more preferred embodiment, the buffer suitable for injection purposes as defined above, may contain salts selected from sodium chloride (NaCl), calcium chloride (CaCl2) and optionally potassium chloride (KCl), wherein further anions may be present additional to the chlorides. CaCl2 can also be replaced by another salt like KCl. Typically, the salts in the injection buffer are present in a concentration of at least 50 mM sodium chloride (NaCl), at least 3 mM potassium chloride (KCl) and at least 0.01 mM calcium chloride (CaCl2). The injection buffer may be hypertonic, isotonic or hypotonic with reference to the specific reference medium, i.e. the buffer may have a higher, identical or lower salt content with reference to the specific reference medium, wherein preferably such concentrations of the afore mentioned salts may be used, which do not lead to damage of cells due to osmosis or other concentration effects. Reference media are e.g. in “in vivo” methods occurring liquids such as blood, lymph, cytosolic liquids, or other body liquids, or e.g. liquids, which may be used as reference media in “in vitro” methods, such as common buffers or liquids. Such common buffers or liquids are known to a skilled person. Ringer-Lactate solution is particularly preferred as a liquid basis.
However, one or more compatible solid or liquid fillers or diluents or encapsulating compounds may be used as well, which are suitable for administration to a person. The term “compatible” as used herein means that the constituents of the inventive vaccine are capable of being mixed with the mRNA according to the invention as defined herein, in such a manner that no interaction occurs, which would substantially reduce the pharmaceutical effectiveness of the inventive vaccine under typical use conditions. Pharmaceutically acceptable carriers, fillers and diluents must, of course, have sufficiently high purity and sufficiently low toxicity to make them suitable for administration to a person to be treated. Some examples of compounds which can be used as pharmaceutically acceptable carriers, fillers or constituents thereof are sugars, such as, for example, lactose, glucose, trehalose and sucrose; starches, such as, for example, corn starch or potato starch; dextrose; cellulose and its derivatives, such as, for example, sodium carboxymethylcellulose, ethylcellulose, cellulose acetate; powdered tragacanth; malt; gelatin; tallow; solid glidants, such as, for example, stearic acid, magnesium stearate; calcium sulfate; vegetable oils, such as, for example, groundnut oil, cottonseed oil, sesame oil, olive oil, corn oil and oil from theobroma; polyols, such as, for example, polypropylene glycol, glycerol, sorbitol, mannitol and polyethylene glycol; alginic acid.
The choice of a pharmaceutically acceptable carrier is determined, in principle, by the manner, in which the pharmaceutical composition or vaccine according to the invention is administered. The composition or vaccine can be administered, for example, systemically or locally. Routes for systemic administration in general include, for example, transdermal, oral, parenteral routes, including subcutaneous, intravenous, intramuscular, intraarterial, intradermal and intraperitoneal injections and/or intranasal administration routes. Routes for local administration in general include, for example, topical administration routes but also intradermal, transdermal, subcutaneous, or intramuscular injections or intralesional, intracranial, intrapulmonal, intracardial, and sublingual injections. More preferably, composition or vaccines according to the present invention may be administered by an intradermal, subcutaneous, or intramuscular route, preferably by injection, which may be needle-free and/or needle injection. Compositions/vaccines are therefore preferably formulated in liquid or solid form. The suitable amount of the vaccine or composition according to the invention to be administered can be determined by routine experiments, e.g. by using animal models. Such models include, without implying any limitation, rabbit, sheep, mouse, rat, dog and non-human primate models. Preferred unit dose forms for injection include sterile solutions of water, physiological saline or mixtures thereof. The pH of such solutions should be adjusted to about 7.4. Suitable carriers for injection include hydrogels, devices for controlled or delayed release, polylactic acid and collagen matrices. Suitable pharmaceutically acceptable carriers for topical application include those which are suitable for use in lotions, creams, gels and the like. If the inventive composition or vaccine is to be administered perorally, tablets, capsules and the like are the preferred unit dose form. The pharmaceutically acceptable carriers for the preparation of unit dose forms which can be used for oral administration are well known in the prior art. The choice thereof will depend on secondary considerations such as taste, costs and storability, which are not critical for the purposes of the present invention, and can be made without difficulty by a person skilled in the art.
Despite, the inventive composition may comprise further components for facilitating administration and uptake of components of the pharmaceutical composition. Such further components may be an appropriate carrier or vehicle, additional adjuvants for supporting any immune response, antibacterial and/or antiviral agents.
A further component of the inventive pharmaceutical composition may be an immunotherapeutic agent that can be selected from immunoglobulins, preferably IgGs, monoclonal or polyclonal antibodies, polyclonal serum or sera, etc. Preferably, such a further immunotherapeutic agent may be provided as a peptide/protein or may be encoded by a nucleic acid, preferably by a DNA or an RNA, more preferably an mRNA.
Further additives which may be included in the inventive composition are emulsifiers, such as, for example, Tween®; wetting agents, such as, for example, sodium lauryl sulfate; colouring agents; taste-imparting agents, pharmaceutical carriers; tablet-forming agents; stabilizers; antioxidants; preservatives.
The inventive composition and especially the pharmaceutical composition or vaccine typically comprises a “safe and effective amount” of the components of the composition, particularly of the mRNA sequence molecule(s) as defined herein. As used herein, a “safe and effective amount” means an amount of the mRNA molecule(s) as defined herein as such that is sufficient to significantly induce a positive protection or treatment of rotavirus disease. At the same time, however, a “safe and effective amount” is small enough to avoid serious side-effects and to permit a sensible relationship between advantage and risk. The determination of these limits typically lies within the scope of sensible medical judgment. A “safe and effective amount” of the components of the inventive pharmaceutical composition will furthermore vary in connection with the particular condition to be treated and also with the age and physical condition of the patient to be treated, the body weight, general health, sex, diet, time of administration, rate of excretion, drug combination, the severity of the condition, the duration of the treatment, the nature of the accompanying therapy, of the particular pharmaceutically acceptable carrier used, and similar factors, within the knowledge and experience of the accompanying doctor.
The inventive pharmaceutical composition may be used for human and also for veterinary medical purposes, preferably for human medical purposes, as a pharmaceutical composition in general or preferably as a vaccine or immunostimulating agent for prophylaxis and/or treatment of rotavirus infections.
Moreover, one or more compatible solid or liquid fillers or diluents or encapsulating compounds, which are suitable for administration to a patient to be treated, may be used as well for the pharmaceutical composition or vaccine according to the invention. The term “compatible” as used here means that these constituents of the inventive pharmaceutical composition or vaccine are capable of being mixed with the components of the inventive pharmaceutical composition or vaccine in such a manner that no interaction occurs which would substantially reduce the pharmaceutical effectiveness of the pharmaceutical composition or vaccine under typical use conditions.
Furthermore, the inventive pharmaceutical composition or vaccine may comprise at least one additional pharmaceutically active component. A pharmaceutically active component in this connection is a compound that has a therapeutic effect to heal, ameliorate or prevent a particular indication or disease. Such compounds include, without implying any limitation, peptides or proteins, preferably as defined herein, nucleic acids, preferably as defined herein, (therapeutically active) low molecular weight organic or inorganic compounds (molecular weight less than 5000, preferably less than 1000), sugars, antigens or antibodies, preferably as defined herein, therapeutic agents already known in the prior art, antigenic cells, antigenic cellular fragments, cellular fractions, cell wall components (e.g. polysaccharides), modified, attenuated or de-activated (e.g. chemically or by irradiation) pathogens (virus, bacteria etc.), antibacterial agents etc.
In a further aspect, the invention relates to a kit, preferably a kit of parts comprising one or more of the inventive mRNA sequence or the mRNA containing (pharmaceutical) composition or vaccine as described above, optionally a liquid vehicle for solubilising, and optionally technical instructions with information on the administration and dosage of the components respectively the composition or the pharmaceutical composition or the vaccine. Preferably, the mRNA component of the composition and possibly further components are provided in lyophilized form as a separate part. Preferably, the kit contains as a part Ringer-Lactate solution.
Beside the components of the inventive mRNA containing composition, the kit may additionally contain a pharmaceutically acceptable vehicle and/or one or more adjuvant components and optionally further components as described below, as well as means for administration and technical instructions. In a preferred embodiment, prior to use of the kit, the provided vehicle is than added to the lyophilized components in a predetermined amount as written e.g. in the provided technical instructions.
Moreover, the invention relates to the composition as defined above, or the pharmaceutical composition as defined above, or the vaccine as defined above, or the kit as defined above for use as a medicament and especially for use for the preparation of a medicament.
Moreover, the invention relates to a use of the composition as defined above, or the pharmaceutical composition as defined above, or the vaccine as defined above, or the kit as defined above for use in the treatment or prophylaxis of rotavirus infections. It is especially preferred to use the composition or the pharmaceutical composition or the vaccine or the kit in prophylaxis of rotavirus infections. It is especially preferred to use it as a prophylactic vaccine because the inventive vaccine is very efficient and safe and also cheap in production.
Preferably the composition as defined above, or the pharmaceutical composition as defined above, or the vaccine as defined above, or the kit as defined above is prepared for parenteral administration, more preferably for administration by subcutaneous or intramuscular or intradermal or topical or transdermal application. Compared to oral administration parenteral administration is particularly preferred because the efficiency or oral vaccination is significantly reduced in developing countries as already described above. Preferably, the administration is done by conventional needle injection orjet injection, preferably by using jet injection. For practical reasons in developing countries particularly conventional needle injection may be preferred, whereas also jet injection may be preferred, because it is an especially effective administration route. Nevertheless, other administration routes may also be applied, especially topical or transdermal administration may be applied, wherein the pharmaceutical composition or vaccine may be prepared in the form of a transdermal patch. The efficiency of the transdermal application may be enhanced by addition of enhancers for transdermal delivery.
Moreover, the invention relates to a method of treatment or prophylaxis of rotavirus infections, wherein one or more mRNA sequences or the composition or the pharmaceutical composition or the vaccine or the kit or kit of parts as defined above is provided and applied or administered to a tissue or an organism. Preferably, a therapeutically effective amount thereof is applied or administered to a subject in need thereof. In preferred embodiments the mRNA sequence(s) or the composition or the pharmaceutical composition or the vaccine or the kit or kit of parts is administered by subcutaneous or intramuscular or intradermal or topical or transdermal application, e. g. by conventional needle injection or jet injection.
Moreover, topical or transdermal administration may be especially preferred, for example by iontophoresis or by non-cavitational ultrasound or by cavitational ultrasound or by electroporation of by microneedles or by thermal ablation or by microdermabrasion. In this context it is referred to the article of Prausnitz M. R. and Langer R. (Prausnitz M. R. and Langer R. (2008), Nat Biotechnol November 26(11):1261-1268) generally describing methods for transdermal drug delivery, which may be used for the inventive composition. It may be especially preferred to use transdermal patches for administration, as described above. Moreover, patches with microneedles may be used, especially for slow release. Moreover, in especially preferred embodiments, creams, lotions or gels containing the inventive composition may be used.
According to a specific embodiment, the inventive mRNA sequence(s), or the inventive (pharmaceutical) composition or vaccine may be administered to the patient as a single dose or as at least one single dose, respectively. In certain embodiments, the inventive mRNA sequence(s) or the inventive (pharmaceutical) composition or vaccine may be administered to a patient as a single dose followed by a second dose later and optionally even a third, fourth (or more) dose subsequent thereto etc. In accordance with this embodiment, booster inoculations with the inventive mRNA sequence(s) or the inventive (pharmaceutical) composition or vaccine may be administered to a patient at specific time intervals, preferably as defined below, following the second (or third, fourth, etc.) inoculation. Preferably, at least one dose of the inventive mRNA sequence, pharmaceutical composition or vaccine is administered, preferably from 1 to 10 doses, more preferably from 2 to 7 doses, even more preferably from 2 to 5 doses and most preferably from 3 to 5 doses. In a particularly preferred embodiment, 3 doses are administered. In another embodiment 5 doses are administered. In that embodiment, the doses are given in a specific time period, e.g. 20-30 or 20-60 days. The interval between the administration of two or more doses is preferably from 5 to 120 days, more preferably from 7 to 15 days or 15 to 30 days. In a preferred embodiment, the interval between the administration of two or more doses is at least 7 days, more preferably 28 days.
In a preferred embodiment, a single dose of the inventive mRNA sequence(s), composition or vaccine comprises a specific amount of the mRNA according to the invention. Preferably, the inventive mRNA sequence(s) is provided in an amount of at least 40 μg per dose, preferably in an amount of from 40 to 700 μg per dose, more preferably in an amount of from 80 to 400 μg per dose. More specifically, in the case of intradermal injection, which is preferably carried out by using a conventional needle, the amount of the inventive mRNA sequence(s) comprised in a single dose is typically at least 200 μg, preferably from 200 μg to 1.000 μg, more preferably from 300 μg to 850 μg, even more preferably from 300 μg to 700 μg. In the case of intradermal injection, which is preferably carried out via jet injection (e.g. using a Tropis device), the amount of the inventive mRNA sequence(s) comprised in a single dose is typically at least 80 μg, preferably from 80 μg to 700 μg, more preferably from 80 μg to 400 μg. Moreover, in the case of intramuscular injection, which is preferably carried out by using a conventional needle or via jet injection, the amount of the inventive mRNA sequence(s) comprised in a single dose is typically at least 80 μg, preferably from 80 μg to 1.000 μg, more preferably from 80 μg to 850 μg, even more preferably from 80 μg to 700 μg.
More specifically, the following specific embodiments are particularly preferred:
In certain embodiments, such booster inoculations with the inventive mRNA sequence(s) or inventive (pharmaceutical) composition or vaccine as disclosed above (second, third etc. vaccination) may utilize an additional compound or component as defined for the inventive mRNA sequence(s) or inventive (pharmaceutical) composition or vaccine as defined herein.
In the present invention, if not otherwise indicated, different features of alternatives and embodiments may be combined with each other, where suitable. Furthermore, the term “comprising” shall not be narrowly construed as being limited to “consisting of” only, if not specifically mentioned. Rather, in the context of the present invention, “consisting of” is an embodiment specifically contemplated by the inventors to fall under the scope of “comprising”, wherever “comprising” is used herein.
All publications, patents and patent applications cited in this specification are herein incorporated by reference as if each individual publication or patent application were specifically and individually indicated to be incorporated by reference. Although the foregoing invention has been described in some detail by way of illustration and example for purposes of clarity of understanding, it will be readily apparent to those of ordinary skill in the art in light of the teachings of this invention that certain changes and modifications may be made thereto without departing from the spirit or scope of the appended claims.
The examples and figures shown in the following are merely illustrative and shall describe the present invention in a further way. These figures and examples shall not be construed to limit the present invention thereto.
The examples shown in the following are merely illustrative and shall describe the present invention in a further way. These examples shall not be construed to limit the present invention thereto.
1. Preparation of DNA and mRNA Constructs
For the present examples DNA sequences encoding the VP8* protein of different serotypes of rotavirus were prepared and used for subsequent in vitro transcription. Schematics of the constructs are shown in
2. In Vitro Transcription
The respective DNA plasmids prepared according to paragraph 1 were transcribed in vitro using T7 RNA polymerase in the presence of a CAP analogue (m7GpppG). Subsequently the mRNA was purified using PureMessenger© (CureVac, Tubingen, Germany; WO2008/077592A1).
The following mRNA sequences were prepared:
TABLE 21
R
SEQ ID
AA in
Helper
Signal
N
Transmembrane
VLP
number
NO.
Serotype
VP4
peptide
peptide
glycosylation sites
domain
domain
1.
R3718
3114
P[6]
+P2
N −> Q
2.
R3720
3135
P[6]
+P2
N −> Q
3.
R3722
3121
P[6]
+P2
N −> Q
4.
R3724
3128
P[6]
+P2
N −> Q
WhcAg
5.
R5471
2968
P[8]
65-223
+P2
—
wt
6.
R5473
2872
P[8]
40-223
+P2
—
wt
7.
R5475
2488
P[8]
2-230
+P2
—
wt
8.
R5479
2484
P[8]
1-230
—
—
wt
9.
R5481
2496
P[8]
2-230
—
+HSA
wt
10.
R5483
2592
P[8]
10-223
—
+HSA
wt
11.
R5485
2880
P[8]
40-223
—
+HSA
wt
12.
R5487
2900
P[8]
40-223
+P2
+HSA
wt
13.
R5489
2904
P[8]
40-223
+P2
+IgE
wt
14.
R5491
2736
P[8]
10-240
—
+HSA
N −> Q
15.
R5493
2928
P[8]
40-223
—
+HSA
N −> Q
16.
R5594
2875
P[6]
40-223
—
+HSA
wt
17.
R5595
2895
P[6]
40-223
+P2
+TPA
wt
18.
R5596
2491
P[6]
2-230
—
+TPA
wt
19.
R5597
2827
P[6]
20-240
—
+TPA
N −> Q
The mRNA sequences comprise in 5′- to 3′-direction:
3. Preparation of the mRNA Vaccine
3.1. Protamine Complexation:
The mRNA vaccine consisted of a mixture of 50% free mRNA and 50% mRNA complexed with protamine at a weight ratio of 2:1. First, mRNA was complexed with protamine by addition of protamine-Ringer's lactate solution to mRNA. After incubation for 10 minutes, when the complexes were stably generated, free mRNA was added, and the final concentration of the vaccine was adjusted with Ringer's lactate solution.
3.2. LNP Formulation
Lipid nano particle (LNP)-formulated mRNA was generated using an ionizable amino lipid, phospholipid, cholesterol and a PEGylated lipid, similar in composition as described in Thess et al. Mol Ther J Am Soc Gene Ther. 2015; 23(9):1456-1464.
3.3. CVCM Formulation:
The polyethylene glycol/peptide polymers (HO-PEG 5000-S-(S-CHHHHHHRRRRHHHHHHC-S-)7-S-PEG 5000-OH according to formula IV (referred to as PB83) is used for complexation of the inventive mRNA sequences, optional in combination with a lipid or lipidoid as disclosed in PCT/EP2016/063228 (incorporated herewith by reference).
4. Analysis of VP8* Specific Antibodies by ELISA
ELISA plates were coated with 1 μg/ml (for P[4] P2-VP8* and P[6] P2-VP8*) or 10 μg/ml (for P[8] P2-VP8*) protein. Coated plates were blocked (in 1% milk; 0.05% Tween in PBS) and incubated with serum in different dilutions. Binding of specific antibodies to the P2 VP8* protein was detected using HRP (horse radish peroxidase) coupled rat monoclonal anti-mouse IgG1 or IgG2a using Amplex Ultra Red as a substrate for detection.
Statistical analysis was done by Mann Whitney test. Asterisks represent the following p values: *p<0.05; **p<0.01; ***p<0.001.
Balb/c mice were immunised with mRNA vaccines (as prepared in Example 1) encoding P2-VP8* from serotype P[6] or RiLa (Ringer lactate) as a negative control as indicated in Table 22 below. Intradermal (i.d.) vaccinations were performed on day 0, day 21 and day 42. Blood samples taken on day 56 were analysed for the presence of VP8* specific IgG1 and IgG2a antibodies by ELISA using P2 VP8* P[6] protein (
TABLE 22
Animal groups and treatment
Strain/
Treatment/
Route/
Vaccin.
gender
Nr.
Vaccine dose
Construct
Serotype
Volume
schedule
1
BALB/c
8
80 μg R3718
Secreted
P[6]
i.d.
d0/21/42
Female
P2-VP8*
2 × 50 μl
2
BALB/c
8
80 μg R3720
Secreted repeat
P[6]
i.d.
d0/21/42
Female
P2-VP8*
2 × 50 μl
3
BALB/c
8
80 μg R3722
Transmembrane
P[6]
i.d.
d0/21/42
Female
P2-VP8*
2 × 50 μl
4
BALB/c
8
80 μg R3724
VLP P2-VP8*
P[6]
i.d.
d0/21/42
Female
2 × 50 μl
5
BALB/c
8
100% RiLa
—
—
i.d.
d0/21/42
Female
2 × 50 μl
Results
As shown in
Balb/c mice are immunised intradermally with the new mRNA vaccine designs encoding P2-VP8* (table 21). RiLa (Ringer lactate) and adjuvanted P2-VP8* protein are employed as a negative and positive control, respectively, as indicated in Table 23 and 24. Vaccinations are performed on day 0, day 21 and day 42. Blood samples taken on day 56 are analysed for the presence of VP8* specific IgG1 and IgG2a antibodies by ELISA using P2 VP8* P[8] or P[6] protein, respectively, as a coating reagent as described above.
TABLE 23
Animal groups and treatment part A
Strain/
Route/
Gr.
gender
Number
Treatment
Volume
Vaccin. schedule
1
BALB/c
6
Negative control
i.d.
d0/21/42
female
2 × 50 μl
2
BALB/c
6
Positive control
i.m.
d0/21/42
female
4 × 25 μl
3
BALB/c
12
80 μg R3718
i.d.
d0/21/42
female
2 × 50 μl
4
BALB/c
12
80 μg R5594
i.d.
d0/21/42
female
2 × 50 μl
5
BALB/c
12
80 μg R5595
i.d.
d0/21/42
female
2 × 50 μl
6
BALB/c
12
80 μg R5596
i.d.
d0/21/42
female
2 × 50 μl
7
BALB/c
12
80 μg R5597
i.d.
d0/21/42
female
2 × 50 μl
8
BALB/c
12
80 μg R5471
i.d.
d0/21/42
female
2 × 50 μl
9
BALB/c
12
80 μg R5473
i.d.
d0/21/42
female
2 × 50 μl
10
BALB/c
12
80 μg R5475
i.d.
d0/21/42
female
2 × 50 μl
11
BALB/c
12
80 μg R5479
i.d.
d0/21/42
female
2 × 50 μl
TABLE 24
Animal groups and treatment part B
Strain/
Route/
Gr.
gender
Number
Treatment
Volume
Vaccin. schedule
1
BALB/c
6
Negative control
i.d.
d0/21/42
Female
2 × 50 μl
2
BALB/c
6
Positive control
i.m.
d0/21/42
Female
4 × 25 μl
3
BALB/c
12
80 μg R3718
i.d.
d0/21/42
Female
2 × 50 μl
4
BALB/c
12
80 μg R5481
i.d.
d0/21/42
Female
2 × 50 μl
5
BALB/c
12
80 μg R5483
i.d.
d0/21/42
Female
2 × 50 μl
6
BALB/c
12
80 μg R5485
i.d.
d0/21/42
Female
2 × 50 μl
7
BALB/c
12
80 μg R5487
i.d.
d0/21/42
Female
2 × 50 μl
8
BALB/c
12
80 μg R5489
i.d.
d0/21/42
Female
2 × 50 μl
9
BALB/c
12
80 μg R5490
i.d.
d0/21/42
Female
2 × 50 μl
10
BALB/c
12
80 μg R5491
i.d.
d0/21/42
Female
2 × 50 μl
Five mRNA designs are tested as mRNA formulations with LNPs (lipid nanoparticle) and CVCMs formulations. For this, the respective mRNA sequences are formulated as described above and are tested upon intramuscular injection. RiLa (Ringer lactate) and adjuvanted P2-VP8* protein are employed as a negative and positive control, respectively, as indicated in Table 25 and 26 vaccinations are performed on day 0, day 21 and day 42. Blood samples taken on day 56 are analysed for the presence of VP8* specific IgG1 and IgG2a antibodies by ELISA using P2 VP8* P[8] or P[6] protein, respectively, as a coating reagent (
TABLE 25
Animal groups and treatment LNP formulations
Strain/
Vaccine
Route/
Vaccin.
Gr.
gender
Number
Treatment
formulation
Volume
schedule
1
BALB/c
6
Negative
—
i.d.
d0/21/42
female
control
2 × 50 μl
2
BALB/c
6
Positive
—
i.m.
d0/21/42
female
control
4 × 25 μl
3
BALB/c
12
Design 1
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
4
BALB/c
12
Design 2
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
5
BALB/c
12
Design 3
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
6
BALB/c
12
Design 4
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
7
BALB/c
12
Design 5
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
8
BALB/c
12
Design 1
LNP
i.m.
d0/21/42
female
5 μg
formulation
1 × 25 μl
9
BALB/c
12
Design 2
LNP
i.m.
d0/21/42
female
5 μg
formulation
1 × 25 μl
10
BALB/c
12
Design 3
LNP
i.m.
d0/21/42
female
5 μg
formulation
1 × 25 μl
11
BALB/c
12
Design 4
LNP
i.m.
d0/21/42
female
5 μg
formulation
1 × 25 μl
12
BALB/c
12
Design 5
LNP
i.m.
d0/21/42
female
5 μg
formulation
1 × 25 μl
TABLE 26
Animal groups and treatment CVCM formulations
Strain/
Vaccine
Route/
Vaccin.
Gr.
gender
Number
Treatment
formulation
Volume
schedule
1
BALB/c
6
Negative
—
i.d.
d0/21/42
female
control
2 × 50 μl
2
BALB/c
6
Positive
—
i.m.
d0/21/42
female
control
4 × 25 μl
3
BALB/c
12
Design 1
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
4
BALB/c
12
Design 2
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
5
BALB/c
12
Design 3
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
6
BALB/c
12
Design 4
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
7
BALB/c
12
Design 5
Protamine
i.d.
d0/21/42
female
80 μg
formulation
2 × 50 μl
8
BALB/c
12
Design 1
CVCM
i.m.
d0/21/42
female
10 μg
formulation
1 × 25 μl
9
BALB/c
12
Design 2
CVCM
i.m.
d0/21/42
female
10 μg
formulation
1 × 25 μl
10
BALB/c
12
Design 3
CVCM
i.m.
d0/21/42
female
10 μg
formulation
1 × 25 μl
11
BALB/c
12
Design 4
CVCM
i.m.
d0/21/42
female
10 μg
formulation
1 × 25 μl
12
BALB/c
12
Design 5
CVCM
i.m.
d0/21/42
female
10 μg
formulation
1 × 25 μl
Petsch, Benjamin, Rauch, Susanne
Patent | Priority | Assignee | Title |
11786590, | Nov 09 2015 | CureVac SE | Rotavirus vaccines |
11834651, | May 29 2015 | CureVac Manufacturing GmbH | Method for producing and purifying RNA, comprising at least one step of tangential flow filtration |
Executed on | Assignor | Assignee | Conveyance | Frame | Reel | Doc |
Nov 09 2016 | CureVac AG | (assignment on the face of the patent) | / | |||
Mar 21 2019 | PETSCH, BENJAMIN | CureVac AG | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 048924 | /0419 | |
Apr 10 2019 | RAUCH, SUSANNE | CureVac AG | ASSIGNMENT OF ASSIGNORS INTEREST SEE DOCUMENT FOR DETAILS | 048924 | /0419 | |
Sep 26 2022 | CureVac AG | CureVac SE | CHANGE OF NAME SEE DOCUMENT FOR DETAILS | 062404 | /0030 |
Date | Maintenance Fee Events |
May 08 2018 | BIG: Entity status set to Undiscounted (note the period is included in the code). |
Date | Maintenance Schedule |
Aug 16 2025 | 4 years fee payment window open |
Feb 16 2026 | 6 months grace period start (w surcharge) |
Aug 16 2026 | patent expiry (for year 4) |
Aug 16 2028 | 2 years to revive unintentionally abandoned end. (for year 4) |
Aug 16 2029 | 8 years fee payment window open |
Feb 16 2030 | 6 months grace period start (w surcharge) |
Aug 16 2030 | patent expiry (for year 8) |
Aug 16 2032 | 2 years to revive unintentionally abandoned end. (for year 8) |
Aug 16 2033 | 12 years fee payment window open |
Feb 16 2034 | 6 months grace period start (w surcharge) |
Aug 16 2034 | patent expiry (for year 12) |
Aug 16 2036 | 2 years to revive unintentionally abandoned end. (for year 12) |