AU603883B2 - Genetic engineering process for the preparation of polypeptides - Google Patents
Genetic engineering process for the preparation of polypeptides Download PDFInfo
- Publication number
- AU603883B2 AU603883B2 AU14469/88A AU1446988A AU603883B2 AU 603883 B2 AU603883 B2 AU 603883B2 AU 14469/88 A AU14469/88 A AU 14469/88A AU 1446988 A AU1446988 A AU 1446988A AU 603883 B2 AU603883 B2 AU 603883B2
- Authority
- AU
- Australia
- Prior art keywords
- galactosidase
- gene
- fragment
- polypeptide
- codons
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 108090000765 processed proteins & peptides Proteins 0.000 title claims description 50
- 102000004196 processed proteins & peptides Human genes 0.000 title claims description 50
- 229920001184 polypeptide Polymers 0.000 title claims description 48
- 238000000034 method Methods 0.000 title claims description 21
- 230000008569 process Effects 0.000 title claims description 17
- 238000002360 preparation method Methods 0.000 title claims description 6
- 238000010353 genetic engineering Methods 0.000 title description 6
- 239000012634 fragment Substances 0.000 claims description 71
- 108090000623 proteins and genes Proteins 0.000 claims description 65
- 108020004705 Codon Proteins 0.000 claims description 51
- 150000001413 amino acids Chemical class 0.000 claims description 33
- 235000001014 amino acid Nutrition 0.000 claims description 29
- 108010076181 Proinsulin Proteins 0.000 claims description 19
- 102000037865 fusion proteins Human genes 0.000 claims description 19
- 108020001507 fusion proteins Proteins 0.000 claims description 19
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 claims description 17
- 229930182817 methionine Natural products 0.000 claims description 17
- 239000013598 vector Substances 0.000 claims description 12
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 claims description 11
- 235000018417 cysteine Nutrition 0.000 claims description 11
- 241000894006 Bacteria Species 0.000 claims description 10
- 239000004475 Arginine Substances 0.000 claims description 9
- 241000588724 Escherichia coli Species 0.000 claims description 9
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 claims description 9
- 230000002255 enzymatic effect Effects 0.000 claims description 6
- 239000000126 substance Substances 0.000 claims description 6
- BBBFJLBPOGFECG-VJVYQDLKSA-N calcitonin Chemical compound N([C@H](C(=O)N[C@@H](CC(C)C)C(=O)NCC(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC=1C=CC(O)=CC=1)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N[C@@H](CO)C(=O)NCC(=O)N[C@@H]([C@@H](C)O)C(=O)N1[C@@H](CCC1)C(N)=O)C(C)C)C(=O)[C@@H]1CSSC[C@H](N)C(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H]([C@@H](C)O)C(=O)N1 BBBFJLBPOGFECG-VJVYQDLKSA-N 0.000 claims description 5
- 230000004927 fusion Effects 0.000 claims description 5
- 229920001308 poly(aminoacid) Polymers 0.000 claims description 5
- 102000055006 Calcitonin Human genes 0.000 claims description 4
- 108060001064 Calcitonin Proteins 0.000 claims description 4
- 108091028043 Nucleic acid sequence Proteins 0.000 claims description 4
- 229960004015 calcitonin Drugs 0.000 claims description 4
- 238000000926 separation method Methods 0.000 claims description 4
- 239000000095 Growth Hormone-Releasing Hormone Substances 0.000 claims description 3
- 102000007625 Hirudins Human genes 0.000 claims description 3
- 108010007267 Hirudins Proteins 0.000 claims description 3
- 108010002350 Interleukin-2 Proteins 0.000 claims description 3
- 102000000588 Interleukin-2 Human genes 0.000 claims description 3
- 102100022831 Somatoliberin Human genes 0.000 claims description 3
- 101710142969 Somatoliberin Proteins 0.000 claims description 3
- WQPDUTSPKFMPDP-OUMQNGNKSA-N hirudin Chemical compound C([C@@H](C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H]([C@@H](C)CC)C(=O)N1[C@@H](CCC1)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC=1C=CC(OS(O)(=O)=O)=CC=1)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(O)=O)NC(=O)[C@H](CC(O)=O)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC=1NC=NC=1)NC(=O)[C@H](CO)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H]1N(CCC1)C(=O)[C@H](CCCCN)NC(=O)[C@H]1N(CCC1)C(=O)[C@@H](NC(=O)CNC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@@H](NC(=O)[C@@H](NC(=O)[C@H]1NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CCC(O)=O)NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)CNC(=O)[C@H](CC(C)C)NC(=O)[C@H]([C@@H](C)CC)NC(=O)[C@@H]2CSSC[C@@H](C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](CO)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@H](C(=O)N[C@H](C(NCC(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](CCCCN)C(=O)N2)=O)CSSC1)C(C)C)NC(=O)[C@H](CC(C)C)NC(=O)[C@H]1NC(=O)[C@H](CC(C)C)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)CNC(=O)[C@H](CO)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H]([C@@H](C)O)NC(=O)[C@@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@@H](NC(=O)[C@H](CC=2C=CC(O)=CC=2)NC(=O)[C@@H](NC(=O)[C@@H](N)C(C)C)C(C)C)[C@@H](C)O)CSSC1)C(C)C)[C@@H](C)O)[C@@H](C)O)C1=CC=CC=C1 WQPDUTSPKFMPDP-OUMQNGNKSA-N 0.000 claims description 3
- 229940006607 hirudin Drugs 0.000 claims description 3
- 229940079322 interferon Drugs 0.000 claims description 3
- OWMZNFCDEHGFEP-NFBCVYDUSA-N secretin human Chemical compound C([C@@H](C(=O)N[C@H](C(=O)N[C@@H](CO)C(=O)N[C@@H](CCC(O)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CO)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CCC(O)=O)C(=O)NCC(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(N)=O)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](C(C)C)C(N)=O)[C@@H](C)O)NC(=O)[C@@H](NC(=O)CNC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CO)NC(=O)[C@@H](N)CC=1NC=NC=1)[C@@H](C)O)C1=CC=CC=C1 OWMZNFCDEHGFEP-NFBCVYDUSA-N 0.000 claims description 3
- 238000011144 upstream manufacturing Methods 0.000 claims description 3
- 239000003643 water by type Substances 0.000 claims description 2
- 102000014150 Interferons Human genes 0.000 claims 2
- 108010050904 Interferons Proteins 0.000 claims 2
- 108010086019 Secretin Proteins 0.000 claims 2
- 102100037505 Secretin Human genes 0.000 claims 2
- 229960002101 secretin Drugs 0.000 claims 2
- 230000008878 coupling Effects 0.000 claims 1
- 238000010168 coupling process Methods 0.000 claims 1
- 238000005859 coupling reaction Methods 0.000 claims 1
- 239000013612 plasmid Substances 0.000 description 46
- 238000003776 cleavage reaction Methods 0.000 description 41
- 230000007017 scission Effects 0.000 description 41
- 108010054576 Deoxyribonuclease EcoRI Proteins 0.000 description 35
- 108020004414 DNA Proteins 0.000 description 34
- 229940024606 amino acid Drugs 0.000 description 23
- 108091008146 restriction endonucleases Proteins 0.000 description 15
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 14
- 238000010276 construction Methods 0.000 description 14
- 238000006243 chemical reaction Methods 0.000 description 11
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 10
- 102000004142 Trypsin Human genes 0.000 description 8
- 108090000631 Trypsin Proteins 0.000 description 8
- 230000029087 digestion Effects 0.000 description 8
- 238000002955 isolation Methods 0.000 description 8
- 229920002401 polyacrylamide Polymers 0.000 description 8
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 8
- 235000018102 proteins Nutrition 0.000 description 8
- 102000004169 proteins and genes Human genes 0.000 description 8
- 239000012588 trypsin Substances 0.000 description 8
- 241000282693 Cercopithecidae Species 0.000 description 7
- ATDGTVJJHBUTRL-UHFFFAOYSA-N cyanogen bromide Chemical compound BrC#N ATDGTVJJHBUTRL-UHFFFAOYSA-N 0.000 description 7
- 238000000338 in vitro Methods 0.000 description 7
- 231100000350 mutagenesis Toxicity 0.000 description 7
- 238000002703 mutagenesis Methods 0.000 description 7
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 6
- 238000010367 cloning Methods 0.000 description 6
- 239000002773 nucleotide Substances 0.000 description 6
- 102000053602 DNA Human genes 0.000 description 5
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 5
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 238000001502 gel electrophoresis Methods 0.000 description 5
- NOESYZHRGYRDHS-UHFFFAOYSA-N insulin Chemical compound N1C(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(NC(=O)CN)C(C)CC)CSSCC(C(NC(CO)C(=O)NC(CC(C)C)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CCC(N)=O)C(=O)NC(CC(C)C)C(=O)NC(CCC(O)=O)C(=O)NC(CC(N)=O)C(=O)NC(CC=2C=CC(O)=CC=2)C(=O)NC(CSSCC(NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2C=CC(O)=CC=2)NC(=O)C(CC(C)C)NC(=O)C(C)NC(=O)C(CCC(O)=O)NC(=O)C(C(C)C)NC(=O)C(CC(C)C)NC(=O)C(CC=2NC=NC=2)NC(=O)C(CO)NC(=O)CNC2=O)C(=O)NCC(=O)NC(CCC(O)=O)C(=O)NC(CCCNC(N)=N)C(=O)NCC(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC=CC=3)C(=O)NC(CC=3C=CC(O)=CC=3)C(=O)NC(C(C)O)C(=O)N3C(CCC3)C(=O)NC(CCCCN)C(=O)NC(C)C(O)=O)C(=O)NC(CC(N)=O)C(O)=O)=O)NC(=O)C(C(C)CC)NC(=O)C(CO)NC(=O)C(C(C)O)NC(=O)C1CSSCC2NC(=O)C(CC(C)C)NC(=O)C(NC(=O)C(CCC(N)=O)NC(=O)C(CC(N)=O)NC(=O)C(NC(=O)C(N)CC=1C=CC=CC=1)C(C)C)CC1=CN=CN1 NOESYZHRGYRDHS-UHFFFAOYSA-N 0.000 description 5
- 125000003729 nucleotide group Chemical group 0.000 description 5
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 4
- 230000001580 bacterial effect Effects 0.000 description 4
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 4
- 229960000310 isoleucine Drugs 0.000 description 4
- 231100000219 mutagenic Toxicity 0.000 description 4
- 230000003505 mutagenic effect Effects 0.000 description 4
- 238000004904 shortening Methods 0.000 description 4
- 102000004190 Enzymes Human genes 0.000 description 3
- 108090000790 Enzymes Proteins 0.000 description 3
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 3
- 239000004471 Glycine Substances 0.000 description 3
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 3
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 3
- 108091081024 Start codon Proteins 0.000 description 3
- 235000003704 aspartic acid Nutrition 0.000 description 3
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 3
- 230000015572 biosynthetic process Effects 0.000 description 3
- 238000005194 fractionation Methods 0.000 description 3
- 230000035772 mutation Effects 0.000 description 3
- 108010066381 preproinsulin Proteins 0.000 description 3
- 238000000746 purification Methods 0.000 description 3
- 230000009466 transformation Effects 0.000 description 3
- 108700010070 Codon Usage Proteins 0.000 description 2
- 102000002464 Galactosidases Human genes 0.000 description 2
- 108010093031 Galactosidases Proteins 0.000 description 2
- 108090001061 Insulin Proteins 0.000 description 2
- 102000004877 Insulin Human genes 0.000 description 2
- COLNVLDHVKWLRT-QMMMGPOBSA-N L-phenylalanine Chemical compound OC(=O)[C@@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- 150000007513 acids Chemical class 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 239000003795 chemical substances by application Substances 0.000 description 2
- QPJDMGCKMHUXFD-UHFFFAOYSA-N cyanogen chloride Chemical compound ClC#N QPJDMGCKMHUXFD-UHFFFAOYSA-N 0.000 description 2
- 239000013613 expression plasmid Substances 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- 125000003630 glycyl group Chemical group [H]N([H])C([H])([H])C(*)=O 0.000 description 2
- 238000010348 incorporation Methods 0.000 description 2
- 238000003780 insertion Methods 0.000 description 2
- 230000037431 insertion Effects 0.000 description 2
- 229940125396 insulin Drugs 0.000 description 2
- 230000010354 integration Effects 0.000 description 2
- -1 poly(amino acid) Polymers 0.000 description 2
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 1
- AUXMWYRZQPIXCC-KNIFDHDWSA-N (2s)-2-amino-4-methylpentanoic acid;(2s)-2-aminopropanoic acid Chemical compound C[C@H](N)C(O)=O.CC(C)C[C@H](N)C(O)=O AUXMWYRZQPIXCC-KNIFDHDWSA-N 0.000 description 1
- MUSGYEMSJUFFHT-UWABRSFTSA-N 2-[(4R,7S,10S,13S,19S,22S,25S,28S,31S,34R)-34-[[(2S,3S)-2-[[(2R)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]-3-methylpentanoyl]amino]-4-[[(2S,3S)-1-amino-3-methyl-1-oxopentan-2-yl]-methylcarbamoyl]-25-(3-amino-3-oxopropyl)-7-(3-carbamimidamidopropyl)-10-(1H-imidazol-5-ylmethyl)-19-(1H-indol-3-ylmethyl)-13,17-dimethyl-28-[(1-methylindol-3-yl)methyl]-6,9,12,15,18,21,24,27,30,33-decaoxo-31-propan-2-yl-1,2-dithia-5,8,11,14,17,20,23,26,29,32-decazacyclopentatriacont-22-yl]acetic acid Chemical group CC[C@H](C)[C@H](NC(=O)[C@H](N)Cc1ccc(O)cc1)C(=O)N[C@H]1CSSC[C@H](NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](Cc2cnc[nH]2)NC(=O)[C@H](C)NC(=O)CN(C)C(=O)[C@H](Cc2c[nH]c3ccccc23)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(N)=O)NC(=O)[C@H](Cc2cn(C)c3ccccc23)NC(=O)[C@@H](NC1=O)C(C)C)C(=O)N(C)[C@@H]([C@@H](C)CC)C(N)=O MUSGYEMSJUFFHT-UWABRSFTSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 101150001836 Adgrb1 gene Proteins 0.000 description 1
- OMLWNBVRVJYMBQ-YUMQZZPRSA-N Arg-Arg Chemical compound NC(N)=NCCC[C@H](N)C(=O)N[C@@H](CCCN=C(N)N)C(O)=O OMLWNBVRVJYMBQ-YUMQZZPRSA-N 0.000 description 1
- UKGGPJNBONZZCM-WDSKDSINSA-N Asp-Pro Chemical compound OC(=O)C[C@H](N)C(=O)N1CCC[C@H]1C(O)=O UKGGPJNBONZZCM-WDSKDSINSA-N 0.000 description 1
- 102000003670 Carboxypeptidase B Human genes 0.000 description 1
- 108090000087 Carboxypeptidase B Proteins 0.000 description 1
- 108091026890 Coding region Proteins 0.000 description 1
- 108091033380 Coding strand Proteins 0.000 description 1
- OABOXRPGTFRBFZ-IMJSIDKUSA-N Cys-Cys Chemical compound SC[C@H](N)C(=O)N[C@@H](CS)C(O)=O OABOXRPGTFRBFZ-IMJSIDKUSA-N 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 241001131785 Escherichia coli HB101 Species 0.000 description 1
- JZDHUJAFXGNDSB-WHFBIAKZSA-N Glu-Ala Chemical compound OC(=O)[C@H](C)NC(=O)[C@@H](N)CCC(O)=O JZDHUJAFXGNDSB-WHFBIAKZSA-N 0.000 description 1
- NWOUBJNMZDDGDT-AVGNSLFASA-N Glu-Leu-His Chemical compound OC(=O)CC[C@H](N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C(O)=O)CC1=CN=CN1 NWOUBJNMZDDGDT-AVGNSLFASA-N 0.000 description 1
- BUEFQXUHTUZXHR-LURJTMIESA-N Gly-Gly-Pro zwitterion Chemical compound NCC(=O)NCC(=O)N1CCC[C@H]1C(O)=O BUEFQXUHTUZXHR-LURJTMIESA-N 0.000 description 1
- WNGHUXFWEWTKAO-YUMQZZPRSA-N Gly-Ser-Leu Chemical compound CC(C)C[C@@H](C(O)=O)NC(=O)[C@H](CO)NC(=O)CN WNGHUXFWEWTKAO-YUMQZZPRSA-N 0.000 description 1
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 1
- ZUKPVRWZDMRIEO-VKHMYHEASA-N L-cysteinylglycine Chemical compound SC[C@H]([NH3+])C(=O)NCC([O-])=O ZUKPVRWZDMRIEO-VKHMYHEASA-N 0.000 description 1
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 1
- RNKSNIBMTUYWSH-YFKPBYRVSA-N L-prolylglycine Chemical compound [O-]C(=O)CNC(=O)[C@@H]1CCC[NH2+]1 RNKSNIBMTUYWSH-YFKPBYRVSA-N 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- XGDCYUQSFDQISZ-BQBZGAKWSA-N Leu-Ser Chemical compound CC(C)C[C@H](N)C(=O)N[C@@H](CO)C(O)=O XGDCYUQSFDQISZ-BQBZGAKWSA-N 0.000 description 1
- NPBGTPKLVJEOBE-IUCAKERBSA-N Lys-Arg Chemical compound NCCCC[C@H](N)C(=O)N[C@H](C(O)=O)CCCNC(N)=N NPBGTPKLVJEOBE-IUCAKERBSA-N 0.000 description 1
- NDYNTQWSJLPEMK-WDSKDSINSA-N Met-Cys Chemical compound CSCC[C@H](N)C(=O)N[C@@H](CS)C(O)=O NDYNTQWSJLPEMK-WDSKDSINSA-N 0.000 description 1
- 108010079364 N-glycylalanine Proteins 0.000 description 1
- 108091034117 Oligonucleotide Proteins 0.000 description 1
- 102000035195 Peptidases Human genes 0.000 description 1
- 108091005804 Peptidases Proteins 0.000 description 1
- GKZIWHRNKRBEOH-HOTGVXAUSA-N Phe-Phe Chemical compound C([C@H]([NH3+])C(=O)N[C@@H](CC=1C=CC=CC=1)C([O-])=O)C1=CC=CC=C1 GKZIWHRNKRBEOH-HOTGVXAUSA-N 0.000 description 1
- 239000004365 Protease Substances 0.000 description 1
- FIXILCYTSAUERA-FXQIFTODSA-N Ser-Ala-Arg Chemical compound [H]N[C@@H](CO)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(O)=O FIXILCYTSAUERA-FXQIFTODSA-N 0.000 description 1
- SOACHCFYJMCMHC-BWBBJGPYSA-N Ser-Thr-Cys Chemical compound C[C@H]([C@@H](C(=O)N[C@@H](CS)C(=O)O)NC(=O)[C@H](CO)N)O SOACHCFYJMCMHC-BWBBJGPYSA-N 0.000 description 1
- PQLXHSACXPGWPD-GSSVUCPTSA-N Thr-Asn-Thr Chemical compound [H]N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H]([C@@H](C)O)C(O)=O PQLXHSACXPGWPD-GSSVUCPTSA-N 0.000 description 1
- GXDLGHLJTHMDII-WISUUJSJSA-N Thr-Ser Chemical compound C[C@@H](O)[C@H](N)C(=O)N[C@@H](CO)C(O)=O GXDLGHLJTHMDII-WISUUJSJSA-N 0.000 description 1
- VNYDHJARLHNEGA-RYUDHWBXSA-N Tyr-Pro Chemical compound C([C@H](N)C(=O)N1[C@@H](CCC1)C(O)=O)C1=CC=C(O)C=C1 VNYDHJARLHNEGA-RYUDHWBXSA-N 0.000 description 1
- 108091023045 Untranslated Region Proteins 0.000 description 1
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 235000004279 alanine Nutrition 0.000 description 1
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 1
- 229960000723 ampicillin Drugs 0.000 description 1
- 238000004458 analytical method Methods 0.000 description 1
- 108010068380 arginylarginine Proteins 0.000 description 1
- 108010093581 aspartyl-proline Proteins 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 229960003773 calcitonin (salmon synthetic) Drugs 0.000 description 1
- 230000001413 cellular effect Effects 0.000 description 1
- 238000005119 centrifugation Methods 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 108010004073 cysteinylcysteine Proteins 0.000 description 1
- 108010016616 cysteinylglycine Proteins 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 238000001962 electrophoresis Methods 0.000 description 1
- 230000008030 elimination Effects 0.000 description 1
- 238000003379 elimination reaction Methods 0.000 description 1
- 230000002349 favourable effect Effects 0.000 description 1
- 230000002068 genetic effect Effects 0.000 description 1
- 108010049041 glutamylalanine Proteins 0.000 description 1
- VPZXBVLAVMBEQI-UHFFFAOYSA-N glycyl-DL-alpha-alanine Natural products OC(=O)C(C)NC(=O)CN VPZXBVLAVMBEQI-UHFFFAOYSA-N 0.000 description 1
- 108010051307 glycyl-glycyl-proline Proteins 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 239000000411 inducer Substances 0.000 description 1
- 230000006698 induction Effects 0.000 description 1
- 239000004026 insulin derivative Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 1
- 239000000203 mixture Substances 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000010369 molecular cloning Methods 0.000 description 1
- 239000005445 natural material Substances 0.000 description 1
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 1
- 108010073025 phenylalanylphenylalanine Proteins 0.000 description 1
- 238000006116 polymerization reaction Methods 0.000 description 1
- 238000007639 printing Methods 0.000 description 1
- 108010029020 prolylglycine Proteins 0.000 description 1
- 230000009467 reduction Effects 0.000 description 1
- 108010068072 salmon calcitonin Proteins 0.000 description 1
- 238000012163 sequencing technique Methods 0.000 description 1
- 238000010561 standard procedure Methods 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000003786 synthesis reaction Methods 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 108010020532 tyrosyl-proline Proteins 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N15/00—Mutation or genetic engineering; DNA or RNA concerning genetic engineering, vectors, e.g. plasmids, or their isolation, preparation or purification; Use of hosts therefor
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/585—Calcitonins
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/435—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from animals; from humans
- C07K14/575—Hormones
- C07K14/62—Insulins
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12N—MICROORGANISMS OR ENZYMES; COMPOSITIONS THEREOF; PROPAGATING, PRESERVING, OR MAINTAINING MICROORGANISMS; MUTATION OR GENETIC ENGINEERING; CULTURE MEDIA
- C12N9/00—Enzymes; Proenzymes; Compositions thereof; Processes for preparing, activating, inhibiting, separating or purifying enzymes
- C12N9/14—Hydrolases (3)
- C12N9/24—Hydrolases (3) acting on glycosyl compounds (3.2)
- C12N9/2402—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1)
- C12N9/2468—Hydrolases (3) acting on glycosyl compounds (3.2) hydrolysing O- and S- glycosyl compounds (3.2.1) acting on beta-galactose-glycoside bonds, e.g. carrageenases (3.2.1.83; 3.2.1.157); beta-agarase (3.2.1.81)
- C12N9/2471—Beta-galactosidase (3.2.1.23), i.e. exo-(1-->4)-beta-D-galactanase
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Y—ENZYMES
- C12Y302/00—Hydrolases acting on glycosyl compounds, i.e. glycosylases (3.2)
- C12Y302/01—Glycosidases, i.e. enzymes hydrolysing O- and S-glycosyl compounds (3.2.1)
- C12Y302/01023—Beta-galactosidase (3.2.1.23), i.e. exo-(1-->4)-beta-D-galactanase
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/61—Fusion polypeptide containing an enzyme fusion for detection (lacZ, luciferase)
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K2319/00—Fusion polypeptide
- C07K2319/70—Fusion polypeptide containing domain for protein-protein interaction
- C07K2319/74—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor
- C07K2319/75—Fusion polypeptide containing domain for protein-protein interaction containing a fusion for binding to a cell surface receptor containing a fusion for activation of a cell surface receptor, e.g. thrombopoeitin, NPY and other peptide hormones
Landscapes
- Health & Medical Sciences (AREA)
- Chemical & Material Sciences (AREA)
- Life Sciences & Earth Sciences (AREA)
- Organic Chemistry (AREA)
- Genetics & Genomics (AREA)
- Zoology (AREA)
- Engineering & Computer Science (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Health & Medical Sciences (AREA)
- Biochemistry (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- General Engineering & Computer Science (AREA)
- Biomedical Technology (AREA)
- Biotechnology (AREA)
- Medicinal Chemistry (AREA)
- Biophysics (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- Microbiology (AREA)
- Endocrinology (AREA)
- Toxicology (AREA)
- Gastroenterology & Hepatology (AREA)
- Diabetes (AREA)
- Physics & Mathematics (AREA)
- Plant Pathology (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Peptides Or Proteins (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Enzymes And Modification Thereof (AREA)
Description
b 4 Form COMMONWEALTH OF AUSTRALIA PATENTS ACT 1952-69 COMPLETE SPECIFICATION
-IRIGINAL)
class Application Number: Lodged.
Int. Class Co mplete Specification Lodged: Accepted: Published: N)iority: a IRated Art.
I his dmismnent conl'liins th1 iendnients made unde.r Section 4-9and con Ixt ftu 1 Printing.
Name of Applicant. HOECH4ST AKTIIENGESELLSCHAFT Addressof Applicant: 45 Bruningstrasse, D-6230 Frankfurt/Main 80, Federal Republic of Germany SDActual Inventor: Address for Service WALDEMAR WETEKAM, HANS WILLI JANSEN AND WOLFGkNG BECKER EDWD. WATERS SONS, 50 QUEEN STREET, MELBOURNE, AUSTRALIA, 3000.
Complete Specification for the invention entitled: GENETIC ENGINEERING PROCESS FOR THE PREPARATION OF POLYP EPT IDES The following statement is a full description of this invention, including the best method of performing it known to,. us To the Oommies±Qnor or Patanto AKTIENGESELLSI' 1AFT PAT 510 Prokuriot Authorized Bigna* y ppa. Ilenbruck i.V. Lapice VOECHST AKTIENGESELLSCHAFT HOE 87/F 110J Dr.KL/mu Specification Genetic engine&ring process for the preparation of polypeptides In the preparation, by genetic manipulation, of polypeptides in bacteria, the structural gene for the desired polypeptide is frequently coupled in the reading frame to the gene for the polypeptide B-galactosidase which is intrinsic to the bacteria. The bacterium then produces a S fusion protein in which the amino terminus of the desired c oe, .polypeptide is bonded to the carboxyl terminus of the B- *galactosidase.
o* j 0" In the chemical or enzymatic elimination of the -galactosidase portion the latter is broken down into many fragments. This results in fragments of the B-galactosidase which, in respect of the properties relevant for fraction- 20 ation, such as molecular weight, are similar to the desired protein. Thus, this makes the working up and isolation of the desired polypeptide very difficult, and re- Sduces the yield. There is a description, in Broker, Gene Anal. Techn. 3 (1986) 53-57, of how fusion proteins having shortened B-galactosidase portions can be prepared. It is mentioned as an advantage of these fusion proteins that cleavage with cyanogen bromide yields fewer cleavage products, which facilitates purification. Although this reduces the number of cleavage products, it does not eliminate the abovementioned disadvantage because this known process also results in fragments which are very similar to the desired product in the properties relevant for fractionation.
The disadvantages of the known processes are avoided according to the invention by replacing, in whole or in part, codons for methionine and/or arginine and/or cysteine in the gene for 8-gaactosidase, or for a fragment of B-galactosidase, by cadons of other amino acids.
_I
In this way it is possible to "tailor" the fragments of the B-galactosidase portion in such a way that there are no problems with the removal of the desired polypeptide.
Hence the invention relates to a process for the prepara- 1i tion of a genetically codable polypeptide, with the structural gene for this polypeptide being coupled in the correct reading frame via the gene for B-gaLactosidase, or a fragment of B-galactosidase, to a regulator region, introduction of this gene structure into a bacterium, expression therein of an insoluble fusion protein, isolation thereof after cell disruption, and the desired polypeptide being obtained by chemical or enzymatic cleavage, which comprises codons for methionine and/or arginine and/or 15 cysteine in the gene for B-galactosidase, or for the fragment of B-galactosidase,.being replaced, in whole or in i part, by codons of other amino acids.
Further aspects of the invention and its preferred embodi- S. 20 ments are explained hereinafter and defined in the patent 4 4 claims.
t 44 It has been found that constructions having a B-galactosidase fragment with more than 250 amino acids, but significantly less than the total sequence of B-galactosidase, result in insoluble fusion proteins which can easily be isolated. This B-galactosidase portion advantageously has about 300 to about 800, preferably about 320 to about 650, amino acids and contains an amino-terminal and/or carboxylterminal portion of B-galactosidase.
The gene for B-galactosidase, or the B-galactosidase fragment, is bonded in the correct reading frame to a regulator region and, if necessary via an adaptor, to the structural gene for the desired polypeptide.
This adaptor, which can also be omitted where appropriate, advantageously codes for an amino acid (or a group of amino acids) which is Located in front of the amino i Is -3terminus of the desired polypeptide and permits the easy chemical or enzymatic separation thereof from the Bgalactosidase portion. If, for example, the desired polypeptide contains no methionine or has been modified by genetic manipulation in such a way that it contains no i methionine, then it is advantageous to choose as amino acid in front of the amino terminus of the desired polypeptide methionine, after which the desired polypeptide can be separated from the B-galactosidase portion by cleavage witn cyanogen chloride or cyanogen bromide. If the desired polypeptide is not inactivated by trypsin, then it t is possible to program the amino acid arginine in front of the amino terminus of the desired polypeptide, after which the desired polypeptide is obtained by trypsin cleavage.
S. 15 Of course, it is also possible in this case to eliminate any trypsin cleavage site present in the desired poly- S peptide by genetic manipulation.
Shortened 0-galactosidase constructions are generally preferred because the capacity of the host cell for the production of the foreign protein is limited, and thus a larger portion of the fusion protein is available for the desired protein. Hence, this by itself results in a crucial improvement in the yield. Another advantage is that not only can the insoluble fusion proteins be readily isolated after cell disruption, they are also not degraded to a noteworthy extent by proteases intrinsic to the host.
A Longer induction period is thus possible, which results in greater accumulation of the foreign protein in the bacterium. Furthermore, owing to the easier working up, there are lower losses of yield on isolation of the desired protein so that, overall, significantly higher yields are obtained than result by the known processes.
The figures illustrate the preferred embodiment of the invention with shortened 8-galactosidase constructions, no account being taken of the extent to which there has been a replacement of "undesired" by "desired" codons.
'P
-4- Fig. 1 shows constructions of shortened 0-galactosidase sequences without (A 1. and with (B 1. Linkers.
Fig. 2 shows the construction of the plasmid pWZRI (a derivative of the plasmid pBR 322 with B-galactosidase fragments from the plasmids pUC 9 and pUR 270).
Figs. 3A 3D show the construction of the plasmid pWI 6 having monkey proinsulin DNA.
Fig. 4 shows the construction of the plasmid pWZIP dMdC A 2 from Fig. 1.
o* 15 Fig. 5 shows the construction of the plasmid pWZPWB1 dMdC from pWI 6 (Fig. 3D) and pWZIP dMdC (Fig. pWZPWB1 S°dMdC contains a polylinker (MCS) which permits the insertion of genes for expression of the desired polypeptides.
20 The regulator region can be natural, especially intrinsic t to the bacteria, chemically synthesized or a hybrid region, for example of the fusion promoter tac. The regulator regions additionally contain an operator, for example the Lac operator, and 6 to 14 nucleotides upstream of the methionine codon of the B-galactosidase fragments a ribosome binding site.
The S-galactosidase fragment advantageously comprises a fusion of an amino-terminal and carboxyl-terminal partsequence. This results in a considerable reduction in the B-galactosidase portion in the fusion protein. In addition, this construction allows the replacement of various regulator systems without special effort. However, it is also possible to use exclusively amino-terminal or predominantly carboxyl-terminal sequences of B-galactosidase.
It is possible to use natural restriction enzyme cleavage sites for these shortened B-galactosidase constructions.
However, it is also possible to employ constructions having chemically synthesized linkers or adaptors which i 9 guarantee a correct reading frame free of stop codons and, where appropriate, contain an ATG start codon. Figure 1 A shows B-galactosidase fragments in which natural restriction enzyme cleavage sites have been used, and Figure 1 B shows the like where a linker is employed.
Further modifications of the B-galactosidase fragment may prove advantageous, singly or in combination, in particular cases.
In the case of polypeptides which are to be separated from the B-galactosidase fragment by cleavage with 4, cyanogen chloride or cyanogen bromide, the purification of the former is facilitated if, by targeted in vitro mutagenesis, some, or advantageously all, of the codons of the interfering methionine residues are replaced by codons for other amino acids, preferably leucine or isoleucine.
In a fusion protein modified in this way, the result of cleavage is, besides the desired polypeptide, a reduced number of B-galactosidase cleavage fragments which can be ties chosen such that they can easily be separated from the desired polypeptide on the basis of their size and/or charge.
In the case of polypeptides which are to be separated from the B-galactosidase fragment by acid cleavage of the polypeptide bond between aspartic acid and proline, it may be advantageous to modify the appropriate codons in the BgaLactosidase fragment by targeted in vitro mutagenesis in such a way that acid cleavage is no longer possible at these points, preferably by converting the codon for aspartic acid into a codon for glutamic acid.
In the case of polypeptides which are to be separated from the B-galactosidase fragment by cleavage with trypsin, it is possible to modify, by targeted in vitro mutagenesis, codons for arginine and/or Lysine in such a way that the
I
6 8-galactosidase fragments resulting after cleavage with trypsin can easily be separated from the desired polypeptide on the basis of their size and/or charge.
In the case of potypeptides which contain no cysteine, it is possible to insert a codon for the amino acid cysteine between the DNA coding for the S-galactosidase fragment and the DNA coding for the desired polypeptide. It is then possible, by subsequent specific S-cyanylation, to cleave the desired polypeptide off from the 0-galactosidase portion.
0 0 In the case of polypeptides for which the formation of di- 0t. o 15 generally proves beneficial to convert the codons for cysoo teine which are.present in the 0-galactosidase fragment, o j by targeted in vitro mutagenesis, into codons for other o amino acids, preferably serine, in order in this way to avoid any possible formation of wronq disulfide bridges 0o 20 between the 8-galactosidase fragment and the polypeptide.
too# The adaptor between the 8-galactosidase fragment and the structural gene for the desired polypeptide, which adaptor can be omitted in favorable cases, codes immediately in front of the amino terminus of the desired polypeptide for an amino acid or a sequence of amino acids which allow S' easy separation of the desired polypeptide from the 8galactosidase portion. As already mentioned, this amino acid can be methionine, which allows straightforward cleavage with cyanogen bromide, as long as the desired polypeptide contains no methionine or the corresponding codons have been modified by genetic manipulation. One example of an adaptor of this type has the following nucleotide sequence: AAT TAT GAA TTC GCA ATG (Eco RI) TA CTT AAG CGT TAC One example of an adaptor which in the reading frame -7encodes a trypsin cleavage site has the following sequence: AAT TAT GAA TTC GCA AGA (Eco RI) TA CTT AAG CGT TCT An additional facilitation of the various modalities of cleavage off chemical or enzymatic can be achieved by steric separation of the desired polypeptide from the 8galactosidase portion. For this purpose, the codons for a poly(amino acid) are inserted, via a special chemically synthesized adaptor, between the B-galactosidase portion ne, and the polypeptide. In general, it is possible to use as oo amino acids, in view of the different structuring of this poly(dmino acid) "arm", all genetically codable amino I i 15 acids, for example small uncharged amino acids such as glycine, alanine, serine or proline, or charged amino acids o such as aspartic acid and glutamic acid on the one hand, or lysine and arginine on the other hand. Inc poly(amino acid) chain expediently encompasses 5 to 30, preferably 20 10 to 24, in particular 15 to 20, amino acids. Depending on the choice of the poly(amino acids), it is possible to achieve direct folding-back of the desired gene product in conjunction with the 8-galactosidase portion.
The structural gene for the desired polypeptide can be obtained in a manner known per se from natural sources or chemically synthesized. Reference may be made to EP-B 0,032,675 as an example of the isolation of a gene from natural material. Chemically synthesized genes suited to the specific codon usage in the host cell, such as described in, for example, German Oftenlegungsschriften 3,327,007 (derivatives of growth hormone releasing factor), 3,328,793 (derivative of secretin), 3,409,966 (human y-interferon), 3,414,831 (derivatives of human yinterferon), 3,419,995 (interleukin-2 and derivatives) and 3,429,430 (hirudin derivatives) or proposed in (not prepublished) German Offenlegungsschrift 3,632,037 (calcitonin), are advantageous.
8 The incorporation of the gene structure composed of regulator region, 8-galactosidase gene fragment, where appropriate adaptor and structural gene into a suitable vector, the introduction of the hybrid vector obtained in this way into a suitable host cell, the cultivation of the host cells, the cell disruption, the isolation and cleavage of the fusion protein, and the isolation of the desired polypeptide are generally known. Reference may be made to the widely available textbooks and handbooks for this purpose.
Preferred vectors are plasmids, in particular the plasmids compatible with E. coli, such as pBR 322, pBR 325, pUC 8 00 and pUC 9, as well as other commercially available or geno ,o erally accessible plasmids. The preferred bacterial host 0000 0ooo 15 is E. coLi.
0 00 The invention is explained in detail in the examples which S follow.
20 Example 1 20 pg of the commercially available plasmid pUC 9 (cf.
Vieira et al., Gene 19 (1982) 259 268; The Molecular Biology Catalogue, Pharmacia P-L Biochemicals, 1984, appendix, p. 40)are subjected to double digestion with the restriction endonucleases Eco RI and Pvu I and a DNA fragment 123 base-pairs (Bp) in length is separated out by gel electrophoresis. This fragment encompasses part of the amino-terminal coding sequence of B-galactosidase.
To isolate the carboxyl-terminal portion of the 8-galactosidase gene up to the natural Eco RI cleavage site, 20 ig of the plasmid pUR 270 (RUther and MUller-Hill, EMBO J. 2 (1983) 1791-1794) are initially digested with Eco RI and then subjected to partial digestion with -the enzyme Pvu I.
A DNA fragment of 2895 Bp is separated out by electrophoresis on a 5% polyacrylamide gel and is isolated.
The amino-terminal and carboxyl-terminal DNA fragments of I I l IL 1 9 the 8-gaLactosidase gene are ligated together over the course of 6 hours at 16 0 C, and the ligation product is precipitated with ethanol. The precipitated and resuspended DNA is cut with Eco RI and is then fractionated again on a 5% polyacrylamide gel. The DNA fragment 3018 Bp in length is isolated from the gel by electroelution, and is Ligated into the Eco RI cleavage site of the plasmid pBR 322. The hybrid plasmid obtained in this way is called pWZ RI.
The reaction steps described above are shown in Figure 2.
The individual measures were carried out in a known manner (Maniatis et at., Molecular Cloning, Cold Spring Harbor 1982).
t The plasmid pWZ RI is transformed into E. coli, and is .amplified there and re-isolated. It is possible, by digestion with Eco RI, to cleave out the B-galactosidase gene fragment which is shortened at the amino and carboxyl termini, and to isolate it preparatively. The known restriction enzyme cleavage sites can be used for shortening the Sconstruction further and for insertion into suitable expression plasmids. Figure 1 A shows these types of shortening.
Figure 1 B shows constructions having a chemically synthesized linker.
Both the constructions in Figure 1 A and those in Figure 1 B are chosen such that the reading frame for the shortened B-galactosidase is directly joined to the reading frame for the desired carboxyl-terminal polypeptide. The chemically synthesized linkers in Figure 1 B can have any desired form but, of course, must guarantee, where appropriate, an ATG start codon and, of course, a reading frame without a stop codon, and have the desired restriction enzyme cleavage sites.
10 Example 2 A fusion protein composed of monkey proinsulin and a shortened and modified B-galactosidase can be obtained as foL- Lows: ug of the plasmid pWZ RI (Fig. 2) are cut with the restriction enzymes Eco RI and Pvu I and fractionated on a polyacrylamide gel. DNA fragments 123 and 1222 Bp in Length are isolated. Equimolar amounts of the two DNA fragments are subsequently Ligated together at 10°C for 6 hours, and are then digested with Eco RI. The Ligation mixture obtained in this way is fractionated on a polyacrylamide gel, and the DNA band having a Length of 15 1345 Bp is preparatively isolated. This DNA fragment is Ligated into the vector pBR 322 which has been opened with Eco RI and subsequently dephosphorylated. The plasmid obtained in this way is called pWZP RI.
To remove the 8 codons for methionine which pWZP RI contains (M1-M8) and the 6 codons for cysteine which are pre- 4. sent in this plasmid (C1-C6), 1 yg of DNA from pWZP RI is cleaved with Eco RI. The fragment 1345 Bp in size is ligated into the phage vector M13mpl9am(Patschinsky et al., J. Virol. 59 (1986) 341-353) which has been opened with Eco RI and subsequently dephosphorylated. The phage obtained after transfection of E. coli JM101 is cal- Led MWZPam. As a first step for removing the codons for methionine, targeted in vitro mutagenesis by the gapped duplex method (Kramer et al., Nucl. Acids Res. 12 (1984) 9441-9456) is carried out, with, because of the high efficiency of this method (70% on average), the 4 oligonucleotides dM5-dM8 (Tab. 1) being used as mutagenic primers.
The ssDNA of 12 of the resulting phages is sequenced, with, besides the normal 17mer primer, use also being made of dM7 and dC5 (Tab. 1) as primers for the sequencing. 2 of the 12 DNAs have all 4 desired mutations, i.e. the codons for M5-M8 have been altered to codons for isoleucine. These phages are called MWZP dM5,8. To remove the 11 remaining methionine codons, the RF DNA of MWZP dMS,8 is cleaved with Eco RI, and the DNA fragment 1345 Bp in size is cloned into dephosphorylated M13mpl9am. The phage obtained in this way is called MWZP dM5,8am. The ssDNA of this phage is subjected to another in vitro mutagenesis with dMl-dM4 as mutagenic primers. The ssDNA from 12 of the resulting phages is sequenced, and 3 of the 12 phages have all the desired mutations, i.e. the codons for M1-M3 have been altered to codons for leucine, and the codon for M4 has been altered to a codon for isoleucine. These phages are called MWZPdM. The plasmid pWZP dM is obtained by cloning the 1345 Bp Eco RI fragment from the RF form of o"oS° these phages into the dephosphorylated vector pBR 322.
B0 0 oo For the additional conversion of the codons for cysteine o 15 into codons for serine, the 1345 Bp Eco RI fragment from S0"" pWZP dM is isolated and cloned into the dephosphorylated I 4 n phage vector M13mpl9am. The ssDNA of the phage MWZP dMam 0 obtained in this way is subjected to in vitro mutagenesis with dC1-dc6 as mutagenic primers. The ssDNA of 24 of the resulting phages is sequenced, with 4 of the phage clones, which are called MWZP dMdC, proving to be the correct ones in which all the codons for cysteine have been converted into condons for serine. The RF DNA of these phages is cleaved with Eco RI, anc the 1345 Bp Eco RI fragment is cloned into the dephosphorylated vector pBR 322, resulting in the plasmid pWZP dMdC.
The fragment of the B-galactosidase gene which is obtained in this way and is flanked by Eco RI cleavage sites can be integrated as follows into a plasmid which connects together a bacterial regulator region and the gene for monkey proinsulin via an Eco RI cleavage site: pg of the plasmid pBR 322 are digested with the restriction endonucleases Eco RI and Pvu II, and subsequently a fill-in reaction is carried out with Klenow polymerase at the Eco RI cleavage site. Following fractionation by gel electrophoresis in a 5% polyacrylamide gel the plasmid fragment 2293 Bp in length can be obtained by i 12 electroelution (Figure 3 A).
Monkey preproinsulin DNA (Tab. 2; Wetekam et al., Gene 19 (1982) 179 183) integrated in the plasmid pBR 322 is isolated from the latter by digestion with the restriction endonucleases Hind III and Fsp I and is recloned into the plasmid pUC 9 as follows: the plasmid pUC 9 is cleaved with the enzyme Bam HI, a standard fill-in reaction with KLenow polymerase (large fragment) is carried out on the cleavage site, then cutting with the restriction enzyme Hind III is carried out, and the DNA is separated by gel electrophoresis in a 5% polyacrylamide gel from the other 1 DNA fragments. It was possible to integrate the isolated So, insulin DNA fragment about 1250 Bp in length into the S 15 opened plasmid. To remove the untranslated region and the ,Pit presequence, digestion is carried out with Hae III, and the fragment 143 Bp in length is digested under limiting enzyme conditions with Bal 31 to cleave off the last two nucleotides from the presequence. There is obtained in this way TTT as the first codon at the amino terminus, which represents phenylalanine as the first amino acid of the B chain.
An adaptor specific for Eco RI is now ligated onto this fragment in a blunt-end ligation reaction: a) 5' AAT TAT GAA TTC GCA GGA GGC GGG GGT GGC GGT GGG (Eco RI) TA CTT AAG CGT CCT CCG CCC CCA CCG CCA CCC GGC GGA GGT GGT GGC GGT GGA GGC GGT GGA GGC GGG CCG CCT CCA CCA CCG CCA CCT CCG CCA CCT CCG CCC GGT ATG CCA TAC i i i i -13 b) 5' AAT TAT GAA TTC GCA GGA GGC GGG GGT GGC GGT GGG (Eco RI) TA CTT AAG CGT CCT CCG CCC CCA CCG CCA CCC GGC GGA GGT GGT GGC GGT GGA GGC GGT GGA GGC GGG CCG CCT CCA CCA CCG CCA CCT CCG CCA CCT CCG CCC GGT AGA CCA TCT In order to prevent polymerization of the adaptors, they were used unphosphorylated in the ligation reaction. The adaptor a) has at the end a codon for methionine, and the adaptor b) has the codon for arginine. Thus, the gene product obtained by variant a) permits removal of the bacterial portion by cleavage with cyanogen bromide, whereas variant b) allows trypsin cleavage The ligation product is digested with Mbo II. Fractionation by gel electrophoresis results in a DNA 'fragment which is 1J9 Bp in length and has the information for amino acids Nos. 1 to 21 of the B chain.
The gene for the remaining information of the proinsulin molecule (including a G-C sequence from the cloning, and 21 Bp from pBR 322 following the stop codon) is obtained from the pUC 9 plasmid with the complete monkey preproinsulin information by digestion with Mbo II/Sma I and isolation of a DNA fragment about 240 Bp in length. The correct ligation product of about 380 Bp in length (including the adaptor of 78 Bp) is obtained by ligation of the two proinsulin fragments. The proinsulin DNA fragment constructed in this way can now be ligated together with a regulator region via the Eco RI-negative cleavage site.
The entire sequence of reactions is shown in Fig. 3 B, in which A, B and C denote the DNA for the relevant peptide 14 chains of the proinsulin molecule, Ad denotes the (dephosphorylated) adaptor (a or b) and Prae is the DNA for the presequence of monkey preproinsulin.
Example 3 A chemically synthesized regulator region composed of a recognition sequence for Bam HI, the Lac operator, a bacterial promoter and a ribosome binding site (RB) with an ATG start codon 6 to 14 nucLeotides away from the RB, and a subsequent recognition sequence for Eco RI (Figure 3 C) is Ligated via the common Eco RI overlap region with the proinsulin gene fragment obtained in the previous example.
Following a double digestion with Sma I/ Bai1 HI and a 15 fill-in reaction of the Bam HI cleavage site with the list KLenow fragment, the ligatinn product (about 480 Bp) is isolated by gel electrophoresis.
The fragment obtained in this way can subsequently be Ligated, via a blunt-end Ligation, into the pBR 322 partplasmid shown in Figure 3 A (Figure 3 The hybrid plasmid pWI 6 is obtained.
Following transformation into the E. coli strain HB 101 ano selection on ampicillin plates, the plasmid DNA of individual clones was tested for the integration of a 480 Bp fragment having the regulator region and the Bal 31-shortened proinsulin gene. To verify the correct shortening of the proinsulin gene by Bat 31 (Figure 3 the plasmids having the integrated proinsulin fragment were sequenced starting from the Eco RI cleavage site.
Three of 60 sequenced clones had the desired shortening of two nucleotides (Figure 3 D).
The hybrid plasmid pWI6 is now used as starting material for integrating the B-galactosidase gene fragments depicted in Figures 1A and B. This reaction is presented here, by way of example, by cloning of the shortened 15 B-galactosidase gene which is detailed in Figure 1A under 2.
and in which, in addition, all the codons for methionine and cysteine have been replaced by codons for other amino acids: equimotar amounts of the shortened and modified B-galactosidase gene sequence (composed of about 120 Bp of the aminoterminal region and 1220 Bp of the carboxyl-terminal region, flanked by 2 Eco RI cleavage sites) and of the hybrid plasmid pWI 6 cleaved with Eco RI (Figure 3 D) are ligated (Figure Following transformation into indicator bacteria with a B-galactosidase A M15 deletion (The Lactose Operon, Ed. J. Beckwith and D. Zipser, Cold Spring Harbor, 1970) the only colonies of bacteria to react to the indicator dye X-gal (5-bromo-4-chloro-3-indolyl-B-D-galactoo side) by a blue coloration are those which have integrated 15 the B-galactosidase gene fragment in the correct orientaoo tion. Any integration of several shortened 8-galactosidase genes into a plasmid can be detected by standard analysis with restriction enzymes.
The plasmid pWZIP dMdC obtained in this way was induced with IPTG (isopropyl-B-D-thiogalactopyranoside), the inducer for B-galactosidase synthesis, and then tested for its capacity to form a fusion product of B-galactosidase protein and p'oinsulin. The contribution of this product to the total cellular protein is about 15 to 20%. The product is insoluble and can easily be separated from the other cell constituents and proteins by centrifugation.
The other shortened B-galactosidase genes detailed in Figures 1 A and B can be integrated in the same way into the plasmid pWI 6 (Figure 3 D) and, in E. coli, have similar synthetic capacities, with insoluble fusion proteins being obtained.
Depending on the choice of the adaptor a or b, either the proinsulin is cleaved off from the fusion protein with cyanogen bromide, or else, after digestion with trypsin, the insulin derivative B 31 Arg is obtained, from which the arginine is removed by enzymatic cleavage with 16 carboxypeptidase B. The proinsulin or insulin liberated in this way can be purified by standard methods.
Example 4 Plasmids such as pWI 6 which are used as the starting construction for preparing fusion expression plasmids with proinsulin and B-galactosidase gene fragments can advantageously be made utilizable for the expression of other gene products. For this purpose, 10 pg of the plasmid pWI 6 are cut with the restriction enzymes Eco RI and Pvu II (Fig. The opened and shortened plasmid is separated from *he proinsulin gene fragments (324 Bp in length) on a polyacrylamide gel. Following preparative isolation of the plasmid fragment from the polyacrylamide gel, it is possible to ligate into a plasmid which has been opened in this way a chemically synthesized DNA sequence which, apart from the Pst I cleavage sjite, has a multiplicity of i unique restriction enzyme cleavage sites: (Eco RI) SstI SmaI BamHI XbaI Sall PstI 3' AA TTC GAG CTC GCC CGG GGA TCC TCT AGA GTC GAC CTG CAG 5' G CTC GAG CGG GCC CGT AGG AGA TCT CAG CTG GAC GTC HindIII Nrul AvaIII BglII Ncol SphI 3' CCC AAG CTT CGC GAT GCA TCA GAT CTA CCA TGG CAT GCC GGG TTC GAA GCG CTA CGT AGT CTA GAT GGT ACC GTA CGG There is obtained in this way the plasmid pWB-1 (Fig. which, besides a regulator region, also contains a DNA sequence which is suitable, with its multiple unique restriction enzyme cleavage sites, for the cloning of various genes of natural and chemical synthetic origin. In analogy to Example 2, a chemically synthesized adaptor with DNA for a poly(amino acid) sequence and an appropriate cleavage 17 site can be inserted upstream of each gene. Between the regulator region and the multiple cloning site (MCS) there is a unique restriction enzyme cleavage site for Eco RI, into which each shortened B-galactosidase gene with the various mutation modalities can be Ligated, in analogy to Example 3.
2 pg of DNA of the plasmid pWB-1 are cleaved with Eco RI and subsequently dephosphorylated. The shortened and modified B-galactosidase gene fragment which has been obtained from pWZIP dMdC by cleavage with Eco RI is ligated into the vector treated in this way. Transformation of E. coli cells and selection for expression results in the plasmid ,o pWZPWB1 dMdC which contains, downstream of the shortened 15 and modified B-galactosidase gene, the multiple cloning site described above (Fig. Example As an example of the incorporation of a chemically synthesized gene, the preparation of a fusion protein having the amino acid sequence of salmon calcitonin to which a carboxyl-terminal glycine has been attached is described hereinafter.
The chemically synthesized DNA sequence for calcitonin which is shown in Table 3, whose nucleotide sequence is specifically suited to the codon usage of E. coli and which has at the 5' end a Bam HI "protrusion" and, up- 30 stream of the codon for the first amino acid, an adaptor i coding for 15 glycine residues and a methionine codon, and i at the 3' end following the proline codon a glycine codon, Stwo stop codons and an Sph I "protrusion", is cloned into the plasmid pWZPWB1 dMdC which has been opened with the restriction enzymes Bam HI and Sph 1. The plasmid pWZP dMdC catc. is obtained.
This plasmid can be transformed in a known manner into E. coli, and the corresponding fusion protein is expressed j 1 L- 18 there and isolated in analogy to the proinsulin fusion protein. The galactosidase portion is cLeaved off with cyanogen bromide, and the subsequent purification is carried out by known methods.
Table 1 Mutagenic primers for the conversion of codons for methionine into codons for leucine or isoleucine and for the conversion of codons for cysteine into codons for serine: dM1 AG ACC GTT CAG ACA GAA CTG G dM2 AG CGC CAC CAG CCA GTG CAG G dM3 GCA AAA ATC CAG TTC GCT GGT G dM4 C GCC AAT CCA TAT CTG TGA AA C GGT AAT CGC AAT TTG ACC AC dM6 A CGG GGT ATA GAT GTC TGA CA dM7 G GCT GGT TTC AAT CAG TTG CT dM8 C ACC AAT CCC TAT ATG GAA ACC dC1 G ACC GTT CAG AGA GAA CTG GCG dC2 CAG CTC GAT GGA AAA ATC CAG TTC dC3 ATC TGC CGT GGA CTG CAA CAA dC4 CGC CAG CTG GGA GTT CAG GCC GCG CTC AAA AGA GGC GGC AGT dC6 GCG CGT CCC GGA GCG CAG ACC 19 TabLe 2 AAT TCT GCA AGA 3' GA CGT TCT (Asn)Ser Ala Arg (EcoRI)
BEI
TTT
AAA
Phe
TAC
ATG
Tyr
GTG
CAC
Val
AAC
TTG
Asn CTG GTG GAC CAC Leu Vai B stN I
CAG
GTC
Gin
TGC
ACG
Cys
GCA
CGT
Ala
CAC
GTG
His
GGG
ccc Gly
CTG
GAC
Lou
GAG
CTC
Giu TGC GGC ACG CCG Cys Gly Fnu4H
TCC
AGO
Ser
CAC
GTG
His
CGA
OCT
Arg
CTA
GAT
Leu
TAC
ATG
Tyr
GGC
CCG
Gly TTC TTC AAG AAG Phe Phe Mb ol
GTG
CAC
Val
ACA
TGT
Thr
GTG
CAC
Val GAA GCT CTT CGA Glu Ala AluI ccc
GGG
Pro
AAG
TTC
Lys
CTC
GAG
Leu
ACC
TGG
Thr
GGC
CCG
Gly Cl1 CGC CG GCG GCC Arg Arg HpaI I
GAG
CTC
Glu
GAC
CTC
G1iL GAC CCT CA CTG GGA GT Asp Pro G1 Avail DdeI
G
C
n
GTG
CAC
Val GGG CAG CCC GTC Gly Gin GAG CTG CTC GAC Giu Leu Alul GGG GGC CCT GGC GCA CCC CCG GGA CCG CGT Gly Gly Pro Gly Ala HaeI1I BstNIHhaI GGC AGC CTG CAG CCC CCG TCG GAC GTC GGG Gly Ser Leu Gin Pro Fnu4HI PstIFnu4HI
TTG
AAC
Leu GCG CTG CGC GAC Ala Leu Hh a
GAG
CTC
Glu
GGG
CCC
Giy Avail
ATC
TAG
Ile
TCC
AGG
Ser
TGC
ACG
Cys CTG CAG GAC GTC Leu Gin PstI AAG CGC TTC GCG Lys Arg Hha I AlI
GGC
CCG
Gly
ATC
TAG
Ile
GAG
CTC
Giu
GTG
CAC
Val
AAC
TTG
Asn
GAG
CTC
Giu
TAC
ATG
Tyr
CAG
GTC
Gin
TGC
ACG
Cys TGC TGC ACG ACG Cys Cys Fnu4H I ACC ACC TGG TCG Thr Ser
TCC
AGG
Ser
CTC
GAG
Leu
TAC
ATG
Tyr CAG CTG GTC GAC Gin Leu PvUI I
AAC
TTG
Asn TAA TAG TCG ACC ATT ATC AGC TGG Sail TGC AGC CA ACG TCG GTT CGA Pstl (HindlII) B 1, C I and A 1 designate the start of the B, C and A chains of monkey proinsuLin 20 TabLe 3 Triplet No.
Amino acid Nucteotide No.
Coding strand Non-coding strand 0 1 met Cys 1 5 10 5' GA TCC ATG TGC 31 G TAC ACG 2 Ser
TCT
AGA
3 Asn
AAC,
TTG
0 004 0 00 ~0 4 0 00 00 0 0000 0404 0 0000 4004 0 4040 040404 0 0 0 40 0 4 4 .4 4 4~ 04 0 444.
4 I I 4 Leu
CTF
GAC
14 Gin 50
CAG
GTC
24 Arg 80
CGC
GCG
34 Stp
I:
5 6 7 8 Ser Thr Cys Val 25 30 TCG ACT TGC OTT AGC TGA ACG CAA 15 16 17 18 Glu Leu His Lys 55 60 GAA CTT CAT AAA CTT GAA GTA 'TT 25 26 27 28 Thr Asn Thr Giy 85 90 ACT AAT ACC GGC TGA TTA TGG CCG 9 Leu 35
CTT
GAA
19 Leu 65
CTG
GAC
29 Ser 95
'ICT
AGA
10 Giy
GGT
CCA
20 Gin
CAG
GTC
30 Giy
GGT
CCA
11 Lys 40
AAG
TTC
21 Thr 70
ACC
TGG
31 Thr 100
ACC
TGG
12 13 Leu Ser CTT TCT GAA AGA 22 23 Tyr Pro TAT CCG ATA GGC 32 33 Pro Gly 105 CCT GGT GGA CCA 1.0 Stp 115 TAA TAG CAT G 3' 'ITT ATC
Claims (14)
1. A process for the preparation of a genetically codable polypeptide by coupling the structural gene for this polypeptide in the correct reading frame via the gene for P-galactosidase, or a fragment of A-galactosidase, to a regulator region, introducing this gene structure into a bacterium, expressing therein an insoluble fusion protein, isolating it after cell disruption, and cleaving off the desired polypeptide chemically or enzymatically, which comprises replacing codons for methionine and/or arginine and/or cysteine in the gene for B-galacto- sidase, or for the fragment of A-galactosidase, in whole or in part, by codons of other amino acids.
2. The process as claimed in claim 1, wherein the gene structure codes for a B-galactosidase fragment of more than 250 amino acids but significantly less than the total $-galactosidase sequence.
3. The process as claimed in claim 1 or 2, wherein the B-galactosidase fragment is composed of a fusion of an amino-terminal and/or a carboxyl-terminal part- sequence of P-3alacQco dase
4. The process as claimed in cLaim 1, 2 or 3, wherein the B-galactosidase fragment has about 300 to about 800 amino acids. The process as claimed in one or more of the pre- ceding claims, wherein the B-galactosidase fragment has about 320 to about 650 amino acids.
6. The process as claimed in one or more of the pre- ceding claims, wherein the gene for the B-galacto- sidase fragment corresponds to a DNA sequence shown in Figure 1. M S* 22-
7. The process as claimed in one or more of the pre- ceding claims, wherein the structura ee for the genetically codable polypeptide is -4-ed- via an adaptor to the gene for the modified B-galactosidase or the B-galactosidase fragment.
8. The process as claimed in claim 7, wherein the adap- tor codes for a poly(amino acid) sequence.
9. The process as claimed in one or more of the pre- ceding claims, wherein a codon for an amino acid which allows chemical or enzymatic separation of the polypeptide from the B-galactosidase portion is loca- ted immediately upstream of the amino-terminal end of the structural gene. The process as claimed in one or more of the pre- ceding claims, wherein the genetically codable poly- peptide is a derivative of growth hormone releasing factor, an interferon, a proinsulin, secretin, inter- leukin-2, a calcitonin or a hirudin.
11. A gene structure containing a regulator region, a gene for the modified B-galactosidase or a B-galac- tosidase fragment, in which codons for methionine and/or arginine and/or cysteine have been replaced, in whole or in part, by codons of other amino acids, and a structural gene for a genetically codable poly- Speptide.
12. A gene structure as claimed in claim 11, wherein the structural gene for the genetically codable poly- peptide is coupled via an adaptor, which ensures the correct reading frame, to the gene for the modified B-galactosidase or the B-galactosidase fragment.
13. A vector containing a gene structure as claimed in claim 11 or 12, i- 23
14. A bacterium containing a vector as claimed in claim 13. E. coli containing a vector as claimed in claim 13.
16. A fusion protein containing modified B-galactosidase or a B-galactosidase fragment as claimed in one or more of claims 1 to 9, and a eukaryotic genetically codable polypeptide.
17. A fusion protein as claimed in claim 16, wherein the eukaryotic genetically codable polypeptide portion is a derivative of growth hormone releasing factor, an interferon, a proinsulin, secretin, interleukin-2, a calcitonin or a hirudin. DATED this 8th day of April 1988. HOECHST AKTIENGESELLSCHAFT EDWD. WATERS SONS PATENT ATTORNEYS QUEEN STREET MELBOURNE. VIC. 3000.
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| DE3712361 | 1987-04-11 | ||
| DE3712361 | 1987-04-11 | ||
| DE3805150 | 1988-02-19 | ||
| DE3805150A DE3805150A1 (en) | 1987-04-11 | 1988-02-19 | GENE TECHNOLOGICAL METHOD FOR PRODUCING POLYPEPTIDES |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| AU1446988A AU1446988A (en) | 1988-10-13 |
| AU603883B2 true AU603883B2 (en) | 1990-11-29 |
Family
ID=25854588
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU14469/88A Ceased AU603883B2 (en) | 1987-04-11 | 1988-04-11 | Genetic engineering process for the preparation of polypeptides |
Country Status (12)
| Country | Link |
|---|---|
| EP (1) | EP0286956B1 (en) |
| JP (1) | JPS63263087A (en) |
| KR (1) | KR880012758A (en) |
| AU (1) | AU603883B2 (en) |
| DE (2) | DE3805150A1 (en) |
| DK (1) | DK196188A (en) |
| FI (1) | FI881617A7 (en) |
| GR (1) | GR3007417T3 (en) |
| IL (1) | IL86023A (en) |
| NO (1) | NO881548L (en) |
| NZ (1) | NZ224184A (en) |
| PT (1) | PT87211B (en) |
Families Citing this family (11)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US4987070A (en) * | 1987-03-04 | 1991-01-22 | Suntory Limited | Use of a 97 amino acid leader sequence from the E. coli B-galactosidase gene for the production of hanp and hptc as fusion proteins |
| DE3716722A1 (en) * | 1987-05-19 | 1988-12-01 | Hoechst Ag | GENE TECHNOLOGICAL METHOD FOR PRODUCING ANGIOGENINES |
| DE3844211A1 (en) * | 1988-12-29 | 1990-07-05 | Hoechst Ag | NEW INSULINE DERIVATIVES, THE PROCESS FOR THEIR PRODUCTION, THEIR USE AND A PHARMACEUTICAL PREPARATION CONTAINING THEM |
| US5179196A (en) * | 1989-05-04 | 1993-01-12 | Sri International | Purification of proteins employing ctap-iii fusions |
| US5527687A (en) * | 1993-05-25 | 1996-06-18 | American Cyanamid Company | Enzyme induction screen for ergosterol biosynthesis inhibitors |
| CA2198966C (en) * | 1996-03-04 | 2011-06-21 | Yuji Suzuki | Method for cleaving chimeric protein using processing enzyme |
| PT821006E (en) | 1996-07-26 | 2004-09-30 | Aventis Pharma Gmbh | INSULIN DERIVATIVES WITH ENHANCED ZINC LINK |
| DE19825447A1 (en) | 1998-06-06 | 1999-12-09 | Hoechst Marion Roussel De Gmbh | New insulin analogues with increased zinc formation |
| RU2694686C2 (en) * | 2013-03-12 | 2019-07-16 | Е.И.Дюпон Де Немур Энд Компани | Methods for identifying variant recognition sites for rare-cutting engineered double-strand-break-inducing agents and compositions and uses thereof |
| KR20200080747A (en) | 2018-12-27 | 2020-07-07 | 주식회사 폴루스 | An Enzymatic Conversion Composition for Producing Insulin from Proinsulin and a Method for Producing Insulin from Proinsulin Using the Same |
| KR20200080748A (en) | 2018-12-27 | 2020-07-07 | 주식회사 폴루스 | A Method for Purifying Proinsulin Using Anion Exchange Chromatography |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AU1563188A (en) * | 1987-05-06 | 1988-11-10 | Rudolf Glockshuber | A process for the preparation of a genetically encodable polypeptide |
Family Cites Families (4)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| NO158880C (en) * | 1977-11-08 | 1988-11-09 | Genentech Inc | PROCEDURE FOR THE PREPARATION OF A SPECIFIC HUMAN HORMON. |
| JPS6028994A (en) * | 1983-07-08 | 1985-02-14 | Wakunaga Seiyaku Kk | (21-leucine) human urogastrone, corresponding gene, corresponding recombinant plasmid, transformed cell and their preparation |
| WO1985000831A1 (en) * | 1983-08-10 | 1985-02-28 | Amgen | Microbial expression of insulin-like growth factor |
| US4987070A (en) * | 1987-03-04 | 1991-01-22 | Suntory Limited | Use of a 97 amino acid leader sequence from the E. coli B-galactosidase gene for the production of hanp and hptc as fusion proteins |
-
1988
- 1988-02-19 DE DE3805150A patent/DE3805150A1/en not_active Withdrawn
- 1988-04-06 EP EP88105438A patent/EP0286956B1/en not_active Expired - Lifetime
- 1988-04-06 DE DE8888105438T patent/DE3877035D1/en not_active Expired - Fee Related
- 1988-04-07 FI FI881617A patent/FI881617A7/en not_active IP Right Cessation
- 1988-04-08 NO NO881548A patent/NO881548L/en unknown
- 1988-04-08 NZ NZ224184A patent/NZ224184A/en unknown
- 1988-04-10 IL IL86023A patent/IL86023A/en unknown
- 1988-04-11 PT PT87211A patent/PT87211B/en not_active IP Right Cessation
- 1988-04-11 JP JP63088888A patent/JPS63263087A/en active Pending
- 1988-04-11 AU AU14469/88A patent/AU603883B2/en not_active Ceased
- 1988-04-11 KR KR1019880004072A patent/KR880012758A/en not_active Ceased
- 1988-04-11 DK DK196188A patent/DK196188A/en not_active Application Discontinuation
-
1993
- 1993-03-19 GR GR930400200T patent/GR3007417T3/el unknown
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AU1563188A (en) * | 1987-05-06 | 1988-11-10 | Rudolf Glockshuber | A process for the preparation of a genetically encodable polypeptide |
Also Published As
| Publication number | Publication date |
|---|---|
| FI881617A0 (en) | 1988-04-07 |
| GR3007417T3 (en) | 1993-07-30 |
| JPS63263087A (en) | 1988-10-31 |
| IL86023A0 (en) | 1988-09-30 |
| EP0286956A2 (en) | 1988-10-19 |
| DK196188D0 (en) | 1988-04-11 |
| AU1446988A (en) | 1988-10-13 |
| NO881548D0 (en) | 1988-04-08 |
| PT87211B (en) | 1992-07-31 |
| KR880012758A (en) | 1988-11-29 |
| EP0286956B1 (en) | 1992-12-30 |
| DE3877035D1 (en) | 1993-02-11 |
| DK196188A (en) | 1988-10-12 |
| EP0286956A3 (en) | 1989-04-05 |
| NO881548L (en) | 1988-10-12 |
| IL86023A (en) | 1993-01-31 |
| PT87211A (en) | 1988-05-01 |
| DE3805150A1 (en) | 1988-10-20 |
| FI881617A7 (en) | 1988-10-12 |
| NZ224184A (en) | 1989-10-27 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| JP2566933B2 (en) | Eukaryotic fusion protein, production and use thereof, and method | |
| AU638277B2 (en) | Fusion proteins, their preparation and use | |
| CA1336329C (en) | Fusion proteins, a process for their preparation and their use | |
| CA1337181C (en) | Recombinant herpes simplex gb-gd vaccine | |
| IE53897B1 (en) | Methods and products for facile microbial expression of dna sequences | |
| FI93734B (en) | Process for producing fusion proteins containing a eukaryotic ballast fraction and products useful in the process | |
| WO1992002550A1 (en) | Expression of recombinant polypeptides with improved purification | |
| AU603883B2 (en) | Genetic engineering process for the preparation of polypeptides | |
| EP0205475B1 (en) | Recombinant methods for production of serine protease inhibitors and dna sequences useful for same | |
| HU214236B (en) | Method for the preparation of hirudin and hirudin variants | |
| Zhu et al. | Determination of the carboxyl termini of the alpha and beta subunits of yeast K1 killer toxin. Requirement of a carboxypeptidase B-like activity for maturation. | |
| AU595805B2 (en) | Releasing desired peptides from fusion polypeptides using endopeptidases | |
| US5087564A (en) | Release of recombinant peptides from polypeptides using V8 endopeptidase | |
| US5420242A (en) | Production of human parathyroid hormone from microorganisms | |
| WO1989003886A1 (en) | Expression systems for preparation of polypeptides in prokaryotic cells | |
| CA2040810C (en) | Process for producing foreign proteins in streptomyces | |
| AU617999B2 (en) | A genetic engineering process for the preparation of angiogenins | |
| AU655316B2 (en) | Increased expression of low molecular weight recombinant polypeptides | |
| US6194200B1 (en) | Expression systems for preparation of polypeptides in prokaryotic cells | |
| Glück et al. | The primary structure of rat ribosomal protein S10 | |
| US5426036A (en) | Processes for the preparation of foreign proteins in streptomycetes | |
| FI97549B (en) | Process for the preparation of foreign proteins in Streptomyces cells | |
| Šebo et al. | Delineation of the minimal portion of the Bacillus sphaericus 1593M toxin required for the expression of larvicidal activity | |
| Daum et al. | Production of human adrenocorticotropin by cleavage of alkaline‐phosphatase‐derived fusion proteins containing repetitive recognition sequences for collagenases | |
| Lukacsovich et al. | A family of expression vectors based on the rrnBP2 promoter of Escherichia coli |