AU726892B2 - Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof - Google Patents
Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof Download PDFInfo
- Publication number
- AU726892B2 AU726892B2 AU25984/97A AU2598497A AU726892B2 AU 726892 B2 AU726892 B2 AU 726892B2 AU 25984/97 A AU25984/97 A AU 25984/97A AU 2598497 A AU2598497 A AU 2598497A AU 726892 B2 AU726892 B2 AU 726892B2
- Authority
- AU
- Australia
- Prior art keywords
- seq
- pylori
- polypeptide
- fragment
- amino acid
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
- 150000007523 nucleic acids Chemical class 0.000 title claims description 257
- 108020004707 nucleic acids Proteins 0.000 title claims description 254
- 102000039446 nucleic acids Human genes 0.000 title claims description 254
- 125000003275 alpha amino acid group Chemical group 0.000 title claims description 120
- 239000000203 mixture Substances 0.000 title claims description 30
- 229960005486 vaccine Drugs 0.000 title description 38
- 241000590002 Helicobacter pylori Species 0.000 title description 15
- 229940037467 helicobacter pylori Drugs 0.000 title description 14
- 108090000765 processed proteins & peptides Proteins 0.000 claims description 624
- 102000004196 processed proteins & peptides Human genes 0.000 claims description 574
- 229920001184 polypeptide Polymers 0.000 claims description 534
- 239000012634 fragment Substances 0.000 claims description 240
- 210000004027 cell Anatomy 0.000 claims description 162
- 239000002773 nucleotide Substances 0.000 claims description 119
- 125000003729 nucleotide group Chemical group 0.000 claims description 119
- 238000000034 method Methods 0.000 claims description 104
- 239000012528 membrane Substances 0.000 claims description 90
- 230000014509 gene expression Effects 0.000 claims description 79
- 238000006243 chemical reaction Methods 0.000 claims description 55
- 238000003556 assay Methods 0.000 claims description 39
- 150000001413 amino acids Chemical class 0.000 claims description 38
- 239000000523 sample Substances 0.000 claims description 38
- 241000588724 Escherichia coli Species 0.000 claims description 36
- 230000001086 cytosolic effect Effects 0.000 claims description 34
- 238000012360 testing method Methods 0.000 claims description 32
- 230000002163 immunogen Effects 0.000 claims description 28
- 241000894006 Bacteria Species 0.000 claims description 27
- 102000009016 Cholera Toxin Human genes 0.000 claims description 26
- 108010049048 Cholera Toxin Proteins 0.000 claims description 26
- 230000001580 bacterial effect Effects 0.000 claims description 25
- 150000001875 compounds Chemical class 0.000 claims description 25
- 239000000126 substance Substances 0.000 claims description 24
- 230000032258 transport Effects 0.000 claims description 22
- 239000013604 expression vector Substances 0.000 claims description 21
- 241000589989 Helicobacter Species 0.000 claims description 16
- 230000004060 metabolic process Effects 0.000 claims description 16
- 230000028327 secretion Effects 0.000 claims description 15
- 238000011282 treatment Methods 0.000 claims description 15
- 210000004899 c-terminal region Anatomy 0.000 claims description 14
- 230000001225 therapeutic effect Effects 0.000 claims description 14
- 230000035897 transcription Effects 0.000 claims description 14
- 238000013518 transcription Methods 0.000 claims description 14
- 239000002671 adjuvant Substances 0.000 claims description 12
- 239000002245 particle Substances 0.000 claims description 12
- COLNVLDHVKWLRT-QMMMGPOBSA-N phenylalanine group Chemical group N[C@@H](CC1=CC=CC=C1)C(=O)O COLNVLDHVKWLRT-QMMMGPOBSA-N 0.000 claims description 12
- OUYCCCASQSFEME-UHFFFAOYSA-N tyrosine Natural products OC(=O)C(N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-UHFFFAOYSA-N 0.000 claims description 12
- 238000004519 manufacturing process Methods 0.000 claims description 11
- 230000001413 cellular effect Effects 0.000 claims description 10
- 239000003937 drug carrier Substances 0.000 claims description 10
- 125000001493 tyrosinyl group Chemical group [H]OC1=C([H])C([H])=C(C([H])=C1[H])C([H])([H])C([H])(N([H])[H])C(*)=O 0.000 claims description 10
- 230000000295 complement effect Effects 0.000 claims description 9
- 238000001514 detection method Methods 0.000 claims description 7
- 230000010076 replication Effects 0.000 claims description 7
- 230000004260 plant-type cell wall biogenesis Effects 0.000 claims description 6
- 230000001105 regulatory effect Effects 0.000 claims description 6
- 241000700605 Viruses Species 0.000 claims description 5
- 230000037354 amino acid metabolism Effects 0.000 claims description 5
- 230000037356 lipid metabolism Effects 0.000 claims description 5
- 230000037360 nucleotide metabolism Effects 0.000 claims description 5
- 230000006798 recombination Effects 0.000 claims description 5
- 238000005215 recombination Methods 0.000 claims description 5
- 230000008439 repair process Effects 0.000 claims description 5
- 238000007423 screening assay Methods 0.000 claims description 5
- 230000002238 attenuated effect Effects 0.000 claims description 4
- 239000002502 liposome Substances 0.000 claims description 4
- 239000006166 lysate Substances 0.000 claims description 4
- 239000003094 microcapsule Substances 0.000 claims description 4
- BSOQXXWZTUDTEL-ZUYCGGNHSA-N muramyl dipeptide Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@H]1[C@H](O)[C@@H](CO)O[C@@H](O)[C@@H]1NC(C)=O BSOQXXWZTUDTEL-ZUYCGGNHSA-N 0.000 claims description 4
- 238000003259 recombinant expression Methods 0.000 claims description 4
- 108010042708 Acetylmuramyl-Alanyl-Isoglutamine Proteins 0.000 claims description 2
- 238000012258 culturing Methods 0.000 claims description 2
- 230000002538 fungal effect Effects 0.000 claims description 2
- 150000004676 glycans Chemical class 0.000 claims description 2
- 231100000252 nontoxic Toxicity 0.000 claims description 2
- 230000003000 nontoxic effect Effects 0.000 claims description 2
- 229920000642 polymer Polymers 0.000 claims description 2
- 229920001282 polysaccharide Polymers 0.000 claims description 2
- 239000005017 polysaccharide Substances 0.000 claims description 2
- 229930182490 saponin Natural products 0.000 claims description 2
- 150000007949 saponins Chemical class 0.000 claims description 2
- 239000003053 toxin Substances 0.000 claims description 2
- 231100000765 toxin Toxicity 0.000 claims description 2
- 206010019375 Helicobacter infections Diseases 0.000 claims 2
- 238000011321 prophylaxis Methods 0.000 claims 2
- QGVLYPPODPLXMB-UBTYZVCOSA-N (1aR,1bS,4aR,7aS,7bS,8R,9R,9aS)-4a,7b,9,9a-tetrahydroxy-3-(hydroxymethyl)-1,1,6,8-tetramethyl-1,1a,1b,4,4a,7a,7b,8,9,9a-decahydro-5H-cyclopropa[3,4]benzo[1,2-e]azulen-5-one Chemical compound C1=C(CO)C[C@]2(O)C(=O)C(C)=C[C@H]2[C@@]2(O)[C@H](C)[C@@H](O)[C@@]3(O)C(C)(C)[C@H]3[C@@H]21 QGVLYPPODPLXMB-UBTYZVCOSA-N 0.000 claims 1
- 150000002148 esters Chemical class 0.000 claims 1
- QGVLYPPODPLXMB-QXYKVGAMSA-N phorbol Natural products C[C@@H]1[C@@H](O)[C@]2(O)[C@H]([C@H]3C=C(CO)C[C@@]4(O)[C@H](C=C(C)C4=O)[C@@]13O)C2(C)C QGVLYPPODPLXMB-QXYKVGAMSA-N 0.000 claims 1
- 239000001397 quillaja saponaria molina bark Substances 0.000 claims 1
- 108090000623 proteins and genes Proteins 0.000 description 355
- 102000004169 proteins and genes Human genes 0.000 description 207
- 235000018102 proteins Nutrition 0.000 description 202
- 108020004414 DNA Proteins 0.000 description 161
- 108700026244 Open Reading Frames Proteins 0.000 description 91
- 238000003752 polymerase chain reaction Methods 0.000 description 78
- 239000000047 product Substances 0.000 description 67
- 210000004379 membrane Anatomy 0.000 description 65
- 239000013615 primer Substances 0.000 description 60
- 239000013612 plasmid Substances 0.000 description 55
- 238000010367 cloning Methods 0.000 description 54
- 239000013598 vector Substances 0.000 description 49
- 108091028043 Nucleic acid sequence Proteins 0.000 description 47
- 230000000694 effects Effects 0.000 description 44
- 235000001014 amino acid Nutrition 0.000 description 38
- 208000015181 infectious disease Diseases 0.000 description 36
- 108091007433 antigens Proteins 0.000 description 35
- 102000036639 antigens Human genes 0.000 description 35
- 229940024606 amino acid Drugs 0.000 description 34
- 108091034117 Oligonucleotide Proteins 0.000 description 32
- -1 Orn Chemical compound 0.000 description 31
- 239000000427 antigen Substances 0.000 description 31
- 238000002360 preparation method Methods 0.000 description 31
- 238000000746 purification Methods 0.000 description 28
- 230000027455 binding Effects 0.000 description 27
- 229930027917 kanamycin Natural products 0.000 description 26
- 229960000318 kanamycin Drugs 0.000 description 26
- SBUJHOSQTJFQJX-NOAMYHISSA-N kanamycin Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N SBUJHOSQTJFQJX-NOAMYHISSA-N 0.000 description 26
- 229930182823 kanamycin A Natural products 0.000 description 26
- 238000009396 hybridization Methods 0.000 description 23
- 241001465754 Metazoa Species 0.000 description 22
- 238000012163 sequencing technique Methods 0.000 description 22
- PEDCQBHIVMGVHV-UHFFFAOYSA-N Glycerine Chemical compound OCC(O)CO PEDCQBHIVMGVHV-UHFFFAOYSA-N 0.000 description 21
- 108010052285 Membrane Proteins Proteins 0.000 description 21
- 238000004925 denaturation Methods 0.000 description 20
- 230000036425 denaturation Effects 0.000 description 20
- 230000003053 immunization Effects 0.000 description 20
- 108010008281 Recombinant Fusion Proteins Proteins 0.000 description 19
- 102000007056 Recombinant Fusion Proteins Human genes 0.000 description 19
- 230000004927 fusion Effects 0.000 description 19
- 230000014616 translation Effects 0.000 description 19
- 238000012408 PCR amplification Methods 0.000 description 18
- 230000003321 amplification Effects 0.000 description 18
- 238000002649 immunization Methods 0.000 description 18
- 238000003199 nucleic acid amplification method Methods 0.000 description 18
- 108020004705 Codon Proteins 0.000 description 17
- 125000000539 amino acid group Chemical group 0.000 description 17
- 239000003599 detergent Substances 0.000 description 16
- 238000000338 in vitro Methods 0.000 description 16
- YBYRMVIVWMBXKQ-UHFFFAOYSA-N phenylmethanesulfonyl fluoride Chemical compound FS(=O)(=O)CC1=CC=CC=C1 YBYRMVIVWMBXKQ-UHFFFAOYSA-N 0.000 description 16
- 238000012216 screening Methods 0.000 description 16
- 102000004190 Enzymes Human genes 0.000 description 15
- 108090000790 Enzymes Proteins 0.000 description 15
- 108010020062 Peptidylprolyl Isomerase Proteins 0.000 description 15
- 102000009658 Peptidylprolyl Isomerase Human genes 0.000 description 15
- 230000004071 biological effect Effects 0.000 description 15
- 229940088598 enzyme Drugs 0.000 description 15
- 239000003446 ligand Substances 0.000 description 15
- 238000013519 translation Methods 0.000 description 15
- 102000018697 Membrane Proteins Human genes 0.000 description 14
- 230000006870 function Effects 0.000 description 14
- 238000006467 substitution reaction Methods 0.000 description 14
- 108091026890 Coding region Proteins 0.000 description 13
- 210000001744 T-lymphocyte Anatomy 0.000 description 13
- 238000004458 analytical method Methods 0.000 description 13
- 230000029087 digestion Effects 0.000 description 13
- 102000037865 fusion proteins Human genes 0.000 description 13
- 108020001507 fusion proteins Proteins 0.000 description 13
- 230000004048 modification Effects 0.000 description 13
- 238000012986 modification Methods 0.000 description 13
- 238000002703 mutagenesis Methods 0.000 description 13
- 231100000350 mutagenesis Toxicity 0.000 description 13
- 230000035772 mutation Effects 0.000 description 13
- XSQUKJJJFZCRTK-UHFFFAOYSA-N Urea Chemical compound NC(N)=O XSQUKJJJFZCRTK-UHFFFAOYSA-N 0.000 description 12
- JLCPHMBAVCMARE-UHFFFAOYSA-N [3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[3-[[3-[[3-[[3-[[3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-[[5-(2-amino-6-oxo-1H-purin-9-yl)-3-hydroxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxyoxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(5-methyl-2,4-dioxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(6-aminopurin-9-yl)oxolan-2-yl]methoxy-hydroxyphosphoryl]oxy-5-(4-amino-2-oxopyrimidin-1-yl)oxolan-2-yl]methyl [5-(6-aminopurin-9-yl)-2-(hydroxymethyl)oxolan-3-yl] hydrogen phosphate Polymers Cc1cn(C2CC(OP(O)(=O)OCC3OC(CC3OP(O)(=O)OCC3OC(CC3O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c3nc(N)[nH]c4=O)C(COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3COP(O)(=O)OC3CC(OC3CO)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3ccc(N)nc3=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cc(C)c(=O)[nH]c3=O)n3cc(C)c(=O)[nH]c3=O)n3ccc(N)nc3=O)n3cc(C)c(=O)[nH]c3=O)n3cnc4c3nc(N)[nH]c4=O)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)n3cnc4c(N)ncnc34)O2)c(=O)[nH]c1=O JLCPHMBAVCMARE-UHFFFAOYSA-N 0.000 description 12
- 239000003814 drug Substances 0.000 description 12
- 108020004999 messenger RNA Proteins 0.000 description 12
- FFEARJCKVFRZRR-SCSAIBSYSA-N D-methionine Chemical compound CSCC[C@@H](N)C(O)=O FFEARJCKVFRZRR-SCSAIBSYSA-N 0.000 description 11
- FAPWRFPIFSIZLT-UHFFFAOYSA-M Sodium chloride Chemical compound [Na+].[Cl-] FAPWRFPIFSIZLT-UHFFFAOYSA-M 0.000 description 11
- 239000011543 agarose gel Substances 0.000 description 11
- 230000000692 anti-sense effect Effects 0.000 description 11
- 230000015572 biosynthetic process Effects 0.000 description 11
- 239000003153 chemical reaction reagent Substances 0.000 description 11
- 239000000499 gel Substances 0.000 description 11
- 230000002441 reversible effect Effects 0.000 description 11
- 238000003786 synthesis reaction Methods 0.000 description 11
- 230000009466 transformation Effects 0.000 description 11
- 241000699666 Mus <mouse, genus> Species 0.000 description 10
- 241000699670 Mus sp. Species 0.000 description 10
- 239000007983 Tris buffer Substances 0.000 description 10
- 239000000872 buffer Substances 0.000 description 10
- 239000003795 chemical substances by application Substances 0.000 description 10
- 238000012217 deletion Methods 0.000 description 10
- 230000037430 deletion Effects 0.000 description 10
- 210000002966 serum Anatomy 0.000 description 10
- 210000001519 tissue Anatomy 0.000 description 10
- LENZDBCJOHFCAS-UHFFFAOYSA-N tris Chemical compound OCC(N)(CO)CO LENZDBCJOHFCAS-UHFFFAOYSA-N 0.000 description 10
- 108091032973 (ribonucleotides)n+m Proteins 0.000 description 9
- 241000589562 Brucella Species 0.000 description 9
- LFQSCWFLJHTTHZ-UHFFFAOYSA-N Ethanol Chemical compound CCO LFQSCWFLJHTTHZ-UHFFFAOYSA-N 0.000 description 9
- 101710116435 Outer membrane protein Proteins 0.000 description 9
- 108020004511 Recombinant DNA Proteins 0.000 description 9
- 238000013459 approach Methods 0.000 description 9
- 239000012139 lysis buffer Substances 0.000 description 9
- 239000008188 pellet Substances 0.000 description 9
- 238000005382 thermal cycling Methods 0.000 description 9
- 108091003079 Bovine Serum Albumin Proteins 0.000 description 8
- HEDRZPFGACZZDS-UHFFFAOYSA-N Chloroform Chemical compound ClC(Cl)Cl HEDRZPFGACZZDS-UHFFFAOYSA-N 0.000 description 8
- 108010014303 DNA-directed DNA polymerase Proteins 0.000 description 8
- 102000016928 DNA-directed DNA polymerase Human genes 0.000 description 8
- DHMQDGOQFOQNFH-UHFFFAOYSA-N Glycine Chemical compound NCC(O)=O DHMQDGOQFOQNFH-UHFFFAOYSA-N 0.000 description 8
- 101710137500 T7 RNA polymerase Proteins 0.000 description 8
- 230000004075 alteration Effects 0.000 description 8
- 229960000723 ampicillin Drugs 0.000 description 8
- AVKUERGKIZMTKX-NJBDSQKTSA-N ampicillin Chemical compound C1([C@@H](N)C(=O)N[C@H]2[C@H]3SC([C@@H](N3C2=O)C(O)=O)(C)C)=CC=CC=C1 AVKUERGKIZMTKX-NJBDSQKTSA-N 0.000 description 8
- 238000002474 experimental method Methods 0.000 description 8
- 238000002744 homologous recombination Methods 0.000 description 8
- 230000006801 homologous recombination Effects 0.000 description 8
- 238000001727 in vivo Methods 0.000 description 8
- 238000003780 insertion Methods 0.000 description 8
- 230000037431 insertion Effects 0.000 description 8
- BPHPUYQFMNQIOC-NXRLNHOXSA-N isopropyl beta-D-thiogalactopyranoside Chemical compound CC(C)S[C@@H]1O[C@H](CO)[C@H](O)[C@H](O)[C@H]1O BPHPUYQFMNQIOC-NXRLNHOXSA-N 0.000 description 8
- 244000052769 pathogen Species 0.000 description 8
- 101150010516 ppi gene Proteins 0.000 description 8
- 101150105899 ppiB gene Proteins 0.000 description 8
- 108091008146 restriction endonucleases Proteins 0.000 description 8
- 238000010561 standard procedure Methods 0.000 description 8
- 239000000725 suspension Substances 0.000 description 8
- NKDFYOWSKOHCCO-YPVLXUMRSA-N 20-hydroxyecdysone Chemical compound C1[C@@H](O)[C@@H](O)C[C@]2(C)[C@@H](CC[C@@]3([C@@H]([C@@](C)(O)[C@H](O)CCC(C)(O)C)CC[C@]33O)C)C3=CC(=O)[C@@H]21 NKDFYOWSKOHCCO-YPVLXUMRSA-N 0.000 description 7
- 239000003155 DNA primer Substances 0.000 description 7
- 238000001712 DNA sequencing Methods 0.000 description 7
- 230000002759 chromosomal effect Effects 0.000 description 7
- 108010048032 cyclophilin B Proteins 0.000 description 7
- SUYVUBYJARFZHO-RRKCRQDMSA-N dATP Chemical compound C1=NC=2C(N)=NC=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-RRKCRQDMSA-N 0.000 description 7
- SUYVUBYJARFZHO-UHFFFAOYSA-N dATP Natural products C1=NC=2C(N)=NC=NC=2N1C1CC(O)C(COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 SUYVUBYJARFZHO-UHFFFAOYSA-N 0.000 description 7
- HAAZLUGHYHWQIW-KVQBGUIXSA-N dGTP Chemical compound C1=NC=2C(=O)NC(N)=NC=2N1[C@H]1C[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)O1 HAAZLUGHYHWQIW-KVQBGUIXSA-N 0.000 description 7
- NHVNXKFIZYSCEB-XLPZGREQSA-N dTTP Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](COP(O)(=O)OP(O)(=O)OP(O)(O)=O)[C@@H](O)C1 NHVNXKFIZYSCEB-XLPZGREQSA-N 0.000 description 7
- 229940009976 deoxycholate Drugs 0.000 description 7
- KXGVEGMKQFWNSR-LLQZFEROSA-N deoxycholic acid Chemical compound C([C@H]1CC2)[C@H](O)CC[C@]1(C)[C@@H]1[C@@H]2[C@@H]2CC[C@H]([C@@H](CCC(O)=O)C)[C@@]2(C)[C@@H](O)C1 KXGVEGMKQFWNSR-LLQZFEROSA-N 0.000 description 7
- 238000011161 development Methods 0.000 description 7
- 208000037265 diseases, disorders, signs and symptoms Diseases 0.000 description 7
- 239000013613 expression plasmid Substances 0.000 description 7
- 239000002609 medium Substances 0.000 description 7
- 230000001717 pathogenic effect Effects 0.000 description 7
- JKMHFZQWWAIEOD-UHFFFAOYSA-N 2-[4-(2-hydroxyethyl)piperazin-1-yl]ethanesulfonic acid Chemical compound OCC[NH+]1CCN(CCS([O-])(=O)=O)CC1 JKMHFZQWWAIEOD-UHFFFAOYSA-N 0.000 description 6
- KCXVZYZYPLLWCC-UHFFFAOYSA-N EDTA Chemical compound OC(=O)CN(CC(O)=O)CCN(CC(O)=O)CC(O)=O KCXVZYZYPLLWCC-UHFFFAOYSA-N 0.000 description 6
- WQZGKKKJIJFFOK-GASJEMHNSA-N Glucose Natural products OC[C@H]1OC(O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-GASJEMHNSA-N 0.000 description 6
- 108700016167 Glutamate racemases Proteins 0.000 description 6
- WHUUTDBJXJRKMK-UHFFFAOYSA-N Glutamic acid Natural products OC(=O)C(N)CCC(O)=O WHUUTDBJXJRKMK-UHFFFAOYSA-N 0.000 description 6
- WHUUTDBJXJRKMK-VKHMYHEASA-N L-glutamic acid Chemical compound OC(=O)[C@@H](N)CCC(O)=O WHUUTDBJXJRKMK-VKHMYHEASA-N 0.000 description 6
- 108091006629 SLC13A2 Proteins 0.000 description 6
- 108091081024 Start codon Proteins 0.000 description 6
- 238000007792 addition Methods 0.000 description 6
- 229940098773 bovine serum albumin Drugs 0.000 description 6
- 229940041514 candida albicans extract Drugs 0.000 description 6
- 239000004202 carbamide Substances 0.000 description 6
- RGWHQCVHVJXOKC-SHYZEUOFSA-J dCTP(4-) Chemical compound O=C1N=C(N)C=CN1[C@@H]1O[C@H](COP([O-])(=O)OP([O-])(=O)OP([O-])([O-])=O)[C@@H](O)C1 RGWHQCVHVJXOKC-SHYZEUOFSA-J 0.000 description 6
- 229940079593 drug Drugs 0.000 description 6
- 238000001962 electrophoresis Methods 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 6
- 230000002255 enzymatic effect Effects 0.000 description 6
- ZMMJGEGLRURXTF-UHFFFAOYSA-N ethidium bromide Chemical compound [Br-].C12=CC(N)=CC=C2C2=CC=C(N)C=C2[N+](CC)=C1C1=CC=CC=C1 ZMMJGEGLRURXTF-UHFFFAOYSA-N 0.000 description 6
- 229960005542 ethidium bromide Drugs 0.000 description 6
- 239000000284 extract Substances 0.000 description 6
- 230000002068 genetic effect Effects 0.000 description 6
- 230000002209 hydrophobic effect Effects 0.000 description 6
- RAXXELZNTBOGNW-UHFFFAOYSA-N imidazole Natural products C1=CNC=N1 RAXXELZNTBOGNW-UHFFFAOYSA-N 0.000 description 6
- 230000028993 immune response Effects 0.000 description 6
- 230000001965 increasing effect Effects 0.000 description 6
- 238000011534 incubation Methods 0.000 description 6
- 238000002955 isolation Methods 0.000 description 6
- 230000001404 mediated effect Effects 0.000 description 6
- 210000002729 polyribosome Anatomy 0.000 description 6
- 102000005962 receptors Human genes 0.000 description 6
- 108020003175 receptors Proteins 0.000 description 6
- 230000009467 reduction Effects 0.000 description 6
- 239000011780 sodium chloride Substances 0.000 description 6
- 210000002784 stomach Anatomy 0.000 description 6
- 239000012137 tryptone Substances 0.000 description 6
- 230000003612 virological effect Effects 0.000 description 6
- XLYOFNOQVPJJNP-UHFFFAOYSA-N water Substances O XLYOFNOQVPJJNP-UHFFFAOYSA-N 0.000 description 6
- 239000012138 yeast extract Substances 0.000 description 6
- AGPKZVBTJJNPAG-RFZPGFLSSA-N D-Isoleucine Chemical compound CC[C@@H](C)[C@@H](N)C(O)=O AGPKZVBTJJNPAG-RFZPGFLSSA-N 0.000 description 5
- WHUUTDBJXJRKMK-GSVOUGTGSA-N D-glutamic acid Chemical compound OC(=O)[C@H](N)CCC(O)=O WHUUTDBJXJRKMK-GSVOUGTGSA-N 0.000 description 5
- ROHFNLRQFUQHCH-RXMQYKEDSA-N D-leucine Chemical compound CC(C)C[C@@H](N)C(O)=O ROHFNLRQFUQHCH-RXMQYKEDSA-N 0.000 description 5
- KZSNJWFQEVHDMF-SCSAIBSYSA-N D-valine Chemical compound CC(C)[C@@H](N)C(O)=O KZSNJWFQEVHDMF-SCSAIBSYSA-N 0.000 description 5
- 230000006820 DNA synthesis Effects 0.000 description 5
- 238000002965 ELISA Methods 0.000 description 5
- 239000007995 HEPES buffer Substances 0.000 description 5
- DCXYFEDJOCDNAF-REOHCLBHSA-N L-asparagine Chemical compound OC(=O)[C@@H](N)CC(N)=O DCXYFEDJOCDNAF-REOHCLBHSA-N 0.000 description 5
- CKLJMWTZIZZHCS-REOHCLBHSA-N L-aspartic acid Chemical compound OC(=O)[C@@H](N)CC(O)=O CKLJMWTZIZZHCS-REOHCLBHSA-N 0.000 description 5
- AYFVYJQAPQTCCC-GBXIJSLDSA-N L-threonine Chemical compound C[C@@H](O)[C@H](N)C(O)=O AYFVYJQAPQTCCC-GBXIJSLDSA-N 0.000 description 5
- 108010076504 Protein Sorting Signals Proteins 0.000 description 5
- 108010001267 Protein Subunits Proteins 0.000 description 5
- 102000002067 Protein Subunits Human genes 0.000 description 5
- 240000004808 Saccharomyces cerevisiae Species 0.000 description 5
- 238000012300 Sequence Analysis Methods 0.000 description 5
- AYFVYJQAPQTCCC-UHFFFAOYSA-N Threonine Natural products CC(O)C(N)C(O)=O AYFVYJQAPQTCCC-UHFFFAOYSA-N 0.000 description 5
- 108700019146 Transgenes Proteins 0.000 description 5
- 239000002253 acid Substances 0.000 description 5
- 230000000890 antigenic effect Effects 0.000 description 5
- 239000006161 blood agar Substances 0.000 description 5
- 230000008859 change Effects 0.000 description 5
- 238000010276 construction Methods 0.000 description 5
- 238000013461 design Methods 0.000 description 5
- 239000008103 glucose Substances 0.000 description 5
- 230000001771 impaired effect Effects 0.000 description 5
- 239000003112 inhibitor Substances 0.000 description 5
- 230000003993 interaction Effects 0.000 description 5
- OOYGSFOGFJDDHP-KMCOLRRFSA-N kanamycin A sulfate Chemical compound OS(O)(=O)=O.O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CN)O[C@@H]1O[C@H]1[C@H](O)[C@@H](O[C@@H]2[C@@H]([C@@H](N)[C@H](O)[C@@H](CO)O2)O)[C@H](N)C[C@@H]1N OOYGSFOGFJDDHP-KMCOLRRFSA-N 0.000 description 5
- 229960002064 kanamycin sulfate Drugs 0.000 description 5
- 210000004962 mammalian cell Anatomy 0.000 description 5
- 125000002924 primary amino group Chemical group [H]N([H])* 0.000 description 5
- 238000012545 processing Methods 0.000 description 5
- 125000006853 reporter group Chemical group 0.000 description 5
- 239000000243 solution Substances 0.000 description 5
- 239000006228 supernatant Substances 0.000 description 5
- 239000001226 triphosphate Substances 0.000 description 5
- 235000011178 triphosphate Nutrition 0.000 description 5
- UNXRWKVEANCORM-UHFFFAOYSA-N triphosphoric acid Chemical compound OP(O)(=O)OP(O)(=O)OP(O)(O)=O UNXRWKVEANCORM-UHFFFAOYSA-N 0.000 description 5
- 241001515965 unidentified phage Species 0.000 description 5
- 230000001018 virulence Effects 0.000 description 5
- MTCFGRXMJLQNBG-REOHCLBHSA-N (2S)-2-Amino-3-hydroxypropansäure Chemical compound OC[C@H](N)C(O)=O MTCFGRXMJLQNBG-REOHCLBHSA-N 0.000 description 4
- IEQAICDLOKRSRL-UHFFFAOYSA-N 2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-[2-(2-dodecoxyethoxy)ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethoxy]ethanol Chemical compound CCCCCCCCCCCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCOCCO IEQAICDLOKRSRL-UHFFFAOYSA-N 0.000 description 4
- 101710132601 Capsid protein Proteins 0.000 description 4
- 101710094648 Coat protein Proteins 0.000 description 4
- DCXYFEDJOCDNAF-UWTATZPHSA-N D-Asparagine Chemical compound OC(=O)[C@H](N)CC(N)=O DCXYFEDJOCDNAF-UWTATZPHSA-N 0.000 description 4
- CKLJMWTZIZZHCS-UHFFFAOYSA-N D-OH-Asp Natural products OC(=O)C(N)CC(O)=O CKLJMWTZIZZHCS-UHFFFAOYSA-N 0.000 description 4
- CKLJMWTZIZZHCS-UWTATZPHSA-N D-aspartic acid Chemical compound OC(=O)[C@H](N)CC(O)=O CKLJMWTZIZZHCS-UWTATZPHSA-N 0.000 description 4
- ZDXPYRJPNDTMRX-GSVOUGTGSA-N D-glutamine Chemical compound OC(=O)[C@H](N)CCC(N)=O ZDXPYRJPNDTMRX-GSVOUGTGSA-N 0.000 description 4
- AYFVYJQAPQTCCC-STHAYSLISA-N D-threonine Chemical compound C[C@H](O)[C@@H](N)C(O)=O AYFVYJQAPQTCCC-STHAYSLISA-N 0.000 description 4
- 102000012410 DNA Ligases Human genes 0.000 description 4
- 108010061982 DNA Ligases Proteins 0.000 description 4
- 241000724791 Filamentous phage Species 0.000 description 4
- 102100021181 Golgi phosphoprotein 3 Human genes 0.000 description 4
- 241000238631 Hexapoda Species 0.000 description 4
- 102100034349 Integrase Human genes 0.000 description 4
- KFZMGEQAYNKOFK-UHFFFAOYSA-N Isopropanol Chemical compound CC(C)O KFZMGEQAYNKOFK-UHFFFAOYSA-N 0.000 description 4
- QNAYBMKLOCPYGJ-REOHCLBHSA-N L-alanine Chemical compound C[C@H](N)C(O)=O QNAYBMKLOCPYGJ-REOHCLBHSA-N 0.000 description 4
- KDXKERNSBIXSRK-YFKPBYRVSA-N L-lysine Chemical compound NCCCC[C@H](N)C(O)=O KDXKERNSBIXSRK-YFKPBYRVSA-N 0.000 description 4
- OUYCCCASQSFEME-QMMMGPOBSA-N L-tyrosine Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-QMMMGPOBSA-N 0.000 description 4
- KDXKERNSBIXSRK-UHFFFAOYSA-N Lysine Natural products NCCCCC(N)C(O)=O KDXKERNSBIXSRK-UHFFFAOYSA-N 0.000 description 4
- CSNNHWWHGAXBCP-UHFFFAOYSA-L Magnesium sulfate Chemical compound [Mg+2].[O-][S+2]([O-])([O-])[O-] CSNNHWWHGAXBCP-UHFFFAOYSA-L 0.000 description 4
- 101710125418 Major capsid protein Proteins 0.000 description 4
- 102000016943 Muramidase Human genes 0.000 description 4
- 108010014251 Muramidase Proteins 0.000 description 4
- 108010062010 N-Acetylmuramoyl-L-alanine Amidase Proteins 0.000 description 4
- BACYUWVYYTXETD-UHFFFAOYSA-N N-Lauroylsarcosine Chemical compound CCCCCCCCCCCC(=O)N(C)CC(O)=O BACYUWVYYTXETD-UHFFFAOYSA-N 0.000 description 4
- 101710141454 Nucleoprotein Proteins 0.000 description 4
- 239000002033 PVDF binder Substances 0.000 description 4
- 239000002202 Polyethylene glycol Substances 0.000 description 4
- 101710083689 Probable capsid protein Proteins 0.000 description 4
- 108010046334 Urease Proteins 0.000 description 4
- KZSNJWFQEVHDMF-UHFFFAOYSA-N Valine Natural products CC(C)C(N)C(O)=O KZSNJWFQEVHDMF-UHFFFAOYSA-N 0.000 description 4
- 235000004279 alanine Nutrition 0.000 description 4
- 238000012867 alanine scanning Methods 0.000 description 4
- 239000005557 antagonist Substances 0.000 description 4
- 229910002091 carbon monoxide Inorganic materials 0.000 description 4
- 238000005119 centrifugation Methods 0.000 description 4
- 210000004978 chinese hamster ovary cell Anatomy 0.000 description 4
- 238000004587 chromatography analysis Methods 0.000 description 4
- 238000003776 cleavage reaction Methods 0.000 description 4
- 239000002299 complementary DNA Substances 0.000 description 4
- 238000000502 dialysis Methods 0.000 description 4
- 238000009826 distribution Methods 0.000 description 4
- 230000008029 eradication Effects 0.000 description 4
- 230000001747 exhibiting effect Effects 0.000 description 4
- 238000009472 formulation Methods 0.000 description 4
- 230000002496 gastric effect Effects 0.000 description 4
- 230000012010 growth Effects 0.000 description 4
- 230000001976 improved effect Effects 0.000 description 4
- 230000000977 initiatory effect Effects 0.000 description 4
- PHTQWCKDNZKARW-UHFFFAOYSA-N isoamylol Chemical compound CC(C)CCO PHTQWCKDNZKARW-UHFFFAOYSA-N 0.000 description 4
- 229960000274 lysozyme Drugs 0.000 description 4
- 239000004325 lysozyme Substances 0.000 description 4
- 235000010335 lysozyme Nutrition 0.000 description 4
- 239000011159 matrix material Substances 0.000 description 4
- 238000002156 mixing Methods 0.000 description 4
- 238000002823 phage display Methods 0.000 description 4
- 229920002401 polyacrylamide Polymers 0.000 description 4
- 229920001223 polyethylene glycol Polymers 0.000 description 4
- 229920002981 polyvinylidene fluoride Polymers 0.000 description 4
- 239000002987 primer (paints) Substances 0.000 description 4
- 239000011541 reaction mixture Substances 0.000 description 4
- 230000007017 scission Effects 0.000 description 4
- 238000007790 scraping Methods 0.000 description 4
- 241000894007 species Species 0.000 description 4
- 230000004083 survival effect Effects 0.000 description 4
- 229940124597 therapeutic agent Drugs 0.000 description 4
- 230000002103 transcriptional effect Effects 0.000 description 4
- 238000012546 transfer Methods 0.000 description 4
- 230000009261 transgenic effect Effects 0.000 description 4
- GHKCSRZBNZQHKW-UHFFFAOYSA-N 1-sulfanylethanol Chemical compound CC(O)S GHKCSRZBNZQHKW-UHFFFAOYSA-N 0.000 description 3
- 241000283690 Bos taurus Species 0.000 description 3
- 101150029029 CAVIN4 gene Proteins 0.000 description 3
- 102100035882 Catalase Human genes 0.000 description 3
- 108010053835 Catalase Proteins 0.000 description 3
- 108020004635 Complementary DNA Proteins 0.000 description 3
- 108091035707 Consensus sequence Proteins 0.000 description 3
- 102000004127 Cytokines Human genes 0.000 description 3
- 108090000695 Cytokines Proteins 0.000 description 3
- XUJNEKJLAYXESH-UWTATZPHSA-N D-Cysteine Chemical compound SC[C@@H](N)C(O)=O XUJNEKJLAYXESH-UWTATZPHSA-N 0.000 description 3
- 102000053602 DNA Human genes 0.000 description 3
- IAZDPXIOMUYVGZ-UHFFFAOYSA-N Dimethylsulphoxide Chemical compound CS(C)=O IAZDPXIOMUYVGZ-UHFFFAOYSA-N 0.000 description 3
- 241000196324 Embryophyta Species 0.000 description 3
- 101710091045 Envelope protein Proteins 0.000 description 3
- 244000068988 Glycine max Species 0.000 description 3
- 235000010469 Glycine max Nutrition 0.000 description 3
- 241000282412 Homo Species 0.000 description 3
- 206010020751 Hypersensitivity Diseases 0.000 description 3
- 101710203526 Integrase Proteins 0.000 description 3
- 102000000588 Interleukin-2 Human genes 0.000 description 3
- 108010002350 Interleukin-2 Proteins 0.000 description 3
- XUJNEKJLAYXESH-REOHCLBHSA-N L-Cysteine Chemical compound SC[C@H](N)C(O)=O XUJNEKJLAYXESH-REOHCLBHSA-N 0.000 description 3
- ZDXPYRJPNDTMRX-VKHMYHEASA-N L-glutamine Chemical compound OC(=O)[C@@H](N)CCC(N)=O ZDXPYRJPNDTMRX-VKHMYHEASA-N 0.000 description 3
- ROHFNLRQFUQHCH-YFKPBYRVSA-N L-leucine Chemical compound CC(C)C[C@H](N)C(O)=O ROHFNLRQFUQHCH-YFKPBYRVSA-N 0.000 description 3
- KZSNJWFQEVHDMF-BYPYZUCNSA-N L-valine Chemical compound CC(C)[C@H](N)C(O)=O KZSNJWFQEVHDMF-BYPYZUCNSA-N 0.000 description 3
- 235000019687 Lamb Nutrition 0.000 description 3
- ROHFNLRQFUQHCH-UHFFFAOYSA-N Leucine Natural products CC(C)CC(N)C(O)=O ROHFNLRQFUQHCH-UHFFFAOYSA-N 0.000 description 3
- 239000006142 Luria-Bertani Agar Substances 0.000 description 3
- OKKJLVBELUTLKV-UHFFFAOYSA-N Methanol Chemical compound OC OKKJLVBELUTLKV-UHFFFAOYSA-N 0.000 description 3
- 239000004677 Nylon Substances 0.000 description 3
- WXOMTJVVIMOXJL-BOBFKVMVSA-A O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)OS(=O)(=O)OC[C@H]1O[C@@H](O[C@]2(COS(=O)(=O)O[Al](O)O)O[C@H](OS(=O)(=O)O[Al](O)O)[C@@H](OS(=O)(=O)O[Al](O)O)[C@@H]2OS(=O)(=O)O[Al](O)O)[C@H](OS(=O)(=O)O[Al](O)O)[C@@H](OS(=O)(=O)O[Al](O)O)[C@@H]1OS(=O)(=O)O[Al](O)O Chemical compound O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)O.O[Al](O)OS(=O)(=O)OC[C@H]1O[C@@H](O[C@]2(COS(=O)(=O)O[Al](O)O)O[C@H](OS(=O)(=O)O[Al](O)O)[C@@H](OS(=O)(=O)O[Al](O)O)[C@@H]2OS(=O)(=O)O[Al](O)O)[C@H](OS(=O)(=O)O[Al](O)O)[C@@H](OS(=O)(=O)O[Al](O)O)[C@@H]1OS(=O)(=O)O[Al](O)O WXOMTJVVIMOXJL-BOBFKVMVSA-A 0.000 description 3
- 241000283973 Oryctolagus cuniculus Species 0.000 description 3
- 108091005804 Peptidases Proteins 0.000 description 3
- 102000035195 Peptidases Human genes 0.000 description 3
- 239000004365 Protease Substances 0.000 description 3
- 101710188315 Protein X Proteins 0.000 description 3
- 101000697856 Rattus norvegicus Bile acid-CoA:amino acid N-acyltransferase Proteins 0.000 description 3
- MTCFGRXMJLQNBG-UHFFFAOYSA-N Serine Natural products OCC(N)C(O)=O MTCFGRXMJLQNBG-UHFFFAOYSA-N 0.000 description 3
- DBMJMQXJHONAFJ-UHFFFAOYSA-M Sodium laurylsulphate Chemical compound [Na+].CCCCCCCCCCCCOS([O-])(=O)=O DBMJMQXJHONAFJ-UHFFFAOYSA-M 0.000 description 3
- 230000006052 T cell proliferation Effects 0.000 description 3
- 239000004473 Threonine Substances 0.000 description 3
- 229940122618 Trypsin inhibitor Drugs 0.000 description 3
- 101710162629 Trypsin inhibitor Proteins 0.000 description 3
- 239000000556 agonist Substances 0.000 description 3
- NWMHDZMRVUOQGL-CZEIJOLGSA-N almurtide Chemical compound OC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)CO[C@@H]([C@H](O)[C@H](O)CO)[C@@H](NC(C)=O)C=O NWMHDZMRVUOQGL-CZEIJOLGSA-N 0.000 description 3
- 230000003115 biocidal effect Effects 0.000 description 3
- 210000004369 blood Anatomy 0.000 description 3
- 239000008280 blood Substances 0.000 description 3
- 238000004113 cell culture Methods 0.000 description 3
- 239000006143 cell culture medium Substances 0.000 description 3
- 239000013599 cloning vector Substances 0.000 description 3
- 238000002742 combinatorial mutagenesis Methods 0.000 description 3
- 239000012141 concentrate Substances 0.000 description 3
- 238000007796 conventional method Methods 0.000 description 3
- 230000003247 decreasing effect Effects 0.000 description 3
- 201000010099 disease Diseases 0.000 description 3
- 239000000975 dye Substances 0.000 description 3
- 238000004520 electroporation Methods 0.000 description 3
- 238000003018 immunoassay Methods 0.000 description 3
- 230000001939 inductive effect Effects 0.000 description 3
- 230000002458 infectious effect Effects 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 101150066555 lacZ gene Proteins 0.000 description 3
- 235000005772 leucine Nutrition 0.000 description 3
- 239000007788 liquid Substances 0.000 description 3
- MYWUZJCMWCOHBA-VIFPVBQESA-N methamphetamine Chemical compound CN[C@@H](C)CC1=CC=CC=C1 MYWUZJCMWCOHBA-VIFPVBQESA-N 0.000 description 3
- 238000010369 molecular cloning Methods 0.000 description 3
- 229920001778 nylon Polymers 0.000 description 3
- 238000010647 peptide synthesis reaction Methods 0.000 description 3
- 238000007747 plating Methods 0.000 description 3
- 239000013641 positive control Substances 0.000 description 3
- 230000001915 proofreading effect Effects 0.000 description 3
- 230000000069 prophylactic effect Effects 0.000 description 3
- 235000019419 proteases Nutrition 0.000 description 3
- 229940126409 proton pump inhibitor Drugs 0.000 description 3
- 235000021251 pulses Nutrition 0.000 description 3
- 238000002708 random mutagenesis Methods 0.000 description 3
- 230000003362 replicative effect Effects 0.000 description 3
- 150000003839 salts Chemical class 0.000 description 3
- 235000004400 serine Nutrition 0.000 description 3
- 235000019333 sodium laurylsulphate Nutrition 0.000 description 3
- 230000004936 stimulating effect Effects 0.000 description 3
- 239000000758 substrate Substances 0.000 description 3
- 235000008521 threonine Nutrition 0.000 description 3
- 230000001131 transforming effect Effects 0.000 description 3
- 239000003656 tris buffered saline Substances 0.000 description 3
- 239000002753 trypsin inhibitor Substances 0.000 description 3
- 238000003160 two-hybrid assay Methods 0.000 description 3
- 238000009281 ultraviolet germicidal irradiation Methods 0.000 description 3
- 239000004474 valine Substances 0.000 description 3
- 230000035899 viability Effects 0.000 description 3
- OWEGMIWEEQEYGQ-UHFFFAOYSA-N 100676-05-9 Natural products OC1C(O)C(O)C(CO)OC1OCC1C(O)C(O)C(O)C(OC2C(OC(O)C(O)C2O)CO)O1 OWEGMIWEEQEYGQ-UHFFFAOYSA-N 0.000 description 2
- LKDMKWNDBAVNQZ-WJNSRDFLSA-N 4-[[(2s)-1-[[(2s)-1-[(2s)-2-[[(2s)-1-(4-nitroanilino)-1-oxo-3-phenylpropan-2-yl]carbamoyl]pyrrolidin-1-yl]-1-oxopropan-2-yl]amino]-1-oxopropan-2-yl]amino]-4-oxobutanoic acid Chemical compound OC(=O)CCC(=O)N[C@@H](C)C(=O)N[C@@H](C)C(=O)N1CCC[C@H]1C(=O)N[C@H](C(=O)NC=1C=CC(=CC=1)[N+]([O-])=O)CC1=CC=CC=C1 LKDMKWNDBAVNQZ-WJNSRDFLSA-N 0.000 description 2
- KDCGOANMDULRCW-UHFFFAOYSA-N 7H-purine Chemical compound N1=CNC2=NC=NC2=C1 KDCGOANMDULRCW-UHFFFAOYSA-N 0.000 description 2
- 229930024421 Adenine Natural products 0.000 description 2
- GFFGJBXGBJISGV-UHFFFAOYSA-N Adenine Chemical compound NC1=NC=NC2=C1N=CN2 GFFGJBXGBJISGV-UHFFFAOYSA-N 0.000 description 2
- 229920000936 Agarose Polymers 0.000 description 2
- 102000002260 Alkaline Phosphatase Human genes 0.000 description 2
- 108020004774 Alkaline Phosphatase Proteins 0.000 description 2
- 108010039627 Aprotinin Proteins 0.000 description 2
- 101100234243 Aquifex aeolicus (strain VF5) kdtA gene Proteins 0.000 description 2
- 239000004475 Arginine Substances 0.000 description 2
- DCXYFEDJOCDNAF-UHFFFAOYSA-N Asparagine Natural products OC(=O)C(N)CC(N)=O DCXYFEDJOCDNAF-UHFFFAOYSA-N 0.000 description 2
- 238000011725 BALB/c mouse Methods 0.000 description 2
- 102100026189 Beta-galactosidase Human genes 0.000 description 2
- 102000014914 Carrier Proteins Human genes 0.000 description 2
- LZZYPRNAOMGNLH-UHFFFAOYSA-M Cetrimonium bromide Chemical compound [Br-].CCCCCCCCCCCCCCCC[N+](C)(C)C LZZYPRNAOMGNLH-UHFFFAOYSA-M 0.000 description 2
- AHLPHDHHMVZTML-SCSAIBSYSA-N D-Ornithine Chemical compound NCCC[C@@H](N)C(O)=O AHLPHDHHMVZTML-SCSAIBSYSA-N 0.000 description 2
- ONIBWKKTOPOVIA-SCSAIBSYSA-N D-Proline Chemical compound OC(=O)[C@H]1CCCN1 ONIBWKKTOPOVIA-SCSAIBSYSA-N 0.000 description 2
- MTCFGRXMJLQNBG-UWTATZPHSA-N D-Serine Chemical compound OC[C@@H](N)C(O)=O MTCFGRXMJLQNBG-UWTATZPHSA-N 0.000 description 2
- QNAYBMKLOCPYGJ-UWTATZPHSA-N D-alanine Chemical compound C[C@@H](N)C(O)=O QNAYBMKLOCPYGJ-UWTATZPHSA-N 0.000 description 2
- 150000008574 D-amino acids Chemical class 0.000 description 2
- ODKSFYDXXFIFQN-SCSAIBSYSA-N D-arginine Chemical compound OC(=O)[C@H](N)CCCNC(N)=N ODKSFYDXXFIFQN-SCSAIBSYSA-N 0.000 description 2
- HNDVDQJCIGZPNO-RXMQYKEDSA-N D-histidine Chemical compound OC(=O)[C@H](N)CC1=CN=CN1 HNDVDQJCIGZPNO-RXMQYKEDSA-N 0.000 description 2
- KDXKERNSBIXSRK-RXMQYKEDSA-N D-lysine Chemical compound NCCCC[C@@H](N)C(O)=O KDXKERNSBIXSRK-RXMQYKEDSA-N 0.000 description 2
- COLNVLDHVKWLRT-MRVPVSSYSA-N D-phenylalanine Chemical compound OC(=O)[C@H](N)CC1=CC=CC=C1 COLNVLDHVKWLRT-MRVPVSSYSA-N 0.000 description 2
- 230000004543 DNA replication Effects 0.000 description 2
- 102000004163 DNA-directed RNA polymerases Human genes 0.000 description 2
- 108010042407 Endonucleases Proteins 0.000 description 2
- 102000004533 Endonucleases Human genes 0.000 description 2
- 101100322888 Escherichia coli (strain K12) metL gene Proteins 0.000 description 2
- 241001198387 Escherichia coli BL21(DE3) Species 0.000 description 2
- 241001646716 Escherichia coli K-12 Species 0.000 description 2
- 108700039887 Essential Genes Proteins 0.000 description 2
- 241000206602 Eukaryota Species 0.000 description 2
- WSFSSNUMVMOOMR-UHFFFAOYSA-N Formaldehyde Chemical compound O=C WSFSSNUMVMOOMR-UHFFFAOYSA-N 0.000 description 2
- 101001066288 Gallus gallus GATA-binding factor 3 Proteins 0.000 description 2
- 108700028146 Genetic Enhancer Elements Proteins 0.000 description 2
- 108010070675 Glutathione transferase Proteins 0.000 description 2
- 239000004471 Glycine Substances 0.000 description 2
- 102100029100 Hematopoietic prostaglandin D synthase Human genes 0.000 description 2
- WTDRDQBEARUVNC-LURJTMIESA-N L-DOPA Chemical compound OC(=O)[C@@H](N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-LURJTMIESA-N 0.000 description 2
- WTDRDQBEARUVNC-UHFFFAOYSA-N L-Dopa Natural products OC(=O)C(N)CC1=CC=C(O)C(O)=C1 WTDRDQBEARUVNC-UHFFFAOYSA-N 0.000 description 2
- ODKSFYDXXFIFQN-BYPYZUCNSA-N L-arginine Chemical compound OC(=O)[C@@H](N)CCCN=C(N)N ODKSFYDXXFIFQN-BYPYZUCNSA-N 0.000 description 2
- AGPKZVBTJJNPAG-WHFBIAKZSA-N L-isoleucine Chemical compound CC[C@H](C)[C@H](N)C(O)=O AGPKZVBTJJNPAG-WHFBIAKZSA-N 0.000 description 2
- FFEARJCKVFRZRR-BYPYZUCNSA-N L-methionine Chemical compound CSCC[C@H](N)C(O)=O FFEARJCKVFRZRR-BYPYZUCNSA-N 0.000 description 2
- QEFRNWWLZKMPFJ-YGVKFDHGSA-N L-methionine S-oxide Chemical compound CS(=O)CC[C@H](N)C(O)=O QEFRNWWLZKMPFJ-YGVKFDHGSA-N 0.000 description 2
- 239000006137 Luria-Bertani broth Substances 0.000 description 2
- 239000004472 Lysine Substances 0.000 description 2
- TWRXJAOTZQYOKJ-UHFFFAOYSA-L Magnesium chloride Chemical compound [Mg+2].[Cl-].[Cl-] TWRXJAOTZQYOKJ-UHFFFAOYSA-L 0.000 description 2
- GUBGYTABKSRVRQ-PICCSMPSSA-N Maltose Natural products O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@@H](CO)OC(O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-PICCSMPSSA-N 0.000 description 2
- 241000202946 Mycoplasma pulmonis Species 0.000 description 2
- 108700015872 N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine Proteins 0.000 description 2
- MQUQNUAYKLCRME-INIZCTEOSA-N N-tosyl-L-phenylalanyl chloromethyl ketone Chemical compound C1=CC(C)=CC=C1S(=O)(=O)N[C@H](C(=O)CCl)CC1=CC=CC=C1 MQUQNUAYKLCRME-INIZCTEOSA-N 0.000 description 2
- 239000000020 Nitrocellulose Substances 0.000 description 2
- 102000010562 Peptide Elongation Factor G Human genes 0.000 description 2
- 108010077742 Peptide Elongation Factor G Proteins 0.000 description 2
- ISWSIDIOOBJBQZ-UHFFFAOYSA-N Phenol Chemical compound OC1=CC=CC=C1 ISWSIDIOOBJBQZ-UHFFFAOYSA-N 0.000 description 2
- 101800001440 Rimorphin Proteins 0.000 description 2
- 108020004682 Single-Stranded DNA Proteins 0.000 description 2
- UIIMBOGNXHQVGW-UHFFFAOYSA-M Sodium bicarbonate Chemical compound [Na+].OC([O-])=O UIIMBOGNXHQVGW-UHFFFAOYSA-M 0.000 description 2
- 101100443856 Streptococcus pyogenes serotype M18 (strain MGAS8232) polC gene Proteins 0.000 description 2
- 230000024932 T cell mediated immunity Effects 0.000 description 2
- 108091008874 T cell receptors Proteins 0.000 description 2
- 102000016266 T-Cell Antigen Receptors Human genes 0.000 description 2
- 239000006180 TBST buffer Substances 0.000 description 2
- 102000018679 Tacrolimus Binding Proteins Human genes 0.000 description 2
- 108010027179 Tacrolimus Binding Proteins Proteins 0.000 description 2
- 108010006785 Taq Polymerase Proteins 0.000 description 2
- IQFYYKKMVGJFEH-XLPZGREQSA-N Thymidine Chemical compound O=C1NC(=O)C(C)=CN1[C@@H]1O[C@H](CO)[C@@H](O)C1 IQFYYKKMVGJFEH-XLPZGREQSA-N 0.000 description 2
- 239000013504 Triton X-100 Substances 0.000 description 2
- 229920004890 Triton X-100 Polymers 0.000 description 2
- FJWGYAHXMCUOOM-QHOUIDNNSA-N [(2s,3r,4s,5r,6r)-2-[(2r,3r,4s,5r,6s)-4,5-dinitrooxy-2-(nitrooxymethyl)-6-[(2r,3r,4s,5r,6s)-4,5,6-trinitrooxy-2-(nitrooxymethyl)oxan-3-yl]oxyoxan-3-yl]oxy-3,5-dinitrooxy-6-(nitrooxymethyl)oxan-4-yl] nitrate Chemical compound O([C@@H]1O[C@@H]([C@H]([C@H](O[N+]([O-])=O)[C@H]1O[N+]([O-])=O)O[C@H]1[C@@H]([C@@H](O[N+]([O-])=O)[C@H](O[N+]([O-])=O)[C@@H](CO[N+]([O-])=O)O1)O[N+]([O-])=O)CO[N+](=O)[O-])[C@@H]1[C@@H](CO[N+]([O-])=O)O[C@@H](O[N+]([O-])=O)[C@H](O[N+]([O-])=O)[C@H]1O[N+]([O-])=O FJWGYAHXMCUOOM-QHOUIDNNSA-N 0.000 description 2
- 238000002835 absorbance Methods 0.000 description 2
- 230000009471 action Effects 0.000 description 2
- 239000012190 activator Substances 0.000 description 2
- 230000010933 acylation Effects 0.000 description 2
- 238000005917 acylation reaction Methods 0.000 description 2
- 229960000643 adenine Drugs 0.000 description 2
- 208000026935 allergic disease Diseases 0.000 description 2
- 230000007815 allergy Effects 0.000 description 2
- 230000030741 antigen processing and presentation Effects 0.000 description 2
- 210000000612 antigen-presenting cell Anatomy 0.000 description 2
- 229960004405 aprotinin Drugs 0.000 description 2
- ODKSFYDXXFIFQN-UHFFFAOYSA-N arginine Natural products OC(=O)C(N)CCCNC(N)=N ODKSFYDXXFIFQN-UHFFFAOYSA-N 0.000 description 2
- 235000009582 asparagine Nutrition 0.000 description 2
- 229960001230 asparagine Drugs 0.000 description 2
- 235000003704 aspartic acid Nutrition 0.000 description 2
- 239000012298 atmosphere Substances 0.000 description 2
- 230000037429 base substitution Effects 0.000 description 2
- 230000008901 benefit Effects 0.000 description 2
- 108010005774 beta-Galactosidase Proteins 0.000 description 2
- OQFSQFPPLPISGP-UHFFFAOYSA-N beta-carboxyaspartic acid Natural products OC(=O)C(N)C(C(O)=O)C(O)=O OQFSQFPPLPISGP-UHFFFAOYSA-N 0.000 description 2
- 230000033228 biological regulation Effects 0.000 description 2
- 238000001574 biopsy Methods 0.000 description 2
- 210000001124 body fluid Anatomy 0.000 description 2
- 239000002775 capsule Substances 0.000 description 2
- 125000003178 carboxy group Chemical group [H]OC(*)=O 0.000 description 2
- 239000000969 carrier Substances 0.000 description 2
- 238000012219 cassette mutagenesis Methods 0.000 description 2
- 230000003197 catalytic effect Effects 0.000 description 2
- 230000032823 cell division Effects 0.000 description 2
- 230000003915 cell function Effects 0.000 description 2
- 239000006285 cell suspension Substances 0.000 description 2
- 238000012512 characterization method Methods 0.000 description 2
- 239000013522 chelant Substances 0.000 description 2
- 150000005829 chemical entities Chemical class 0.000 description 2
- YTRQFSDWAXHJCC-UHFFFAOYSA-N chloroform;phenol Chemical compound ClC(Cl)Cl.OC1=CC=CC=C1 YTRQFSDWAXHJCC-UHFFFAOYSA-N 0.000 description 2
- 239000013611 chromosomal DNA Substances 0.000 description 2
- 210000000349 chromosome Anatomy 0.000 description 2
- 230000005757 colony formation Effects 0.000 description 2
- 239000000287 crude extract Substances 0.000 description 2
- 238000005520 cutting process Methods 0.000 description 2
- 210000000805 cytoplasm Anatomy 0.000 description 2
- OPTASPLRGRRNAP-UHFFFAOYSA-N cytosine Chemical compound NC=1C=CNC(=O)N=1 OPTASPLRGRRNAP-UHFFFAOYSA-N 0.000 description 2
- 230000001419 dependent effect Effects 0.000 description 2
- 101150076598 dnaB gene Proteins 0.000 description 2
- 101150008507 dnaE gene Proteins 0.000 description 2
- 101150035285 dnaE1 gene Proteins 0.000 description 2
- 101150003155 dnaG gene Proteins 0.000 description 2
- 238000007878 drug screening assay Methods 0.000 description 2
- 208000000718 duodenal ulcer Diseases 0.000 description 2
- 210000003495 flagella Anatomy 0.000 description 2
- 239000007850 fluorescent dye Substances 0.000 description 2
- 235000013922 glutamic acid Nutrition 0.000 description 2
- 229960002989 glutamic acid Drugs 0.000 description 2
- 239000004220 glutamic acid Substances 0.000 description 2
- ZDXPYRJPNDTMRX-UHFFFAOYSA-N glutamine Natural products OC(=O)C(N)CCC(N)=O ZDXPYRJPNDTMRX-UHFFFAOYSA-N 0.000 description 2
- 230000013595 glycosylation Effects 0.000 description 2
- 238000006206 glycosylation reaction Methods 0.000 description 2
- UYTPUPDQBNUYGX-UHFFFAOYSA-N guanine Chemical compound O=C1NC(N)=NC2=C1N=CN2 UYTPUPDQBNUYGX-UHFFFAOYSA-N 0.000 description 2
- 238000012203 high throughput assay Methods 0.000 description 2
- 230000008348 humoral response Effects 0.000 description 2
- 125000001165 hydrophobic group Chemical group 0.000 description 2
- 238000003119 immunoblot Methods 0.000 description 2
- 210000003000 inclusion body Anatomy 0.000 description 2
- 230000006698 induction Effects 0.000 description 2
- ZPNFWUPYTFPOJU-LPYSRVMUSA-N iniprol Chemical compound C([C@H]1C(=O)NCC(=O)NCC(=O)N[C@H]2CSSC[C@H]3C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@H](C(N[C@H](C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC=4C=CC=CC=4)C(=O)N[C@@H](CC=4C=CC(O)=CC=4)C(=O)N[C@@H](CC(N)=O)C(=O)N[C@@H](C)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C)C(=O)NCC(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC(O)=O)NC(=O)[C@H](CCC(O)=O)NC(=O)[C@H](C)NC(=O)[C@H](CO)NC(=O)[C@H](CCCCN)NC(=O)[C@H](CC=4C=CC=CC=4)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CC(N)=O)NC(=O)[C@H](CCCNC(N)=N)NC(=O)[C@H](CCCCN)NC(=O)[C@H](C)NC(=O)[C@H](CCCNC(N)=N)NC2=O)C(=O)N[C@@H](CCSC)C(=O)N[C@@H](CCCNC(N)=N)C(=O)N[C@@H]([C@@H](C)O)C(=O)N[C@@H](CSSC[C@H](NC(=O)[C@H](CC=2C=CC=CC=2)NC(=O)[C@H](CC(O)=O)NC(=O)[C@H]2N(CCC2)C(=O)[C@@H](N)CCCNC(N)=N)C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCC(O)=O)C(=O)N2[C@@H](CCC2)C(=O)N2[C@@H](CCC2)C(=O)N[C@@H](CC=2C=CC(O)=CC=2)C(=O)N[C@@H]([C@@H](C)O)C(=O)NCC(=O)N2[C@@H](CCC2)C(=O)N3)C(=O)NCC(=O)NCC(=O)N[C@@H](C)C(O)=O)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@H](C(=O)N[C@@H](CC=2C=CC=CC=2)C(=O)N[C@H](C(=O)N1)C(C)C)[C@@H](C)O)[C@@H](C)CC)=O)[C@@H](C)CC)C1=CC=C(O)C=C1 ZPNFWUPYTFPOJU-LPYSRVMUSA-N 0.000 description 2
- 238000011081 inoculation Methods 0.000 description 2
- 230000004068 intracellular signaling Effects 0.000 description 2
- 229960000310 isoleucine Drugs 0.000 description 2
- AGPKZVBTJJNPAG-UHFFFAOYSA-N isoleucine Natural products CCC(C)C(N)C(O)=O AGPKZVBTJJNPAG-UHFFFAOYSA-N 0.000 description 2
- 238000006317 isomerization reaction Methods 0.000 description 2
- AWJUIBRHMBBTKR-UHFFFAOYSA-N isoquinoline Chemical compound C1=NC=CC2=CC=CC=C21 AWJUIBRHMBBTKR-UHFFFAOYSA-N 0.000 description 2
- 101150109249 lacI gene Proteins 0.000 description 2
- 231100000518 lethal Toxicity 0.000 description 2
- 230000001665 lethal effect Effects 0.000 description 2
- HQKMJHAJHXVSDF-UHFFFAOYSA-L magnesium stearate Chemical compound [Mg+2].CCCCCCCCCCCCCCCCCC([O-])=O.CCCCCCCCCCCCCCCCCC([O-])=O HQKMJHAJHXVSDF-UHFFFAOYSA-L 0.000 description 2
- 229910052943 magnesium sulfate Inorganic materials 0.000 description 2
- 235000019341 magnesium sulphate Nutrition 0.000 description 2
- 239000003550 marker Substances 0.000 description 2
- 229910021645 metal ion Inorganic materials 0.000 description 2
- 229930182817 methionine Natural products 0.000 description 2
- 125000001360 methionine group Chemical group N[C@@H](CCSC)C(=O)* 0.000 description 2
- 238000000520 microinjection Methods 0.000 description 2
- 230000003278 mimic effect Effects 0.000 description 2
- 210000004877 mucosa Anatomy 0.000 description 2
- 210000003097 mucus Anatomy 0.000 description 2
- YFCUZWYIPBUQBD-ZOWNYOTGSA-N n-[(3s)-7-amino-1-chloro-2-oxoheptan-3-yl]-4-methylbenzenesulfonamide;hydron;chloride Chemical compound Cl.CC1=CC=C(S(=O)(=O)N[C@@H](CCCCN)C(=O)CCl)C=C1 YFCUZWYIPBUQBD-ZOWNYOTGSA-N 0.000 description 2
- 238000002663 nebulization Methods 0.000 description 2
- 230000007935 neutral effect Effects 0.000 description 2
- 229920001220 nitrocellulos Polymers 0.000 description 2
- 229940079938 nitrocellulose Drugs 0.000 description 2
- VIKNJXKGJWUCNN-XGXHKTLJSA-N norethisterone Chemical compound O=C1CC[C@@H]2[C@H]3CC[C@](C)([C@](CC4)(O)C#C)[C@@H]4[C@@H]3CCC2=C1 VIKNJXKGJWUCNN-XGXHKTLJSA-N 0.000 description 2
- SBQLYHNEIUGQKH-UHFFFAOYSA-N omeprazole Chemical compound N1=C2[CH]C(OC)=CC=C2N=C1S(=O)CC1=NC=C(C)C(OC)=C1C SBQLYHNEIUGQKH-UHFFFAOYSA-N 0.000 description 2
- 229960000381 omeprazole Drugs 0.000 description 2
- 230000003287 optical effect Effects 0.000 description 2
- 238000004091 panning Methods 0.000 description 2
- 230000036961 partial effect Effects 0.000 description 2
- 230000037361 pathway Effects 0.000 description 2
- FAXGPCHRFPCXOO-LXTPJMTPSA-N pepstatin A Chemical compound OC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C)NC(=O)C[C@H](O)[C@H](CC(C)C)NC(=O)[C@H](C(C)C)NC(=O)[C@H](C(C)C)NC(=O)CC(C)C FAXGPCHRFPCXOO-LXTPJMTPSA-N 0.000 description 2
- 239000000816 peptidomimetic Substances 0.000 description 2
- 239000012071 phase Substances 0.000 description 2
- COLNVLDHVKWLRT-UHFFFAOYSA-N phenylalanine Natural products OC(=O)C(N)CC1=CC=CC=C1 COLNVLDHVKWLRT-UHFFFAOYSA-N 0.000 description 2
- 239000002953 phosphate buffered saline Substances 0.000 description 2
- 230000026731 phosphorylation Effects 0.000 description 2
- 238000006366 phosphorylation reaction Methods 0.000 description 2
- 210000002381 plasma Anatomy 0.000 description 2
- 238000013492 plasmid preparation Methods 0.000 description 2
- 229920000136 polysorbate Polymers 0.000 description 2
- 230000003389 potentiating effect Effects 0.000 description 2
- 230000000644 propagated effect Effects 0.000 description 2
- 238000001742 protein purification Methods 0.000 description 2
- 238000001243 protein synthesis Methods 0.000 description 2
- 230000002797 proteolythic effect Effects 0.000 description 2
- 230000009711 regulatory function Effects 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 230000000717 retained effect Effects 0.000 description 2
- 210000003705 ribosome Anatomy 0.000 description 2
- 238000002864 sequence alignment Methods 0.000 description 2
- 229940083575 sodium dodecyl sulfate Drugs 0.000 description 2
- 238000002415 sodium dodecyl sulfate polyacrylamide gel electrophoresis Methods 0.000 description 2
- 239000007790 solid phase Substances 0.000 description 2
- 238000010532 solid phase synthesis reaction Methods 0.000 description 2
- 230000003381 solubilizing effect Effects 0.000 description 2
- 238000000527 sonication Methods 0.000 description 2
- 208000024891 symptom Diseases 0.000 description 2
- 230000002194 synthesizing effect Effects 0.000 description 2
- 230000009897 systematic effect Effects 0.000 description 2
- 125000005931 tert-butyloxycarbonyl group Chemical group [H]C([H])([H])C(OC(*)=O)(C([H])([H])[H])C([H])([H])[H] 0.000 description 2
- RWQNBRDOKXIBIV-UHFFFAOYSA-N thymine Chemical compound CC1=CNC(=O)NC1=O RWQNBRDOKXIBIV-UHFFFAOYSA-N 0.000 description 2
- 238000000844 transformation Methods 0.000 description 2
- 108091005703 transmembrane proteins Proteins 0.000 description 2
- 102000035160 transmembrane proteins Human genes 0.000 description 2
- 241000701447 unidentified baculovirus Species 0.000 description 2
- 238000011144 upstream manufacturing Methods 0.000 description 2
- 229940125575 vaccine candidate Drugs 0.000 description 2
- 101150040194 waaA gene Proteins 0.000 description 2
- 238000005406 washing Methods 0.000 description 2
- HDTRYLNUVZCQOY-UHFFFAOYSA-N α-D-glucopyranosyl-α-D-glucopyranoside Natural products OC1C(O)C(O)C(CO)OC1OC1C(O)C(O)C(O)C(CO)O1 HDTRYLNUVZCQOY-UHFFFAOYSA-N 0.000 description 1
- WDQLRUYAYXDIFW-RWKIJVEZSA-N (2r,3r,4s,5r,6r)-4-[(2s,3r,4s,5r,6r)-3,5-dihydroxy-4-[(2r,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxy-6-[[(2r,3r,4s,5s,6r)-3,4,5-trihydroxy-6-(hydroxymethyl)oxan-2-yl]oxymethyl]oxan-2-yl]oxy-6-(hydroxymethyl)oxane-2,3,5-triol Chemical compound O[C@@H]1[C@@H](CO)O[C@@H](O)[C@H](O)[C@H]1O[C@H]1[C@H](O)[C@@H](O[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)[C@H](O)[C@@H](CO[C@H]2[C@@H]([C@@H](O)[C@H](O)[C@@H](CO)O2)O)O1 WDQLRUYAYXDIFW-RWKIJVEZSA-N 0.000 description 1
- XSYUPRQVAHJETO-WPMUBMLPSA-N (2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-amino-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidazol-5-yl)propanoyl]amino]-3-(1h-imidaz Chemical compound C([C@H](N)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(=O)N[C@@H](CC=1NC=NC=1)C(O)=O)C1=CN=CN1 XSYUPRQVAHJETO-WPMUBMLPSA-N 0.000 description 1
- AGTSSZRZBSNTGQ-ITZCFHCWSA-N (2s,3r)-2-[[(2s)-2-[[(2s)-2-[[(2s)-6-amino-2-[[(2s)-2-[[(2s)-5-amino-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[(2s)-2-[[2-[[2-[[(2s)-2-amino-3-(4-hydroxyphenyl)propanoyl]amino]acetyl]amino]acetyl]amino]-3-phenylpropanoyl]amino]-4-methylpentanoyl]amino]-5-(diaminomet Chemical compound C([C@@H](C(=O)N[C@@H](CC(C)C)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCCN=C(N)N)C(=O)N[C@@H](CCC(N)=O)C(=O)N[C@@H](CC=1C=CC=CC=1)C(=O)N[C@@H](CCCCN)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H](C(C)C)C(=O)N[C@@H]([C@@H](C)O)C(O)=O)NC(=O)CNC(=O)CNC(=O)[C@@H](N)CC=1C=CC(O)=CC=1)C1=CC=CC=C1 AGTSSZRZBSNTGQ-ITZCFHCWSA-N 0.000 description 1
- YHQZWWDVLJPRIF-JLHRHDQISA-N (4R)-4-[[(2S,3R)-2-[acetyl-[(3R,4R,5S,6R)-3-amino-4-[(1R)-1-carboxyethoxy]-5-hydroxy-6-(hydroxymethyl)oxan-2-yl]amino]-3-hydroxybutanoyl]amino]-5-amino-5-oxopentanoic acid Chemical compound C(C)(=O)N([C@@H]([C@H](O)C)C(=O)N[C@H](CCC(=O)O)C(N)=O)C1[C@H](N)[C@@H](O[C@@H](C(=O)O)C)[C@H](O)[C@H](O1)CO YHQZWWDVLJPRIF-JLHRHDQISA-N 0.000 description 1
- QRXMUCSWCMTJGU-UHFFFAOYSA-L (5-bromo-4-chloro-1h-indol-3-yl) phosphate Chemical compound C1=C(Br)C(Cl)=C2C(OP([O-])(=O)[O-])=CNC2=C1 QRXMUCSWCMTJGU-UHFFFAOYSA-L 0.000 description 1
- YYGNTYWPHWGJRM-UHFFFAOYSA-N (6E,10E,14E,18E)-2,6,10,15,19,23-hexamethyltetracosa-2,6,10,14,18,22-hexaene Chemical compound CC(C)=CCCC(C)=CCCC(C)=CCCC=C(C)CCC=C(C)CCC=C(C)C YYGNTYWPHWGJRM-UHFFFAOYSA-N 0.000 description 1
- DYLIWHYUXAJDOJ-OWOJBTEDSA-N (e)-4-(6-aminopurin-9-yl)but-2-en-1-ol Chemical compound NC1=NC=NC2=C1N=CN2C\C=C\CO DYLIWHYUXAJDOJ-OWOJBTEDSA-N 0.000 description 1
- LRANPJDWHYRCER-UHFFFAOYSA-N 1,2-diazepine Chemical compound N1C=CC=CC=N1 LRANPJDWHYRCER-UHFFFAOYSA-N 0.000 description 1
- CJAOGUFAAWZWNI-UHFFFAOYSA-N 1-n,1-n,4-n,4-n-tetramethylbenzene-1,4-diamine Chemical compound CN(C)C1=CC=C(N(C)C)C=C1 CJAOGUFAAWZWNI-UHFFFAOYSA-N 0.000 description 1
- SVUOLADPCWQTTE-UHFFFAOYSA-N 1h-1,2-benzodiazepine Chemical compound N1N=CC=CC2=CC=CC=C12 SVUOLADPCWQTTE-UHFFFAOYSA-N 0.000 description 1
- 238000010600 3H thymidine incorporation assay Methods 0.000 description 1
- QFVHZQCOUORWEI-UHFFFAOYSA-N 4-[(4-anilino-5-sulfonaphthalen-1-yl)diazenyl]-5-hydroxynaphthalene-2,7-disulfonic acid Chemical compound C=12C(O)=CC(S(O)(=O)=O)=CC2=CC(S(O)(=O)=O)=CC=1N=NC(C1=CC=CC(=C11)S(O)(=O)=O)=CC=C1NC1=CC=CC=C1 QFVHZQCOUORWEI-UHFFFAOYSA-N 0.000 description 1
- CFYIUBWVKZQDOG-UHFFFAOYSA-N 4-[[2-[[2-[[1-(4-nitroanilino)-1-oxo-3-phenylpropan-2-yl]amino]-2-oxoethyl]amino]-2-oxoethyl]amino]-4-oxobutanoic acid Chemical compound C=1C=C([N+]([O-])=O)C=CC=1NC(=O)C(NC(=O)CNC(=O)CNC(=O)CCC(=O)O)CC1=CC=CC=C1 CFYIUBWVKZQDOG-UHFFFAOYSA-N 0.000 description 1
- QCVGEOXPDFCNHA-UHFFFAOYSA-N 5,5-dimethyl-2,4-dioxo-1,3-oxazolidine-3-carboxamide Chemical compound CC1(C)OC(=O)N(C(N)=O)C1=O QCVGEOXPDFCNHA-UHFFFAOYSA-N 0.000 description 1
- OPIFSICVWOWJMJ-AEOCFKNESA-N 5-bromo-4-chloro-3-indolyl beta-D-galactoside Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1OC1=CNC2=CC=C(Br)C(Cl)=C12 OPIFSICVWOWJMJ-AEOCFKNESA-N 0.000 description 1
- 108010006533 ATP-Binding Cassette Transporters Proteins 0.000 description 1
- 102000005416 ATP-Binding Cassette Transporters Human genes 0.000 description 1
- HRPVXLWXLXDGHG-UHFFFAOYSA-N Acrylamide Chemical compound NC(=O)C=C HRPVXLWXLXDGHG-UHFFFAOYSA-N 0.000 description 1
- 241000251468 Actinopterygii Species 0.000 description 1
- 229920001817 Agar Polymers 0.000 description 1
- GUBGYTABKSRVRQ-XLOQQCSPSA-N Alpha-Lactose Chemical compound O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@H]1O[C@@H]1[C@@H](CO)O[C@H](O)[C@H](O)[C@H]1O GUBGYTABKSRVRQ-XLOQQCSPSA-N 0.000 description 1
- 101100165658 Alternaria brassicicola bsc5 gene Proteins 0.000 description 1
- 244000153158 Ammi visnaga Species 0.000 description 1
- 235000010585 Ammi visnaga Nutrition 0.000 description 1
- USFZMSVCRYTOJT-UHFFFAOYSA-N Ammonium acetate Chemical compound N.CC(O)=O USFZMSVCRYTOJT-UHFFFAOYSA-N 0.000 description 1
- 239000005695 Ammonium acetate Substances 0.000 description 1
- 244000105975 Antidesma platyphyllum Species 0.000 description 1
- 101100032924 Bacillus subtilis (strain 168) radA gene Proteins 0.000 description 1
- 231100000699 Bacterial toxin Toxicity 0.000 description 1
- 101000609456 Beet necrotic yellow vein virus (isolate Japan/S) Protein P26 Proteins 0.000 description 1
- DWRXFEITVBNRMK-UHFFFAOYSA-N Beta-D-1-Arabinofuranosylthymine Natural products O=C1NC(=O)C(C)=CN1C1C(O)C(O)C(CO)O1 DWRXFEITVBNRMK-UHFFFAOYSA-N 0.000 description 1
- 238000009010 Bradford assay Methods 0.000 description 1
- 101100227322 Caenorhabditis elegans fli-1 gene Proteins 0.000 description 1
- UXVMQQNJUSDDNG-UHFFFAOYSA-L Calcium chloride Chemical compound [Cl-].[Cl-].[Ca+2] UXVMQQNJUSDDNG-UHFFFAOYSA-L 0.000 description 1
- 241000589876 Campylobacter Species 0.000 description 1
- 241000283707 Capra Species 0.000 description 1
- 102000003846 Carbonic anhydrases Human genes 0.000 description 1
- 108090000209 Carbonic anhydrases Proteins 0.000 description 1
- 108010078791 Carrier Proteins Proteins 0.000 description 1
- 102000005600 Cathepsins Human genes 0.000 description 1
- 108010084457 Cathepsins Proteins 0.000 description 1
- 108010039939 Cell Wall Skeleton Proteins 0.000 description 1
- 102000003813 Cis-trans-isomerases Human genes 0.000 description 1
- 108090000175 Cis-trans-isomerases Proteins 0.000 description 1
- KRKNYBCHXYNGOX-UHFFFAOYSA-K Citrate Chemical compound [O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O KRKNYBCHXYNGOX-UHFFFAOYSA-K 0.000 description 1
- 101100423897 Clostridium perfringens (strain 13 / Type A) glnS gene Proteins 0.000 description 1
- 208000035473 Communicable disease Diseases 0.000 description 1
- 241000186216 Corynebacterium Species 0.000 description 1
- 241000699802 Cricetulus griseus Species 0.000 description 1
- 102100033270 Cyclin-dependent kinase inhibitor 1 Human genes 0.000 description 1
- PJWWRFATQTVXHA-UHFFFAOYSA-N Cyclohexylaminopropanesulfonic acid Chemical compound OS(=O)(=O)CCCNC1CCCCC1 PJWWRFATQTVXHA-UHFFFAOYSA-N 0.000 description 1
- 108010068682 Cyclophilins Proteins 0.000 description 1
- 102000001493 Cyclophilins Human genes 0.000 description 1
- FBPFZTCFMRRESA-KVTDHHQDSA-N D-Mannitol Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)[C@H](O)CO FBPFZTCFMRRESA-KVTDHHQDSA-N 0.000 description 1
- QNAYBMKLOCPYGJ-UHFFFAOYSA-N D-alpha-Ala Natural products CC([NH3+])C([O-])=O QNAYBMKLOCPYGJ-UHFFFAOYSA-N 0.000 description 1
- 229930182847 D-glutamic acid Natural products 0.000 description 1
- QIVBCDIJIAJPQS-SECBINFHSA-N D-tryptophane Chemical compound C1=CC=C2C(C[C@@H](N)C(O)=O)=CNC2=C1 QIVBCDIJIAJPQS-SECBINFHSA-N 0.000 description 1
- OUYCCCASQSFEME-MRVPVSSYSA-N D-tyrosine Chemical compound OC(=O)[C@H](N)CC1=CC=C(O)C=C1 OUYCCCASQSFEME-MRVPVSSYSA-N 0.000 description 1
- 108010017826 DNA Polymerase I Proteins 0.000 description 1
- 102000004594 DNA Polymerase I Human genes 0.000 description 1
- 102100024607 DNA topoisomerase 1 Human genes 0.000 description 1
- 102000052510 DNA-Binding Proteins Human genes 0.000 description 1
- 230000004568 DNA-binding Effects 0.000 description 1
- 101710096438 DNA-binding protein Proteins 0.000 description 1
- 108090000626 DNA-directed RNA polymerases Proteins 0.000 description 1
- 102000016911 Deoxyribonucleases Human genes 0.000 description 1
- 108010053770 Deoxyribonucleases Proteins 0.000 description 1
- 229920002307 Dextran Polymers 0.000 description 1
- 101100492392 Didymella fabae pksAC gene Proteins 0.000 description 1
- BWGNESOTFCXPMA-UHFFFAOYSA-N Dihydrogen disulfide Chemical compound SS BWGNESOTFCXPMA-UHFFFAOYSA-N 0.000 description 1
- 108010016626 Dipeptides Proteins 0.000 description 1
- 101100108073 Drosophila melanogaster Actn gene Proteins 0.000 description 1
- 108010000912 Egg Proteins Proteins 0.000 description 1
- 102000002322 Egg Proteins Human genes 0.000 description 1
- 108010067770 Endopeptidase K Proteins 0.000 description 1
- 108010013369 Enteropeptidase Proteins 0.000 description 1
- 102100029727 Enteropeptidase Human genes 0.000 description 1
- 101100012780 Escherichia coli (strain K12) fecA gene Proteins 0.000 description 1
- 241000701959 Escherichia virus Lambda Species 0.000 description 1
- 108060002716 Exonuclease Proteins 0.000 description 1
- 108010074860 Factor Xa Proteins 0.000 description 1
- 241000282326 Felis catus Species 0.000 description 1
- 108010000916 Fimbriae Proteins Proteins 0.000 description 1
- 108010040721 Flagellin Proteins 0.000 description 1
- 241000589599 Francisella tularensis subsp. novicida Species 0.000 description 1
- 102000002464 Galactosidases Human genes 0.000 description 1
- 108010093031 Galactosidases Proteins 0.000 description 1
- 208000007882 Gastritis Diseases 0.000 description 1
- 238000003794 Gram staining Methods 0.000 description 1
- 241000606768 Haemophilus influenzae Species 0.000 description 1
- 241001674326 Helicobacter pylori J99 Species 0.000 description 1
- 108010093488 His-His-His-His-His-His Proteins 0.000 description 1
- 101000830681 Homo sapiens DNA topoisomerase 1 Proteins 0.000 description 1
- 108010021625 Immunoglobulin Fragments Proteins 0.000 description 1
- 102000008394 Immunoglobulin Fragments Human genes 0.000 description 1
- 206010061218 Inflammation Diseases 0.000 description 1
- 102000014150 Interferons Human genes 0.000 description 1
- 108010050904 Interferons Proteins 0.000 description 1
- 102000004388 Interleukin-4 Human genes 0.000 description 1
- 108090000978 Interleukin-4 Proteins 0.000 description 1
- 108091092195 Intron Proteins 0.000 description 1
- ONIBWKKTOPOVIA-BYPYZUCNSA-N L-Proline Chemical compound OC(=O)[C@@H]1CCCN1 ONIBWKKTOPOVIA-BYPYZUCNSA-N 0.000 description 1
- 150000008575 L-amino acids Chemical class 0.000 description 1
- 125000000174 L-prolyl group Chemical group [H]N1C([H])([H])C([H])([H])C([H])([H])[C@@]1([H])C(*)=O 0.000 description 1
- LFZGUGJDVUUGLK-REOHCLBHSA-N L-serine O-sulfate Chemical compound OC(=O)[C@@H](N)COS(O)(=O)=O LFZGUGJDVUUGLK-REOHCLBHSA-N 0.000 description 1
- 125000000510 L-tryptophano group Chemical group [H]C1=C([H])C([H])=C2N([H])C([H])=C(C([H])([H])[C@@]([H])(C(O[H])=O)N([H])[*])C2=C1[H] 0.000 description 1
- GUBGYTABKSRVRQ-QKKXKWKRSA-N Lactose Natural products OC[C@H]1O[C@@H](O[C@H]2[C@H](O)[C@@H](O)C(O)O[C@@H]2CO)[C@H](O)[C@@H](O)[C@H]1O GUBGYTABKSRVRQ-QKKXKWKRSA-N 0.000 description 1
- 108090001090 Lectins Proteins 0.000 description 1
- 102000004856 Lectins Human genes 0.000 description 1
- GDBQQVLCIARPGH-UHFFFAOYSA-N Leupeptin Natural products CC(C)CC(NC(C)=O)C(=O)NC(CC(C)C)C(=O)NC(C=O)CCCN=C(N)N GDBQQVLCIARPGH-UHFFFAOYSA-N 0.000 description 1
- 101710105045 Lipoprotein E Proteins 0.000 description 1
- 239000004425 Makrolon Substances 0.000 description 1
- 241000124008 Mammalia Species 0.000 description 1
- 229930195725 Mannitol Natural products 0.000 description 1
- RJQXTJLFIWVMTO-TYNCELHUSA-N Methicillin Chemical compound COC1=CC=CC(OC)=C1C(=O)N[C@@H]1C(=O)N2[C@@H](C(O)=O)C(C)(C)S[C@@H]21 RJQXTJLFIWVMTO-TYNCELHUSA-N 0.000 description 1
- 241000353097 Molva molva Species 0.000 description 1
- 206010065764 Mucosal infection Diseases 0.000 description 1
- 101100281205 Mus musculus Fli1 gene Proteins 0.000 description 1
- 102000003505 Myosin Human genes 0.000 description 1
- 108060008487 Myosin Proteins 0.000 description 1
- 108700020354 N-acetylmuramyl-threonyl-isoglutamine Proteins 0.000 description 1
- 241000588653 Neisseria Species 0.000 description 1
- 108020004711 Nucleic Acid Probes Proteins 0.000 description 1
- 108010079246 OMPA outer membrane proteins Proteins 0.000 description 1
- 108020005187 Oligonucleotide Probes Proteins 0.000 description 1
- 108010038807 Oligopeptides Proteins 0.000 description 1
- 102000015636 Oligopeptides Human genes 0.000 description 1
- 108010058846 Ovalbumin Proteins 0.000 description 1
- 206010034133 Pathogen resistance Diseases 0.000 description 1
- 102000057297 Pepsin A Human genes 0.000 description 1
- 108090000284 Pepsin A Proteins 0.000 description 1
- 108010033276 Peptide Fragments Proteins 0.000 description 1
- 102000007079 Peptide Fragments Human genes 0.000 description 1
- 108010071384 Peptide T Proteins 0.000 description 1
- 101100226893 Phomopsis amygdali PaP450-2 gene Proteins 0.000 description 1
- 108010065081 Phosphorylase b Proteins 0.000 description 1
- 235000016816 Pisum sativum subsp sativum Nutrition 0.000 description 1
- 229920001213 Polysorbate 20 Polymers 0.000 description 1
- 101710193132 Pre-hexon-linking protein VIII Proteins 0.000 description 1
- ONIBWKKTOPOVIA-UHFFFAOYSA-N Proline Natural products OC(=O)C1CCCN1 ONIBWKKTOPOVIA-UHFFFAOYSA-N 0.000 description 1
- 101710118538 Protease Proteins 0.000 description 1
- 102100037681 Protein FEV Human genes 0.000 description 1
- 101710198166 Protein FEV Proteins 0.000 description 1
- 108010029485 Protein Isoforms Proteins 0.000 description 1
- 102000001708 Protein Isoforms Human genes 0.000 description 1
- 101710130181 Protochlorophyllide reductase A, chloroplastic Proteins 0.000 description 1
- CZPWVGJYEJSRLH-UHFFFAOYSA-N Pyrimidine Chemical compound C1=CN=CN=C1 CZPWVGJYEJSRLH-UHFFFAOYSA-N 0.000 description 1
- 238000012181 QIAquick gel extraction kit Methods 0.000 description 1
- 102000006382 Ribonucleases Human genes 0.000 description 1
- 108010083644 Ribonucleases Proteins 0.000 description 1
- 102400000235 Rimorphin Human genes 0.000 description 1
- 241000283984 Rodentia Species 0.000 description 1
- 108091006627 SLC12A9 Proteins 0.000 description 1
- 229920002305 Schizophyllan Polymers 0.000 description 1
- 108091058545 Secretory proteins Proteins 0.000 description 1
- 102000040739 Secretory proteins Human genes 0.000 description 1
- BQCADISMDOOEFD-UHFFFAOYSA-N Silver Chemical compound [Ag] BQCADISMDOOEFD-UHFFFAOYSA-N 0.000 description 1
- 241000191940 Staphylococcus Species 0.000 description 1
- 101900014382 Staphylococcus haemolyticus Glutamate racemase Proteins 0.000 description 1
- 229920002472 Starch Polymers 0.000 description 1
- 208000007107 Stomach Ulcer Diseases 0.000 description 1
- 241000187747 Streptomyces Species 0.000 description 1
- 241000192560 Synechococcus sp. Species 0.000 description 1
- 108700005078 Synthetic Genes Proteins 0.000 description 1
- 241000255588 Tephritidae Species 0.000 description 1
- BHEOSNUKNHRBNM-UHFFFAOYSA-N Tetramethylsqualene Natural products CC(=C)C(C)CCC(=C)C(C)CCC(C)=CCCC=C(C)CCC(C)C(=C)CCC(C)C(C)=C BHEOSNUKNHRBNM-UHFFFAOYSA-N 0.000 description 1
- ZMZDMBWJUHKJPS-UHFFFAOYSA-M Thiocyanate anion Chemical compound [S-]C#N ZMZDMBWJUHKJPS-UHFFFAOYSA-M 0.000 description 1
- 108090000190 Thrombin Proteins 0.000 description 1
- HDTRYLNUVZCQOY-WSWWMNSNSA-N Trehalose Natural products O[C@@H]1[C@@H](O)[C@@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-WSWWMNSNSA-N 0.000 description 1
- 208000025865 Ulcer Diseases 0.000 description 1
- 108020000999 Viral RNA Proteins 0.000 description 1
- 210000001015 abdomen Anatomy 0.000 description 1
- 230000001594 aberrant effect Effects 0.000 description 1
- 238000010521 absorption reaction Methods 0.000 description 1
- QPMSXSBEVQLBIL-CZRHPSIPSA-N ac1mix0p Chemical compound C1=CC=C2N(C[C@H](C)CN(C)C)C3=CC(OC)=CC=C3SC2=C1.O([C@H]1[C@]2(OC)C=CC34C[C@@H]2[C@](C)(O)CCC)C2=C5[C@]41CCN(C)[C@@H]3CC5=CC=C2O QPMSXSBEVQLBIL-CZRHPSIPSA-N 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000021736 acetylation Effects 0.000 description 1
- 238000006640 acetylation reaction Methods 0.000 description 1
- 230000002378 acidificating effect Effects 0.000 description 1
- 150000007513 acids Chemical class 0.000 description 1
- 230000003213 activating effect Effects 0.000 description 1
- 239000000654 additive Substances 0.000 description 1
- 238000001042 affinity chromatography Methods 0.000 description 1
- 238000001261 affinity purification Methods 0.000 description 1
- 238000003314 affinity selection Methods 0.000 description 1
- 239000008272 agar Substances 0.000 description 1
- 230000029936 alkylation Effects 0.000 description 1
- 238000005804 alkylation reaction Methods 0.000 description 1
- 239000013566 allergen Substances 0.000 description 1
- 208000030961 allergic reaction Diseases 0.000 description 1
- 108010013829 alpha subunit DNA polymerase III Proteins 0.000 description 1
- HDTRYLNUVZCQOY-LIZSDCNHSA-N alpha,alpha-trehalose Chemical compound O[C@@H]1[C@@H](O)[C@H](O)[C@@H](CO)O[C@@H]1O[C@@H]1[C@H](O)[C@@H](O)[C@H](O)[C@@H](CO)O1 HDTRYLNUVZCQOY-LIZSDCNHSA-N 0.000 description 1
- WNROFYMDJYEPJX-UHFFFAOYSA-K aluminium hydroxide Chemical compound [OH-].[OH-].[OH-].[Al+3] WNROFYMDJYEPJX-UHFFFAOYSA-K 0.000 description 1
- 229940043376 ammonium acetate Drugs 0.000 description 1
- 235000019257 ammonium acetate Nutrition 0.000 description 1
- 210000004102 animal cell Anatomy 0.000 description 1
- 238000010171 animal model Methods 0.000 description 1
- 239000003242 anti bacterial agent Substances 0.000 description 1
- 230000002788 anti-peptide Effects 0.000 description 1
- 229940124350 antibacterial drug Drugs 0.000 description 1
- 229940088710 antibiotic agent Drugs 0.000 description 1
- 230000005875 antibody response Effects 0.000 description 1
- 238000011203 antimicrobial therapy Methods 0.000 description 1
- 239000012736 aqueous medium Substances 0.000 description 1
- PYMYPHUHKUWMLA-WDCZJNDASA-N arabinose Chemical compound OC[C@@H](O)[C@@H](O)[C@H](O)C=O PYMYPHUHKUWMLA-WDCZJNDASA-N 0.000 description 1
- PYMYPHUHKUWMLA-UHFFFAOYSA-N arabinose Natural products OCC(O)C(O)C(O)C=O PYMYPHUHKUWMLA-UHFFFAOYSA-N 0.000 description 1
- 229940009098 aspartate Drugs 0.000 description 1
- 238000002820 assay format Methods 0.000 description 1
- 238000000376 autoradiography Methods 0.000 description 1
- XYOVOXDWRFGKEX-UHFFFAOYSA-N azepine Chemical compound N1C=CC=CC=C1 XYOVOXDWRFGKEX-UHFFFAOYSA-N 0.000 description 1
- 210000003719 b-lymphocyte Anatomy 0.000 description 1
- 239000000688 bacterial toxin Substances 0.000 description 1
- 239000011324 bead Substances 0.000 description 1
- 229940049706 benzodiazepine Drugs 0.000 description 1
- SRBFZHDQGSBBOR-UHFFFAOYSA-N beta-D-Pyranose-Lyxose Natural products OC1COC(O)C(O)C1O SRBFZHDQGSBBOR-UHFFFAOYSA-N 0.000 description 1
- WQZGKKKJIJFFOK-VFUOTHLCSA-N beta-D-glucose Chemical compound OC[C@H]1O[C@@H](O)[C@H](O)[C@@H](O)[C@@H]1O WQZGKKKJIJFFOK-VFUOTHLCSA-N 0.000 description 1
- IQFYYKKMVGJFEH-UHFFFAOYSA-N beta-L-thymidine Natural products O=C1NC(=O)C(C)=CN1C1OC(CO)C(O)C1 IQFYYKKMVGJFEH-UHFFFAOYSA-N 0.000 description 1
- UCMIRNVEIXFBKS-UHFFFAOYSA-N beta-alanine Chemical compound NCCC(O)=O UCMIRNVEIXFBKS-UHFFFAOYSA-N 0.000 description 1
- 108091008324 binding proteins Proteins 0.000 description 1
- 238000004166 bioassay Methods 0.000 description 1
- 239000012472 biological sample Substances 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 229910052797 bismuth Inorganic materials 0.000 description 1
- JCXGWMGPZLAOME-UHFFFAOYSA-N bismuth atom Chemical compound [Bi] JCXGWMGPZLAOME-UHFFFAOYSA-N 0.000 description 1
- 208000003836 bluetongue Diseases 0.000 description 1
- 230000037396 body weight Effects 0.000 description 1
- 238000009835 boiling Methods 0.000 description 1
- 230000001488 breeding effect Effects 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 239000001110 calcium chloride Substances 0.000 description 1
- 229910001628 calcium chloride Inorganic materials 0.000 description 1
- 239000001506 calcium phosphate Substances 0.000 description 1
- 229910000389 calcium phosphate Inorganic materials 0.000 description 1
- 235000011010 calcium phosphates Nutrition 0.000 description 1
- 230000021523 carboxylation Effects 0.000 description 1
- 238000006473 carboxylation reaction Methods 0.000 description 1
- 230000015556 catabolic process Effects 0.000 description 1
- 239000003054 catalyst Substances 0.000 description 1
- 150000001768 cations Chemical class 0.000 description 1
- 230000020411 cell activation Effects 0.000 description 1
- 210000000170 cell membrane Anatomy 0.000 description 1
- 210000002421 cell wall Anatomy 0.000 description 1
- 210000004520 cell wall skeleton Anatomy 0.000 description 1
- 210000004671 cell-free system Anatomy 0.000 description 1
- 230000008614 cellular interaction Effects 0.000 description 1
- 230000036755 cellular response Effects 0.000 description 1
- 230000007541 cellular toxicity Effects 0.000 description 1
- 239000001913 cellulose Substances 0.000 description 1
- 229920002678 cellulose Polymers 0.000 description 1
- 210000001175 cerebrospinal fluid Anatomy 0.000 description 1
- 230000003196 chaotropic effect Effects 0.000 description 1
- 238000012412 chemical coupling Methods 0.000 description 1
- 238000002144 chemical decomposition reaction Methods 0.000 description 1
- 239000007795 chemical reaction product Substances 0.000 description 1
- 239000005081 chemiluminescent agent Substances 0.000 description 1
- 235000019219 chocolate Nutrition 0.000 description 1
- 208000023652 chronic gastritis Diseases 0.000 description 1
- 238000007697 cis-trans-isomerization reaction Methods 0.000 description 1
- 238000000975 co-precipitation Methods 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000021615 conjugation Effects 0.000 description 1
- 108091036078 conserved sequence Proteins 0.000 description 1
- 239000000470 constituent Substances 0.000 description 1
- 230000004940 costimulation Effects 0.000 description 1
- 230000009260 cross reactivity Effects 0.000 description 1
- 239000012228 culture supernatant Substances 0.000 description 1
- 210000004748 cultured cell Anatomy 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 125000004122 cyclic group Chemical group 0.000 description 1
- 235000018417 cysteine Nutrition 0.000 description 1
- XUJNEKJLAYXESH-UHFFFAOYSA-N cysteine Natural products SCC(N)C(O)=O XUJNEKJLAYXESH-UHFFFAOYSA-N 0.000 description 1
- 125000000151 cysteine group Chemical class N[C@@H](CS)C(=O)* 0.000 description 1
- 229940104302 cytosine Drugs 0.000 description 1
- 238000006731 degradation reaction Methods 0.000 description 1
- 230000002939 deleterious effect Effects 0.000 description 1
- 238000002716 delivery method Methods 0.000 description 1
- 239000008121 dextrose Substances 0.000 description 1
- 239000003085 diluting agent Substances 0.000 description 1
- 238000010790 dilution Methods 0.000 description 1
- 239000012895 dilution Substances 0.000 description 1
- 239000000539 dimer Substances 0.000 description 1
- 238000006471 dimerization reaction Methods 0.000 description 1
- LOKCTEFSRHRXRJ-UHFFFAOYSA-I dipotassium trisodium dihydrogen phosphate hydrogen phosphate dichloride Chemical compound P(=O)(O)(O)[O-].[K+].P(=O)(O)([O-])[O-].[Na+].[Na+].[Cl-].[K+].[Cl-].[Na+] LOKCTEFSRHRXRJ-UHFFFAOYSA-I 0.000 description 1
- 229940042399 direct acting antivirals protease inhibitors Drugs 0.000 description 1
- KAKKHKRHCKCAGH-UHFFFAOYSA-L disodium;(4-nitrophenyl) phosphate;hexahydrate Chemical compound O.O.O.O.O.O.[Na+].[Na+].[O-][N+](=O)C1=CC=C(OP([O-])([O-])=O)C=C1 KAKKHKRHCKCAGH-UHFFFAOYSA-L 0.000 description 1
- 208000035475 disorder Diseases 0.000 description 1
- PRAKJMSDJKAYCZ-UHFFFAOYSA-N dodecahydrosqualene Natural products CC(C)CCCC(C)CCCC(C)CCCCC(C)CCCC(C)CCCC(C)C PRAKJMSDJKAYCZ-UHFFFAOYSA-N 0.000 description 1
- 231100000673 dose–response relationship Toxicity 0.000 description 1
- 238000009513 drug distribution Methods 0.000 description 1
- 238000007877 drug screening Methods 0.000 description 1
- 239000003596 drug target Substances 0.000 description 1
- 210000001198 duodenum Anatomy 0.000 description 1
- 201000006549 dyspepsia Diseases 0.000 description 1
- 238000013399 early diagnosis Methods 0.000 description 1
- 235000014103 egg white Nutrition 0.000 description 1
- 210000000969 egg white Anatomy 0.000 description 1
- 239000003995 emulsifying agent Substances 0.000 description 1
- 239000000839 emulsion Substances 0.000 description 1
- 244000088681 endo Species 0.000 description 1
- 239000003623 enhancer Substances 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000000688 enterotoxigenic effect Effects 0.000 description 1
- 230000007613 environmental effect Effects 0.000 description 1
- 210000002919 epithelial cell Anatomy 0.000 description 1
- 210000003527 eukaryotic cell Anatomy 0.000 description 1
- 102000013165 exonuclease Human genes 0.000 description 1
- 238000001400 expression cloning Methods 0.000 description 1
- 235000013861 fat-free Nutrition 0.000 description 1
- 239000012091 fetal bovine serum Substances 0.000 description 1
- 239000012894 fetal calf serum Substances 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 101150017109 fliA gene Proteins 0.000 description 1
- 239000012530 fluid Substances 0.000 description 1
- 235000013305 food Nutrition 0.000 description 1
- 238000005194 fractionation Methods 0.000 description 1
- 230000005021 gait Effects 0.000 description 1
- 210000004211 gastric acid Anatomy 0.000 description 1
- 201000006585 gastric adenocarcinoma Diseases 0.000 description 1
- 210000001156 gastric mucosa Anatomy 0.000 description 1
- 201000005917 gastric ulcer Diseases 0.000 description 1
- 210000001035 gastrointestinal tract Anatomy 0.000 description 1
- 238000001502 gel electrophoresis Methods 0.000 description 1
- 238000001641 gel filtration chromatography Methods 0.000 description 1
- 238000002523 gelfiltration Methods 0.000 description 1
- 238000010353 genetic engineering Methods 0.000 description 1
- 238000012248 genetic selection Methods 0.000 description 1
- 239000011521 glass Substances 0.000 description 1
- 101150103988 gltX gene Proteins 0.000 description 1
- 125000000291 glutamic acid group Chemical group N[C@@H](CCC(O)=O)C(=O)* 0.000 description 1
- 239000011544 gradient gel Substances 0.000 description 1
- 101150077981 groEL gene Proteins 0.000 description 1
- 239000001963 growth medium Substances 0.000 description 1
- 235000009424 haa Nutrition 0.000 description 1
- 230000036541 health Effects 0.000 description 1
- 238000010438 heat treatment Methods 0.000 description 1
- 210000002443 helper t lymphocyte Anatomy 0.000 description 1
- 125000000487 histidyl group Chemical group [H]N([H])C(C(=O)O*)C([H])([H])C1=C([H])N([H])C([H])=N1 0.000 description 1
- 230000028996 humoral immune response Effects 0.000 description 1
- 230000004727 humoral immunity Effects 0.000 description 1
- ZMZDMBWJUHKJPS-UHFFFAOYSA-N hydrogen thiocyanate Natural products SC#N ZMZDMBWJUHKJPS-UHFFFAOYSA-N 0.000 description 1
- 230000002519 immonomodulatory effect Effects 0.000 description 1
- 210000002865 immune cell Anatomy 0.000 description 1
- 230000001900 immune effect Effects 0.000 description 1
- 210000000987 immune system Anatomy 0.000 description 1
- 230000005847 immunogenicity Effects 0.000 description 1
- 238000011532 immunohistochemical staining Methods 0.000 description 1
- 238000001114 immunoprecipitation Methods 0.000 description 1
- 230000003308 immunostimulating effect Effects 0.000 description 1
- 238000009169 immunotherapy Methods 0.000 description 1
- 238000000099 in vitro assay Methods 0.000 description 1
- 238000012606 in vitro cell culture Methods 0.000 description 1
- 238000005462 in vivo assay Methods 0.000 description 1
- 230000000415 inactivating effect Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 230000004054 inflammatory process Effects 0.000 description 1
- 230000005764 inhibitory process Effects 0.000 description 1
- 239000003999 initiator Substances 0.000 description 1
- 238000002347 injection Methods 0.000 description 1
- 239000007924 injection Substances 0.000 description 1
- 238000002743 insertional mutagenesis Methods 0.000 description 1
- 239000000138 intercalating agent Substances 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 229940079322 interferon Drugs 0.000 description 1
- 229940028885 interleukin-4 Drugs 0.000 description 1
- 230000003834 intracellular effect Effects 0.000 description 1
- 238000007918 intramuscular administration Methods 0.000 description 1
- 238000005342 ion exchange Methods 0.000 description 1
- 238000004255 ion exchange chromatography Methods 0.000 description 1
- 239000003456 ion exchange resin Substances 0.000 description 1
- 229920003303 ion-exchange polymer Polymers 0.000 description 1
- 150000002500 ions Chemical class 0.000 description 1
- VAVAILGEOMLRRK-UHFFFAOYSA-H iron(III) dicitrate(3-) Chemical compound [Fe+3].[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O.[O-]C(=O)CC(O)(CC([O-])=O)C([O-])=O VAVAILGEOMLRRK-UHFFFAOYSA-H 0.000 description 1
- 150000002576 ketones Chemical class 0.000 description 1
- 210000003734 kidney Anatomy 0.000 description 1
- 238000002372 labelling Methods 0.000 description 1
- 150000003951 lactams Chemical group 0.000 description 1
- 239000008101 lactose Substances 0.000 description 1
- 239000002523 lectin Substances 0.000 description 1
- GDBQQVLCIARPGH-ULQDDVLXSA-N leupeptin Chemical compound CC(C)C[C@H](NC(C)=O)C(=O)N[C@@H](CC(C)C)C(=O)N[C@H](C=O)CCCN=C(N)N GDBQQVLCIARPGH-ULQDDVLXSA-N 0.000 description 1
- 108010052968 leupeptin Proteins 0.000 description 1
- 238000007169 ligase reaction Methods 0.000 description 1
- 230000000670 limiting effect Effects 0.000 description 1
- 150000002632 lipids Chemical class 0.000 description 1
- 238000011068 loading method Methods 0.000 description 1
- 101150103033 lpxB gene Proteins 0.000 description 1
- 101150033242 lpxC gene Proteins 0.000 description 1
- 239000000891 luminescent agent Substances 0.000 description 1
- 238000004020 luminiscence type Methods 0.000 description 1
- 210000002751 lymph Anatomy 0.000 description 1
- 229920002521 macromolecule Polymers 0.000 description 1
- ZLNQQNXFFQJAID-UHFFFAOYSA-L magnesium carbonate Chemical compound [Mg+2].[O-]C([O-])=O ZLNQQNXFFQJAID-UHFFFAOYSA-L 0.000 description 1
- 239000001095 magnesium carbonate Substances 0.000 description 1
- 229910000021 magnesium carbonate Inorganic materials 0.000 description 1
- 229910001629 magnesium chloride Inorganic materials 0.000 description 1
- 235000019359 magnesium stearate Nutrition 0.000 description 1
- 239000000594 mannitol Substances 0.000 description 1
- 235000010355 mannitol Nutrition 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 102000006240 membrane receptors Human genes 0.000 description 1
- 108020004084 membrane receptors Proteins 0.000 description 1
- 229910052751 metal Inorganic materials 0.000 description 1
- 239000002184 metal Substances 0.000 description 1
- 230000011987 methylation Effects 0.000 description 1
- 238000007069 methylation reaction Methods 0.000 description 1
- 229960003085 meticillin Drugs 0.000 description 1
- 230000000813 microbial effect Effects 0.000 description 1
- 244000005700 microbiome Species 0.000 description 1
- 238000000386 microscopy Methods 0.000 description 1
- 239000004005 microsphere Substances 0.000 description 1
- JMUHBNWAORSSBD-WKYWBUFDSA-N mifamurtide Chemical compound CCCCCCCCCCCCCCCC(=O)OC[C@@H](OC(=O)CCCCCCCCCCCCCCC)COP(O)(=O)OCCNC(=O)[C@H](C)NC(=O)CC[C@H](C(N)=O)NC(=O)[C@H](C)NC(=O)[C@@H](C)O[C@H]1[C@H](O)[C@@H](CO)OC(O)[C@@H]1NC(C)=O JMUHBNWAORSSBD-WKYWBUFDSA-N 0.000 description 1
- 229960005225 mifamurtide Drugs 0.000 description 1
- 108700007621 mifamurtide Proteins 0.000 description 1
- 235000013336 milk Nutrition 0.000 description 1
- 239000008267 milk Substances 0.000 description 1
- 210000004080 milk Anatomy 0.000 description 1
- 108091005573 modified proteins Proteins 0.000 description 1
- 102000035118 modified proteins Human genes 0.000 description 1
- 238000000329 molecular dynamics simulation Methods 0.000 description 1
- 239000000178 monomer Substances 0.000 description 1
- 229940035032 monophosphoryl lipid a Drugs 0.000 description 1
- 230000004899 motility Effects 0.000 description 1
- 210000003205 muscle Anatomy 0.000 description 1
- 230000036438 mutation frequency Effects 0.000 description 1
- 239000013642 negative control Substances 0.000 description 1
- 230000003472 neutralizing effect Effects 0.000 description 1
- 229930027945 nicotinamide-adenine dinucleotide Natural products 0.000 description 1
- BOPGDPNILDQYTO-NNYOXOHSSA-N nicotinamide-adenine dinucleotide Chemical compound C1=CCC(C(=O)N)=CN1[C@H]1[C@H](O)[C@H](O)[C@@H](COP(O)(=O)OP(O)(=O)OC[C@@H]2[C@H]([C@@H](O)[C@@H](O2)N2C3=NC=NC(N)=C3N=C2)O)O1 BOPGDPNILDQYTO-NNYOXOHSSA-N 0.000 description 1
- JPXMTWWFLBLUCD-UHFFFAOYSA-N nitro blue tetrazolium(2+) Chemical compound COC1=CC(C=2C=C(OC)C(=CC=2)[N+]=2N(N=C(N=2)C=2C=CC=CC=2)C=2C=CC(=CC=2)[N+]([O-])=O)=CC=C1[N+]1=NC(C=2C=CC=CC=2)=NN1C1=CC=C([N+]([O-])=O)C=C1 JPXMTWWFLBLUCD-UHFFFAOYSA-N 0.000 description 1
- 230000009871 nonspecific binding Effects 0.000 description 1
- 238000007899 nucleic acid hybridization Methods 0.000 description 1
- 239000002853 nucleic acid probe Substances 0.000 description 1
- 239000002751 oligonucleotide probe Substances 0.000 description 1
- 238000002515 oligonucleotide synthesis Methods 0.000 description 1
- 101150016099 omcA gene Proteins 0.000 description 1
- 229940126578 oral vaccine Drugs 0.000 description 1
- 210000000056 organ Anatomy 0.000 description 1
- 229940092253 ovalbumin Drugs 0.000 description 1
- 210000001672 ovary Anatomy 0.000 description 1
- 230000002018 overexpression Effects 0.000 description 1
- 238000004806 packaging method and process Methods 0.000 description 1
- 210000000496 pancreas Anatomy 0.000 description 1
- 229940111202 pepsin Drugs 0.000 description 1
- 108010091212 pepstatin Proteins 0.000 description 1
- 229950000964 pepstatin Drugs 0.000 description 1
- 239000000137 peptide hydrolase inhibitor Substances 0.000 description 1
- 210000001322 periplasm Anatomy 0.000 description 1
- 239000008194 pharmaceutical composition Substances 0.000 description 1
- 239000000546 pharmaceutical excipient Substances 0.000 description 1
- 150000004633 phorbol derivatives Chemical class 0.000 description 1
- 239000002644 phorbol ester Substances 0.000 description 1
- 230000001766 physiological effect Effects 0.000 description 1
- 239000006187 pill Substances 0.000 description 1
- 239000013600 plasmid vector Substances 0.000 description 1
- 108010054442 polyalanine Proteins 0.000 description 1
- 229920000515 polycarbonate Polymers 0.000 description 1
- 229920002704 polyhistidine Polymers 0.000 description 1
- 102000054765 polymorphisms of proteins Human genes 0.000 description 1
- 239000000256 polyoxyethylene sorbitan monolaurate Substances 0.000 description 1
- 235000010486 polyoxyethylene sorbitan monolaurate Nutrition 0.000 description 1
- 235000010482 polyoxyethylene sorbitan monooleate Nutrition 0.000 description 1
- 229920000053 polysorbate 80 Polymers 0.000 description 1
- 238000011176 pooling Methods 0.000 description 1
- 230000001124 posttranscriptional effect Effects 0.000 description 1
- 230000001376 precipitating effect Effects 0.000 description 1
- 239000002243 precursor Substances 0.000 description 1
- 239000003755 preservative agent Substances 0.000 description 1
- 238000011085 pressure filtration Methods 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 230000009465 prokaryotic expression Effects 0.000 description 1
- 230000035755 proliferation Effects 0.000 description 1
- 230000001681 protective effect Effects 0.000 description 1
- 230000012846 protein folding Effects 0.000 description 1
- 230000006916 protein interaction Effects 0.000 description 1
- 230000020978 protein processing Effects 0.000 description 1
- 238000000734 protein sequencing Methods 0.000 description 1
- 230000004850 protein–protein interaction Effects 0.000 description 1
- 230000017854 proteolysis Effects 0.000 description 1
- 230000006337 proteolytic cleavage Effects 0.000 description 1
- 239000000612 proton pump inhibitor Substances 0.000 description 1
- 230000002685 pulmonary effect Effects 0.000 description 1
- 150000003254 radicals Chemical class 0.000 description 1
- 230000009257 reactivity Effects 0.000 description 1
- 238000010188 recombinant method Methods 0.000 description 1
- 229940126583 recombinant protein vaccine Drugs 0.000 description 1
- 238000011084 recovery Methods 0.000 description 1
- 230000007115 recruitment Effects 0.000 description 1
- 230000002829 reductive effect Effects 0.000 description 1
- 230000022532 regulation of transcription, DNA-dependent Effects 0.000 description 1
- 230000035806 respiratory chain Effects 0.000 description 1
- 230000028706 ribosome biogenesis Effects 0.000 description 1
- 238000007363 ring formation reaction Methods 0.000 description 1
- 229920002477 rna polymer Polymers 0.000 description 1
- CVHZOJJKTDOEJC-UHFFFAOYSA-N saccharin Chemical compound C1=CC=C2C(=O)NS(=O)(=O)C2=C1 CVHZOJJKTDOEJC-UHFFFAOYSA-N 0.000 description 1
- 210000003296 saliva Anatomy 0.000 description 1
- 235000017709 saponins Nutrition 0.000 description 1
- 230000009863 secondary prevention Effects 0.000 description 1
- 230000003248 secreting effect Effects 0.000 description 1
- 230000035945 sensitivity Effects 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
- 238000010008 shearing Methods 0.000 description 1
- 230000035939 shock Effects 0.000 description 1
- 229910052709 silver Inorganic materials 0.000 description 1
- 239000004332 silver Substances 0.000 description 1
- 210000002027 skeletal muscle Anatomy 0.000 description 1
- 150000003384 small molecules Chemical class 0.000 description 1
- 239000011734 sodium Substances 0.000 description 1
- 235000017557 sodium bicarbonate Nutrition 0.000 description 1
- 229910000030 sodium bicarbonate Inorganic materials 0.000 description 1
- 238000005063 solubilization Methods 0.000 description 1
- 230000007928 solubilization Effects 0.000 description 1
- 229940031439 squalene Drugs 0.000 description 1
- TUHBEKDERLKLEC-UHFFFAOYSA-N squalene Natural products CC(=CCCC(=CCCC(=CCCC=C(/C)CCC=C(/C)CC=C(C)C)C)C)C TUHBEKDERLKLEC-UHFFFAOYSA-N 0.000 description 1
- 238000011272 standard treatment Methods 0.000 description 1
- 239000008107 starch Substances 0.000 description 1
- 235000019698 starch Nutrition 0.000 description 1
- 239000007858 starting material Substances 0.000 description 1
- 238000007619 statistical method Methods 0.000 description 1
- 230000000638 stimulation Effects 0.000 description 1
- 238000003860 storage Methods 0.000 description 1
- 230000002483 superagonistic effect Effects 0.000 description 1
- 239000000829 suppository Substances 0.000 description 1
- 238000010189 synthetic method Methods 0.000 description 1
- 230000008685 targeting Effects 0.000 description 1
- 210000001138 tear Anatomy 0.000 description 1
- 238000012956 testing procedure Methods 0.000 description 1
- 229960004072 thrombin Drugs 0.000 description 1
- 229940104230 thymidine Drugs 0.000 description 1
- 229940113082 thymine Drugs 0.000 description 1
- 238000001890 transfection Methods 0.000 description 1
- 230000001052 transient effect Effects 0.000 description 1
- 238000011269 treatment regimen Methods 0.000 description 1
- QORWJWZARLRLPR-UHFFFAOYSA-H tricalcium bis(phosphate) Chemical compound [Ca+2].[Ca+2].[Ca+2].[O-]P([O-])([O-])=O.[O-]P([O-])([O-])=O QORWJWZARLRLPR-UHFFFAOYSA-H 0.000 description 1
- GPRLSGONYQIRFK-MNYXATJNSA-N triton Chemical class [3H+] GPRLSGONYQIRFK-MNYXATJNSA-N 0.000 description 1
- 230000001810 trypsinlike Effects 0.000 description 1
- 238000010396 two-hybrid screening Methods 0.000 description 1
- 108010087967 type I signal peptidase Proteins 0.000 description 1
- 231100000397 ulcer Toxicity 0.000 description 1
- 238000000108 ultra-filtration Methods 0.000 description 1
- 230000009452 underexpressoin Effects 0.000 description 1
- 210000002700 urine Anatomy 0.000 description 1
- 238000002255 vaccination Methods 0.000 description 1
- 210000003934 vacuole Anatomy 0.000 description 1
- 125000002987 valine group Chemical group [H]N([H])C([H])(C(*)=O)C([H])(C([H])([H])[H])C([H])([H])[H] 0.000 description 1
- 239000003981 vehicle Substances 0.000 description 1
- 108700026220 vif Genes Proteins 0.000 description 1
- 239000000304 virulence factor Substances 0.000 description 1
- 230000007923 virulence factor Effects 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 238000001262 western blot Methods 0.000 description 1
- 238000009736 wetting Methods 0.000 description 1
- 239000000080 wetting agent Substances 0.000 description 1
- 210000005253 yeast cell Anatomy 0.000 description 1
- 229910052727 yttrium Inorganic materials 0.000 description 1
- DGVVWUTYPXICAM-UHFFFAOYSA-N β‐Mercaptoethanol Chemical compound OCCS DGVVWUTYPXICAM-UHFFFAOYSA-N 0.000 description 1
Classifications
-
- C—CHEMISTRY; METALLURGY
- C07—ORGANIC CHEMISTRY
- C07K—PEPTIDES
- C07K14/00—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof
- C07K14/195—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria
- C07K14/205—Peptides having more than 20 amino acids; Gastrins; Somatostatins; Melanotropins; Derivatives thereof from bacteria from Campylobacter (G)
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61P—SPECIFIC THERAPEUTIC ACTIVITY OF CHEMICAL COMPOUNDS OR MEDICINAL PREPARATIONS
- A61P31/00—Antiinfectives, i.e. antibiotics, antiseptics, chemotherapeutics
-
- C—CHEMISTRY; METALLURGY
- C12—BIOCHEMISTRY; BEER; SPIRITS; WINE; VINEGAR; MICROBIOLOGY; ENZYMOLOGY; MUTATION OR GENETIC ENGINEERING
- C12Q—MEASURING OR TESTING PROCESSES INVOLVING ENZYMES, NUCLEIC ACIDS OR MICROORGANISMS; COMPOSITIONS OR TEST PAPERS THEREFOR; PROCESSES OF PREPARING SUCH COMPOSITIONS; CONDITION-RESPONSIVE CONTROL IN MICROBIOLOGICAL OR ENZYMOLOGICAL PROCESSES
- C12Q1/00—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions
- C12Q1/68—Measuring or testing processes involving enzymes, nucleic acids or microorganisms; Compositions therefor; Processes of preparing such compositions involving nucleic acids
- C12Q1/6876—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes
- C12Q1/6888—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms
- C12Q1/689—Nucleic acid products used in the analysis of nucleic acids, e.g. primers or probes for detection or identification of organisms for bacteria
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
- A61K2039/51—Medicinal preparations containing antigens or antibodies comprising whole cells, viruses or DNA/RNA
-
- A—HUMAN NECESSITIES
- A61—MEDICAL OR VETERINARY SCIENCE; HYGIENE
- A61K—PREPARATIONS FOR MEDICAL, DENTAL OR TOILETRY PURPOSES
- A61K39/00—Medicinal preparations containing antigens or antibodies
Landscapes
- Chemical & Material Sciences (AREA)
- Health & Medical Sciences (AREA)
- Organic Chemistry (AREA)
- Life Sciences & Earth Sciences (AREA)
- Proteomics, Peptides & Aminoacids (AREA)
- General Health & Medical Sciences (AREA)
- Analytical Chemistry (AREA)
- Wood Science & Technology (AREA)
- Molecular Biology (AREA)
- Medicinal Chemistry (AREA)
- Biochemistry (AREA)
- Engineering & Computer Science (AREA)
- Zoology (AREA)
- Biophysics (AREA)
- Genetics & Genomics (AREA)
- Pharmacology & Pharmacy (AREA)
- Oncology (AREA)
- Physics & Mathematics (AREA)
- Public Health (AREA)
- Animal Behavior & Ethology (AREA)
- Biotechnology (AREA)
- Immunology (AREA)
- Microbiology (AREA)
- Nuclear Medicine, Radiotherapy & Molecular Imaging (AREA)
- General Chemical & Material Sciences (AREA)
- Chemical Kinetics & Catalysis (AREA)
- Veterinary Medicine (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Communicable Diseases (AREA)
- Gastroenterology & Hepatology (AREA)
- Peptides Or Proteins (AREA)
- Medicines Containing Antibodies Or Antigens For Use As Internal Diagnostic Agents (AREA)
- Micro-Organisms Or Cultivation Processes Thereof (AREA)
- Preparation Of Compounds By Using Micro-Organisms (AREA)
- Measuring Or Testing Involving Enzymes Or Micro-Organisms (AREA)
- Medicines That Contain Protein Lipid Enzymes And Other Medicines (AREA)
- Investigating Or Analysing Biological Materials (AREA)
Description
WO 97/37044 PCT/US97/05223 -1- NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO HELICOBACTER PYLORI AND VACCINE COMPOSITIONS THEREOF Background of the Invention Helicobacterpylori is a gram-negative, S-shaped, microaerophilic bacterium that was discovered and cultured from a human gastric biopsy specimen. (Warren, J.R. and B. Marshall, (1983) Lancet 1: 1273-1275; and Marshall et al., (1984) Microbios Lett. 83-88). H. pylori has been strongly linked to chronic gastritis and duodenal ulcer disease. (Rathbone et. al., (1986) Gut 27: 635-641). Moreover, evidence is accumulating for an etiologic role of H pylori in nonulcer dyspepsia, gastric ulcer disease, and gastric adenocarcinoma. (Blaser M. (1993) Trends Microbiol. 1: 255- 260). Transmission of the bacteria occurs via the oral route, and the risk of infection increases with age. (Taylor, D.N. and M. J. Blaser, (1991) Epidemiol. Rev 13: 42-50).
H. pylori colonizes the human gastric mucosa, establishing an infection that usually persists for decades. Infection by H. pylori is prevalent worldwide. Developed countries have infection rates over 50% of the adult population, while developing countries have infection rates reaching 90% of the adults over the age of 20. (Hopkins R. J. and J. G. Morris (1994) Am. J. Med. 97: 265-277).
The bacterial factors necessary for colonization of the gastric environment, and for virulence of this pathogen, are poorly understood. Examples of the putative virulence factors include the following: urease, an enzyme that may play a role in neutralizing gastric acid pH (Eaton et al., (1991) Infect. Immunol. 59: 2470-2475; Ferrero, R.L. and A. Lee (1991) Microb. Ecol. Hlth. Dis. 4: 121-134; Labigne et al., (1991) J. Bacteriol. 173: 1920-1931); the bacterial flagellar proteins responsible for motility across the mucous layer. (Hazell et al., (1986) J. Inf Dis. 153: 658-663; Leying et al., (1992) Mol. Microbiol. 6: 2863-2874; and Haas et al., (1993) Mol. Microbiol. 8: 753-760); Vac A, a bacterial toxin that induces the formation of intracellular vacuoles in epithelial cells (Schmitt, W. and R. Haas, (1994) Molecular Microbiol. 12(2): 307-319); and several gastric tissue-specific adhesins. (Boren et al., (1993) Science 262: 1892- 1895; Evans et al., (1993) J. Bacteriol. 175: 674-683; and Falk et al., (1993) Proc. Natl.
Acad. Sci. USA 90: 2035-203).
Numerous therapeutic agents are currently available that eradicate H. pylori infections in vitro. (Huesca et. al., (1993) Zbl. Bakt. 280: 244-252; Hopkins, R. J. and J.
G. Morris, supra). However, many of these treatments are suboptimally effective in vivo because of bacterial resistance, altered drug distribution, patient non-compliance or poor drug availabilty. (Hopkins, R. J. and J. G. Morris, supra). Treatment with antibiotics combined with bismuth are part of the standard regime used to treat H. pylori infection.
WO 97/37044 PCT/US97/05223 -2- (Malfertheiner, P. and J. E. Dominguez-Munoz (1993) Clinical Therapeutics 15 Supp.
B: 37-48). Recently, combinations of a proton pump inhibitors and a single antibiotic have been shown to ameliorate duodenal ulcer disease. (Malfertheiner, P. and J. E.
Dominguez-Munoz supra). However, methods employing antibiotic agents can have the problem of the emergence of bacterial strains which are resistant to these agents.
(Hopkins, R. J. and J. G. Morris, supra). These limitations demonstrate that new more effective methods are needed to combat H. pylori infections in vivo. In particular, the design of new vaccines that may prevent infection by this bacterium is highly desirable.
Summary of the Invention This invention relates to novel genes, genes encoding polypeptides such as bacterial surface proteins, from the organism Helicobacter pylori pylori), and other related genes, their products, and uses thereof. The nucleic acids and peptides of the present invention have utility for diagnostic and therapeutics for H. pylori and other Helicobacter species. They can also be used to detect the presence of H. pylori and other Helicobacter species in a sample; and for use in screening compounds for the ability to interfere with the H. pylori life cycle or to inhibit H pylori infection. More specifically, this invention features compositions of nucleic acids corresponding to entire coding sequences ofH. pylori proteins, including surface or secreted proteins or parts thereof, nucleic acids capable of binding mRNA from H. pylori proteins to block protein translation, and methods for producing H pylori proteins or parts thereof using peptide synthesis and recombinant DNA techniques. This invention also features antibodies and nucleic acids useful as probes to detect H. pylori infection. In addition, vaccine compositions and methods for the protection against infection by H. pylori are within the scope of this invention.
Detailed Description of the Drawings Figure 1 is a bar graph that depicts the antibody titer in serum of mice following immunization with specific H. pylori antigens.
Figure 2 is a bar graph that depicts the antibody titer in mucous of mice following immunization with specific H. pylori antigens.
Figure 3 is a bar graph that depicts therapeutic immunization of H pylori infected mice with specific antigens dissolved in HEPES buffer.
Figure 4 is a bar graph that depicts therapeutic immunization of H. pylori infected mice with specific antigens dissolved in buffer containing DOG.
Figure 5 is a graph depicting the activity of recombinant PPIase.
WO 97/37044 PCT/US97/05223 Figure 6 is a graph depicting PPIase activity in an H. pylori extract.
Figure 7 is a graph depicting a decrease of glutamate racemase activity with L- Serine-O-Sulfate.
Figure 8 depicts the amino acid sequence alignment in a portion of the sequence of 12 H. pylori proteins (depicted in the single letter amino acid code and designated by their amino acid Sequence ID Numbers; shown N-terminal to C-terminal, left to right).
Figure 9 depicts the N-terminal portion of nine H. pylori proteins (depicted in the single letter amino acid code and designated by their amino acid Sequence ID Numbers; shown N-terminal to C-terminal, left to right).
Detailed Description of the Invention In one aspect, the invention features a recombinant or substantially pure preparation of H. pylori polypeptide of SEQ ID NO: 492. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide of SEQ ID NO: 492, such nucleic acid is contained in SEQ ID NO: 1. The H pylori polypeptide sequences described herein are contained in the Sequence Listing, and the nucleic acids encoding H. pylori polypeptides are contained in the Sequence Listing.
In another aspect, the invention features a recombinant or substantially pure preparation of an H pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 492 through SEQ ID NO: 541. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting ofH. pylori polypeptides SEQ ID NO: 492 through SEQ ID NO: 541, such nucleic acids are contained in SEQ ID NO: 1 through SEQ ID NO: In another aspect, the invention features a recombinant or substantially pure preparation of an H pylori polypeptide selected from the group consisting of H pylori polypeptides of SEQ ID NO: 542 through SEQ ID NO: 591. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting of H pylori polypeptides SEQ ID NO: 542 through SEQ ID NO: 591, such nucleic acids are contained in SEQ ID NO: 51 through SEQ ID NO: 100.
In another aspect, the invention features a recombinant or substantially pure preparation of an H pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 592 through SEQ ID NO: 641. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting ofH. pylori polypeptides SEQ ID NO: 592 through SEQ ID NO: 641, such nucleic acids are contained in SEQ ID NO: 101 through SEQ ID NO: 150.
WO 97/37044 PCT/US97/05223 -4- In another aspect, the invention features a recombinant or substantially pure preparation of an H. pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 642 through SEQ ID NO: 691. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting ofH. pylori polypeptides SEQ ID NO: 642 through SEQ ID NO: 691, such nucleic acids are contained in SEQ ID NO: 151 through SEQ ID NO: 200.
In another aspect, the invention features a recombinant or substantially pure preparation of an H. pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 692 through SEQ ID NO: 741. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting of H. pylori polypeptides SEQ ID NO: 692 through SEQ ID NO: 741, such nucleic acids are contained in SEQ ID NO: 201 through SEQ ID NO: 250.
In another aspect, the invention features a recombinant or substantially pure preparation of an H pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 742 through SEQ ID NO: 759, SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 765 through SEQ ID NO: 791. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting ofH. pylori polypeptides SEQ ID NO: 742 through SEQ ID NO: 759, SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 765 through SEQ ID NO: 791, such nucleic acids are contained in SEQ ID NO: 251 through SEQ ID NO: 268, SEQ ID NO: 270, SEQ ID NO: 272, and SEQ ID NO: 274 through SEQ ID NO: 300.
In another aspect, the invention features a recombinant or substantially pure preparation of an H. pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 792 through SEQ ID NO: 818 and SEQ ID NO: 820 through SEQ ID NO: 841. The invention also includes substantially pure nucleic acid encoding an H pylori polypeptide selected from the group consisting ofH. pylori polypeptides SEQ ID NO: 792 through SEQ ID NO: 818 and SEQ ID NO: 820 through SEQ ID NO: 841, such nucleic acids are contained in SEQ ID NO: 301 through SEQ ID NO: 327 and SEQ ID NO: 329 throgh SEQ ID NO: 350.
In another aspect, the invention features a recombinant or substantially pure preparation of an H pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 842 through SEQ ID NO: 846 and SEQ ID NO: 848 through SEQ ID NO: 891. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting ofH. pylori polypeptides SEQ ID NO: 842 through SEQ ID NO: 846 and SEQ ID NO: 848 through WO 97/37044 PCT/US97/05223 SEQ ID NO: 891, such nucleic acids are contained in SEQ ID NO: 351 through SEQ ID NO: 364 and SEQ ID NO: 366 through SEQ ID NO: 400.
In another aspect, the invention features a recombinant or substantially pure preparation of an H pylori polypeptide selected from the group consisting of H pylori polypeptides of SEQ ID NO: 892 through SEQ ID NO: 896 and SEQ ID NO: 898 through SEQ ID NO: 941. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting of H pylori polypeptides SEQ ID NO: 892 through SEQ ID NO: 896 and SEQ ID NO: 898 through SEQ ID NO: 941, such nucleic acids are contained in SEQ ID NO: 401 through SEQ ID NO: 405 and SEQ ID NO: 407 through SEQ ID NO: 450.
In another aspect, the invention features a recombinant or substantially pure preparation of an H pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 942 through SEQ ID NO: 963 and SEQ ID NO: 966 through SEQ ID NO: 982. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting of H. pylori polypeptides SEQ ID NO: 942 through SEQ ID NO: 963 and SEQ ID NO: 966 through SEQ ID NO: 982, such nucleic acids are contained in SEQ ID NO: 451 through SEQ ID NO: 472 and SEQ ID NO: 475 through SEQ ID NO: 491.
In another aspect, the invention features a recombinant or substantially pure preparation of an H. pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 1037, SEQ ID NO: 1038, SEQ ID NO: 1041 through SEQ ID NO: 1087 and SEQ ID NO: 1090. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting of H.
pylori polypeptides SEQ ID NO: 1037, SEQ ID NO: 1038, SEQ ID NO: 1041 through SEQ ID NO: 1087 and SEQ ID NO: 1090, such nucleic acids are contained in SEQ ID NO: 983, SEQ ID NO: 984, SEQ ID NO: 987 through SEQ ID NO: 1033 and SEQ ID NO: 1036.
In another aspect, the invention features a recombinant or substantially pure preparation of an H. pylori polypeptide selected from the group consisting of H. pylori polypeptides of SEQ ID NO: 1296 through SEQ ID NO: 1298. The invention also includes substantially pure nucleic acid encoding an H. pylori polypeptide selected from the group consisting ofH. pylori polypeptides SEQ ID NO: 1296 through SEQ ID NO: 1298, such nucleic acids are contained in SEQ ID NO: 1293 through SEQ ID NO: 1295.
In another aspect, the invention features a recombinant or substantially pure preparation of an H. pylori polypeptide selected from the group consisting of H pylori polypeptides as set forth in the Sequence Listing. The invention also includes WO 97/37044 PCT/US97/05223 -6substantially pure nucleic acid encoding an H pylori polypeptide selected from the group consisting ofH. pylori polypeptides as set forth in the Sequence Listing. It should be understood that this invention encompasses each of the H. pylori polypeptides and nucleic acids encoding such polypeptides as identified in the Sequence Listing by a given sequence identification number. For example, a representative H. pylori polypeptide is contained in SEQ ID NO: 494. Therefore, this invention encompasses a recombinant or substantially pure preparation of an H pylori polypeptide of SEQ ID NO: 494. The invention also includes substantially pure nucleic acid encoding an H.
pylori polypeptide of SEQ ID NO: 494.
In another aspect, the invention pertains to any individual H pylori polypeptide member or nucleic acid encoding such member from the above-identified groups of H pylori polypeptides SEQ ID NO: 542-SEQ ID NO: 591) or nucleic acids SEQ ID NO: 51-SEQ ID NO: 100), as well as any subgroups from within the aboveidentified groups. Furthermore, the subgroups can preferably consists of 1, 3, 5, 10, 20, 30 or 40 members of any of the groups identified above, as well as, any combinations thereof. For example, the group consisting ofH. pylori polypeptides SEQ ID NO: 692 through SEQ ID NO: 741 can be divided into one or more subgroups as follows: SEQ ID NO: 692-SEQ ID NO: 680; SEQ ID NO: 681-SEQ ID NO: 710; SEQ ID NO: 711-SEQ ID NO: 730; SEQ ID NO: 731-SEQ ID NO: 741; or any combinations thereof.
Particularly preferred is an isolated nucleic acid comprising a nucleotide sequence encoding an H. pylori cell envelope polypeptide or a fragment thereof. Such nucleic acid is selected from the group consisting of SEQ ID NO: 255, SEQ ID NO: 263, SEQ ID NO: 266, SEQ ID NO: 277, SEQ ID NO: 280, SEQ ID NO: 285, SEQ ID NO: 292, SEQ ID NO: 294, SEQ ID NO: 299, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 321, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ ID NO: 353, SEQ ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: 375, SEQ ID NO: 384, SEQ ID NO: 391, SEQ ID NO: 392, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 402, SEQ ID NO: 404, SEQ ID NO: 409, SEQ ID NO: 410, SEQ ID NO: 412, SEQ ID NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ ID NO: 441, SEQ ID NO: 444, SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450, SEQ ID NO: 452, SEQ ID NO: 453, SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ ID NO: 983, SEQ ID NO: 989, SEQ ID NO: 1008, SEQ ID NO: 1011, SEQ ID NO: 1014, SEQ ID NO: 1015, SEQ ID NO: 1029, SEQ ID NO: 1032, SEQ ID NO: 259, SEQ ID NO: 286, SEQ ID NO: 326, SEQ ID NO: 374, SEQ ID NO: 399, SEQ ID NO: 422, SEQ ID NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ ID NO: 1009, SEQ ID NO: WO 97/37044 PCT/US97/05223 -7- 1023, SEQ ID NO: 1294, SEQ ID NO: 1295, SEQ ID NO: 319, SEQ ID NO: 325, SEQ ID NO: 425, SEQ ID NO: 437, SEQ ID NO: 438, SEQ ID NO: 447, SEQ ID NO: 448, SEQ ID NO: 467, SEQ ID NO: 996, SEQ ID NO: 1027, SEQ ID NO: 1031, SEQ ID NO: 254, SEQ ID NO: 352, SEQ ID NO: 415, SEQ ID NO: 1019, SEQ ID NO: 381, SEQ ID NO: 389, SEQ ID NO: 1010, SEQ ID NO: 1012, SEQ ID NO: 354, SEQ ID NO: 372, SEQ ID NO: 400, SEQ ID NO: 421, SEQ ID NO: 1022, SEQ ID NO: 463, SEQ ID NO: 281, SEQ ID NO: 988, SEQ ID NO: 411, SEQ ID NO: 407, SEQ ID NO: 1017, SEQ ID NO: 290, SEQ ID NO: 417, SEQ ID NO: 430, SEQ ID NO: 992, SEQ ID NO: 1025, SEQ ID NO: 477, SEQ ID NO: 414, SEQ ID NO: 253, SEQ ID NO: 293, SEQ ID NO: 334, SEQ ID NO: 343, SEQ ID NO: 418, SEQ ID NO: 424, and SEQ ID NO: 443.
In another embodiment, the H. pylori cell envelope polypeptide or a fragment thereof is an H pylori outer membrane polypeptide or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 255, SEQ ID NO: 263, SEQ ID NO: 266, SEQ ID NO: 277, SEQ ID NO: 280, SEQ ID NO: 285, SEQ ID NO: 292, SEQ ID NO: 294, SEQ ID NO: 299, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 321, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ ID NO: 353, SEQ ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: 375, SEQ ID NO: 384, SEQ ID NO: 391, SEQ ID NO: 392, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 402, SEQ ID NO: 404, SEQ ID NO: 409, SEQ ID NO: 410, SEQ ID NO: 412, SEQ ID NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ ID NO: 441, SEQ ID NO: 444, SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450, SEQ ID NO: 452, SEQ ID NO: 453, SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ ID NO: 983, SEQ ID NO: 989, SEQ ID NO: 1008, SEQ ID NO: 1011, SEQ ID NO: 1014, SEQ ID NO: 1015, SEQ ID NO: 1029, SEQ ID NO: 1032, SEQ ID NO: 259, SEQ ID NO: 286, SEQ ID NO: 326, SEQ ID NO: 374, SEQ ID NO: 399, SEQ ID NO: 422, SEQ ID NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ ID NO: 1009, SEQ ID NO: 1023, SEQ ID NO: 1294, SEQ ID NO: 1295, SEQ ID NO: 319, SEQ ID NO: 325, SEQ ID NO: 425, SEQ ID NO: 437, SEQ ID NO: 438, SEQ ID NO: 447, SEQ ID NO: 448, SEQ ID NO: 467, SEQ ID NO: 996, SEQ ID NO: 1027, SEQ ID NO: 1031, SEQ ID NO: 254, SEQ ID NO: 352, SEQ ID NO: 415, SEQ ID NO: 1019, SEQ ID NO: 381, SEQ ID NO: 389, SEQ ID NO: 1010, and SEQ ID NO: 1012.
In another embodiment, the H pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 255, SEQ ID NO: 263, SEQ ID NO: 266, SEQ ID NO: 277, SEQ ID NO: 280, WO 97/37044 PCT/US97/05223 -8- SEQ ID NO: 285, SEQ ID NO: 292, SEQ ID NO: 294, SEQ ID NO: 299, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 321, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ ID NO: 353, SEQ ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: 375, SEQ ID NO: 384, SEQ ID NO: 391, SEQ ID NO: 392, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 402, SEQ ID NO: 404, SEQ ID NO: 409, SEQ ID NO: 410, SEQ ID NO: 412, SEQ ID NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ ID NO: 441, SEQ ID NO: 444, SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450, SEQ ID NO: 452, SEQ ID NO: 453, SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ ID NO: 983, SEQ ID NO: 989, SEQ ID NO: 1008, SEQ ID NO: 1011, SEQ ID NO: 1014, SEQ ID NO: 1015, SEQ ID NO: 1029, and SEQ ID NO: 1032.
In another embodiment, the H pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a C-terminal tyrosine cluster or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 286, SEQ ID NO: 326, SEQ ID NO: 374, SEQ ID NO: 399, SEQ ID NO: 422, SEQ ID NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ ID NO: 1009, SEQ ID NO: 1023, SEQ ID NO: 1294, and SEQ ID NO: 1295.
In another embodiment, the H pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a Cterminal tyrosine cluster or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 319, SEQ ID NO: 325, SEQ ID NO: 425, SEQ ID NO: 437, SEQ ID NO: 438, SEQ ID NO: 447, SEQ ID NO: 448, SEQ ID NO: 467, SEQ ID NO: 996, SEQ ID NO: 1027, and SEQ ID NO: 1031.
In another embodiment, the H. pylori cell envelope polypeptide or a fragment thereof is an H pylori inner membrane polypeptide or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 354, SEQ ID NO: 372, SEQ ID NO: 400, SEQ ID NO: 421, SEQ ID NO: 1022, SEQ ID NO: 463, SEQ ID NO: 281, SEQ ID NO: 988, SEQ ID NO: 411, SEQ ID NO: 407, SEQ ID NO: 1017, SEQ ID NO: 290, SEQ ID NO: 417, SEQ ID NO: 430, SEQ ID NO: 992, and SEQ ID NO: 1025.
In another embodiment, the H pylori inner membrane polypeptide or a fragment thereof is an H pylori polypeptide involved in outer membrane and cell wall synthesis or a fragment thereof encoded by the nucleic acid comprising a nucleotide sequence of SEQ ID NO: 354.
In another embodiment, the H pylori inner membrane polypeptide or a fragment thereof is an H pylori polypeptide involved in energy conversion or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 372, SEQ ID NO: 400, SEQ ID NO: 421, and SEQ ID NO: 1022.
WO 97/37044 PCT/US97/05223 -9- In another embodiment, the H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in cofactor metabolism or a fragment thereof encoded by the nucleic acid comprising a nucleotide sequence of SEQ ID NO: 463.
In another embodiment, the H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in secretion or adhesion or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 281 and SEQ ID NO: 988.
In another embodiment, the H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in transport or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 407 and SEQ ID NO: 1017.
In another embodiment, the H. pylori cell envelope polypeptide or a fragment thereof is an H pylori flagellar polypeptide or a fragment thereof encoded by the nucleic acid comprising a nucleotide sequence of SEQ ID NO: 477.
In another embodiment, the H. pylori cell envelope polypeptide or a fragment thereof is an H. pylori transport polypeptide or a fragment thereof encoded by the nucleic acid comprising a nucleotide sequence of SEQ ID NO: 414.
Particularly preferred is an isolated nucleic acid comprising a nucleotide sequence encoding an H. pylori cytoplasmic polypeptide or a fragment thereof. Such nucleic acid is selected from the group consisting of SEQ ID NO: 470, SEQ ID NO: 1033, SEQ ID NO: 357, SEQ ID NO: 457, SEQ ID NO: 461, SEQ ID NO: 1030, SEQ ID NO: 345, SEQ ID NO: 383, SEQ ID NO: 387, SEQ ID NO: 455, SEQ ID NO: 1003, SEQ ID NO: 351, SEQ ID NO: 416, SEQ ID NO: 278, SEQ ID NO: 335, SEQ ID NO: 346, SEQ ID NO: 350, SEQ ID NO: 419, SEQ ID NO: 460, SEQ ID NO: 472, SEQ ID NO: 1000, SEQ ID NO: 1004, SEQ ID NO: 1020, SEQ ID NO: 1293, SEQ ID NO: 318, SEQ ID NO: 322, SEQ ID NO: 324, SEQ ID NO: 330, SEQ ID NO: 347, SEQ ID NO: 440, SEQ ID NO: 446, SEQ ID NO: 464, SEQ ID NO: 490, SEQ ID NO: 491, SEQ ID NO: 995, SEQ ID NO: 997, SEQ ID NO: 1005, and SEQ ID NO: 1028.
In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in energy conversion or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 470 and SEQ ID NO: 1033.
In another embodiment, the H pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in amino acid metabolism and transport or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 357 and SEQ ID NO: 457.
WO 97/37044 PCT/US97/05223 In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in nucleotide metabolism and transport or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 461 and SEQ ID NO: 1030.
In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in cofactor metabolism or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 345, SEQ ID NO: 383, SEQ ID NO: 387, SEQ ID NO: 455, and SEQ ID NO: 1003.
In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in lipid metabolism or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 351 and SEQ ID NO: 416.
In another embodiment, the H pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in genome replication, transcription, recombination and repair or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 278, SEQ ID NO: 335, SEQ ID NO: 346, SEQ ID NO: 350, SEQ ID NO: 419, SEQ ID NO: 460, SEQ ID NO: 472, SEQ ID NO: 1000, SEQ ID NO: 1004, SEQ ID NO: 1020, and SEQ ID NO: 1293.
Particularly preferred is an isolated nucleic acid comprising a nucleotide sequence encoding an H pylori secreted polypeptide or a fragment thereof. Such nucleic acid is selected from the group consisting of SEQ ID NO: 355, SEQ ID NO: 1006, SEQ ID NO: 257, SEQ ID NO: 258, SEQ ID NO: 260, SEQ ID NO: 261, SEQ ID NO: 264, SEQ ID NO: 265, SEQ ID NO: 268, SEQ ID NO: 270, SEQ ID NO: 272, SEQ ID NO: 274, SEQ ID NO: 275, SEQ ID NO: 276, SEQ ID NO: 279, SEQ ID NO: 283, SEQ ID NO: 284, SEQ ID NO: 287, SEQ ID NO: 288, SEQ ID NO: 289, SEQ ID NO: 291, SEQ ID NO: 295, SEQ ID NO: 296, SEQ ID NO: 297, SEQ ID NO: 298, SEQ ID NO: 300, SEQ ID NO: 301, SEQ ID NO: 302, SEQ ID NO: 303, SEQ ID NO: 304, SEQ ID NO: 305, SEQ ID NO: 314, SEQ ID NO: 315, SEQ ID NO: 323, SEQ ID NO: 338, SEQ ID NO: 342, SEQ ID NO: 348, SEQ ID NO: 349, SEQ ID NO: 356, SEQ ID NO: 358, SEQ ID NO: 359, SEQ ID NO: 360, SEQ ID NO: 361, SEQ ID NO: 362, SEQ ID NO: 363, SEQ ID NO: 367, SEQ ID NO: 370, SEQ ID NO: 371, SEQ ID NO: 373, SEQ ID NO: 377, SEQ ID NO: 378, SEQ ID NO: 379, SEQ ID NO: 380, SEQ ID NO: 388, SEQ ID NO: 390, SEQ ID NO: 394, SEQ ID NO: 395, SEQ ID NO: 396, SEQ ID NO: 401, SEQ ID NO: 403, SEQ ID NO: 405, SEQ ID NO: 408, SEQ ID NO: 420, SEQ ID NO: 426, SEQ ID NO: 428, SEQ ID NO: 429, SEQ ID NO: 432, SEQ ID NO: 439, SEQ ID NO: 442, SEQ ID NO: 451, SEQ ID NO: 471, SEQ ID NO: 478, SEQ ID NO: 488, WO 97/37044 PCT/US97/05223 -11 SEQ ID NO: 987, SEQ ID NO: 990, SEQ ID NO: 991, SEQ ID NO: 993, SEQ ID NO: 1001, SEQ ID NO: 1002, SEQ ID NO: 1007, SEQ ID NO: 1013, SEQ ID NO: 1016, SEQ ID NO: 1018, SEQ ID NO: 1021, and SEQ ID NO: 1026.
In another embodiment, the H. pylori secreted polypeptide or a fragment thereof is an H. pylori polypeptide involved in secretion and adhesion or a fragment thereof encoded by the nucleic acid selected from the group consisting of SEQ ID NO: 355 and SEQ ID NO: 1006.
Particularly preferred is an isolated nucleic acid comprising a nucleotide sequence encoding an H. pylori cellular polypeptide or a fragment thereof. Such nucleic acid is selected from the group consisting of SEQ ID NO: 256, SEQ ID NO: 267, SEQ ID NO: 282, SEQ ID NO: 306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, SEQ ID NO: 310, SEQ ID NO: 316, SEQ ID NO: 317, SEQ ID NO: 332, SEQ ID NO: 333, SEQ ID NO: 336, SEQ ID NO: 337, SEQ ID NO: 339, SEQ ID NO: 340, SEQ ID NO: 341, SEQ ID NO: 344, SEQ ID NO: 369, SEQ ID NO: 376, SEQ ID NO: 382, SEQ ID NO: 386, SEQ ID NO: 423, SEQ ID NO: 431, SEQ ID NO: 435, SEQ ID NO: 436, SEQ ID NO: 458, SEQ ID NO: 462, SEQ ID NO: 475, SEQ ID NO: 476, SEQ ID NO: 479, SEQ ID NO: 480, SEQ ID NO: 481, SEQ ID NO: 482, SEQ ID NO: 483, SEQ ID NO: 484, SEQ ID NO: 485, SEQ ID NO: 486, SEQ ID NO: 487, SEQ ID NO: 489, SEQ ID NO: 984, SEQ ID NO: 994, SEQ ID NO: 1024, and SEQ ID NO: 1036.
Particularly preferred is a purified or isolated H. pylori cell envelope polypeptide or a fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ ID NO: 746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ ID NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, SEQ ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 859, SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ ID NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ ID NO: 925, SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO: 941, SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ ID NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, SEQ ID NO: 1065, SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, SEQ ID NO: 1086, SEQ ID NO: 750, SEQ ID NO: 777, SEQ ID NO: 817, SEQ ID NO: 865, SEQ ID NO: 890, SEQ ID NO: 913, SEQ ID NO: 945, SEQ ID NO: 956, SEQ ID NO: 1052, SEQ ID NO: 1063, SEQ ID NO: 1077, SEQ ID NO: 1297, SEQ ID NO: 1298, SEQ ID NO: 810, SEQ ID NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ WO 97/37044 PCT/US97/05223 -12- ID NO: 938, SEQ ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID NO: 1081, SEQ ID NO: 1085, SEQ ID NO: 745, SEQ ID NO: 843, SEQ ID NO: 906, SEQ ID NO: 1073, SEQ ID NO: 872, SEQ ID NO: 880, SEQ ID NO: 1064, SEQ ID NO: 1066, SEQ ID NO: 845, SEQ ID NO: 863, SEQ ID NO: 891, SEQ ID NO: 912, SEQ ID NO: 1076, SEQ ID NO: 954, SEQ ID NO: 772, SEQ ID NO: 1042, SEQ ID NO: 902, SEQ ID NO: 898, SEQ ID NO: 1071, SEQ ID NO: 781, SEQ ID NO: 908, SEQ ID NO: 921, SEQ ID NO: 1046, SEQ ID NO: 1079, SEQ ID NO: 968, SEQ ID NO: 905, SEQ ID NO: 744, SEQ ID NO: 784, SEQ ID NO: 825, SEQ ID NO: 834, SEQ ID NO: 909, SEQ ID NO: 915, and SEQ ID NO: 934.
In another embodiment, the H. pylori cell envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide or a fragment thereof selected from the group consisting of SEQ ID NO: 746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ ID NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, SEQ ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 859, SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ ID NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ ID NO: 925, SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO: 941, SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ ID NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, SEQ ID NO: 1065, SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, SEQ ID NO: 1086, SEQ ID NO: 750, SEQ ID NO: 777, SEQ ID NO: 817, SEQ ID NO: 865, SEQ ID NO: 890, SEQ ID NO: 913, SEQ ID NO: 945, SEQ ID NO: 956, SEQ ID NO: 1052, SEQ ID NO: 1063, SEQ ID NO: 1077, SEQ ID NO: 1297, SEQ ID NO: 1298, SEQ ID NO: 810, SEQ ID NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ ID NO: 938, SEQ ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID NO: 1081, SEQ ID NO: 1085, SEQ ID NO: 745, SEQ ID NO: 843, SEQ ID NO: 906, SEQ ID NO: 1073, SEQ ID NO: 872, SEQ ID NO: 880, SEQ ID NO: 1064, and SEQ ID NO: 1066.
In another embodiment, the H. pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a fragment thereof selected from the group consisting of SEQ ID NO: 746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ ID NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, SEQ ID NO: 882, WO 97/37044 PCT/US97/05223 13- SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 859, SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ ID NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ ID NO: 925, SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO: 941, SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ ID NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, SEQ ID NO: 1065, SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, and SEQ ID NO: 1086.
In another embodiment, the H. pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a C-terminal tyrosine cluster or a fragment thereof selected from the group consisting of SEQ ID NO: 777, SEQ ID NO: 817, SEQ ID NO: 865, SEQ ID NO: 890, SEQ ID NO: 913, SEQ ID NO: 945, SEQ ID NO: 956, SEQ ID NO: 1052, SEQ ID NO: 1063, SEQ ID NO: 1077, SEQ ID NO: 1297, and SEQ ID NO: 1298.
In another embodiment, the H. pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a Cterminal tyrosine cluster or a fragment thereof selected from the group consisting of SEQ ID NO: 810, SEQ ID NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ ID NO: 938, SEQ ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID NO: 1081, and SEQ ID NO: 1085.
In another embodiment, the H. pylori cell envelope polypeptide or a fragment thereof is an H pylori inner membrane polypeptide or a fragment thereof selected from the group consisting of SEQ ID NO: 845, SEQ ID NO: 863, SEQ ID NO: 891, SEQ ID NO: 912, SEQ ID NO: 1076, SEQ ID NO: 954, SEQ ID NO: 772, SEQ ID NO: 1042, SEQ ID NO: 902, SEQ ID NO: 898, SEQ ID NO: 1071, SEQ ID NO: 781, SEQ ID NO: 908, SEQ ID NO: 921, SEQ ID NO: 1046, and SEQ ID NO: 1079.
In another embodiment, the H. pylori inner membrane polypeptide or a fragment thereof is an H pylori polypeptide involved in outer membrane and cell wall synthesis or a fragment thereof comprising an amino acid sequence of SEQ ID NO: 845.
In another embodiment, the H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in energy conversion or a fragment thereof selected from the group consisting of SEQ ID NO: 863, SEQ ID NO: 891, SEQ ID NO: 912, and SEQ ID NO: 1076.
In another embodiment, the H. pylori inner membrane polypeptide or a fragment thereof is an H pylori polypeptide involved in cofactor metabolism or a fragment thereof comprising an amino acid sequence of SEQ ID NO: 954.
WO 97/37044 PCT/US97/05223 -14- In another embodiment, the H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in secretion or adhesion or a fragment thereof selected from the group consisting of SEQ ID NO: 772 and SEQ ID NO: 1042.
In another embodiment, the H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in transport or a fragment thereof selected from the group consisting of SEQ ID NO: 898 and SEQ ID NO: 1071.
In another embodiment, the H pylori cell envelope polypeptide or a fragment thereof is an H. pylori flagellar polypeptide or a fragment thereof comprising an amino acid sequence of SEQ ID NO: 968.
In another embodiment, the H. pylori cell envelope polypeptide or a fragment thereof is an H. pylori transport polypeptide or a fragment thereof comprising an amino acid sequence of SEQ ID NO: 905.
Particularly preferred is a purified or isolated H. pylori cytoplasmic polypeptide or a fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ ID NO: 961, SEQ ID NO: 1087, SEQ ID NO: 848, SEQ ID NO: 948, SEQ ID NO: 952, SEQ ID NO: 1084, SEQ ID NO: 836, SEQ ID NO: 874, SEQ ID NO: 878, SEQ ID NO: 946, SEQ ID NO: 1057, SEQ ID NO: 842, SEQ ID NO: 907, SEQ ID NO: 769, SEQ ID NO: 826, SEQ ID NO: 837, SEQ ID NO: 841, SEQ ID NO: 910, SEQ ID NO: 951, SEQ ID NO: 963, SEQ ID NO: 1054, SEQ ID NO: 1058, SEQ ID NO: 1074, SEQ ID NO: 1296, SEQ ID NO: 809, SEQ ID NO: 813, SEQ ID NO: 815, SEQ ID NO: 821, SEQ ID NO: 838, SEQ ID NO: 931, SEQ ID NO: 937, SEQ ID NO: 955, SEQ ID NO: 981, SEQ ID NO: 982, SEQ ID NO: 1049, SEQ ID NO: 1051, SEQ ID NO: 1059, and SEQ ID NO: 1082.
In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof is an H pylori polypeptide involved in energy conversion or a fragment thereof selected from the group consisting of SEQ ID NO: 961 and SEQ ID NO: 1087.
In another embodiment, the H pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in amino acid metabolism and transport or a fragment thereof selected from the group consisting of SEQ ID NO: 848 and SEQ ID NO: 948.
In another embodiment, the H pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in nucleotide metabolism and transport or a fragment thereof selected from the group consisting of SEQ ID NO: 952 and SEQ ID NO: 1084.
In another embodiment, the H pylori cytoplasmic polypeptide or a fragment thereof is an H pylori polypeptide involved in cofactor metabolism or a fragment WO 97/37044 PCT/US97/05223 15 thereof selected from the group consisting of SEQ ID NO: 836, SEQ ID NO: 874, SEQ ID NO: 878, SEQ ID NO: 946, and SEQ ID NO: 1057.
In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in lipid metabolism or a fragment thereof selected from the group consisting of SEQ ID NO: 842, SEQ ID NO: 907.
In another embodiment, the H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in genome replication, transcription, recombination and repair or a fragment thereof selected from the group consisting of SEQ ID NO: 769, SEQ ID NO: 826, SEQ ID NO: 837, SEQ ID NO: 841, SEQ ID NO: 910, SEQ ID NO: 951, SEQ ID NO: 963, SEQ ID NO: 1054, SEQ ID NO: 1058, SEQ ID NO: 1074, and SEQ ID NO: 1296.
Particularly preferred is a purified or isolated H. pylori secreted polypeptide or a fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ ID NO: 846, SEQ ID NO: 1060, SEQ ID NO: 748, SEQ ID NO: 749, SEQ ID NO: 751, SEQ ID NO: 752, SEQ ID NO: 755, SEQ ID NO: 756, SEQ ID NO: 759, SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 765, SEQ ID NO: 766, SEQ ID NO: 767, SEQ ID NO: 770, SEQ ID NO: 774, SEQ ID NO: 775, SEQ ID NO: 778, SEQ ID NO: 779, SEQ ID NO: 780, SEQ ID NO: 782, SEQ ID NO: 786, SEQ ID NO: 787, SEQ ID NO: 788, SEQ ID NO: 789, SEQ ID NO: 791, SEQ ID NO: 792, SEQ ID NO: 793, SEQ ID NO: 794, SEQ ID NO: 795, SEQ ID NO: 796, SEQ ID NO: 805, SEQ ID NO: 806, SEQ ID NO: 814, SEQ ID NO: 829, SEQ ID NO: 833, SEQ ID NO: 839, SEQ ID NO: 840, SEQ ID NO: 847, SEQ ID NO: 849, SEQ ID NO: 850, SEQ ID NO: 851, SEQ ID NO: 852, SEQ ID NO: 853, SEQ ID NO: 854, SEQ ID NO: 858, SEQ ID NO: 861, SEQ ID NO: 862, SEQ ID NO: 864, SEQ ID NO: 868, SEQ ID NO: 869, SEQ ID NO: 870, SEQ ID NO: 871, SEQ ID NO: 879, SEQ ID NO: 881, SEQ ID NO: 885, SEQ ID NO: 886, SEQ ID NO: 887, SEQ ID NO: 892, SEQ ID NO: 894, SEQ ID NO: 896, SEQ ID NO: 899, SEQ ID NO: 911, SEQ ID NO: 917, SEQ ID NO: 919, SEQ ID NO: 920, SEQ ID NO: 923, SEQ ID NO: 930, SEQ ID NO: 933, SEQ ID NO: 942, SEQ ID NO: 962, SEQ ID NO: 969, SEQ ID NO: 979, SEQ ID NO: 1041, SEQ ID NO: 1044, SEQ ID NO: 1045, SEQ ID NO: 1047, SEQ ID NO: 1055, SEQ ID NO: 1056, SEQ ID NO: 1061, SEQ ID NO: 1067, SEQ ID NO: 1070, SEQ ID NO: 1072, SEQ ID NO: 1075, and SEQ ID NO: 1080.
In another embodiment, the H. pylori secreted polypeptide or a fragment thereof is an H pylori polypeptide involved in secretion and adhesion or a fragment thereof selected from the group consisting of SEQ ID NO: 846 and SEQ ID NO: 1060.
WO 97/37044 PCT/US97/05223 -16- Particularly preferred is a purified or isolated H. pylori cellular polypeptide or a fragment thereof, wherein the polypeptide is selected from the group consisting of SEQ ID NO: 747, SEQ ID NO: 758, SEQ ID NO: 773, SEQ ID NO: 797, SEQ ID NO: 798, SEQ ID NO: 799, SEQ ID NO: 800, SEQ ID NO: 801, SEQ ID NO: 807, SEQ ID NO: 808, SEQ ID NO: 823, SEQ ID NO: 824, SEQ ID NO: 827, SEQ ID NO: 828, SEQ ID NO: 830, SEQ ID NO: 831, SEQ ID NO: 832, SEQ ID NO: 835, SEQ ID NO: 860, SEQ ID NO: 867, SEQ ID NO: 873, SEQ ID NO: 877, SEQ ID NO: 914, SEQ ID NO: 922, SEQ ID NO: 926, SEQ ID NO: 927, SEQ ID NO: 949, SEQ ID NO: 953, SEQ ID NO: 966, SEQ ID NO: 967, SEQ ID NO: 970, SEQ ID NO: 971, SEQ ID NO: 972, SEQ ID NO: 973, SEQ ID NO: 974, SEQ ID NO: 975, SEQ ID NO: 976, SEQ ID NO: 977, SEQ ID NO: 978, SEQ ID NO: 980, SEQ ID NO: 1038, SEQ ID NO: 1048, SEQ ID NO: 1078, and SEQ ID NO: 1090.
In another aspect, the invention pertains to any individual H. pylori polypeptide member or nucleic acid encoding such a member from the above-identified groups of H.
pylori polypeptides.
In another aspect, the invention features nucleic acids capable of binding mRNA of H. pylori. Such nucleic acid is capable of acting as antisense nucleic acid to control the translation of mRNA of H. pylori. A further aspect features a nucleic acid which is capable of binding specifically to an H. pylori nucleic acid. These nucleic acids are also referred to herein as complements and have utility as probes and as capture reagents.
In another aspect, the invention features an expression system comprising an open reading frame corresponding to H. pylori nucleic acid. The nucleic acid further comprises a control sequence compatible with an intended host. The expression system is useful for making polypeptides corresponding to H. pylori nucleic acid.
In another aspect, the invention features a cell transformed with the expression system to produce H. pylori polypeptides.
In another aspect, the invention features a method of generating antibodies against H. pylori polypeptides which are capable of binding specifically to H. pylori polypeptides. Such antibody has utility as reagents for immunoassays to evaluate the abundance and distribution ofH. pylori-specific antigens.
In another aspect, the invention features a method of generating vaccines for immunizing an individual against H. pylori. The method includes: immunizing a subject with an H. pylori polypeptide, a surface or secreted polypeptide, or active portion thereof, and a pharmaceutically acceptable carrier. Such vaccines have therapeutic and prophylactic utilities.
WO 97/37044 PCT/US97/05223 -17- In another aspect, the invention provides a method for generating a vaccine comprising a modified immunogenic H. pylori polypeptide, a surface or secreted polypeptide, or active portion thereof, and a pharmacologically acceptable carrier.
In another aspect, the invention features a method of evaluating a compound, e.g.
a polypeptide, a fragment of a host cell polypeptide, for the ability to bind an H.
pylori polypeptide. The method includes: contacting the candidate compound with an H. pylori polypeptides and determining if the compound binds or otherwise interacts with an H pylori polypeptide. Compounds which bind H. pylori are candidates as activators or inhibitors of the bacterial life cycle. These assays can be performed in vitro or in vivo.
In another aspect, the invention features a method of evaluating a compound, e.g.
a polypeptide, a fragment of a host cell polypeptide, for the ability to bind an H.
pylori nucleic acid, DNA or RNA. The method includes: contacting the candidate compound with an H. pylori nucleic acid and determining if the compound binds or otherwise interacts with an H. pylori polypeptide. Compounds which bind H. pylori are candidates as activators or inhibitors of the bacterial life cycle. These assays can be performed in vitro or in vivo.
The invention features H. pylori polypeptides, preferably a substantially pure preparation of an H. pylori polypeptide, or a recombinant H. pylori polypeptide. In preferred embodiments: the polypeptide has biological activity; the polypeptide has an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homologous to an amino acid sequence contained in the Sequence Listing; the polypeptide has an amino acid sequence essentially the same as an amino acid sequence in the Sequence Listing; the polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acid residues in length; the polypeptide includes at least 5, preferably at least 10, more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acid residues of a polypeptide contained in the Sequence Listing.
In preferred embodiments: the H. pylori polypeptide is encoded by a nucleic acid contained in the Sequence Listing, or by a nucleic acid having at least 60%, 80%, 90%, 95%, 98%, or 99% homology with a nucleic acid shown in the Sequence Listing.
In a preferred embodiment, the subject H pylori polypeptide differs in amino acid sequence at 1, 2, 3, 5, 10 or more residues from a sequence in the Sequence Listing.
The differences, however, are such that the H. pylori polypeptide exhibits an H. pylori biological activity, the H. pylori polypeptide retains a biological activity of a naturally occurring H. pylori enzyme.
WO 97/37044 PCT/US97/05223 18- In preferred embodiments, the polypeptide includes all or a fragment of an amino acid sequence contained in the Sequence Listing; fused, in reading frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 5' to the genomic DNA which encodes a sequence contained in the Sequence Listing.
In yet other preferred embodiments, the H. pylori polypeptide is a recombinant fusion protein having a first H. pylori polypeptide portion and a second polypeptide portion, a second polypeptide portion having an amino acid sequence unrelated to H pylori. The second polypeptide portion can be, any of glutathione-S-transferase, a DNA binding domain, or a polymerase activating domain. In preferred embodiment the fusion protein can be used in a two-hybrid assay.
Polypeptides of the invention include those which arise as a result of alternative transcription events, alternative RNA splicing events, and alternative translational and postranslational events.
The invention also encompasses an immunogenic component which includes an H. pylori polypeptide in an immunogenic preparation; the immunogenic component being capable of eliciting an immune response specific for the H. pylori polypeptide, a humoral response, an antibody response, or a cellular response. In preferred embodiments, the immunogenic component comprises at least one antigenic determinant from a polypeptide contained in the Sequence Listing.
In another aspect, the invention provides a substantially pure nucleic acid having a nucleotide sequence which encodes an H. pylori polypeptide. In preferred embodiments: the encoded polypeptide has biological activity; the encoded polypeptide has an amino acid sequence at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homologous to an amino acid sequence contained in the Sequence Listing; the encoded polypeptide has an amino acid sequence essentially the same as an amino acid sequence in the Sequence Listing; the encoded polypeptide is at least 5, 10, 20, 50, 100, or 150 amino acids in length; the encoded polypeptide comprises at least 5, preferably at least more preferably at least 20, more preferably at least 50, 100, or 150 contiguous amino acids contained in the Sequence Listing.
In preferred embodiments: the nucleic acid is that as shown in the Sequence Listing; the nucleic acid is at least 60%, 70%, 80%, 90%, 95%, 98%, or 99% homologous with a nucleic acid sequence contained in the Sequence Listing.
In a preferred embodiment, the encoded H pylori polypeptide differs by amino acid substitution, addition or deletion of at least one amino acid residue) in amino acid sequence at 1, 2, 3, 5, 10 or more residues, from a sequence in the Sequence Listing. The differences, however, are such that: the H. pylori encoded polypeptide WO 97/37044 PCT/US97/05223 -19exhibits a H. pylori biological activity, the encoded H. pylori enzyme retains a biological activity of a naturally occurring H pylori.
In preferred embodiments, the encoded polypeptide includes all or a fragment of an amino acid sequence contained in the Sequence Listing; fused, in reading frame, to additional amino acid residues, preferably to residues encoded by genomic DNA 5' to the genomic DNA which encodes a sequence contained in the Sequence Listing.
In preferred embodiments, the subject H. pylori nucleic acid will include a transcriptional regulatory sequence, e.g. at least one of a transcriptional promoter or transcriptional enhancer sequence, operably linked to the H. pylori gene sequence, e.g., to render the H pylori gene sequence suitable for expression in a recombinant host cell.
In yet a further preferred embodiment, the nucleic acid which encodes an H pylori polypeptide of the invention, hybridizes under stringent conditions to a nucleic acid probe corresponding to at least 8 consecutive nucleotides of a nucleic acid contained in the Sequence Listing; more preferably to at least 12 consecutive nucleotides of a nucleic acid contained in the Sequence Listing; more preferably to at least consecutive nucleotides of a nucleic acid contained in the Sequence Listing; more preferably to at least 40 consecutive nucleotides of a nucleic acid contained in the Sequence Listing.
In a preferred embodiment, the nucleic acid encodes a peptide which differs by at least one amino acid residue from the sequences shown in the Sequence Listing.
In a preferred embodiment, the nucleic acid differs by at least one nucleotide from a nucleotide sequence shown in the Sequence Listing which encodes amino acids shown in the Sequence Listing.
In another aspect, the invention encompasses: a vector including a nucleic acid which encodes an H pylori polypeptide or an H pylori polypeptide variant as described herein; a host cell transfected with the vector; and a method of producing a recombinant H. pylori polypeptide or H. pylori polypeptide variant; including culturing the cell, e.g., in a cell culture medium, and isolating the H. pylori or H. pylori polypeptide variant from the cell or from the cell culture medium.
In another aspect, the invention features, a purified recombinant nucleic acid having at least 50%, 60%, 70%, 80%, 90%, 95%, 98%, or 99% homology with a sequence contained in the Sequence Listing.
The invention also provides a probe or primer which includes a substantially purified oligonucleotide. The oligonucleotide includes a region of nucleotide sequence which hybridizes under stringent conditions to at least 10 consecutive nucleotides of sense or antisense sequence contained in the Sequence Listing, or naturally occurring WO 97/37044 PCT/US97/05223 mutants thereof. In preferred embodiments, the probe or primer further includes a label group attached thereto. The label group can be, a radioisotope, a fluorescent compound, an enzyme, and/or an enzyme co-factor. Preferably, the oligonucleotide is at least 10 and less than 20, 30, 50, 100, or 150 nucleotides in length.
The invention further provides nucleic acids, RNA or DNA, encoding a polypeptide of the invention. This includes double stranded nucleic acids as well as coding and antisense single strands.
The H. pylori strain, from which genomic sequences have been sequenced, has been deposited in the American Type Culture Collection(ATCC) as strain HP-J99.
Included in the invention are: allelic variations; natural mutants; induced mutants; proteins encoded by DNA that hybridizes under high or low stringency conditions to a nucleic acid which encodes a polypeptide as shown in the Sequence Listing (for definitions of high and low stringency see Current Protocols in Molecular Biology, John Wiley Sons, New York, 1989, 6.3.1 6.3.6, hereby incorporated by reference); and, polypeptides specifically bound by antisera to H. pylori polypeptides, especially by antisera to an active site or binding domain of H. pylori polypeptide. The invention also includes fragments, preferably biologically active fragments. These and other polypeptides are also referred to herein as H pylori polypeptide analogs or variants.
Putative functions have been determined for several of the H. pylori polypeptides of the invention, as shown in Table 1.
Accordingly, uses of the claimed H pylori polypeptides in these identified functions are also within the scope of the invention.
In addition, the present invention encompasses H. pylori polypeptides characterized as shown in Table 1 below, including: H. pylori cell envelope proteins, H.
pylori secreted proteins, H. pylori cytoplasmic proteins and H. pylori cellular proteins.
Members of these groups were identified by BLAST homology searches and by searches for secretion signal or transmembrane protein motifs. Polypeptides related by significant homology to the polypeptides of Table 1 (as reflected by FASTA comparisons of the amino acid sequences and indicated in many cases in Tables 3-6 below) are also considered to be classified in the manner of the homolog shown in Table 1.
TABLE 1 ORF NAME nt Seq ID# aa Seq ID A.CELL ENVELOPE A.1 Outer membrane A.1.2 Terminal phe residue_ Olael2001 24218781 f2 18 255 746 WO 97/37044 PCTfUS97/05223 -21 O2gei 1622 875260_f3_36 263 754 02gp20706_23632775_ 3_32 266 757 O7apl1015 23938312 f0 2 271 762 05ee10816 4103408 f) 11 277 768 06ap20306_23437632f3_9 280 771 07ap20216 7227202 10 285 776 14ap1081520585777 ci 13 292 783 hplp14013_11726503_c2_20 294 785 hp5piS64i_21698387_c220 299 790 02gp20706_16803513flI 311 802 02gp20706 20365905 f2 8 312 803 02gp20814_3984818_fl _1 313 804 05ae30220_9882767_f2_34 321 812 07ep11916_5913592_0 18 327 818 09ze10333 22460750 f2 6 329 820 14ap10221_13689381_c3 4 331 882 06ce20610 1367157 fl 8.aa 353 844 02ge10116 16803513 C 34 364 855 02ge10116 36367936 ci 92 366 857 05celO910 25598277 f3 3 368 859 06cp11217 4881263 f2 9 375 866 06ep30223_34409437 0 94 384 875 06gp71906_970325_c3_190 391 882 O7ae10923_24426508 fl 1 392 883 09cp10224_1062966 c3 61 397 888 09cp10224_1412715 c3 56 398 889 09cp6 00314562637 c2 93 402 893 09cp61003_240635 87_c 74 404 895 l1ae80818 7952 ci 49 409 900 11ap20714 4960432 c3 97 410 901 11ap20714_7227202 S 43.aa 412 903 hp5p15641_12195281 ci_24 427 918 hp6p109034398263_B6.aa 433 924 hp6p10903 4398263 S 6.aa 434 925 02ae31010 30208317 fi 14 441 932 02cp10615_26573462 cl 45 444 935 03ae10804_12609533 ci_26 445 936 04ep419034101593 f2_ .aa 449 940 05ep10815 26570332_c299 450 941 05ep10815 4719175 ci 83 452 943 06cp30603_679218 f2 34 453 944 llae808i8 19632781 c3 57 466 957 11ap20714_34023312 B46 468 959 11ee11408 4977193 ci 41.aa 469 960 _ael192212586675 f2 1 983 1037 07eeI402 24582670108 985 1039 lap20714 7227202_40 989 1043 hp7e]0192255982 77c2_15 1008 1062 06ep30223_34409437f264 1011 1065 09cp10224_1062966 cl 44 1014 1068 01ce61016_12931513_c2_106 1015 1069 05eplO815 4719175 ci 115 1029 1083 WO 97/37044 WO 9737044PCTfLJS97/05223 22 05ae30220_4-977193_c3_198 1032 1086 A. 1.3 No terminal ph e residue750 02ae11612_247267_f2_27 25975 A. 1.4 C-terminal tyrosine cluster motif 07ep11916_5273452_c3_31 286 777 07ce11019 22051291 fl_1 326 817 06cp11217 19720300 f3 11 374 865 09cp 10224_429510_-c2_46.aa 399 890 hp4e53394 22864682 c2 86.aa 422 913 06ep10615_961562 12 41 454 945 09cp61003 492187 c2 80.aa- 465 956 l4gp 1 1423__26803 80 1_10_7 998 1052 hp4e53394_19720300 c3 98 1009 1063 hp4e53394 197203000c398 1023 1077 01ce61016 492187 c3120 1294 1297 06ep10615 961562 fi15 1295 1298 Terminal phe residue and C-terminal tyrosine cluster 04ge10816_33726080 c2 29 319 810 06ee10709 21675012 fl 2 325 816 hp~pl5575_33445317 f2 20.aa 425 916 01cp20708 36134808 127 11 437 928 02ae31010 12504512 1328.aa 438 929 04ep41903 26757937 S3 16 447 938 04ep41903_26757937_1S_16 448 939 11ae80818_7290627_c2_51 467 958 l3ae I0610 35912C123 996 1050 01cp20708_13086002 13 27 1027 1081 l4cpI 1908 24218954 ci 68 1031 1085 A. 1.6_Via homolgy______ OlaelIOIO 40688 c2 38 254 745 02gp20814 24415958 S3 9 269 760 O9cpI 1003_5945252 f2 4 328 819 06cp30603_23476568 c2 133.aa 352 843 09cp61003_5945252 f] 5 406 897 14ee41924 235272676c3107 415 906 07ee11402 10759567 c2 86 1019 1073 A. 1.7 Other outer membrane proteins 06ep10615_9842 B3 46 381 872 06gp10409_3398427 12 12 389 880 06ep10615_9842 fl 5 1010 1064 06gp10409_3398427 12_12 1012 1066 A.2 Inner Membrane A.2. 1 Proteins involved in outer membrane and cell wall synthesis 06ep30223 4698838 Q2 55 354 845 A.2.2 Proteins involved in energy conversion 06ce20610_43,3133801318 372 863 09cp10224_4484718_ci_38 400 891 hp4e13394 5964452 c2 97 421 912 hp4e13394 15828963 c2 90 1022 1076 A.2.3 Proteins involved in cofactor metabolism 06gp71906_25478192 ci 131 463 1954_ WO 97/37044 PCT/US97/05223 -23- A.2.4 Proteins involved in secretion and adhesion 06cp30603_23452_c3_80 281 772 09cp071 3 23452 c3 195 988 1042 Via homolgy llap20714 5271967 cl 60 411 902 A.2.6 Proteins involved in transport 11ae80818 11188791 c3 60 407 898 14cpl1908_25593768_c3 97 1017 1071 A.2.7 Other inner membrane proteins 13ae10712 14100018_f2 12 290 781 hplpl3939 25397327 3 22 417 908 hp5pl5870_14350428 fI_ 1 430 921 06ge20501_ 14100018 cl 34 992 1046 05ae30220 14350428_f3_91 1025 1079 A.3 Flagellar proteins hp4e13394 3368767 cl 80 477 968 A.4 Transport proteins 14ce31519_15635927_ B_15 414 905 Other cell envelope proteins 04cpil202 24256567 c3 117 253 744 29ge3032134157812 f3 10 293 784 29ge30321_12913562fl_ 334 825 hp6pl0233_12273302fl_1 343 834 hp2e0911 24855312_cl69 418 909 hp5p15575_29300311 cl 29 424 915 02ae31010_5085162 cl_47 443 934 B. CYTOPLASMIC PROTEINS B.1 I Proteins involved in energy conversion 11ge10308 5256 _f2 _1 470 961 11ge10308 24609417 f2 1 1033 1087 B.2 Proteins involved in amino acid metabolism and transport 06ep11917 24803153 c3 24 357 848 06ep11202 4884677 cl 17 457 948 B.3 Proteins involved in nucleotide metabolism and transport 06ep30223 23476067 cl_ 119 461 952 06ep30223 23476067 cl_ 115 1030 1084 B.4 Proteins involved in cofactor metabolism 07apl 1213_35156577 cl 24 345 836 06ep30223_23557202 c2_130 383 874 06ep30223_5109443_ cl 109 387 878 06epl1202_133293cl 19 455 946 07ee50709 35156577_3_80 1003 1057 Proteins involved in lipid metabolism hp6e12267 14650278_f329 351 842 14ee41924 23834800_f2_32 416 907 B.6 Proteins involved in mRNA translation and ribosome biogenesis 14ee41924 16282067_cl 72 473 964 07eel1402_19565702c2 88 1034 1088 B.7 Proteins involved in genome replication, transcription, recombination and repair" 05ee10816 4687651 cl_22 278 769 WO 97/37044 PCT/US97/05223 -24- 29ge30321 135253 f2 6 335 826 07ap80601 976413 3 9 346 837 14ce2151685786 fl 1 350 841 hp2el0911 _3349cl 63 419 910 06ep30223 16512 c3 160 460 951 14ce61516 13073577 £2 12 472 963 07ee50709 4818967 f2_43 1000 1054 05ae30220 976413 c3 204 1004 1058 07ee50709_4818967 f2 43 1020 1074 hp7e10590_13073577_c3 107 1293 1296 B.8 Other cytoplasmic proteins 04ge10816 22086531 12 10 318 809 05cp21223 4725443 _3 14 322 813 06ap11119_244265080326 324 815 12ge10321 4821082 f3_14 330 821 hp3p10807_189075_f2 4 347 838 02ae31010_2117087 f334 440 931 03ae10804 21698400 c2 32 446 937 06gp71906 25504187 f3_112 464 955 hp7p0290 35156558 f3 15 490 981 hp7p0290 4351718 fl_ 6 491 982 13ae10610 859692 c2_32 995 1049 06ap11119 24426508 3 27 997 1051 hp3ell188 47327_ 2 9 1005 1059 07ee50709 26438968 £2 36 1028 1082 C. SECRETED PROTEINS C.1 I Proteins involved in secretion and adhesion 12ap103244805318 f2 3 355 846 12ap10324_4805318_f2_6 1006 1060 C.2 Other secreted proteins 01gp1016_4103403_c2_13 257 748 02ae11612 1074212 fl 1 258 749 02ae11612 23598175 fl 2 260 751 02ael1612 33203250 cl 51 261 752 02gp20706_1203402 c3 58 264 755 02gp20706_15781452 c2_51 265 756 02gp20706 4892558_f3_19 268 759 04cpl 1202_24261588_ f2 23 270 761 05ae30220_21619067 £3_56 272 763 07apI1213 35401528 cl 21 273 764 05ae30220 24882812 c3 103 274 765 05ae30220 25953163 c3 98 275 766 05ee10816 14649077 3 18 276 767 06ap11119_16594193_fl_9 279 770 06cp30603 4689068c3_79 283 774 07aplllll_234693_c3_14 284 775 09cp11003_19532625_ c3 _17 287 778 09cp20502_24001388 _cl 31 288 779 12gp31106_3024126_f2 25 289 780 13ae10712 29569208 c2 27 291 782 hp2p10272 23697200 f3 22 295 786 hp2p10272 26829136 fl 1 296 787 WO 97/37044 PCT/US97/05223 hp5el5211819455 c2 24 297 788 hp5pl5212_34064750 f2_9 298 789 hp6e10967_23476502_12_6 300 791 hp6e]0967 24882750 f2 7 301 792 hp6e12267_4876718f2_23 302 793 hp6e20339 1190660 c2 46 303 794 hp6e20339 21492187 ci 40 304 795 hp6e20339 34024187 ci 37 305 796 04cp11202_16603425 c2 72 314 805 04cp11202_19797128 fl 5 315 806 05ee10816 259703 12 7 323 814 hp2p10272_22692325 S 21 338 829 hp6e20339 24317062 c3 57 342 833 O2gel1622_21695936_cl 54 348 839 12ge10321 24308513 3 20 349 840 14ee41924 2458267_c2 93 356 847 01ce11104 36125337 ci 8 358 849 01ce21104_33203250_c3_87 359 850 02ae31010_34616666 f2 27 360 851 02ae3101035270000_03 33 361 852 02ae31010 36132785 12 29 362 853 02ge10116 15781452 ci 87 363 854 03ae10804_23485968c3_47 367 858 06ce20610_29298537_c2_32 370 861 06ce20610_3913967_c3_36 371 862 06cp11118_212827_cl 17 373 864 06cp30603_21492187_f2_41 377 868 06cp30603_34024187 fl 20 378 869 06cp30603_34024187 fl 20 379 870 06ep10615_14649077_ S_52 380 871 06ep30223_5271902 ci 106 388 879 06gp7190624261588_c2_174 390 881 09ce10413414011 fI_3 394 885 09ce10413 5865665 fl 4 395 886 09ce52017 29324062 c 21 396 887 09cp216077224187_c2_12 401 892 09cp6100319532625_cl78 403 894 09cp6100324335762 c3 111 405 896 11ae80818_783127 c363 408 899 hp4e13394 35957200 fl 21 420 911 hpSpl5575_6140713_12_18 426 917 hp5p1564124304527_c3 35 428 919 hp5pl564l 25635452_c3_34 429 920 hp6p10606 19546933_c3 31 432 923 02ae31010_16833312_12 19 439 930 02ae31010_36132785_12_29 442 933 05ep10815_4195292_cl84 451 942 12ap10324_13178562_136 471 962 hp2eI0911 4882027_c287 474 965 hp5p15212 6928132 c3 34 478 969 hp7p1029025548812 13 14 488 979 07ee50709 10213593 3 77 986 1040 WO 97/37044 WO 9737044PCT/US97/05223 26 06ep10615_14649077 Q2 30 987 1041 01ce61016_236095800c3139 990 1044 06gp71906_3024126_cl 128- 991 1045 09cp10713 34024187 fl 31 993) 1047 02ap11117 23495187 c3_81 1001 1055 09cp10713 34024187 fi 31 1002 1056 -hple80523_23485968_c2_49 1007 1061 09ce10413 5865665_fl_4 1013 106-7 01ce61016_23609580_c3139 1016 1070 14cp11908_783127_ci 72 1018 1072 hp4e13394_5088562 B3 54 1021 1075 hp8elOO8O 19546933 c2 88 1026 1080 07ee50709 960952fV247 1035 1089 D. OTHER CELLULAR PROTEINS Olgel 1619_23711062_c314 256 7 47 02g-p20706 23866562 c2 53 267 758 06cp30603 23476568 ci 44 282 773 01lge10203 35281542 c3 16 306 -797 01ge10203_860166 B3 9 307 798 01ge11619_13788141_c2_11 308 799 01ge11619_24415880_c2_12 3,0-9 800 Olgel 1619 24417813 ci 8 310 801 O4cpI 1202_23553177_ci_75 316 807 04cp11202_23553177_c3_109 31 7 -T08 29ep10720_24220926_V2 8 332 823 29ep10720 24432762 c3 39 333 824 29ge30321_ 21673965_f2_7 3316 827 29ge30321_24336712_fl_5 337 828 hp2p10272_24406280_ci_26 339 830 hp3p10807_29343768_f) 1 340 831 hp3p10807 29352212 f2 5 341 832 02ep20506 24611 325_f_6 344 835 06ae11016_30579712f2_21 369 860 O6cpI 1217_4897077_fI_ 6 376 867 O6epI 1202_26353438_ci22 382 873 06ep30223 4876077 c3 149 386 877 hp5e15044 4554652f3 3 423 914 hp6p10590 23440913 c2 31 431 922 hp6p10904 2214676 ci14 435 926 hp6p10904_23704412_f2 5 43 6 927 O6ep I 1202_792962.c2_26.aa 458 949 06gp71906_15115637_f2_59 462 953 hp3e11188 47327 V2 5 475 966 hp3el1188 5082842_012 76 967 _pp54_0731 c22 479 970 hpp54_51 c2 29 480 971 hpp _9 30203 21 481 972 hpp194_0802 l 6482 973 hpp~2_16 7 B_ 1 483 974 hp6p122443948467 ci 52 484 975 hp6p22217_23470967_fl 4 485 976 hp7e10192 4412568 V2 5 48697 WO 97/37044 PCT/US97/05223 -27hp7p10287_24611325_c2 24 487 978 hp7pl0290_25585941_f3 12 489 980 02gel0116 23866562c3 146 984 1038 hp4p62853_5914693 c3_52 994 1048 07ce10312_4554652 f3 2 1024 1078 hp6p12244_3948467 c3 88 1036 1090 [In Table 1, "nt" represents nucleotide Seq. ID number and "aa" represents amino acid Seq. ID number] Definitions As used herein, the term "comprise" and variations of the term, such as "comprising", "comprises" and "comprised", are not intended to exclude other additives, components, integers or steps.
A purified preparation or a substantially pure preparation of a polypeptide, as used herein, means a polypeptide that has been separated from other proteins, lipids, and nucleic acids with which it naturally occurs. Preferably, the polypeptide is also separated from substances, antibodies or gel matrix, polyacrylamide, which are used to purify it. Preferably, the polypeptide constitutes at least 10, 20, 50 70, 80 or 95% dry weight of the purified preparation. Preferably, the preparation contains: 15 sufficient polypeptide to allow protein sequencing; at least 1, 10, or 100 4g of the polypeptide; at least 1, 10, or 100 mg of the polypeptide.
A purified preparation of cells refers to, in the case of plant or animal cells, an in vitro preparation of cells and not an entire intact plant or animal. In the case of cultured :cells or microbial cells, it consists of a preparation of at least 10% and more preferably 20 50% of the subject cells.
A substantially pure nucleic acid, a substantially pure DNA, is a nucleic acid which is one or both of the following: not immediately contiguous with both of the coding sequences with which it is immediately contiguous one at the 5' end and one Sat the 3' end) in the naturally-occurring genome of the organism from which the nucleic S* 25 acid is derived; or which is substantially free of a nucleic acid with which it occurs in the organism from which the nucleic acid is derived. The term includes, for example, a recombinant DNA which is incorporated into a vector, into an autonomously replicating plasmid or virus, or into the genomic DNA of a prokaryote or eukaryote, or which exists as a separate molecule a cDNA or a genomic DNA fragment produced by PCR or restriction endonuclease treatment) independent of other DNA sequences. Substantially pure DNA also includes a recombinant DNA which is part of a hybrid gene encoding additional H. pylori DNA sequence.
A A "contig" as used herein is a nucleic acid representing a continuous stretch of S genomic sequence of an organism.
WO 97/37044 PCTIUS97/05223 -28- An "open reading frame", also referred to herein as ORF, is a region of nucleic acid which encodes a polypeptide. This region may represent a portion of a coding sequence or a total sequence.
As used herein, a "coding sequence" is a nucleic acid which is transcribed into messenger RNA and/or translated into a polypeptide when placed under the control of appropriate regulatory sequences. The boundaries of the coding sequence are determined by a translation start codon at the five prime terminus and a translation stop code at the three prime terminus. A coding sequence can include but is not limited to messenger RNA, synthetic DNA, and recombinant nucleic acid sequences.
A "complement" of a nucleic acid as used herein referes to an anti-parallel or antisense sequence that participates in Watson-Crick base-pairing with the original sequence.
A "gene product" is a protein or structural RNA which is specifically encoded for by a gene.
As used herein, the term "probe" refers to a nucleic acid, peptide or other chemical entity which specifically binds to a molecule of interest. Probes are often associated with or capable of associating with a label. A label is a chemical moiety capable of detection. Typical labels comprise dyes, radioisotopes, luminescent and chemiluminescent moieties, fluorophores, enzymes, precipitating agents, amplification sequences, and the like. Similarly, a nucleic acid, peptide or other chemical entity which specifically binds to a molecule of interest and immobilizes such molecule is referred herein as a "capture ligand". Capture ligands are typically associated with or capable of associating with a support such as nitro-cellulose, glass, nylon membranes, beads, particles and the like. The specificity of hybridization is dependent on conditions such as the base pair composition of the nucleotides, and the temperature and salt concentration of the reaction. These conditions are readily discernable to one of ordinary skill in the art using routine experimentation.
Homologous refers to the sequence similarity or sequence identity between two polypeptides or between two nucleic acid molecules. When a position in both of the two compared sequences is occupied by the same base or amino acid monomer subunit, e.g., if a position in each of two DNA molecules is occupied by adenine, then the molecules are homologous at that position. The percent of homology between two sequences is a function of the number of matching or homologous positions shared by the two sequences divided by the number of positions compared x 100. For example, if 6 of of the positions in two sequences are matched or homologous then the two sequences are homologous. By way of example, the DNA sequences ATTGCC and TATGGC WO 97/37044 PCT/US97/05223 -29share 50% homology. Generally, a comparison is made when two sequences are aligned to give maximum homology.
Nucleic acids are hybridizable to each other when at least one strand of a nucleic acid can anneal to the other nucleic acid under defined stringency conditions.
Stringency of hybridization is determined by: the temperature at which hybridization and/or washing is performed; and the ionic strength and polarity of the hybridization and washing solutions. Hybridization requires that the two nucleic acids contain complementary sequences; depending on the stringency of hybridization, however, mismatches may be tolerated. Typically, hybridization of two sequences at high stingency (such as, for example, in a solution of 0.5X SSC, at 650 C) requires that the sequences be essentially completely homologous. Conditions of intermediate stringency (such as, for example, 2X SSC at 65 0 C) and low stringency (such as, for example 2X SSC at 550 require correspondingly less overall complementarity between the hybridizing sequences. (IX SSC is 0.15 M NaC1, 0.015 M Na citrate).
The terms peptides, proteins, and polypeptides are used interchangeably herein.
As used herein, the term "surface protein" refers to all surface accessible proteins, e.g. inner and outer membrane proteins, proteins adhering to the cell wall, and secreted proteins.
A polypeptide has H. pylori biological activity if it has one, two and preferably more of the following properties: if when expressed in the course of an H. pylori infection, it can promote, or mediate the attachment of H. pylori to a cell; it has an enzymatic activity characteristic of an H. pylori protein; or the gene which encodes it can rescue a lethal mutation in an H. pylori gene. A polypeptide has biological activity if it is an antagonist, agonist, or super-agonist of a polypeptide having one of the abovelisted properties.
A biologically active fragment or analog is one having an in vivo or in vitro activity which is characteristic of the H. pylori polypeptides shown in the Sequence Listing, or of other naturally occurring H pylori polypeptides, one or more of the biological activities described herein. Especially preferred are fragments which exist in vivo, fragments which arise from post transcriptional processing or which arise from translation of alternatively spliced RNA's. Fragments include those expressed in native or endogenous cells as well as those made in expression systems, in CHO cells. Because peptides such as H pylori polypeptides often exhibit a range of physiological properties and because such properties may be attributable to different portions of the molecule, a useful H pylori fragment or H pylori analog is one which exhibits a biological activity in any biological assay for H pylori activity. Most WO 97/37044 PCT/US97/05223 preferably the fragment or analog possesses 10%, preferably 40%, more preferably or greater of the activity of H pylori, in any in vivo or in vitro assay.
Analogs can differ from naturally occurring H pylori polypeptides in amino acid sequence or in ways that do not involve sequence, or both. Non-sequence modifications include changes in acetylation, methylation, phosphorylation, carboxylation, or glycosylation. Preferred analogs include H pylori polypeptides (or biologically active fragments thereof) whose sequences differ from the wild-type sequence by one or more conservative amino acid substitutions or by one or more non-conservative amino acid substitutions, deletions, or insertions which do not substantially diminish the biological activity of the H pylori polypeptide. Conservative substitutions typically include the substitution of one amino acid for another with similar characteristics, substitutions within the following groups: valine, glycine; glycine, alanine; valine, isoleucine, leucine; aspartic acid, glutamic acid; asparagine, glutamine; serine, threonine; lysine, arginine; and phenylalanine, tyrosine. Other conservative substitutions are outlined in Table 2 below.
TABLE 2 CONSERVATIVE AMINO ACID REPLACEMENTS For Amino Code Replace with any of Acid Alanine A D-Ala, Gly, beta-Ala, L-Cys, D-Cys Arginine R D-Arg, Lys, D-Lys, homo-Arg, D-homo-Arg, Met, Ile, D-Met, D-Ile, Orn, D-Orn Asparagine N D-Asn, Asp, D-Asp, Glu, D-Glu, Gin, D-Gln Aspartic Acid D D-Asp, D-Asn, Asn, Glu, D-Glu, Gin, D-Gln Cysteine C D-Cys, S-Me-Cys, Met, D-Met, Thr, D-Thr Glutamine Q D-Gln, Asn, D-Asn, Glu, D-Glu, Asp, D-Asp Glutamic Acid E D-Glu, D-Asp, Asp, Asn, D-Asn, Gln, D-Gln Glycine G Ala, D-Ala, Pro, D-Pro, P-Ala, Acp Isoleucine I D-Ile, Val, D-Val, Leu, D-Leu, Met, D-Met Leucine L D-Leu, Val, D-Val, Leu, D-Leu, Met, D-Met Lysine K D-Lys, Arg, D-Arg, homo-Arg, D-homo-Arg, Met, D-Met, Ile, D-Ile, Orn, D-Orn Methionine M D-Met, S-Me-Cys, Ile, D-Ile, Leu, D-Leu, Val, D-Val Phenylalanine F D-Phe, Tyr, D-Thr, L-Dopa, His, D-His, Trp, D-Trp, Trans-3,4, or cis-3,4, or Proline P D-Pro, L-I-thioazolidine-4-carboxylic acid, D-or L-l-oxazolidine-4carboxylic acid Serine S D-Ser, Thr, D-Thr, allo-Thr, Met, D-Met, Met(O), D-Met(O), L-Cys, D-Cys Threonine T D-Thr, Ser, D-Ser, allo-Thr, Met, D-Met, Met(O), D-Met(O), Val, D-Val Tyrosine Y D-Tyr, Phe, D-Phe, L-Dopa, His, D-His Valine V D-Val, Leu, D-Leu, lie, D-Ile, Met, D-Met WO 97/37044 PCT/US97/05223 -31 Other analogs within the invention are those with modifications which increase peptide stability; such analogs may contain, for example, one or more non-peptide bonds (which replace the peptide bonds) in the peptide sequence. Also included are: analogs that include residues other than naturally occurring L-amino acids, D-amino acids or non-naturally occurring or synthetic amino acids, P or y amino acids; and cyclic analogs.
As used herein, the term "fragment", as applied to an H. pylori analog, will ordinarily be at least about 20 residues, more typically at least about 40 residues, preferably at least about 60 residues in length. Fragments ofH. pylori polypeptides can be generated by methods known to those skilled in the art. The ability of a candidate fragment to exhibit a biological activity of H pylori polypeptide can be assessed by methods known to those skilled in the art as described herein. Also included are H.
pylori polypeptides containing residues that are not required for biological activity of the peptide or that result from alternative mRNA splicing or alternative protein processing events.
An "immunogenic component" as used herein is a moiety, such as an H pylori polypeptide, analog or fragment thereof, that is capable of eliciting a humoral and/or cellular immune response in a host animal.
An "antigenic component" as used herein is a moiety, such as an H. pylori polypeptide, analog or fragment thereof, that is capable of binding to a specific antibody with sufficiently high affinity to form a detectable antigen-antibody complex.
As used herein, the term "transgene" means a nucleic acid (encoding, one or more polypeptides), which is partly or entirely heterologous, foreign, to the transgenic animal or cell into which it is introduced, or, is homologous to an endogenous gene of the transgenic animal or cell into which it is introduced, but which is designed to be inserted, or is inserted, into the cell's genome in such a way as to alter the genome of the cell into which it is inserted it is inserted at a location which differs from that of the natural gene or its insertion results in a knockout). A transgene can include one or more transcriptional regulatory sequences and any other nucleic acid, such as introns, that may be necessary for optimal expression of the selected nucleic acid, all operably linked to the selected nucleic acid, and may include an enhancer sequence.
As used herein, the term "transgenic cell" refers to a cell containing a transgene.
As used herein, a "transgenic animal" is any animal in which one or more, and preferably essentially all, of the cells of the animal includes a transgene. The transgene can be introduced into the cell, directly or indirectly by introduction into a precursor of the cell, by way of deliberate genetic manipulation, such as by microinjection or by WO 97/37044 PCT/US97/05223 -32infection with a recombinant virus. This molecule may be integrated within a chromosome, or it may be extrachromosomally replicating DNA.
The term "antibody" as used herein is intended to include fragments thereof which are specifically reactive with H. pylori polypeptides.
As used herein, the term "cell-specific promoter" means a DNA sequence that serves as a promoter, regulates expression of a selected DNA sequence operably linked to the promoter, and which effects expression of the selected DNA sequence in specific cells of a tissue. The term also covers so-called "leaky" promoters, which regulate expression of a selected DNA primarily in one tissue, but cause expression in other tissues as well.
Misexpression, as used herein, refers to a non-wild type pattern of gene expression. It includes: expression at non-wild type levels, over or under expression; a pattern of expression that differs from wild type in terms of the time or stage at which the gene is expressed, increased or decreased expression (as compared with wild type) at a predetermined developmental period or stage; a pattern of expression that differs from wild type in terms of decreased expression (as compared with wild type) in a predetermined cell type or tissue type; a pattern of expression that differs from wild type in terms of the splicing size, amino acid sequence, posttransitional modification, or biological activity of the expressed polypeptide; a pattern of expression that differs from wild type in terms of the effect of an environmental stimulus or extracellular stimulus on expression of the gene, a pattern of increased or decreased expression (as compared with wild type) in the presence of an increase or decrease in the strength of the stimulus.
As used herein, "host cells" and other such terms denoting microorganisms or higher eukaryotic cell lines cultured as unicellular entities refers to cells which can become or have been used as recipients for a recombinant vector or other transfer DNA, and include the progeny of the original cell which has been transfected. It is understood by individuals skilled in the art that the progeny of a single parental cell may not necessarily be completely identical in genomic or total DNA compliment to the original parent, due to accident or deliberate mutation.
As used herein, the term "control sequence" refers to a nucleic acid having a base sequence which is recognized by the host organism to effect the expression of encoded sequences to which they are ligated. The nature of such control sequences differs depending upon the host organism; in prokaryotes, such control sequences generally include a promoter, ribosomal binding site and terminators; in eukaryotes, generally such control sequences include promoters, terminators and in some instances, enhancers.
WO 97/37044 PCTIUS97/05223 -33 The term control sequence is intended to include at a minimum, all components whose presence is necessary for expression, and may also include additional components whose presence is advantageous, for example, leader sequences.
As used herein, the term "operably linked" refers to sequences joined or ligated to function in their intended manner. For example, a control sequence is operably linked to coding sequence by ligation in such a way that expression of the coding sequence is achieved under conditions compatible with the control sequence and host cell.
The metabolism of a substance, as used herein, means any aspect of the, expression, function, action, or regulation of the substance. The metabolism of a substance includes modifications, covalent or non-covalent modifications of the substance. The metabolism of a substance includes modifications, covalent or noncovalent modification, the substance induces in other substances. The metabolism of a substance also includes changes in the distribution of the substance. The metabolism of a substance includes changes the substance induces in the distribution of other substances.
A "sample" as used herein refers to a biological sample, such as, for example, tissue or fluid isloated from an individual (including without limitation plasma, serum, cerebrospinal fluid, lymph, tears, saliva and tissue sections) or from in vitro cell culture constituents, as well as samples from the environment.
The practice of the invention will employ, unless otherwise indicated, conventional techniques of chemistry, molecular biology, microbiology, recombinant DNA, and immunology, which are within the skill of the art. Such techniques are explained fully in the literature. See Sambrook, Fritsch, and Maniatis, Molecular Cloning; Laboratory Manual 2nd ed. (1989); DNA Cloning, Volumes I and II (D.N Glover ed. 1985); Oligonucleotide Synthesis Gait ed, 1984); Nucleic Acid Hybridization Hames S.J. Higgins eds. 1984); the series, Methods in Enzymoloqy (Academic Press, Inc.), particularly Vol. 154 and Vol. 155 (Wu and Grossman, eds.) and PCR-A Practical Approach (McPherson, Quirke, and Taylor, eds., 1991).
I. Isolation of Nucleic Acids ofH. pyvlori and Uses Therefor H. pylori Genomic Sequence This invention provides nucleotide sequences of the genome of H. pylori which thus comprises a DNA sequence library of H pylori genomic DNA. The detailed description that follows provides nucleotide sequences of H. pylori, and also describes how the sequences were obtained and how ORFs and protein-coding sequences were WO 97/37044 PCT/US97/05223 -34identified. Also described are methods of using the disclosed H. pylori sequences in methods including diagnostic and therapeutic applications. Furthermore, the library can be used as a database for identification and comparison of medically important sequences in this and other strains of H. pylori.
To determine the genomic sequence of H. pylori, DNA was isolated from a strain of H. pylori and mechanically sheared by nebulization to a median size of 2 kb.
Following size fractionation by gel electrophoresis, the fragments were blunt-ended, ligated to adapter oligonucleotides, and cloned into each of 20 different pMPX vectors (Rice et al., abstracts of Meeting of Genome Mapping and Sequencing, Cold Spring Harbor, NY, 5/11-5/15, 1994, p. 225) to construct a series of "shotgun" subclone libraries.
DNA sequencing was achieved using multiplex sequencing procedures essentially as disclosed in Church et al., 1988, Science 240:185; U.S. Patents No.
4,942,124 and 5,149,625). DNA was extracted from pooled cultures and subjected to chemical or enzymatic sequencing. Sequencing reactions were resolved by electrophoresis, and the products were transferred and covalently bound to nylon membranes. Finally, the membranes were sequentially hybridized with a series of labelled oligonucleotides complimentary to "tag" sequences present in the different shotgun cloning vectors. In this manner, a large number of sequences could be obtained from a single set of sequencing reactions. The cloning and sequencing procedures are described in more detail in the Exemplification.
Individual sequence reads obtained in this manner were assembled using the FALCONTM program (Church et al., 1994, Automated DNA Sequencing and Analysis, J.C. Venter, ed., Academic Press) and PHRAP Green, Abstracts of DOE Human Genome Program Contractor-Grantee Workshop V, Jan. 1996, p.157). A resulting assembly of contigs, each representing a continuous stretch of DNA or DNA sequence was obtained. The average contig length was about 3 kb.
A variety of approaches are used to order the contigs so as to obtain a continuous sequence representing the entire H. pylori genome. Synthetic oligonucleotides are designed that are complementary to sequences at the end of each contig. These oligonucleotides may be hybridized to libaries of H. pylori genomic DNA in, for example, lambda phage vectors or plasmid vectors to identify clones that contain sequences corresponding to the junctional regions between individual contigs. Such clones are then used to isolate template DNA and the same oligonucleotides are used as primers in polymerase chain reaction (PCR) to amplify junctional fragments, the nucleotide sequence of which was then determined.
WO 97/37044 PCT/US97/05223 The H pylori sequences were analyzed for the presence of open reading frames (ORFs) comprising at least 180 nucleotides. ORFs of at least 180 nucleotides (based on stop-to-stop codon reads) were predicted. As a result of the analysis of ORFs based on stop-to-stop codon reads, it should be understood that these ORFs may not correspond to the ORF of a naturally-occurring H. pylori polypeptide. These ORFs may contain start codons which indicate the initiation of protein synthesis of a naturally-occurring H.
pyloripolypeptide. Such start codons within the ORFs provided herein can be identified by those of ordinary skill in the relevant art and the resulting ORF and the encoded H.
pylori polypeptide is within the scope of this invention. For example, within the ORFs a codon such as AUG or GUG (encoding methionine or valine) which is part of the initiation signal for protein synthesis can be identified and the ORF modified to correspond to a naturally-occurring H. pylori polypeptide. The predicted coding regions were defined by evaluating the coding potential of such sequences with the program GENEMARKTM (Borodovsky and McIninch, 1993, Comp. Chem. 17:123).
Other H. pylori Nucleic Acids The nucleic acids of this invention may be obtained directly from the DNA of the above referenced H. pylori strain by using the polymerase chain reaction (PCR). See "PCR, A Practical Approach" (McPherson, Quirke, and Taylor, eds., IRL Press, Oxford, UK, 1991) for details about the PCR. High fidelity PCR can be used to ensure a faithful DNA copy prior to expression. In addition, amplified products can be checked by conventional sequencing methods. Clones carrying the desired sequences described in this invention may be obtained by screening the libraries by means of the PCR or by hybridization of synthetic oligonucleotide probes to filter lifts of the library colonies or plaques as known in the art (see, Sambrook et al., Molecular Cloning, A Laboratory Manual 2nd edition, 1989, Cold Spring Harbor Press, NY).
It is also be possible to obtain nucleic acids encoding H. pylori polypeptides from a cDNA library in accordance with protocols herein described. A cDNA encoding an H. pylori polypeptide can be obtained by isolating total mRNA from an appropriate cell line. Double stranded cDNAs can then be prepared from the total mRNA.
Subsequently, the cDNAs can be inserted into a suitable plasmid or viral bacteriophage) vector using any one of a number of known techniques. Genes encoding H. pylori polypeptides can also be cloned using established polymerase chain reaction techniques in accordance with the nucleotide sequence information provided by the invention. The nucleic acids of the invention can be DNA or RNA. Preferred nucleic acids are shown in the Sequence Listing.
WO 97/37044 PCTIUS97/05223 -36- The nucleic acids of the invention can also be chemically synthesized using standard techniques. Various methods of chemically synthesizing polydeoxynucleotides are known, including solid-phase synthesis which, like peptide synthesis, has been fully automated in commercially available DNA synthesizers (See Itakura et al. U.S.
Patent No. 4,598,049; Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S.
Patent Nos. 4,401,796 and 4,373,071, incorporated by reference herein).
Nucleic acids isolated or synthesized in accordance with features of the present invention are useful, by way of example, without limitation, as probes, primers, capture ligands, antisense genes and for developing expression systems for the synthesis of proteins and peptides corresponding to such sequences. As probes, primers, capture ligands and antisense agents, the nucleic acid normally consists of all or part (approximately twenty or more nucleotides for specificity as well as the ability to form stable hybridization products) of the nucleic acids shown in the Sequence Listing. These uses are described in further detail below.
Probes A nucleic acid isolated or synthesized in accordance with the nucleotide sequences set forth in the Sequence Listing can be used as a probe to specifically detect H. pylori. With the sequence information set forth in the present application, sequences of twenty or more nucleotides are identified which provide the desired inclusivity and exclusivity with respect to H. pylori, and extraneous nucleic acids likely to be encountered during hybridization conditions. More preferably, the sequence will comprise at least twenty to thirty nucleotides to convey stability to the hybridization product formed between the probe and the intended target molecules.
Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be generated by recombinant DNA techniques. Individuals skilled in the art will readily recognize that the nucleic acids, for use as probes, can be provided with a label to facilitate detection of a hybridization product.
Nucleic acid isolated and synthesized in accordance with the Sequence Listing can also be useful as probes to detect homologous regions (especially homologous genes) of other Helicobacter species using appropriate stringency hybridization conditions as described herein.
Capture Ligand For use as a capture ligand, the nucleic acid selected in the manner described above with respect to probes, can be readily associated with a support. The manner in which nucleic acid is associated with supports is well known. Nucleic acid having twenty or more nucleotides in a sequence contained in the Sequence Listing have utility WO 97/37044 PCT/US97/05223 -37to separate H. pylori nucleic acid from the nucleic acid of each other and other organisms. Nucleic acid having twenty or more nucleotides in a sequence shown in the Sequence Listing can also have utility to separate other Helicobacter species from each other and from other organisms. Preferably, the sequence will comprise at least twenty nucleotides to convey stability to the hybridization product formed between the probe and the intended target molecules. Sequences larger than 1000 nucleotides in length are difficult to synthesize but can be generated by recombinant DNA techniques.
Primers Nucleic acid isolated or synthesized in accordance with the sequences described herein have utility as primers for the amplification of H. pylori nucleic acid. These nucleic acids may also have utility as primers for the amplification of nucleic acids in other Helicobacter species. With respect to polymerase chain reaction (PCR) techniques, nucleic acids of 10-15 nucleotides contained in the Sequence Listing have utility in conjunction with suitable enzymes and reagents to create copies of H. pylori nucleic acid. More preferably, the sequence will comprise twenty or more nucleotides to convey stability to the hybridization product formed between the primer and the intended target molecules. Binding conditions of primers greater than 100 nucleotides are more difficult to control to obtain specificity. High fidelity PCR can be used to ensure a faithful DNA copy prior to expression. In addition, amplified products can be checked by conventional sequencing methods.
The copies can be used in diagnostic assays to detect specific sequences, including genes from H. pylori and/or other Helicobacter species. The copies can also be incorporated into cloning and expression vectors to generate polypeptides corresponding to the nucleic acid synthesized by PCR, as is described in greater detail herein.
Antisense Nucleic acid or nucleic acid-hybridizing derivatives isolated or synthesized in accordance with the sequences described herein have utility as antisense agents to prevent the expression ofH. pylori genes. These sequences also have utility as antisense agents to prevent expression of genes of other Helicobacter species.
In one embodiment, nucleic acid or derivatives corresponding to H pylori nucleic acids is loaded into a suitable carrier such as a liposome or bacteriophage for introduction into bacterial cells. For example, a nucleic acid having twenty or more nucleotides is capable of binding to bacteria nucleic acid or bacteria messenger RNA.
Preferably, the antisense nucleic acid is comprised of 20 or more nucleotides to provide necessary stability of a hybridization product of non-naturally occurring nucleic acid and WO 97/37044 PCT/US97/05223 -38bacterial nucleic acid and/or bacterial messenger RNA. Nucleic acid having a sequence greater than 1000 nucleotides in length is difficult to synthesize but can be generated by recombinant DNA techniques. Methods for loading antisense nucleic acid in liposomes is known in the art as exemplified by U.S. Patent 4,241,046 issued December 23, 1980 to Papahadjopoulos et al.
II. Expression ofH. plori Nucleic Acids Nucleic acid isolated or synthesized in accordance with the sequences described herein have utility to generate polypeptides. The nucleic acids exemplified in the Sequence Listing or fragments of said nucleic acid encoding active portions of H pylori polypeptides can be cloned into suitable vectors or used to isolate nucleic acid. The isolated nucleic acid is combined with suitable DNA linkers and cloned into a suitable vector.
The function of a specific gene or operon can be ascertained by expression in a bacterial strain under conditions where the activity of the gene product(s) specified by the gene or operon in question can be specifically measured. Alternatively, a gene product may be produced in large quantities in an expressing strain for use as an antigen, an industrial reagent, for structural studies, etc. This expression can be accomplished in a mutant strain which lacks the activity of the gene to be tested, or in a strain that does not produce the same gene product(s). This includes, but is not limited to other Helicobacter strains, and other bacterial strains such as E. coli, Norcardia, Corynebacterium, and Streptomyces species. In some cases the expression host will utilize the natural Helicobacter promoter whereas in others, it will be necessary to drive the gene with a promoter sequence derived from the expressing organism an E.
coli beta-galactosidase promoter for expression in E. coli).
To express a gene product using the natural H. pylori promoter, a procedure such as the following can be used. A restriction fragment containing the gene of interest, together with its associated natural promoter element and regulatory sequences (identified using the DNA sequence data) is cloned into an appropriate recombinant plasmid containing an origin of replication that functions in the host organism and an appropriate selectable marker. This can be accomplished by a number of procedures known to those skilled in the art. It is most preferably done by cutting the plasmid and the fragment to be cloned with the same restriction enzyme to produce compatible ends that can be ligated to join the two pieces together. The recombinant plasmid is introduced into the host organism by, for example, electroporation and cells containing the WO 97/37044 PCT/US97/05223 -39recombinant plasmid are identified by selection for the marker on the plasmid. Expression of the desired gene product is detected using an assay specific for that gene product.
In the case of a gene that requires a different promoter, the body of the gene (coding sequence) is specifically excised and cloned into an appropriate expression plasmid. This subcloning can be done by several methods, but is most easily accomplished by PCR amplification of a specific fragment and ligation into an expression plasmid after treating the PCR product with a restriction enzyme or exonuclease to create suitable ends for cloning.
A suitable host cell for expression of a gene can be any procaryotic or eucaryotic cell. For example, an H. pylori polypeptide can be expressed in bacterial cells such as E.
coli, insect cells (baculovirus), yeast, or mammalian cells such as Chinese hamster ovary cell (CHO). Other suitable host cells are known to those skilled in the art.
Expression in eucaryotic cells such as mammalian, yeast, or insect cells can lead to partial or complete glycosylation and/or formation of relevant inter- or intra-chain disulfide bonds of a recombinant peptide product. Examples of vectors for expression in yeast S. cerivisae include pYepSecl (Baldari. et al., (1987) Embo J. 6:229-234), pMFa (Kurjan and Herskowitz, (1982) Cell 30:933-943), pJRY88 (Schultz et al., (1987) Gene 54:113-123), and pYES2 (Invitrogen Corporation, San Diego, CA). Baculovirus vectors available for expression of proteins in cultured insect cells (SF 9 cells) include the pAc series (Smith et al., (1983) Mol. Cell Biol. 3:2156-2165) and the pVL series (Lucklow, and Summers, (1989) Virology 170:31-39). Generally, COS cells (Gluzman, (1981) Cell 23:175-182) are used in conjunction with such vectors as pCDM 8 (Aruffo, A. and Seed, (1987) Proc. Natl. Acad. Sci. USA 84:8573-8577) for transient amplification/expression in mammalian cells, while CHO (dhfr- Chinese Hamster Ovary) cells are used with vectors such as pMT2PC (Kaufman et al. (1987), EMBO J. 6:187-195) for stable amplification/expression in mammalian cells. Vector DNA can be introduced into mammalian cells via conventional techniques such as calcium phosphate or calcium chloride co-precipitation, DEAE-dextran-mediated transfection, or electroporation. Suitable methods for transforming host cells can be found in Sambrook et al. (Molecular Cloning: A Laboratory Manual, 2nd Edition, Cold Spring Harbor Laboratory press (1989)), and other laboratory textbooks.
Expression in procaryotes is most often carried out in E. coli with either fusion or non-fusion inducible expression vectors. Fusion vectors usually add a number of
NH
2 terminal amino acids to the expressed target gene. These NH 2 terminal amino acids often are referred to as a reporter group. Such reporter groups usually serve two purposes: 1) to increase the solubility of the target recombinant protein; and 2) to aid in WO 97/37044 PCT/US97/05223 the purification of the target recombinant protein by acting as a ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site is introduced at the junction of the reporter group and the target recombinant protein to enable separation of the target recombinant protein from the reporter group subsequent to purification of the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors include pGEX (Amrad Corp., Melbourne, Australia), pMAL (New England Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase, maltose E binding protein, or protein A, respectively, to the target recombinant protein. A preferred reporter group is poly(His), which may be fused to the amino or carboxy terminus of the protein and which renders the recombinant fusion protein easily purifiable by metal chelate chromatography.
Inducible non-fusion expression vectors include pTrc (Amann et al., (1988) Gene 69:301-315) and pET 11d (Studier et al., Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 60-89). While target gene expression relies on host RNA polymerase transcription from the hybrid trp-lac fusion promoter in pTrc, expression of target genes inserted into pET1 Id relies on transcription from the T7 gnl 0-lac 0 fusion promoter mediated by coexpressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host strains BL21(DE3) or HMS 174(DE3) from a resident X prophage harboring a T7 gnl under the transcriptional control of the lacUV 5 promoter.
For example, a host cell transfected with a nucleic acid vector directing expression of a nucleotide sequence encoding an H. pylori polypeptide can be cultured under appropriate conditions to allow expression of the polypeptide to occur. The polypeptide may be secreted and isolated from a mixture of cells and medium containing the peptide. Alternatively, the polypeptide may be retained cytoplasmically and the cells harvested, lysed and the protein isolated. A cell culture includes host cells, media and other byproducts. Suitable media for cell culture are well known in the art.
Polypeptides of the invention can be isolated from cell culture medium, host cells, or both using techniques known in the art for purifying proteins including ion-exchange chromatography, gel filtration chromatography, ultrafiltration, electrophoresis, and immunoaffinity purification with antibodies specific for such polypeptides.
Additionally, in many situations, polypeptides can be produced by chemical cleavage of a native protein tryptic digestion) and the cleavage products can then be purified by standard techniques.
WO 97/37044 PCT/US97/05223 -41 In the case of membrane bound proteins, these can be isolated from a host cell by contacting a membrane-associated protein fraction with a detergent forming a solubilized complex, where the membrane-associated protein is no longer entirely embedded in the membrane fraction and is solubilized at least to an extent which allows it to be chromatographically isolated from the membrane fraction. Several different criteria are used for choosing a detergent suitable for solubilizing these complex. For example, one property considered is the ability of the detergent to solubilize the H.
pylori protein within the membrane fraction at minimal denaturation of the membraneassociated protein allowing for the activity or functionality of the membrane-associated protein to return upon reconstitution of the protein. Another property considered when selecting the detergent is the critical micells concentration (CMC) of the detergent in that the detergent of choice preferably has a high CMC value allowing for ease of removal after reconstitution. A third property considered when selecting a detergent is the hydrophobicity of the detergent. Typically, membrane-associated proteins are very hydrophobic and therefore detergents which are also hydrophobic, the triton series, would be useful for solubilizing the hydrophobic proteins. Another property important to a detergent can be the capability of the detergent to remove the H. pylori protein with minimal protein-protein interaction facilitating further purification. A fifth property of the detergent which should be considered is the charge of the detergent. For example, if it is desired to use ion exchange resins in the purification process then preferably detergent should be an uncharged detergent. Chromatographic techniques which can be used in the final purification step are known in the art and include hydrophobic interaction, lectin affinity, ion exchange, dye affinity and immunoaffinity.
One strategy to maximize recombinant H. pylori peptide expression in E. coli is to express the protein in a host bacteria with an impaired capacity to proteolytically cleave the recombinant protein (Gottesman, Gene Expression Technolovgy: Methods in Enzmology 185, Academic Press, San Diego, California (1990) 119-128). Another strategy would be to alter the nucleic acid encoding H pylori peptide to be inserted into an expression vector so that the individual codons for each amino acid would be those preferentially utilized in highly expressed E. coli proteins (Wada et al., (1992) Nuc.
Acids Res. 20:2111-2118). Such alteration of nucleic acids of the invention can be carried out by standard DNA synthesis techniques.
The nucleic acids of the invention can also be chemically synthesized using standard techniques. Various methods of chemically synthesizing polydeoxynucleotides are known, including solid-phase synthesis which, like peptide synthesis, has been fully automated in commercially available DNA synthesizers (See, Itakura et al. U.S.
WO 97/37044 PCT/US97/05223 -42- Patent No. 4,598,049; Caruthers et al. U.S. Patent No. 4,458,066; and Itakura U.S.
Patent Nos. 4,401,796 and 4,373,071, incorporated by reference herein).
III. H. pylori Polypeptides This invention encompasses isolated H. pylori polypeptides encoded by the disclosed H. pylori genomic sequences, including the polypeptides contained in the Sequence Listing. Polypeptides of the invention are preferably at least 5 amino acid residues in length. Using the DNA sequence information provided herein, the amino acid sequences of the polypeptides encompassed by the invention can be deduced using methods well-known in the art. It will be understood that the sequence of an entire nucleic acid encoding an H. pylori polypeptide can be isolatedand identified based on an ORF that endoes only a fragment of the cognate protein-coding region. This can be acheived, for example, by using the isolated nucleic acid encoding the ORF, or fragments thereof, to prime a polymerase chain reaction with genomic H. pylori DNA as template; this is followed by sequencing the amplified product.
The polypeptides of the invention can be isolated from wild-type or mutant H.
pylori cells or from heterologous organisms or cells (including, but not limited to, bacteria, yeast, insect, plant and mammalian cells) into which an H. pylori nucleic acid has been introduced and expressed. In addition, the polypeptides can be part of recombinant fusion proteins.
H. pylori polypeptides of the invention can be chemically synthesized using commercially automated procedures such as those referenced herein.
Many of the polypeptides of the invention are related to one another. Some of these relationships are described in Tables 3-6 below. All of the polypeptide lengths in Table 3 are from stop codon to stop codon in the nucleotide sequence ofH. pylori. As is known in the art, the actual polypeptide lengths are usually shorter than the stop-to-stop codon lengths because a start codon for an initiator charged tRNA usually appears a few nucleotides downstream from the prior stop codon and within a few nucleotides following a ribosome binding site (also known as a "Shine-Delgarno sequence"). Since most of the ribosome binding sites in H. pylori have many of the same general features of those known in E. coli, one skilled in the art can predict the actual start codon with good reliability from the stop-to-stop nucleotide sequence of an open reading frame.
The polypeptide sequences of SEQ ID NOs:492-743 of this invention represent the stopto-stop codon lengths of the open reading frames of SEQ ID NOs:1-252. All other polypeptide sequences of this invention represent the predicted start to stop protein lengths from the nucleotide sequences. One skilled in the art can recognize start sites in WO 97/37044 PCT/US97/05223 -43 the stop-to-stop open reading frames of the nucleotide sequences presented herein. In addition, one skilled in the art will occasionally detect alternative start sites, some of which may be utilized in vivo by the cellular machinery. The number of these alternative start sites is sufficiently small that one skilled in the art can readily test them in a recombinant expression systems known in the art to determine which ones provide authentic functional protein products.
The relationship between the polypeptides shown in Table 3 can be described as follows. First, all of the polypeptides of Table 3 are at least 90% identical with each other over most of their lengths, and most are over 95% identical with each other.
Second, the stop-to-stop lengths are different for some of the homologous pairs of polypeptides. In some cases, the shorter polypeptide contains the relevant portion of the protein exhibiting utility in this invention; in some cases, the longer polypeptide may exhibit improved utility. Third, some polypeptides in the second column are homologous to two shorter polypeptides in the fifth column.
In all cases, the homology relationships described in Table 3 are highly significant. For example, a typical H. pylori gene product will exhibit amino acid sequence identities of between 92% and 100% among different strains of H. pylori selected from human patients. The nucleotide sequences encoding the related polypeptides of this invention are also very similar to one another. For example, nucleotide probes derived from the coding sequence of a polypeptide of this invention can be used in PCR or hybridization experiments to identify clones carrying the nucleotide sequence encoding the homologous related polypeptide.
3 ORF Namne aa Seq ID Length ORF Name aa Seq ID Length Length %ID Overlap Difference Olaell]lO 40688 c2 38 745 261 hp3elIlO75orf3 632 261 0 100% 261 Olael200I 24218781 1 2 18 746 148 O5gpI1190O1orfl 9 553 370 -222 95% 149 Olael200l 24218781 f2 18 746 148 O7gp II187orfl'7 568 195 -47 100% 143 01-ge10203_35281542_c3_16 797 261 0O1ge IO2O3orf7 659 154 107 91% 138 01ge10203_35281542_c316 797 261 OlgeIO2O3orfI4 657 71 19 0 99% 69 0 1ge 10203_860166_f3_9 798 233 OlgelO2O3orf6 658 233 0 100% 233 01ge11619_13788141 c2 II 799 105 11lgelO0309orf4 687 105 0 100% 105 01ge11619_23711062_c3_14 747 319 11lgeIO3O9orfI2 579 319 0 100% 31 01ge11619_24415880_c2_12 800 173 1IIgelO3O9orfS 688 173 0 100% 173 01geI16l9_24417813_ci_8 801 324 11 geIO0309orf24 684 324 0 100% 324 OigplIOI6_4103403_-c2_- 13 748 145 OlgpllOI1 orffi 508 123 22 100% 123 02ae11612 1074212 flI 749 103 O2aelIl612orfl 509 103 0 100% 103 O2ael 1612_22477267_12_27 750 304 03xe I 121orf7 529 152 152 -94% 148 62ael 1612 23598175 fl 2 751 332 O2ae I 162orfI15 510 332 0 100% 332 02ae11612 332031250 ci51 752 509 O5cp11911orf35 549 206 303 99%/ 205 02ep20506 24611325 f2 6 835 256 0O1ce I 153orfI17 500 256 0 100% 256 02ge10116 36367936 ci19 753 173 02ge20116orf25 521 173 0 100% 173 O2gel 1622 21695936 ci 54 83 9 313 B3ee I0216orf57 743 313 0 98% 313 62ge1I1622_875260_3_36 754 523 B~ee 1O2I6orf7 586 523 0 100% 523 02gp20706_1203402_c3_58 755 162 09ge70821Iorf2 574 162 0 100% 162 02gp20706_ 15781452_c2_51 756 97 O6geI10l1l1orfl 7 563 97 0 100% 97 02gp20706_16803513 ftI 802 488 0O1cp 117 1Oorfl 6 652 64 424 98% 63 02gp20706_ 16803513_flI 802 488 0O1cp 117l1Oorf6 655 247 241 99% 247 02gp20706 16803513 ftI 802 488 0O1cpl117I1 orf5 654 146 342 99% 145 02gp20706 20365905 1Q 8 803 198 0O1cp 117I1Oorf 9 656 130 68 95% 130 02gp20706_20365905 12Q 8 803 198 OlcplI17IOorfl8 653 85 113 100% 7 02gp20706_23632775 S3 32 757 230 O6gelO IIlorf4 565 230 0 100%23 TABLE 3 (continued) 02gp20706_23866562_c2_53 758 120 6ge1I01I orfi9 564 84 36 96% 84 02gp20706_4892558_B_ 19 759 218 Olcp1710orfl 502 218 0 100% 218 02gp20814 24415958f39 760 176 l4eeI0308orf4 594 176 0 99% 176 02gp20814_3984818 f] 1 804 215 14ee10308rf1 593 68 147 97% 67 02gp20814_3984818_fI 1 804 215 14eelI0308orf7 694 155 60 100% 143 O4cpI1202_i6603425_c2 72 805 185 I4cp 10 19orfi 692 161 24 96% 158 04cp11202_19797128 fI 5 806 199 04ge 11713orf1 0 665 132 67 100% 127 04cp1 1202_23553177_ci 75 807 372 O6ap 10209orf4 674 88 284 96% 79 04cp 11202_23553177 c 1 75 807 372 06ap10209orfl 673 186 186 98% 179 O4cpI 1202_23553177_c3_109 808 372 06aplI0209orf4 674 88 284 96% 79 O4cpI 1202_23553177_c3_109 808 372 O6ap I 0209rfI 673 186 186 98% 179 04cp11202 24256567 c3 117 744 536 04ge 11713orf35 601 200 336 99% 195 04cp 1202 24256567 3 117 744 536 04ge 1113orf28 666 122' 414 91% 111 04cp] 12102 24256567 c3 117 744 536 04ge 11713orf36 667 250 286 97% 234 O4cpI 1202_24261588 f2 23 761 96 1 ae1202iorfS 492 96 0 100% 96 04ge10816 22086531 f2 10 809 208 O4ge 1121Oorfl 664 208 0 100% 208 04ge1 0816_ 33726080 c2 29 810 369 l3ae06]Oorfl 690 133 236 98% 120 05ae30220_21619067_fB_56 763 293 05ae20220orf58 544 293 0 100% 293 05ae30220_24882812_c3 103 765 486 05ae2220orf119 542 486 0 100% 486 05ae30220 25953163 c3 98 766 81 05ae20220orf95 545 81 0 100% 81 05ae30220_9882767_f234 812 69 05ae2O22OorfB7 671 69 0 100% 69 05cp21223_4725443 B 14 813 113 hp Ie10506orfS 696 113 0 100% 113 05ee10816 14649077 B 18 767 263 04ce 11617orf4 534 152 111 97% 144 05ee10816 259703 1 7 814 158 O4ceI 16I7orfl0 661 142 16 93% 142 05ee10816_41034081 2 11 768 115 O4ce I1617orfi6 531 115 0 100% 115 05ee]0816 4687651 c 22 769 953 04ce11617orf27 519 676 277 100%/0 676 05ee10816 4687651 ci 22 769 953 O4ce1617orf26 532 238 715 100% 238 06apI1119 16594193 fI 9 770 87 11ge.10309orf7 580 87 0 100% 87 06apI1119 24426508_1 26 815 120 11ge03O9orf28 685 137 -17 100% 121 06ap I H 1924426508 _26 815 120 -hp4e 12063rfI 699 86 34 100% 06ap20306 23437632 B 9 771 291 O4gpl 1803orfl 3 541 243 48 99% 228 TABLE 3 (continued) 06cp30603_23452 c3 80 772 176 06cp30603orf 16 560 159 17 97% 155 06cp30603_23476568 ci 44 773 442 05cp2O l8orf39 550 442 0 100% 442 06cp30603 4689068 c3 79 774 100 O6cp3O6O3orfl 1 559 100 0 100% 100 06ee10709 21675012 fl 2 816 284 O6ee 10709orf2 675 284 0 100% 284 07ap 1015_23938312 f3 2 762 271 O7ap 110 1orf2 622 95 176 100% 94 07ap1 1015_23938312 f3 2 762 271 07ap 1101 5orf4 624 211 60 99% 177 07ap 11111 234693 c3 14 775 209 O7ap lllllorfl3 566 209 0 100% 209 O7apI 1213_35156577 ci24 836 401 hp2e10911orf35 718 420 -19 99% 399 07ap11213_35401528 ci 21 764 803 hp2e 10911orf25 672 458 345 100% 458 O7ap1 21 3_35401528 ci 21 764 803 hp2e 1091I orf24 620 344 459 100% 344 07ap20216_7227202_3010 776 532 05ap21216orf4 547 529 3 100% 529 07ap80601_976413_9 837 274 O7ap8O6Olorf2 633 268 6 100% 265 O7ce 11019 22051291 fI 1 817 600 17cp2l714orf 678 193 407 99% 190 O7ce 11019_22051291 _f 11 817 600 07cp21714orf3 679 418 182 100% 414 07ep11916_5273452 3_31 777 255 0 1 cp I 14I4orf2 501 255 0 100% 255 07ep 11916_5913592 18 818 272 Ocp1171Oorf 654 146 126 93% 111 O9cp 1003 19532625_c3_ 17 778 357 i4ge 10705orfI 4 599 357 0 99% 357 09cp 1003_5945252_f2_4 819 219 l4ge I 0705orf3 695 219 0 100% 219 09cp20502_24001388ci 31 779 61 14gpI1i820rfl 600 61 0 100% 61 09ze10333 22460750 f2 6 820 729 O3geIO505orf2 660 323 406 97% 312 12ge10321 24308513 f3 20 840 242 O6gpl 1920rfl 1 741 242 0 99% 242 12ge10321 4821082 f3 14 821 172 hp4p I1352orf9 683 115 57 100% 115 12ge10321 4821082 f3 14 821 172 hp4plI1352orfS 701 68 104 92% 64 12gp31106_3024126f225 780 147 I3apl 1SI7orfi 1 585 147 0 100% 147 13ae10712 14100018 f2 12 781 201 l4cpIO705orf4 592 201 0 99% 201 13ae10712 29569208 c2 27 782 264 B3aeI0712orfIS 584 264 0 100% 264 14ap10221 13689381 3 4 822 366 05gp 1901orfI 9 553 370 -4 96% 360 14ap10221 13689381 c3 4 822 366 O6eplIV306rfl 676 66 300 100% 66 14apl0815_20585777_ci 13 783 677 hp3e 10349orf27 628 677 0 100% 677 14ce21516 85786 fl 1 841 700 I4ce21516orfl 533 361 339 98% 350 14ce21516_85786 fl 1 841 700 04ap20904rt3 740 284 416 95% 271 Ir TABLE 3 (continued) 29ep10720 24220926 f2 8 823 86 hp4el2O63orfl 699 86 0 100% 86 29ep10720 24220926 f2 8 823 86 lge10309orf28 685 137 -51 100% 81 29ept0720 24432762 c3 39 824 227 11ge10309orf39 686 227 0 100% 227 29ge30321_12913562 fl 1 825 79 0OIceI1618orf22 648 79 0 100% 79 29ge30321_135253 f2 6 826 101 10ce 1618orf7 650 101 0 97% 101 29ge3032121673965 f2 7 827 488 0ce11618orf9 651 409 79 100% 409 29ge30321 24336712 fI 5 828 422 01ceI1618orfll 647 188 234 93% 181 29ge30321 24336712 fl 5 828 422 0Ice1618orf27 649 235 187 96% 235 29ge30321 34157812 B 1O 784 342 OceI1618orf24 498 86 256 97% hplp14013 1726503 c2 20 785 149 hpipi40l3orfl7 617 149 0 99% 149 hp2p10272_22692325 f3 21 829 188 09ap20802orf13 680 163 25 99% 149 Ip 2 p] 0 2 72 23697200 f3 22 786 329 09ap20802orf14 570 278 51 100% 276 hp2p10272_24406280 cl 26 830 194 09ap20802orf2 681 194 0 100% 194 hp2p10272_26829136_ft 1 787 343 hp2pIO272rf 625 343 0 100% 343 hp3p10807_189075_f2_4 838 312 hp3plO8O7orf6 728 325 -13 99% 312 hp3p10807 29343768 fl 1 831 187 hp3pl0807orf4 697 187 0 100% 187 hp3p10807_29352212_f2 5 832 230 Ip3p0807orf7 698 230 0 100% 230 hp5e5211 819455 c2 24 788 269 hp5e 15211orf23 641 269 0 100% 269 hp5pl5212_34064750 2 9 789 148 O2ceIO213orfl9 515 148 0 98% 148 hp5p1564I 21698387 c2 20 790 185 hp5p15641orf23 644 185 0 98% 185 hp6e10967 23476502 f2 6 791 165 hp2plO625orf5 626 165 0 99% 165 hp6e10967 24882750 f2 7 792 157 hp2plIO625orf6 627 157 0 100% 157 hp6e12267 14650278 f329 842 277 12ge20305orf26 742 277 0 98% 277 hp6e122674876718 f2 23 793 143 I2gelO30aorf 582 146 -3 98% 146 hp6e20339_ 1190660 c246 794 69 04gp1213orf22 540 69 0 100% 69 hp6e20339_21492187_ c I40 795 106 O4gpI 1213offl4 539 136 -30 99% 106 hp6e2033924317062_c3_57 833 382 O4gpI 123orfS 669 382 0 99% 382 hp6e20339 34024187 cl 37 796 122 O4gpl 12I3orfl 1 538 122 0 100% 122 hppp 1 0233_ 12273302if'I 834 143 07cp 103 12orfS 677 138 5 9 5% 105 WO 97/37044 PCT/US97/05223 -48- Additional relationships between polypeptides of the invention are described in Table 4 below. All of the polypeptide lengths in Table 4 below are measured from stop codon to stop codon in the nucleotide sequence of H. pylori.
The relationship between the polypeptides shown in Table 4 can be described as follows. First, all of the polypeptides of Table 4 are at least 90% identical with each other over most of their lengths, and most are over 95% identical with each other.
Second, the stop-to-stop lengths are different for some of the homologous pairs of polypeptides. In some cases, the shorter polypeptide contains the relevant portion of the protein exhibiting utility in this invention; in some cases, the longer polypeptide may exhibit improved utility. Third, some polypeptides in the second column are homologous to two shorter polypeptides in the fifth column.
In all cases, the homology relationships described in Table 4 are highly significant. For example, a typical H. pylori gene product will exhibit amino acid sequence identities of between 92% and 100% among different strains of H pylori selected from human patients. The nucleotide sequences encoding the related polypeptides of this invention are also very similar to one another. For example, nucleotide probes derived from the coding sequence of a polypeptide of this invention can be used in PCR or hybridization experiments to identify clones carrying the nucleotide sequence encoding the homologous related polypeptide.
TABLE 4 ORF Name aa Length ORF Name aa Length Over- Change Seq (aa) Seq (aa) Iden- lap in ID# ID# tity (aa) Length Olcell104 36125337 cl 8 849 74 hple10523orf3 612 63 100 63 11 Olce21104_33203250 c3 87 850 509 05cpll911orf35 549 206 100 204 303 O1cp20708_36134808_f2 11 928 230 14eel0419orf5 704 229 100 229 1 02ae31010 12504512_f328.aa 929 490 Olgel0801orf2 514 60 95 56 430 02ae31010 16833312 f2 19 930 181 02cel 1022orf2 530 184 98 181 -3 02ae31010 2117087 f3 34 931 137 07cel0203orf17 635 76 97 77 61 02ae31010_30208317 fl 14 932 343 hplpl3947orflO 717 343 100 343 0 02ae31010_34616666 f2 27 851 184 Olcell618orfl 503 124 99 125 02ae31010_34616666 f2_27 851 184 hpl13947orfll 616 81 94 77 103 02ae31010_35270000 f3_33 852 231 Olepll504orf 5 505 156 99 156 02ae31010_36132785 f2_29 853 438 Olcell618orf3 499 67 98 64 371 02ae31010_36132785 f2 29 853 438 O1cell618orfl3 504 378 97 387 02ae31010 5085162 cl 47 934 416 07cel0203orfll 634 416 100 416 0 02cp10615 26573462_cl 45 935 168 hp3pl0304orf2 727 126 91 129 42 02gel0116 15781452 cl 87 854 97 06gel0115orfl7 563 97 100 97 0 02gelO116_16803513 f2 34 855 488 Olcpll710orfl6 652 64 100 62 424 02gel0116 16803513 f2 34 855 488 Olcpll710orf5 654 146 99 145 342 02gel0116_36367936 cl 92 857 173 02ge20116orf25 521 173 100 173 0 03ae10804_12609533 cl 26 936 257 06eplO306orflO 610 257 100 257 0 WO 97/37044 PCT/US97/05223 -49p6p10903_4398263f36.aa 924 370 O5gpll90orfl9 553 370 100 370 0 hp6p10903 4398263 G 6.aa 924 370 O5gpll9OlorfI9 553 370 100 370 0 03ae10804 21698400 c2 32 937 386 6ep10306orf 1 611 386 100 386 0 03ae10804 23485968 c3 47 858 88 6eplO306orfS, 561 88 100 88 0 4ep41903 26757937 S 16 938 703 9aplO306orf3 711 325 97 319 378 04ep41903 26757937 0 16 938 703 9apI1902rfi 712 280 92 280 423 04ep41903_4101593f2_10.aa 940 1240 I4ee2lll8orfl 706- .365 96 358 875 5ce10910 25598277 0 3 859 258 O7ee1519rf 567 170 96 166 88 05ep10815 26570332 c2 99 941 183 12ge1061Oorf2 702 97 98 97 86 SepIO8154195292 ci 84 942 441 ple10554orfl 715 103 99 94 338 05ep10815 4719175c ci83 943 663 O6ee10207rfI 606 99 99 99 564 06ae11016 30579712 f2 21 860 239 3ce21717orfl 526 193 99 193 46 6ce20610 1367157 fI 8.aa 844 248 9geOllorf3 714 124 90 121 124 06ce20610 29298537 c2 32 861 234 igp2OIorf4 555 234 100 234 0 6ce20610_3913967_c3_36 862 283 Sgp2OIIIorf6 556 283 100 283 0 6ce20610 4331338 B 18 863 353 O)gp2OIIorf12 554 243 99 245 110 6cp11118_212827 cl 17 864 73 06cpII18orf7 558 73 100 73 0 6cp11217_19720300_B_11 865 308 hp1p3852rf6 614 646 96 278 -338 06cp11217_4881263f2_9 866 93 plpl3852orfl 615 93 100 93 0 6cpl1217 4897077 fl 6 867 100 plpl3852orf4 613 100 100 100 0 06cp30603_21492187_f241 868 106 04gpI1213rfl4 539 136 98 106 6 cp 3060 3 34024187_fl20 869 483 4gp11213orfl1 538 122 100 122 361 6cp30603 34024187 fl 20 869 483 4gpI1213rfS 669 382 95 377 101 6cp30603 679218 Q 34 944 380 p3e10302orfl7 721 231 98 236 149 6ep10615 14649077 B 52 871 300 4ceI1617orf4 534 152 96 145 148 06ep10615_961562 f2_41 945 636 9ep2OIl2orB 713 135 98 87 501 6ep10615 9842 B3 46 872 159 SepI1717orf2 552 192 99 159 -33 6ep11202 133293 c 19 946 167 06cp20302orf3 602 167 100 167 0 06ep11202 26353438 ci 22 873 286 p2eI1858orf6 623 155 100 154 131 6epl1202 26353438 ci 22 873 286 2ae1214orfl 522 109 92 110 .177 6epli202 792962 c2 26.aa 949 151 06cp302orf6 604 66 92 66 6epI1202 4884677_ci_17 948 171 6cp23O2orf5 603 171 99 171 0 06ep11917 24803153_c3_24 848 409 p4e14535orf3 536 253 98 239 156 6ep11917 24803153 c3 24 848 409 p4el4535orf7 730 60 97 60 349 06ep30223 16512_c3_160 951 231 hp3elOl28orfl 720 231 100 231 0 6ep30223 23476067 c 119 952 151 Olep3O520orfS 511 94 93 95 57 6ep30223 23557202 c2 130 874 329 p3e I1024orf49 630 329 100 329 0 06ep30223 34409437 B 94 875 148 p3e Il24orfS 631 148 100 148 0 6ep30223 4698838 f255 845 663 l4gpl2Olorfl3 662 344 100 334 319 06ep30223 4698838_f255 845 663 14gp12015orf16 663 117 99 112 546 6ep30223 4876077 3 149 877 113 p3eI1024orf34 629 113 100 113 0 6ep30223 5109443 ci 109 878 342 O5ee10411orfS 551 293 95 287 49 06ep30223 5271902c ci106 879 207 p2e10229orf4 619 80 95 38 127 06gp10409 3398427_f2_12 880 115 l3aelOSllorf3 583 115 92 115 0 06gp71906 15115637 f2 59 953 158 p4p11393orf2 733 158 100 158 0 06gp71906 24261588 c2 174 881 96 Olael2O2lorfS 492 96 100 96 0 6-p71906 25478192 c 131 954 236 p3elIl22orBf 723 236 100 236 0 06gp71906 25504187 B 112 955 443 p4p I393rfI 732 169 91 155 274 6gp71906 970325 c3 190 882 158 2ap2III3orf2 512 113 92 98 07ae10923 24426508 fl 1 883 137 lge10309orf28 685 137 100 137 0 7ae10923 24426508 fI 1 883 137 lgel0309orf28 685 137 100 137 0 09ce10413 414011 fi 3 885 169 2celO216orf6 516 172 100 169 -3 WO 97/37044 WO 9737044PCTIUS97/05223 50 09ce10413 5865665 fl 4 886 353 02ce I0216orf7 517 74 100 74 279 09ce52017 293'24062 ci21 887 141 0O1ep3O520orf24 506 141 100 141 0 09cp10224 1062966 c3_61 888 367 O5celIO42Oorf 1 548 194 100 .194 17 3 09cp10224 1412715 c356 889 185 ilcelO5l6orf2O 495 185 100 18 5 0 09cp 10224 4295 10 c2 46. aa 890 316 PicelO5l6orf2I 496 108 98 108 1208 09cp10224 4484718 ci 38 891 581 OlcelOSI6orfl5 493 301 99 297 280 09cp21607_7224187_c2_12 892 72 07gp31516orf9 569 72 1.00 72 0 09cp61003_ 14562637_c2_93 893 106 03gp20123orf3 527 80 98 83 26 09cp61003_19532625_ci78 894 357 l4geiO7O5orfl4 599 357 99 357 0 09cp61003 24063587 ci 74 895 201 11 eelO0423 orfI 577 116 100 '116 09cp61003_24335762_c3_111 896 204 11leelO0423orf4 578 105 100 104 99 09cp61003_492187_c2 80.aa 956 667 06gpl10I1O8orf2 621 266 98 266 401 09cp61003 5945252 fI5 897 219 I.4g&lO7O5orf3 695 219 100 219 0 11ae80818 11188791 c3 60 898 221 11 aplIl9O2orf3 575 185 99 185 .36 11ae80818_196327816357 957 100 P6cpIl1722orfll1 595 100 100 100 0 11ae80818 -7290627 c2 51 958 152 06cplIlI722orfS 598 98 1100 93 54 ]1ae808l8-783127-c3-631 899 182 03ap21820orf4 523 182 100 182 0 11ae80818 7952 ci 49 900 148 03ap21820orf8 524 148 100 148 0 11lap20714 34023312 S3 46 959. 285 hp3eIOO57orf3 719 285 100 285 0 1lap20714 49 60432 -c3 -97 901 151 14eelIl2l7.orf2 596 100 100 88 51 11ap20714 5271967 ci 60 902 260 W~eeIIl7orf3 597 231 100 227 219 l1ap20714_7227202_tB_43.aa '903 798 05ap2I2I6orf4 547 529 1100 529 269 I1Iap20714 72272020B43 .aa 903 798 65ap21216orf4 547 529 100 529 269 11lee11408_4977193_ci_41.aa 960 497 l4epllllorr3 708. 91 100 91 406 I1Ige1I0308_5256_f2_1 961 71 l11ge I03O8orfl 693 71 100 71 0 12ap10324_13178562f3_6 962 269 12ap10324orf8 700 164 99 164 105 l2ap1 0324 4805318 f2_3 846 326 12ap I0324orf6 581 95 100. 77 231 12ap10324 4805318f2 3 846 326 12ap10324orf5 645 215 99 213 111 14ce31519_156359270B15 905 293 O2celO8O9orf6 518 293 100 293 0 14ce61516 13073577 f2 12 963 784 l4eplIl9O5orfl 3 709 721 100 526 .63 14ee41924 -16282067 -ci 72 964 64 .02ep30607ort32 546 104 97 63 14ee41924 23527267 c3 107 906 2333 2ep30607orf27 520 138 .98 97 14ee41924 23834800 f2 32 907 298 11lce IO9l7orfIO0 639 298 100 298 0 14ee41924 2458267 c2 93 847 1304 02ep30607orf19 590 407 99 397 897 hpIp13939_253973270B22 908 228 06celIO8O8orf2 5571 151 100 145 77 hp2e10911_24855312_ci 69 909 161 .1lcpl2006orfl'l 576 105 95 112 56 hp2e10911 3349 ci 63 910 113 101ce11618orf22 648 79 100 65 34 hp2e10911 4882027 c2 87 965 583 Plcp I 1O8orf6 507 101 99 99 482 hp3 e1118 8473 27f25 966 368 hp3piO8O7orf6 728 325 96 323 43 hp3e1I1188_50828420B12 967 89 06ee11611orfI 608 89 100 89 0 hp4e13394 3368767 ci80 968 804 09aplIl4O6orf2 668 804 100 804 0 hp4e13394_35957200_fl_21 911 233 06eplIlIlO8orfl 7 562 165 100 147 68 hp4e13)394_5964452_c2.97 912 79 02ap71220orf2 513 79 100 79 0 hp4e53)394 22864682 c2 86.aa 913 427 hplpl38S2orf6 614 646 100 422 -219 hp5e15044 4554652 B3 3 914 146 O7cplO3l2orfS 677 138 100 138 8 hp~pl52l2 6928132 c3 34 969 270 1lIcel1O908orfl 682 165 93 169 105 hp5p15575 29300311 ci29 915 152 .14celIO72OorS 591 163 95 152 -11 hp5p15575 33445317fV 20.aa 916 294 hp3p IlO86orfI 636 217 92 202 77 hp5p15575_6140713_V_1]8 917 288 l4ceIO72Oorfl2 589 288 100 288 0 hp~pl5641 12195281 ci 24 918 92 hp~plIS6l2orf2 643 92 100 92 0 pp6424057_c3 35 919 .139 29ge IO3Oorf4 609 1031 98 102 36 p5p15641 25635452 c3 34 920 192 p9geIO3O7orrB 607 92 100 92 0 WO 97/37044 PCT/US97/05223 -51 hp 5 pl 5 6 4 1_30273312_c2 28 970 301 05cel0613orfl 572 79 91 70 222 hp5p15641 30273312 c2 28 970 301 05ce10613orf2 573 61 100 54 240 hp5p15641 5211687_c2 29 971 297 05apl0914orf3 571 60 98 60 237 hp5pl5870 14350428_fl 1 921 84 05ae20220orf56 543 101 99 79 -17 hp6p10590_23440913_c2 31 922 371 4ee70114orfl0 535 243 98 235 128 hp6p10590_30521093 f2 14 972 138 07cell206orfl 637 107 97 107 31 hp 6 p 106 06 19546933 c3 31 923 283 13epl2003orf21 587 222 92 228 61 hp 6 p 109 04 2214676 cl 14 926 179 hp4pl3446orf3 640 179 99 179 0 hp 6 p 10 9 0 4 2 3 7 0 4 412_f2 5 927 384 hp4p13446orf13 638 384 99 384 0 hp 6 pl0904_7089062 cl 16 973 364 hp4p13446orf5 736 364 100 364 0 hp 6 pl2129_16603417_f3 14 974 358 hp3e0302orf26 722 268 98 266 hp 6 pl 2244 39 4 8467_cl 52 975 476 hp4pl2005orf2 735 108 96 84 368 hp 6 p 222 17 23470967 fl 4 976 272 13eel2016orf7 703 272 100 272 0 hp7el0192 4412568_f2_5 977 291 02cel0114orf3 525 291 100 291 0 hp 7 pl 02 87 24611325 c2_24 978 256 Olcel1513orfl7 500 256 100 256 0 hp 7 pl 0 2 9 0 _25548812_f3_14 979 489 hp3pl0156orf2 724 333 100 333 156 hp 7 p0 2 90_25585941 _f312 980 374 hp3pl0156orf3 725 91 91 87 283 hp7pl0290_35156558_f3_15 981 411 llcpl2002orf3 689 109 96 107 302 hp 7 pl 0290 43 51718 fl 6 982 324 hp3pl0156orf7 726 302 95 299 22 Additional relationships between polypeptides of the invention are described in Table 5 below. All of the polypeptide lengths in Table5 below are measured from start codon to stop codon in the nucleotide sequence of H pylori.
The relationship between the polypeptides shown in Table 5 can be described as follows. First, all of the polypeptides of Table 5 are at least 90% identical with each other over most of their lengths, and most are over 95% identical with each other.
Second, the start-to-stop lengths are different for some of the homologous pairs of polypeptides. In some cases, the shorter polypeptide contains the relevant portion of the protein exhibiting utility in this invention; in some cases, the longer polypeptide may exhibit improved utility. Third, some polypeptides in the second column are homologous to two shorter polypeptides in the fifth column.
In all cases, the homology relationships described in Table 5 are highly significant. For example, a typical H. pylori gene product will exhibit amino acid sequence identities of between 92% and 100% among different strains of H pylori selected from human patients. The nucleotide sequences encoding the related polypeptides of this invention are also very similar to one another. For example, nucleotide probes derived from the coding sequence of a polypeptide of this invention can be used in PCR or hybridization experiments to identify clones carrying the nucleotide sequence encoding the homologous related polypeptide.
STABLE ORF Name aa Seq Length ORF Name aa Seq Length Length Overlap ID# (aa) ID (aa) Differ. Ident. (aa) 01ce21104 33203250 c387 850 502 02ae11612_33203250_ci51 752 502 0 99.8 502 02ge10116_15781452 ci 87 854 97 02gp20706_15781452 c2 51 756 80 17 100 02ge 10 116_ 16803513f2 34 855 479 02gp20706 16803513flI1 802 486 -7 t00 479 02ge10116_16803513 f2_34 855 479 O7ep] 1916 5913592 S3 18 818 269 210 97.66 128 02ge10116 36367936 ci 92 857 171 02ge10116_36367936_ci_19 753 169 2 100 169 06cp30603 21492187 f2 41 868 89 h p6e20339_21492187_ci_40 795 89 0 97.75 89 06cp30603_23476568 c2 133.aa 843 437 06cp30603_23476568_ci_44 773 436 1 100 436 06cp30603_34024187_fl 20 869 479 hp6e20339_24317062_c3_57 833 357 122 100 357 06cp30603_34024187_fl_20 869 479 hp6e20339_34024187_ci37 796 118 361 100 118 06ep10615_ 14649077_B_52 871 300 05ee10816 14649077 S3 18 767 263 37 96.74 245 06gp71906 24261588 c2 174 881 89 04cp11202_24261588_fQ_23 761 88 1 100 88 07ae10923 24426508 fl I 883 133 06ap11119_24426508_B_26 815 117 16 100 117 67ae 10923_24426-508_fII 883 133 .29ep 10720_24220926_f2_8 823 80 53 100 09cp61,003_19532625 ci 78 894 352 09cp11003_19532625_c3_17 778 333 19 100 3 33 09cp61003_5945252_fI_5 897 211 O9cpI 1003_5945252 Q2 4 819 198 13 100 198 1 lap20714_7227202_1 _43.aa 903 792 07ap20216_72-27202 S3 10 776 533 259 100 527 4ee41924_2458267_-c2_-93 847 1288 07ap11015_23938312 S 2 762 257 1031 100 257 hp2eIO9Il_3349_ci_63 910 110 29ge30321_12913562 fI 1 825 70 40 l00 hp3e1 1188 47327 f2 5 966 368 lhp3p,10807_189075_f2_4 838 312 56 100 312 hp~elSO44_4554652_1B_3 914 114 hp6p10233_12273302flI 834 143 -29 95.58 fill hp5p15870_ 14350428_fI_ 1 921 80 05ae30220 14350428 fI 9 811 748 -668 98.75 hp6p10903_4398263_B_6.aa 924 366 Olael2001 24218.781 f2 18 746 119 247 96.64 11 hp6p]0903 4398263 B 6.aa 924 366 I4ap.10221 13689381 c 2 338 28 9.5 3 hp7p10287 24611325 c2 24 978 256 02ep20506 24611325 1 2 6 835 242 14 100 242 WO 97/37044 PCT/US97/05223 -53- Additional relationships between polypeptides of the invention are described in Table 6 below. All of the polypeptide lengths in Table 6 below are measured from stop codon to stop codon in the nucleotide sequence of H. pylori.
The relationship between the polypeptides shown in Table 6 can be described as follows. First, all of the polypeptides of Table 6 are at least 90% identical with each other over most of their lengths, and most are over 95% identical with each other.
Second, the stop-to-stop lengths are different for the homologous pairs of polypeptides.
In some cases, the shorter polypeptide contains the relevant portion of the protein exhibiting utility in this invention; in some cases, the longer polypeptide may exhibit improved utility.
In all cases, the homology relationships described in Table 6 are highly significant. For example, a typical H. pylori gene product will exhibit amino acid sequence identities of between 92% and 100% among different strains of H. pylori selected from human patients. The nucleotide sequences encoding the related polypeptides of this invention are also very similar to one another. For example, nucleotide probes derived from the coding sequence of a polypeptide of this invention can be used in PCR or hybridization experiments to identify clones carrying the nucleotide sequence encoding the homologous related polypeptide.
6 ORF Name aa Seq Len. ORF Name aa Seq Len-. Leng. Overlap ID ID# D# Diff.
OlaeI200l_24218781I_1 2_18 746 149 11 ae 11922 125 8667512_1 1037 249 100 149 100.00 02gp20706 23866562_c2_53 758 120 02ge10116 23866562 c3 146 1038 641 521 102 100.00 O7apl 1015 23938312_1B_2 762 271 07ee1 1402_2458267_c3_108 1039 1304 1033 271 100.00 O7apl 1213 35401528_ci 21 764 803 07ee50709_10213593 1 _77 1040 1484 681 726 100.00 05ee10816_14649077_1 _18 767 263 06ep10615 14649077 f2 30 1041 300 37 245 96.74 06cp30603 23452_c3_-80 772 177 09cp10713_23452_c3_195 1042 526 349 177 94.35 07ap20216 7227202_f3_10 776 533 11lap20714_7227202_B40 1043 798 265 533 100.00 O9cpl 1003 195326256c317 778 357 01ce61016 23609580 c3139 1044 343 -14 342 100.00 12gp31106 302412612 25 780 147 06gp71906 3024126 ci 128 1045 405 258 143 100.00 13ae10712 141000181f2 12 781 201 06ge20501_141 00018_-ci 34 1046 553 352 201 100.00 hp6e20339 34024187 -ci 37 796 122 09cp10713 34024187 fI 31 1047 483 361 122 100.00 01ge10203 352815426c316 797 261 hp4p62853_5914693_c3 52 1048 272 11 261 99.62 04ge10816 2208653! 1 121 10 809 208 iiae10610 859692 c2 32 1049 401 193 208 98.56 04ge]0816 33726080 -c2 -29 810 369 13aeI06I0_35912_f2_3 1050 459 90 361 100.00 06ap11119 24426508 B3 26 815 121 06ap1I1I119_244265080B27 1051 137 16 121 100.00 O7cel 1019_22051I291_flI1 817 600 l4gpl 1423 26803801 B 7 1052 712 112 568 99.82 14ap10221 13689381_c3_4 822 366 06ap10609 12586675 Q2 19 1053 375 9 363 96.69 29ge30321 135253_1 2_6 826 101 07ee50709 4818967 12 43 1054 487 386 86 98.84 hp2p10272 22692325_B_321 829 188 02ap11117 23495187 c3 81 1055 253 65 188 99.47 hp6e20339 24317062_c3_57 833 382 09cp10713_34024187 fI_31 1056 483 101 377 96.55 O7apl 1213 35156577_ci__24 836 401 07ee5O709 35156577 B 80 1057 567 166 400 100.00 07ap80601 976413 B3 9 837 274 05ae30220 976413 c3204 1058 384 1t0 274 100.00 hp~IOO71807_1_483 32 p~iI8847327 12 9 1059 368 56 312 100.00 12ap10324 48053.18_12_3 846 327 12ap10324 480531 8 12 6 1060 420 93 327 100.00 03ae10804_23485968_c3_47 858 88 hp.Ie8G523_23485.968_c2_49 1061 223 135 85 100.00 05ce10910_25598277_1 3_3 859 258 hp7eI0192_25598277_c2_15 1062 422 164 254 100.00 O6cpi 1217 19720300_B_1 865 308 hp4e53194 26209843 c3 98 1063 686 378 278 96.76 06ep10615 9842_B_ 46 872 159 06ep106-15_9842_fI_5 1064 345 186 159 199.37 06ep30223 34409437 B 94 875 148 06ep30223 34409437_12_64 1065 295 147 147 97.28 TABLE 6 (continued) 06gp10409 3398427_ 2 12 880 116 06gp0409 3398427 f2 12 1066 292 176 116 100.00 09ce10413 5865665 fl 4 886 74 09ce10413 5865665 fl 4 1067 353 279 74 100.00 09cp10224 1062966c3_61 888 367 09cp10224_1062966 cl 44 1068 485 118 372 94.09 09cp61003 14562637_c293 893 106 01ce61016 12931513 c2 106 1069 376 270 94 93.62 09cp61003 19532625 cl 78 894 357 Olce61016 23609580 c3 139 1070 343 -14 342 100.00 I1ae80818 11188791 c3 60 898 221 14cpll90825593768_c3 97 1071 538 317 204 99.51 liae80818 783127 c3 63 899 182 14cp11908_783127 cl 72 1072 337 155 167 98.20 14ee41924 23527267 c3 107 906 233 07eeI1402 10759567 c2 86 1073 292 59 232 100.00 hp2el0911 3349 cl _63 910 113 07ee50709 4818967 f2 43 1074 487 374 109 95.41 hp4el3394 35957200_fl_21 911 233 hp4el13394 5088562 f 54 1075 225 -8 215 100.00 hp4el13394 5964452_c2_97 912 79 hp4e13394 15828963 c2 90 1076 135 56 77 100.00 hp4e53394 22864682 c2 86.aa 913 427 hp4e53394 19720300 c3 98 1077 647 220 422 100.00 hp5el15044 4554652_ 3_3 914 146 07ce0312 4554652 f3 2 1078 174 28 146 100.00 hp5p5870 14350428_flI 1 921 84 05ae30220_14350428_f3 91 1079 752 668 84 98.81 hp6p10606 19546933 c331 923 283 hp8el0080 19546933c2 88 1080 428 145 280 99.29 01cp20708_36134808 f2_ 11 928 230 01ce0320 30273587 f3 38 1081 275 45 230 100.00 02ae31010 2117087 f3 34 931 137 07ee50709 26438968 f2 36 1082 265 128 128 98.44 05ep10815_4719175 cl 83 943 663 05epl0815 4719175 cl 115 1083 925 262 653 100.00 06ep30223_23476067 cl 119 952 151 06ep30223 23476067 cl_115 1084 210 59 150 98.67 Ilae80818 7290627 c2 51 958 152 hp7e10590_26172564 cl 68 1085 356 204 141 88.65 lIeel 1408 4977193 ci 41.aa 960 530 05ae30220 4977193 c3 198 1086 531 1 528 99.05 Ilgel0308_5256_f2_1 961 71 hp7e10557 21698387 fl 1 1087 239 168 69 98.55 14ee41924 16282067 cl 72 964 64 07eel 402 19565702 c2 88 1088 465 401 63 100.00 hp2el0911_4882027_c2_87 965 583 07ee50709 960952 P 47 1089 1213 630 585 97.78 hp6pl2244_3948467 ci 52 975 476 hp6pl12244 3948467 c3 88 1090 508 32 445 100.00 14ce61516 13073577 f2 12 963 785 hp7e10590_13073577 c3 107 1296 897 112 785 100.00 09cp61003 492187 c2_80 956 667 01ce61016 492187 c3 120 1297 668 1 667 100.00 06ep10615 961562 f2 41 945 636 06epl0615_ 961562 fl 15 1298 637 1 636 100.00 WO 97/37044 PCT/US97/05223 56- IV. Identification of Nucleic Acids Encoding Vaccine Components and Targets for Agents Effective Against H. pylori The disclosed H pylori genome sequence includes segments that direct the synthesis of ribonucleic acids and polypeptides, as well as origins of replication, promoters, other types of regulatory sequences, and intergenic nucleic acids. The invention encompasses the identification of nucleic acids encoding immunogenic components of vaccines and targets for agents effective against H. pylori. An important aspect of this identification is to determine the function of the disclosed sequences, which can be achieved using a variety of approaches. Non-limiting examples of these methods are described briefly below.
Homology to known sequences: Computer-assisted comparison of the disclosed H. pylori sequences with previously reported sequences present in publicly available databases is a useful tool for identifying functional H pylori nucleic acid and polypeptide sequences. It will be understood that protein-coding sequences, for example, may be compared as a whole, and that a high degree of sequence homology between two proteins (such as, for example, >80-90%) at the amino acid level is strongly suggestive that the two proteins also possess some degree of functional homology, such as, for example, among enzymes involved in metabolism, DNA synthesis, or cell wall synthesis, and proteins involved in transport, cell division, etc. In addition, many structural features of particular protein classes have been identified and correlate with specific consensus sequences, such as, for example, binding domains for nucleotides, DNA, metal ions, and other small molecules; sites for covalent modifications such as phosphorylation, acylation, and the like; sites of protein:protein interactions, etc. These consensus sequences may be quite short and thus may represent only a fraction of the entire protein-coding sequence. Identification of such a feature in an H pylori sequence is therefore useful in determining the function of the encoded protein and identifying potentially useful targets of antibacterial drugs.
Of particular relevance to the present invention are structural features that are common to secretory, transmembrane, and surface proteins, including secretion signal peptides and hydrophobic transmembrane domains. H. pylori proteins identified as containing putative signal sequences and/or transmembrane domains are useful as immunogenic components of vaccines.
Identification of essential genes: Nucleic acids that encode proteins essential for growth or viability ofH. pylori are preferred drug targets. H. pylori genes can be tested for their biological relevance to the organism by examining the effect of deleting and/or disrupting the genes, by so-called gene "knockout", using techniques known to those skilled in the relevant art. In this manner, essential genes may be identified.
WO 97/37044 PCT/US97/05223 -57- Strain-specific sequences: Because of the evolutionary relationship between different H. pylori strains, it is believed that the presently disclosed H. pylori sequences are useful for identifying, and/or discriminating between, previously known and new H.
pylori strains. It is believed that other H pylori strains will exhibit at least sequence homology with the presently disclosed sequence, although whether or not this is correct is not essential to the invention. Systematic and routine analyses of DNA sequences derived from samples containing I. pylori strains, and comparison with the present sequence allows for the identification of sequences that can be used to discriminate between strains, as well as those that are common to all H pylori strains.
In one embodiment, the invention provides nucleic acids, including probes, and peptide and polypeptide sequences that discriminate between different strains of H pylori.
Strain-specific components can also be identified functionally by their ability to elicit or react with antibodies that selectively recognize one or more H. pylori strains.
In another embodiment, the invention provides nucleic acids, including probes, and peptide and polypeptide sequences that are common to all H pylori strains but are not found in other bacterial species.
Specific Example: Determination Of Candidate Protein Antigens For Antibody And Vaccine Development The selection of candidate protein antigens for vaccine development can be derived from the nucleic acids encoding H. pylori polypeptides. First, the ORF's can be analyzed for homology to other known exported or membrane proteins and analyzed using the discriminant analysis described by Klein, et al. (Klein, Kanehsia, and DeLisi, C. (1985) Biochimica et Biophysica Acta 815, 468-476) for predicting exported and membrane proteins.
Homology searches can be performed using the BLAST algorithm contained in the Wisconsin Sequence Analysis Package (Genetics Computer Group, University Research Park, 575 Science Drive, Madison, WI 5371 1) to compare each predicted ORF amino acid sequence with all sequences found in the current GenBank, SWISS-PROT and PIR databases. BLAST searches for local alignments between the ORF and the databank sequences and reports a probability score which indicates the probability of finding this sequence by chance in the database. ORF's with significant homology (e.g.
probabilities better than 1 xl 0 to membrane or exported proteins represent likely protein antigens for vaccine development. Possible functions can be provided to H pylori genes based on sequence homology to genes cloned in other organisms.
Discriminant analysis (Klein, et al. supra) can be used to examine the ORF amino acid sequences. This algorithm uses the intrinsic information contained in the WO 97/37044 PCTIUS97/05223 -58- ORF amino acid sequence and compares it to information derived from the properties of known membrane and exported proteins. This comparison predicts which proteins will be exported, membrane associated or cytoplasmic. ORF amino acid sequences identified as exported or membrane associated by this algorithm are likely protein antigens for vaccine development.
Surface exposed outer membrane proteins are likely to represent the best antigens to provide a protective immune response against H pylori. Among the algorithms that can be used to aid in prediction of these outer membrane proteins include the presence of an amphipathic beta-sheet region at their C-terminus. This region which has been detected in a large number of outer membrane proteins in Gram negative bacteria is often characterized by hydrophobic residues (Phe or Tyr) approximately at positions 1, 3, 5, 7 and 9 from the C-terminus see Figure 8, block Importantly, these sequences have not been detected at the C-termini ofperiplasmic proteins, thus allowing preliminary distinction between these classes of proteins based on primary sequence data. This phenomenon has been reported previously by Struyve et al. Mol.
Biol. 218:141-148, 1991).
Also illustrated in Figure 8 are additional amino acid sequence motifs found in many outer membrane proteins of H. pylori. The amino acid sequence alignment in Figure 8 depicts portions of the sequence of 12 H. pylori proteins (depicted in the single letter amino acid code) labeled with their amino acid Sequence ID Numbers and shown N-terminal to C-terminal, left to right. Six distinct blocks (labeled A through F) of similar amino acid residues are found including the distinctive hydrophobic residues (Phe or Tyr; F or Y according to the single letter code for amino acid residues) frequently found at positions near the C-terminus of outer membrane proteins. The presence of several shared motifs clearly establishes the similarity between members of this group of proteins.
In addition, outer membrane proteins isolated from H. pylori frequently share a motif near the mature N-terminus after processing to remove the secretion signal) as illustrated in the blocked amino acid residues in Figure 9. Figure 9 depicts the Nterminal portion of nine H pylori proteins (designated by their amino acid Sequence ID Numbers and shown N-terminal to C-terminal, left to right).
One skilled in the art would know that these shared sequence motifs are highly significant and establish a similarity among this group of proteins.
Infrequently it is not possible to distinguish between multiple possible nucleotides at a given position in the nucleic acid sequence. In those cases the ambiguities are denoted by an extended alphabet as follows: WO 97/37044 PCT/US97/05223 -59- These are the official IUPAC-IUB single-letter base codes Code Base Description G Guanine A Adenine T Thymine C Cytosine R Purine (A or G) Y Pyrimidine (C or T or U) M Amino (A or C) K Ketone (G or T) S Strong interaction (C or G) W Weak interaction (A or T) H Not-G (A or C or T) B Not-A (C or G or T) V Not-T (not-U) (A or C or G) D Not-C (A or G or T) N Any (A or C or G or T) The amino acid translations of this invention account for the ambiguity in the nucleic acid sequence by translating the ambiguous codon as the letter In all cases, the permissible amino acid residues at a position are clear from an examination of the nucleic acid sequence based on the standard genetic code.
V. Production of Fragments and Analogs of H. pylori Nucleic Acids and Polypeptides Based on the discovery of the H pylori gene products provided in the Sequence Listing, one skilled in the art can alter the disclosed structure (ofH. pylori genes), e.g., by producing fragments or analogs, and test the newly produced structures for activity.
Examples of techniques known to those skilled in the relevant art which allow the production and testing of fragments and analogs are discussed below. These, or analogous methods can be used to make and screen libraries of polypeptides, e.g., libraries of random peptides or libraries of fragments or analogs of cellular proteins for the ability to bind H. pylori polypeptides. Such screens are useful for discovery of inhibitors ofH. pylori.
Generation of Fragments Fragments of a protein can be produced in several ways, recombinantly, by proteolytic digestion, or by chemical synthesis. Internal or terminal fragments of a polypeptide can be generated by removing one or more nucleotides from one end (for a terminal fragment) or both ends (for an internal fragment) of a nucleic acid which encodes the polypeptide. Expression of the mutagenized DNA produces polypeptide fragments. Digestion with "end-nibbling" endonucleases can thus generate DNA's which encode an array of fragments. DNA's which encode fragments of a protein can WO 97/37044 PCT/US97/05223 also be generated by random shearing, restriction digestion or a combination of the above-discussed methods.
Fragments can also be chemically synthesized using techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. For example, peptides of the present invention may be arbitrarily divided into fragments of desired length with no overlap of the fragments, or divided into overlapping fragments of a desired length.
Alteration of Nucleic Acids and Polvpeptides: Random Methods Amino acid sequence variants of a protein can be prepared by random mutagenesis of DNA which encodes a protein or a particular domain or region of a protein. Useful methods include PCR mutagenesis and saturation mutagenesis.
A
library of random amino acid sequence variants can also be generated by the synthesis of a set of degenerate oligonucleotide sequences. (Methods for screening proteins in a library of variants are elsewhere herein).
PCR Mutagenesis In PCR mutagenesis, reduced Taq polymerase fidelity is used to introduce random mutations into a cloned fragment of DNA (Leung et al., 1989, Technique 1:11-15). The DNA region to be mutagenized is amplified using the polymerase chain reaction (PCR) under conditions that reduce the fidelity of DNA synthesis by Taq DNA polymerase, by using a dGTP/dATP ratio of five and adding Mn 2 to the PCR reaction. The pool of amplified DNA fragments are inserted into appropriate cloning vectors to provide random mutant libraries.
Saturation Mutagenesis Saturation mutagenesis allows for the rapid introduction of a large number of single base substitutions into cloned DNA fragments (Mayers et al., 1985, Science 229:242). This technique includes generation of mutations, by chemical treatment or irradiation of single-stranded DNA in vitro, and synthesis of a complimentary DNA strand. The mutation frequency can be modulated by modulating the severity of the treatment, and essentially all possible base substitutions can be obtained. Because this procedure does not involve a genetic selection for mutant fragments both neutral substitutions, as well as those that alter function, are obtained.
The distribution of point mutations is not biased toward conserved sequence elements.
Degenerate Oligonucleotides A library of homologs can also be generated from a set of degenerate oligonucleotide sequences. Chemical synthesis of a degenerate sequences can be carried out in an automatic DNA synthesizer, and the synthetic genes then ligated into an WO 97/37044 PCT/US97/05223 -61 appropriate expression vector. The synthesis of degenerate oligonucleotides is known in the art (see for example, Narang, SA (1983) Tetrahedron 39:3; Itakura et al. (1981) Recombinant DNA, Proc 3rd Cleveland Sympos. Macromolecules, ed. AG Walton, Amsterdam: Elsevier pp273-289; Itakura et al. (1984) Annu. Rev. Biochem. 53:323; Itakura et al. (1984) Science 198:1056; Ike et al. (1983) Nucleic Acid Res. 11:477. Such techniques have been employed in the directed evolution of other proteins (see, for example, Scott et al. (1990) Science 249:386-390; Roberts et al. (1992) PNAS 89:2429- 2433; Devlin et al. (1990) Science 249: 404-406; Cwirla et al. (1990) PNAS 87: 6378- 6382; as well as U.S. Patents Nos. 5,223,409, 5,198,346, and 5,096,815).
Alteration of Nucleic Acids and Polypeptides: Methods for Directed Mutagenesis Non-random or directed, mutagenesis techniques can be used to provide specific sequences or mutations in specific regions. These techniques can be used to create variants which include, deletions, insertions, or substitutions, of residues of the known amino acid sequence of a protein. The sites for mutation can be modified individually or in series, by substituting first with conserved amino acids and then with more radical choices depending upon results achieved, deleting the target residue, or inserting residues of the same or a different class adjacent to the located site, or combinations of options 1-3.
Alanine Scanning Mutagenesis Alanine scanning mutagenesis is a useful method for identification of certain residues or regions of the desired protein that are preferred locations or domains for mutagenesis, Cunningham and Wells (Science 244:1081-1085, 1989). In alanine scanning, a residue or group of target residues are identified charged residues such as Arg, Asp, His, Lys, and Glu) and replaced by a neutral or negatively charged amino acid (most preferably alanine or polyalanine). Replacement of an amino acid can affect the interaction of the amino acids with the surrounding aqueous environment in or outside the cell. Those domains demonstrating functional sensitivity to the substitutions are then refined by introducing further or other variants at or for the sites of substitution.
Thus, while the site for introducing an amino acid sequence variation is predetermined, the nature of the mutation per se need not be predetermined. For example, to optimize the performance of a mutation at a given site, alanine scanning or random mutagenesis may be conducted at the target codon or region and the expressed desired protein subunit variants are screened for the optimal combination of desired activity.
Oligonucleotide-Mediated Mutagenesis Oligonucleotide-mediated mutagenesis is a useful method for preparing substitution, deletion, and insertion variants of DNA, see, Adelman et al., (DNA WO 97/37044 PCT/US97/05223 -62- 2:183, 1983). Briefly, the desired DNA is altered by hybridizing an oligonucleotide encoding a mutation to a DNA template, where the template is the single-stranded form of a plasmid or bacteriophage containing the unaltered or native DNA sequence of the desired protein. After hybridization, a DNA polymerase is used to synthesize an entire second complementary strand of the template that will thus incorporate the oligonucleotide primer, and will code for the selected alteration in the desired protein DNA. Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide will have 12 to 15 nucleotides that are completely complementary to the template on either side of the nucleotide(s) coding for the mutation. This ensures that the oligonucleotide will hybridize properly to the singlestranded DNA template molecule. The oligonucleotides are readily synthesized using techniques known in the art such as that described by Crea et al. (Proc. Natl. Acad. Sci.
USA, 75: 5765[1978]).
Cassette Mutagenesis Another method for preparing variants, cassette mutagenesis, is based on the technique described by Wells et al. (Gene, 34:315[1985]). The starting material is a plasmid (or other vector) which includes the protein subunit DNA to be mutated. The codon(s) in the protein subunit DNA to be mutated are identified. There must be a unique restriction endonuclease site on each side of the identified mutation site(s). If no such restriction sites exist, they may be generated using the above-described oligonucleotide-mediated mutagenesis method to introduce them at appropriate locations in the desired protein subunit DNA. After the restriction sites have been introduced into the plasmid, the plasmid is cut at these sites to linearize it. A double-stranded oligonucleotide encoding the sequence of the DNA between the restriction sites but containing the desired mutation(s) is synthesized using standard procedures. The two strands are synthesized separately and then hybridized together using standard techniques. This double-stranded oligonucleotide is referred to as the cassette. This cassette is designed to have 3' and 5' ends that are comparable with the ends of the linearized plasmid, such that it can be directly ligated to the plasmid. This plasmid now contains the mutated desired protein subunit DNA sequence.
Combinatorial Mutagenesis Combinatorial mutagenesis can also be used to generate mutants (Ladner et al., WO 88/06630). In this method, the amino acid sequences for a group of homologs or other related proteins are aligned, preferably to promote the highest homology possible. All of the amino acids which appear at a given position of the aligned sequences can be selected to create a degenerate set of combinatorial sequences. The WO 97/37044 PCT/US97/05223 63 variegated library of variants is generated by combinatorial mutagenesis at the nucleic acid level, and is encoded by a variegated gene library. For example, a mixture of synthetic oligonucleotides can be enzymatically ligated into gene sequences such that the degenerate set of potential sequences are expressible as individual peptides, or alternatively, as a set of larger fusion proteins containing the set of degenerate sequences.
Other Modifications of H. pylori Nucleic Acids and Polypeptides It is possible to modify the structure of an H. pylori polypeptide for such purposes as increasing solubility, enhancing stability shelf life ex vivo and resistance to proteolytic degradation in vivo). A modified H. pylori protein or peptide can be produced in which the amino acid sequence has been altered, such as by amino acid substitution, deletion, or addition as described herein.
An H. pylori peptide can also be modified by substitution of cysteine residues preferably with alanine, serine, threonine, leucine or glutamic acid residues to minimize dimerization via disulfide linkages. In addition, amino acid side chains of fragments of the protein of the invention can be chemically modified. Another modification is cyclization of the peptide.
In order to enhance stability and/or reactivity, an H pylori polypeptide can be modified to incorporate one or more polymorphisms in the amino acid sequence of the protein resulting from any natural allelic variation. Additionally, D-amino acids, nonnatural amino acids, or non-amino acid analogs can be substituted or added to produce a modified protein within the scope of this invention. Furthermore, an H pylori polypeptide can be modified using polyethylene glycol (PEG) according to the method of A. Sehon and co-workers (Wie et al., supra) to produce a protein conjugated with PEG. In addition, PEG can be added during chemical synthesis of the protein. Other modifications ofH. pylori proteins include reduction/alkylation (Tarr, Methods of Protein Microcharacterization, J. E. Silver ed., Humana Press, Clifton NJ 155-194 (1986)); acylation (Tarr, supra); chemical coupling to an appropriate carrier (Mishell and Shiigi, eds, Selected Methods in Cellular Immunology, WH Freeman, San Francisco, CA (1980), U.S. Patent 4,939,239; or mild formalin treatment (Marsh, (1971) Int. Arch. of Allergy and Appl. Immunol., 41: 199 215).
To facilitate purification and potentially increase solubility of an H pylori protein or peptide, it is possible to add an amino acid fusion moiety to the peptide backbone. For example, hexa-histidine can be added to the protein for purification by immobilized metal ion affinity chromatography (Hochuli, E. et al., (1988) Bio/Technology, 6: 1321 1325). In addition, to facilitate isolation of peptides free of WO 97/37044 PCT/US97/05223 -64irrelevant sequences, specific endoprotease cleavage sites can be introduced between the sequences of the fusion moiety and the peptide.
To potentially aid proper antigen processing of epitopes within an H. pylori polypeptide, canonical protease sensitive sites can be engineered between regions, each comprising at least one epitope via recombinant or synthetic methods. For example, charged amino acid pairs, such as KK or RR, can be introduced between regions within a protein or fragment during recombinant construction thereof. The resulting peptide can be rendered sensitive to cleavage by cathepsin and/or other trypsin-like enzymes which would generate portions of the protein containing one or more epitopes. In addition, such charged amino acid residues can result in an increase in the solubility of the peptide.
Primary Methods for Screening Polvpeptides and Analogs Various techniques are known in the art for screening generated mutant gene products. Techniques for screening large gene libraries often include cloning the gene library into replicable expression vectors, transforming appropriate cells with the resulting library of vectors, and expressing the genes under conditions in which detection of a desired activity, in this case, binding to H. pylori polypeptide or an interacting protein, facilitates relatively easy isolation of the vector encoding the gene whose product was detected. Each of the techniques described below is amenable to high through-put analysis for screening large numbers of sequences created, by random mutagenesis techniques.
Two Hybrid Systems Two hybrid assays such as the system described above (as with the other screening methods described herein), can be used to identify polypeptides, e.g., fragments or analogs of a naturally-occurring H pylori polypeptide, of cellular proteins, or of randomly generated polypeptides which bind to an H pylori protein.
(The H pylori domain is used as the bait protein and the library of variants are expressed as fish fusion proteins.) In an analogous fashion, a two hybrid assay (as with the other screening methods described herein), can be used to find polypeptides which bind a H pylori polypeptide.
Display Libraries In one approach to screening assays, the candidate peptides are displayed on the surface of a cell or viral particle, and the ability of particular cells or viral particles to bind an appropriate receptor protein via the displayed product is detected in a "panning assay". For example, the gene library can be cloned into the gene for a surface membrane protein of a bacterial cell, and the resulting fusion protein detected by WO 97/37044 PCT/US97/05223 panning (Ladner et al., WO 88/06630; Fuchs et al. (1991) Bio/Technology 9:1370-1371; and Goward et al. (1992) TIBS 18:136-140). In a similar fashion, a detectably labeled ligand can be used to score for potentially functional peptide homologs. Fluorescently labeled ligands, receptors, can be used to detect homolog which retain ligandbinding activity. The use of fluorescently labeled ligands, allows cells to be visually inspected and separated under a fluorescence microscope, or, where the morphology of the cell permits, to be separated by a fluorescence-activated cell sorter.
A gene library can be expressed as a fusion protein on the surface of a viral particle. For instance, in the filamentous phage system, foreign peptide sequences can be expressed on the surface of infectious phage, thereby conferring two significant benefits. First, since these phage can be applied to affinity matrices at concentrations well over 1013 phage per milliliter, a large number of phage can be screened at one time.
Second, since each infectious phage displays a gene product on its surface, if a particular phage is recovered from an affinity matrix in low yield, the phage can be amplified by another round of infection. The group of almost identical E. coli filamentous phages M 13, fd., and fl are most often used in phage display libraries. Either of the phage gIII or gVIII coat proteins can be used to generate fusion proteins without disrupting the ultimate packaging of the viral particle. Foreign epitopes can be expressed at the NH 2 terminal end ofpIII and phage bearing such epitopes recovered from a large excess of phage lacking this epitope (Ladner et al. PCT publication WO 90/02909; Garrard et al., PCT publication WO 92/09690; Marks et al. (1992) J. Biol. Chem. 267:16007-16010; Griffiths et al. (1993) EMBO J 12:725-734; Clackson et al. (1991) Nature 352:624-628; and Barbas et al. (1992) PNAS 89:4457-4461).
A common approach uses the maltose receptor of E. coli (the outer membrane protein, LamB) as a peptide fusion partner (Charbit et al. (1986) EMBO 5, 3029-3037).
Oligonucleotides have been inserted into plasmids encoding the LamB gene to produce peptides fused into one of the extracellular loops of the protein. These peptides are available for binding to ligands, to antibodies, and can elicit an immune response when the cells are administered to animals. Other cell surface proteins, OmpA (Schorr et al. (1991) Vaccines 91, pp. 387-392), PhoE (Agterberg, et al. (1990) Gene 88, 37-45), and PAL (Fuchs et al. (1991) Bio/Tech 9, 1369-1372), as well as large bacterial surface structures have served as vehicles for peptide display. Peptides can be fused to pilin, a protein which polymerizes to form the pilus-a conduit for interbacterial exchange of genetic information (Thiry et al. (1989) Appl. Environ. Microbiol. 55, 984-993).
Because of its role in interacting with other cells, the pilus provides a useful support for the presentation of peptides to the extracellular environment. Another large surface WO 97/37044 PCTfUS97/05223 -66structure used for peptide display is the bacterial motive organ, the flagellum. Fusion of peptides to the subunit protein flagellin offers a dense array of may peptides copies on the host cells (Kuwajima et al. (1988) Bio/Tech. 6, 1080-1083). Surface proteins of other bacterial species have also served as peptide fusion partners. Examples include the Staphylococcus protein A and the outer membrane protease IgA of Neisseria (Hansson et al. (1992) J. Bacteriol. 174, 4239-4245 and Klauser et al. (1990) EMBOJ. 9, 1991- 1999).
In the filamentous phage systems and the LamB system described above, the physical link between the peptide and its encoding DNA occurs by the containment of the DNA within a particle (cell or phage) that carries the peptide on its surface.
Capturing the peptide captures the particle and the DNA within. An alternative scheme uses the DNA-binding protein LacI to form a link between peptide and DNA (Cull et al.
(1992) PNAS USA 89:1865-1869). This system uses a plasmid containing the LacI gene with an oligonucleotide cloning site at its 3'-end. Under the controlled induction by arabinose, a LacI-peptide fusion protein is produced. This fusion retains the natural ability of LacI to bind to a short DNA sequence known as LacO operator (LacO). By installing two copies of LacO on the expression plasmid, the LacI-peptide fusion binds tightly to the plasmid that encoded it. Because the plasmids in each cell contain only a single oligonucleotide sequence and each cell expresses only a single peptide sequence, the peptides become specifically and stably associated with the DNA sequence that directed its synthesis. The cells of the library are gently lysed and the peptide-DNA complexes are exposed to a matrix of immobilized receptor to recover the complexes containing active peptides. The associated plasmid DNA is then reintroduced into cells for amplification and DNA sequencing to determine the identity of the peptide ligands.
As a demonstration of the practical utility of the method, a large random library of dodecapeptides was made and selected on a monoclonal antibody raised against the opioid peptide dynorphin B. A cohort of peptides was recovered, all related by a consensus sequence corresponding to a six-residue portion of dynorphin B. (Cull et al.
(1992) Proc. Natl. Acad. Sci. U.S.A. 89-1869) This scheme, sometimes referred to as peptides-on-plasmids, differs in two important ways from the phage display methods. First, the peptides are attached to the C-terminus of the fusion protein, resulting in the display of the library members as peptides having free carboxy termini. Both of the filamentous phage coat proteins, pill and pVIII, are anchored to the phage through their C-termini, and the guest peptides are placed into the outward-extending N-terminal domains. In some designs, the phagedisplayed peptides are presented right at the amino terminus of the fusion protein.
WO 97/37044 PCT/US97105223 -67- (Cwirla, et al. (1990) Proc. Natl. Acad. Sci. US.A. 87, 6378-6382) A second difference is the set of biological biases affecting the population of peptides actually present in the libraries. The LacI fusion molecules are confined to the cytoplasm of the host cells.
The phage coat fusions are exposed briefly to the cytoplasm during translation but are rapidly secreted through the inner membrane into the periplasmic compartment, remaining anchored in the membrane by their C-terminal hydrophobic domains, with the N-termini, containing the peptides, protruding into the periplasm while awaiting assembly into phage particles. The peptides in the LacI and phage libraries may differ significantly as a result of their exposure to different proteolytic activities. The phage coat proteins require transport across the inner membrane and signal peptidase processing as a prelude to incorporation into phage. Certain peptides exert a deleterious effect on these processes and are underrepresented in the libraries (Gallop et al. (1994) J.
Med. Chem. 37(9):1233-1251). These particular biases are not a factor in the LacI display system.
The number of small peptides available in recombinant random libraries is enormous. Libraries of 107-109 independent clones are routinely prepared. Libraries as large as 1011 recombinants have been created, but this size approaches the practical limit for clone libraries. This limitation in library size occurs at the step of transforming the DNA containing randomized segments into the host bacterial cells. To circumvent this limitation, an in vitro system based on the display of nascent peptides in polysome complexes has recently been developed. This display library method has the potential of producing libraries 3-6 orders of magnitude larger than the currently available phage/phagemid or plasmid libraries. Furthermore, the construction of the libraries, expression of the peptides, and screening, is done in an entirely cell-free format.
In one application of this method (Gallop et al. (1994) J. Med. Chem.
37(9):1233-1251), a molecular DNA library encoding 1012 decapeptides was constructed and the library expressed in an E. coli S30 in vitro coupled transcription/translation system. Conditions were chosen to stall the ribosomes on the mRNA, causing the accumulation of a substantial proportion of the RNA in polysomes and yielding complexes containing nascent peptides still linked to their encoding RNA.
The polysomes are sufficiently robust to be affinity purified on immobilized receptors in much the same way as the more conventional recombinant peptide display libraries are screened. RNA from the bound complexes is recovered, converted to eDNA, and amplified by PCR to produce a template for the next round of synthesis and screening.
The polysome display method can be coupled to the phage display system. Following several rounds of screening, eDNA from the enriched pool of polysomes was cloned into WO 97/37044 PCT/US97/05223 68 a phagemid vector. This vector serves as both a peptide expression vector, displaying peptides fused to the coat proteins, and as a DNA sequencing vector for peptide identification. By expressing the polysome-derived peptides on phage, one can either continue the affinity selection procedure in this format or assay the peptides on individual clones for binding activity in a phage ELISA, or for binding specificity in a completion phage ELISA (Barret, et al. (1992) Anal. Biochem 204,357-364). To identify the sequences of the active peptides one sequences the DNA produced by the phagemid host.
Secondary Screening of Polvpeptides and Analogs The high through-put assays described above can be followed by secondary screens in order to identify further biological activities which will, allow one skilled in the art to differentiate agonists from antagonists. The type of a secondary screen used will depend on the desired activity that needs to be tested. For example, an assay can be developed in which the ability to inhibit an interaction between a protein of interest and its respective ligand can be used to identify antagonists from a group of peptide fragments isolated though one of the primary screens described above.
Therefore, methods for generating fragments and analogs and testing them for activity are known in the art. Once the core sequence of interest is identified, it is routine to perform for one skilled in the art to obtain analogs and fragments.
Peptide Mimetics of H. plori Polvpeptides The invention also provides for reduction of the protein binding domains of the subject H. pylori polypeptides to generate mimetics, e.g. peptide or non-peptide agents.
The peptide mimetics are able to disrupt binding of a polypeptide to its counter ligand, in the case of an H. pylori polypeptide binding to a naturally occurring ligand. The critical residues of a subject H. pylori polypeptide which are involved in molecular recognition of a polypeptide can be determined and used to generate H pylori-derived peptidomimetics which competitively or noncompetitively inhibit binding of the H.
pylori polypeptide with an interacting polypeptide (see, for example, European patent applications EP-412,762A and EP-B31,080A).
For example, scanning mutagenesis can be used to map the amino acid residues of a particular H. pylori polypeptide involved in binding an interacting polypeptide, peptidomimetic compounds diazepine or isoquinoline derivatives) can be generated which mimic those residues in binding to an interacting polypeptide, and which therefore can inhibit binding of an H. pylori polypeptide to an interacting polypeptide and thereby interfere with the function of H. pylori polypeptide. For instance, nonhydrolyzable peptide analogs of such residues can be generated using benzodiazepine WO 97/37044 PCT/US97/05223 -69see Freidinger et al. in Peptides. Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), azepine see Huffman et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), substituted gama lactam rings (Garvey et al. in Peptides: Chemistry and Biology, G.R. Marshall ed., ESCOM Publisher: Leiden, Netherlands, 1988), ketomethylene pseudopeptides (Ewenson et al. (1986) JMed Chem 29:295; and Ewenson et al. in Peptides: Structure and Function (Proceedings of the 9th American Peptide Symposium) Pierce Chemical Co. Rockland, IL, 1985), P-turn dipeptide cores (Nagai et al. (1985) Tetrahedron Lett 26:647; and Sato et al. (1986) J Chem Soc Perkin Trans 1:1231), and P-aminoalcohols (Gordon et al. (1985) Biochem Biophys Res Commun126:419; and Dann et al. (1986) Biochem Biophys Res Commun 134:71).
VI. Vaccine Formulations for H. pylori Nucleic Acids and Polvpeptides This invention also features vaccine compositions for protection against infection by H pylori or for treatment of H. pylori infection, a gram-negative spiral microaerophilic bacterium. In one embodiment, the vaccine compositions contain one or more immunogenic components such as a surface protein from H. pylori, or portion thereof, and a pharmaceutically acceptable carrier. Nucleic acids within the scope of the invention are exemplified by the nucleic acids shown in the Sequence Listing which encode H. pylori surface proteins. However, any nucleic acid encoding an immunogenic H. pylori protein, or portion thereof, which is capable of expression in a cell, can be used in the present invention. These vaccines have therapeutic and prophylactic utilities.
One aspect of the invention provides a vaccine composition for protection against infection by H. pylori which contains at least one immunogenic fragment of an H. pylori protein and a pharmaceutically acceptable carrier. Preferred fragments include peptides of at least about 10 amino acid residues in length, preferably about 10-20 amino acid residues in length, and more preferably about 12-16 amino acid residues in length.
Immunogenic components of the invention can be obtained, for example, by screening polypeptides recombinantly produced from the corresponding fragment of the nucleic acid encoding the full-length H. pylori protein. In addition, fragments can be chemically synthesized using techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry.
In one embodiment, immunogenic components are identified by the ability of the peptide to stimulate T cells. Peptides which stimulate T cells, as determined by, for example, T cell proliferation or cytokine secretion are defined herein as comprising at least one T cell epitope. T cell epitopes are believed to be involved in initiation and WO 97/37044 PCTUS97/05223 perpetuation of the immune response to the protein allergen which is responsible for the clinical symptoms of allergy. These T cell epitopes are thought to trigger early events at the level of the T helper cell by binding to an appropriate HLA molecule on the surface of an antigen presenting cell, thereby stimulating the T cell subpopulation with the relevant T cell receptor for the epitope. These events lead to T cell proliferation, lymphokine secretion, local inflammatory reactions, recruitment of additional immune cells to the site of antigen/T cell interaction, and activation of the B cell cascade, leading to the production of antibodies. A T cell epitope is the basic element, or smallest unit of recognition by a T cell receptor, where the epitope comprises amino acids essential to receptor recognition approximately 6 or 7 amino acid residues). Amino acid sequences which mimic those of the T cell epitopes are within the scope of this invention.
In another embodiment, immunogenic components of the invention are identified through genomic vaccination. The basic protocol is based on the idea that expression libraries consisting of all or parts of a pathogen genome, an H. pylori genome, can confer protection when used to genetically immunize a host. This expression library immunization (ELI) is analogous to expression cloning and involves reducing a genomic expression library of a pathogen, H. pylori, into plasmids that can act as genetic vaccines. The plasmids can also be designed to encode genetic adjuvants which can dramatically stimulate the humoral response. These genetic adjuvants can be introduced at remote sites and act as well extracelluraly as intracellularly.
This is a new approach to vaccine production that has many of the advantages of live/attenuated pathogens but no risk of infection. An expression library of pathogen DNA is used to immunize a host thereby producing the effects of antigen presentation of a live vaccine without the risk. For example, in the present invention, random fragments from the H pylori genome or from cosmid or plasmid clones, as well as PCR products from genes identified by genomic sequencing, can be used to immunize a host. The feasibility of this approach has been demonstrated with Mycoplasmapulmonis (Barry et al., Nature 377:632-635, 1995), where even partial expression libraries of Mycoplasma pulmonis, a natural pathogen in rodents, provided protection against challenge from the pathogen.
ELI is a technique that allows for production of a non-infectious multipartite vaccine, even when little is known about pathogen's biology, because ELI uses the immune system to screen candidate genes. Once isolated, these genes can be used as genetic vaccines or for development of recombinant protein vaccines. Thus. ELI allows for production of vaccines in a systematic, largely mechanized fashion.
WO 97/37044 PCT/US97/05223 -71 Screening immunogenic components can be accomplished using one or more of several different assays. For example, in vitro, peptide T cell stimulatory activity is assayed by contacting a peptide known or suspected of being immunogenic with an antigen presenting cell which presents appropriate MHC molecules in a T cell culture.
Presentation of an immunogenic H. pylori peptide in association with appropriate
MHC
molecules to T cells in conjunction with the necessary costimulation has the effect of transmitting a signal to the T cell that induces the production of increased levels of cytokines, particularly of interleukin-2 and interleukin-4. The culture supernatant can be obtained and assayed for interleukin-2 or other known cytokines. For example, any one of several conventional assays for interleukin-2 can be employed, such as the assay described in Proc. Natl. Acad Sci USA, 86: 1333 (1989) the pertinent portions of which are incorporated herein by reference. A kit for an assay for the production of interferon is also available from Genzyme Corporation (Cambridge,
MA).
Alternatively, a common assay for T cell proliferation entails measuring tritiated thymidine incorporation. The proliferation ofT cells can be measured in vitro by determining the amount of 3 H-labeled thymidine incorporated into the replicating DNA of cultured cells. Therefore, the rate of DNA synthesis and, in turn, the rate of cell division can be quantified.
Vaccine compositions of the invention containing immunogenic components H. pylori polypeptide or fragment thereof or nucleic acid encoding an H. pylori polypeptide or fragment thereof) preferably include a pharmaceutically acceptable carrier. The term "pharmaceutically acceptable carrier" refers to a carrier that does not cause an allergic reaction or other untoward effect in patients to whom it is administered.
Suitable pharmaceutically acceptable carriers include, for example, one or more of water, saline, phosphate buffered saline, dextrose, glycerol, ethanol and the like, as well as combinations thereof. Pharmaceutically acceptable carriers may further comprise minor amounts of auxiliary substances such as wetting or emulsifying agents, preservatives or buffers, which enhance the shelf life or effectiveness of the antibody.
For vaccines of the invention containing H. pylori polypeptides, the polypeptide is coadministered with a suitable adjuvant.
It will be apparent to those of skill in the art that the therapeutically effective amount of DNA or protein of this invention will depend, inter alia, upon the administration schedule, the unit dose of antibody administered, whether the protein or DNA is administered in combination with other therapeutic agents, the immune status and health of the patient, and the therapeutic activity of the particular protein or DNA.
WO 97/37044 PCTUS97/05223 -72- Vaccine compositions are conventionally administered parenterally, by injection, either subcutaneously or intramuscularly. Methods for intramuscular immunization are described by Wolff et al. (1990) Science 247: 1465-1468 and by Sedegah et al. (1994) Immunology 91: 9866-9870. Other modes of administration include oral and pulmonary formulations, suppositories, and transdermal applications.
Oral immunization is preferred over parenteral methods for inducing protection against infection by H. pylori. Czinn et. al. (1993) Vaccine 11: 637-642. Oral formulations include such normally employed excipients as, for example, pharmaceutical grades of mannitol, lactose, starch, magnesium stearate, sodium saccharine, cellulose, magnesium carbonate, and the like.
The vaccine compositions of the invention can include an adjuvant, including, but not limited to aluminum hydroxide; N-acetyl-muramyl--L-threonyl-D-isoglutamine (thr-MDP); N-acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637, referred to as nor-MDP); N-acetylmuramyl-L-alanyl-D-isoglutaminyl-L-alanine-2-( '-2'-dipalmitoylsn-glycero-3-hydroxyphos-phoryloxy)-ethylamine (CGP 19835A, referred to a MTP- PE); RIBI, which contains three components from bacteria; monophosphoryl lipid A; trehalose dimycoloate; cell wall skeleton (MPL TDM CWS) in a 2% squalene/Tween 80 emulsion; and cholera toxin. Others which may be used are nontoxic derivatives of cholera toxin, including its B subunit, and/or conjugates or genetically engineered fusions of the H. pylori polypeptide with cholera toxin or its B subunit, procholeragenoid, fungal polysaccharides, including schizophyllan, muramyl dipeptide, muramyl dipeptide derivatives, phorbol esters, labile toxin ofE. coli, non-H.
pylori bacterial lysates, block polymers or saponins.
Other suitable delivery methods include biodegradable microcapsules or immuno-stimulating complexes (ISCOMs) or liposomes, genetically engineered attenuated live vectors such as viruses or bacteria, and recombinant (chimeric) virus-like particles, bluetongue. The amount of adjuvant employed will depend on the type of adjuvant used. For example, when the mucosal adjuvant is cholera toxin, it is suitably used in an amount of 5 pg to 50 tg, for example 10 gg to 35 gg. When used in the form of microcapsules, the amount used will depend on the amount employed in the matrix of the microcapsule to achieve the desired dosage. The determination of this amount is within the skill of a person of ordinary skill in the art.
Carrier systems in humans may include enteric release capsules protecting the antigen from the acidic environment of the stomach, and including H pylori polypeptide in an insoluble form as fusion proteins. Suitable carriers for the vaccines of the WO 97/37044 PCT/US97/05223 -73invention are enteric coated capsules and polylactide-glycolide microspheres. Suitable diluents are 0.2 N NaHCO3 and/or saline.
Vaccines of the invention can be administered as a primary prophylactic agent in adults or in children, as a secondary prevention, after successful eradication of H pylori in an infected host, or as a therapeutic agent in the aim to induce an immune response in a susceptible host to prevent infection by H. pylori. The vaccines of the invention are administered in amounts readily determined by persons of ordinary skill in the art.
Thus, for adults a suitable dosage will be in the range of 10 pg to 10 g, preferably 10 Ig to 100 mg, for example 50 pg to 50 mg. A suitable dosage for adults will also be in the range of 5 tg to 500 mg. Similar dosage ranges will be applicable for children. Those skilled in the art will recognize that the optimal dose may be more or less depending upon the patient's body weight, disease, the route of administration, and other factors.
Those skilled in the art will also recognize that appropriate dosage levels can be obtained based on results with known oral vaccines such as, for example, a vaccine based on an E. coli lysate (6 mg dose daily up to total of 540 mg) and with an enterotoxigenic E. coli purified antigen (4 doses of 1 mg) (Schulman et al., J. Urol.
150:917-921 (1993); Boedecker et al., American Gastroenterological Assoc. 999:A-222 (1993)). The number of doses will depend upon the disease, the formulation, and efficacy data from clinical trials. Without intending any limitation as to the course of treatment, the treatment can be administered over 3 to 8 doses for a primary immunization schedule over 1 month (Boedeker, American Gastroenterological Assoc.
888:A-222 (1993)).
It will be apparent to those skilled in the art that some of the vaccine compositions of the invention are usefuls only for preventing H pylori infection, some are useful only for treating H. pylori infection, and some are useful for both preventing and treating H. pylori infection. In a preferred embodiment, the vaccine composition of the invention provides protection against H. pylori infection by stimulating humoral and/or cell-mediated immunity against H. pylori. It should be understood that amelioration of any of the symptoms of H pylori infection is a desirable clinical goal, including a lessening of the dosage of medication used to treat H. pylori-caused disease.
VII. Antibodies Reactive With H. pvlori Polypeptides The invention also includes antibodies specifically reactive with the subject H.
pylori polypeptide. Anti-protein/anti-peptide antisera or monoclonal antibodies can be made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal such as a mouse, a WO 97/37044 PCT/US97/05223 -74hamster or rabbit can be immunized with an immunogenic form of the peptide.
Techniques for conferring immunogenicity on a protein or peptide include conjugation to carriers or other techniques well known in the art. An immunogenic portion of the subject H. pylori polypeptide can be administered in the presence of adjuvant. The progress of immunization can be monitored by detection of antibody titers in plasma or serum. Standard ELISA or other immunoassays can be used with the immunogen as antigen to assess the levels of antibodies.
In a preferred embodiment, the subject antibodies are immunospecific for antigenic determinants of the H. pylori polypeptides of the invention, e.g. antigenic determinants of a polypeptide shown in the Sequence Listing, or a closely related human or non-human mammalian homolog 90% homologous, more preferably at least homologous). In yet a further preferred embodiment of the invention, the anti-H.
pylori antibodies do not substantially cross react react specifically) with a protein which is for example, less than 80% percent homologous to a sequence shown in the Sequence Listing. By "not substantially cross react", it is meant that the antibody has a binding affinity for a non-homologous protein which is less than 10 percent, more preferably less than 5 percent, and even more preferably less than 1 percent, of the binding affinity for a protein contained in the Sequence Listing. In a most preferred embodiment, there is no crossreactivity between bacterial and mammalian antigens.
The term antibody as used herein is intended to include fragments thereof which are also specifically reactive with H pylori polypeptides. Antibodies can be fragmented using conventional techniques and the fragments screened for utility in the same manner as described above for whole antibodies. For example, F(ab') 2 fragments can be generated by treating antibody with pepsin. The resulting F(ab') 2 fragment can be treated to reduce disulfide bridges to produce Fab' fragments. The antibody of the invention is further intended to include bispecific and chimeric molecules having an anti-H. pylori portion.
Both monoclonal and polyclonal antibodies (Ab) directed against H. pylori polypeptides or H pylori polypeptide variants, and antibody fragments such as Fab' and F(ab')2, can be used to block the action ofH. pylori polypeptide and allow the study of the role of a particular H. pylori polypeptide of the invention in aberrant or unwanted intracellular signaling, as well as the normal cellular function of the H pylori and by microinjection of anti-H pylori polypeptide antibodies of the present invention.
Antibodies which specifically bind H. pylori epitopes can also be used in immunohistochemical staining of tissue samples in order to evaluate the abundance and pattern of expression of H. pylori antigens. Anti H pylori polypeptide antibodies can be WO 97/37044 PCT/US97/05223 used diagnostically in immuno-precipitation and immuno-blotting to detect and evaluate H. pylori levels in tissue or bodily fluid as part of a clinical testing procedure. Likewise, the ability to monitor H. pylori polypeptide levels in an individual can allow determination of the efficacy of a given treatment regimen for an individual afflicted with such a disorder. The level of an H. pylori polypeptide can be measured in cells found in bodily fluid, such as in urine samples or can be measured in tissue, such as produced by gastric biopsy. Diagnostic assays using anti-H pylori antibodies can include, for example, immunoassays designed to aid in early diagnosis ofH. pylori infections. The present invention can also be used as a method of detecting antibodies contained in samples from individuals infected by this bacterium using specific H. pylori antigens.
Another application of anti-H. pylori polypeptide antibodies of the invention is in the immunological screening of cDNA libraries constructed in expression vectors such as Xgtl 1, Xgtl8-23, XZAP, and kORF8. Messenger libraries of this type, having coding sequences inserted in the correct reading frame and orientation, can produce fusion proteins. For instance, Xgtl 1 will produce fusion proteins whose amino termini consist of B-galactosidase amino acid sequences and whose carboxy termini consist of a foreign polypeptide. Antigenic epitopes of a subject H pylori polypeptide can then be detected with antibodies, as, for example, reacting nitrocellulose filters lifted from infected plates with anti-H. pylori polypeptide antibodies. Phage, scored by this assay, can then be isolated from the infected plate. Thus, the presence ofH. pylori gene homologs can be detected and cloned from other species, and alternate isoforms (including splicing variants) can be detected and cloned.
VIII. Kits Containing Nucleic Acids. Polvpeptides or Antibodies of the Invention The nucleic acid, polypeptides and antibodies of the invention can be combined with other reagents and articles to form kits. Kits for diagnostic purposes typically comprise the nucleic acid, polypeptides or antibodies in vials or other suitable vessels.
Kits typically comprise other reagents for performing hybridization reactions, polymerase chain reactions (PCR), or for reconstitution of lyophilized components, such as aqueous media, salts, buffers, and the like. Kits may also comprise reagents for sample processing such as detergents, chaotropic salts and the like. Kits may also comprise immobilization means such as particles, supports, wells, dipsticks and the like.
Kits may also comprise labeling means such as dyes, developing reagents, radioisotopes, fluorescent agents, luminescent or chemiluminescent agents, enzymes, intercalating agents and the like. With the nucleic acid and amino acid sequence information provided WO 97/37044 PCT/US97/05223 -76herein, individuals skilled in art can readily assemble kits to serve their particular purpose. Kits further can include instructions for use.
IX. Drug Screening Assays Using H. pylori Polypeptides By making available purified and recombinant H pylori polypeptides, the present invention provides assays which can be used to screen for drugs which are either agonists or antagonists of the normal cellular function, in this case, of the subject H.
pylori polypeptides, or of their role in intracellular signaling. Such inhibitors or potentiators may be useful as new therapeutic agents to combat H pylori infections in humans. A variety of assay formats will suffice and, in light of the present inventions, will be comprehended by the skilled artisan.
In many drug screening programs which test libraries of compounds and natural extracts, high throughput assays are desirable in order to maximize the number of compounds surveyed in a given period of time. Assays which are performed in cell-free systems, such as may be derived with purified or semi-purified proteins, are often preferred as "primary" screens in that they can be generated to permit rapid development and relatively easy detection of an alteration in a molecular target which is mediated by a test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test compound can be generally ignored in the in vitro system, the assay instead being focused primarily on the effect of the drug on the molecular target as may be manifest in an alteration of binding affinity with other proteins or change in enzymatic properties of the molecular target. Accordingly, in an exemplary screening assay of the present invention, the compound of interest is contacted with an isolated and purified H. pylori polypeptide.
Screening assays can be constructed in vitro with a purified H pylori polypeptide or fragment thereof, such as an H pylori polypeptide having enzymatic activity, such that the activity of the polypeptide produces a detectable reaction product.
The efficacy of the compound can be assessed by generating dose response curves from data obtained using various concentrations of the test compound. Moreover, a control assay can also be performed to provide a baseline for comparison. Suitable products include those with distinctive absorption, fluorescence, or chemi-luminescence properties, for example, because detection may be easily automated. A variety of synthetic or naturally occurring compounds can be tested in the assay to identify those which inhibit or potentiate the activity of the H pylori polypeptide. Some of these active compounds may directly, or with chemical alterations to promote membrane WO 97/37044 PCT/US97/05223 -77permeability or solubility, also inhibit or potentiate the same activity enzymatic activity) in whole, live H. pylori cells.
This invention is further illustrated by the following examples which should not be construed as limiting. The contents of all references and published patent applications cited throughout this application are hereby incorporated by reference.
Other Embodiments Many of the nucleic acids and corresponding polypeptides of the invention were disclosed previously in the parent applications, U.S.S.N. 08/761,318, filed December 6, 1996 (Attorney Docket No.: GTN-009CP2), U.S.S.N. 08/736,905, filed October 1996 (Attorney Docket No.: GTN-010CP) and U.S.S.N. 08/738,859, filed October 28, 1996 (Attorney Docket No.: GTN-009CP), which are a continuation-in-part of U.S.S.N.
08/625,811, filed March 29, 1996 (Attorney Docket No.: GTN-009), and U.S.S.N.
08/758,731, filed April 2, 1996 (Attorney Docket No.: GTN-010). The correlation between sequence identification numbers in the above-identified parent applications and sequence identification numbers provided herein is outlined in Table 7 below.
TABLE 7 ORF Name Source Parent Parent CIP1 CIPI Current Current nt Seq aa Seq nt Seq aa Seq nt Seq aa Seq ID# ID# ID# ID# ID# ID# 01ae12021orf5 009CP1 5 867 3 1095 1 492 01cel0516orfl5 009CP1 15 877 10 1102 2 493 Olcel0516orfl9 009CPI 18 880 13 1105 3 494 Olcel0516orf20 009CP1 19 881 14 1106 4 495 Olcel0516orf21 009CP1 20 882 15 1107 5 496 01ce10516orf24 009CP1 21 883 16 1108 6 497 01ce11618orf24 009CP1 36 898 30 1122 7 498 Olcell618orf3 009CP1 37 899 31 1123 8 499 01cell513orf17 010CPI 44 1957 35 1554 9 500 Olcpll414orf2 009CP1 44 906 36 1128 10 501 1Ocpl17l1orfl 009CP1 46 908 37 1129 11 502 Olcell618orfl 010CP1 51 1964 38 1557 12 503 Olcell618orfl3 010CP1 54 1967 40 1559 13 504 01epll504orf5 009CP1 65 927 52 1144 14 505 01ep30520orf24 009CP1 67 929 54 1146 15 506 Olcpl1108orf6 01CP1 74 1987 54 1573 16 507 Olgp11016orfl3 009CPI 78 940 63 1155 17 508 02aell612orfl 009CP1 97 959 79 1171 18 509 02ae11612orfl5 009CPI 99 961 80 1172 19 510 Olep30520orf8 010CP1 121 2034 86 1605 20 511 02ap21113orf2 009CPI 112 974 88 1180 21 512 02ap71220orf2 009CP1 114 976 90 1182 22 513 Olgel0801orf2 01CPI 133 2046 91 1610 23 514 WO 97/37044 PCTIUS97/05223 78- O2ce 10213 orfl 9 009CPI 116 1 978 92 1184- 24 515 02ceI0216orf6 009CPI1 123 985 96 1188 25 516 O2ceI 02I6orf7 009CP 1 124 986 97 1189 26 517 02ce 10809orf6 009CP1 130 992 103 1195 27 518 04ce1I1617orf27 OOICP7 1087 1538 119 372 28 519 02ep30607orf27 oo9CPI 158 1020 120 1212 2-9 520 02ge20116orf25 009CP 1 164 1026 125 1217 30 521 O2ae21214orfl OIOCP 193 2106 129 1648 31 522 03ap21820orf4 009CP1 181 1043 139 1231 32 523 03ap21820orf8 009CP1 183 1045 141 1233 33 524 O2ce 10ll4orf3 OIOCPI 206 2119 142 1661- 3 4 525 03ce2l7l7orfl oo9CPI 186 1 048 144 1236 35 526 03gp20123orf3 009CPI 196 1058 152 1244 36 527 O2ceI0216orf4 OOCPI 225 2138 152 1671 37 528 O3xelI1215orf7 009CPI 199 1061 154 1246 38 529 O2ceI1022orf2 OIOCP] 232 2145 159 1678 39 530 O4celI1617orfI6 009CP 1 207 1069 161 1253 40 531 04ce 11617orf26 009CPI 211 1073 162 1254 41 532 l4ce2 516orfl 001CP7 1306 1757 163 416 42 533 O4ce I1617orf4 009CPI 214 1076 164 1256 43 534 O4ee7OIl4orflO 00CP7 220 1082 169 1261 44 535 hp4e14535orf3 009CP 1 1414 1865 181 434 45 536 O4ge I1713orf9 009CP1 2 46 1108 185 1277 4 6 537 O4gp I1213orfil I009CPI 247 1109 186 1278 47 538 O4gpI1213orfl4 oo9CPI 248 1110 187 1279 48 539 04gp1I1213orf22 009CPI 29 1111 188 1280 49 540 04gp 11803orf13 009CP 1 254 1116 191 1283 50 541 05ae2220orf1 19 009CPI 263 1125 195 1287 51 542 05ae2220orf56 009CPI1 270 1132 197 1289 52 543 05ae2220orf58 009CP1 271 1133 198 1290 5-3 544 05ae20220orf95 OIOCPI 280 1142 201 1293 54 545 02ep30607orf32 oo9CPI -297 2210 204 1723 55 546 05ap21216orf4 009CPI 290 1152- 206 1298 56 547 I42Orfl o9CPI 291 1153 207 1299 57 548 O5cp11911orf35 009CPI 297 1159 209 F1301 58 549 05cp20518orf39 009CPI 305 1167 212 1304 59 550 05ee10411orf5 o9CPI 308 1170 213 1305 60 551 O5eplI1717orf2 009CPI 314 1176 219 1311 61 552 11901orfl9 009CP1 324 1186 225 1317 62 553 05gp2OlI1orf12 OO9CPI 326 1188 227 1319 63 554 05gp2OIIIorf4 009CP1 328- -1190 229 1321 64 555 05gp2OIIIorf6 009CPI 329 1191 230 1322 65 556 O6celO8O8orf2 009CP 1 343 1205 243 1335 66 557 O6cpI 1 I8orf7 009CPI 352 1214 251 1343 67 558 O6cp30603orfl 5 009CPI 364 12 26 22 135 6 559 06cp30603orfl 6 009CPI 365 1227 1 263 1355 69 560 06eplO306orfS 009CPI 373 1235 266 1358 70 561 O6eplII108orfI7 009CPI 378 1240 270 13-62 71 562 O6ge1I 1orfI7 009CPI 386 1 1248 277 1369 72 563 O6ge10115orfI9 009CPI 3 87 1249 278 1370 73 564 O6ge10 15orf4 009CPI 388 1250 279 1371 74 565 O7aplllllorfl3 009CPI 408 1270 1 297 1389 75 566 WO 97/37044 PCT/US97/05223 -79- 07ee11519orfl 009CPI 422 1284 304 1396 76 567 07gpll807orfl7 009CPI 448 1310 323 1415 77 568 07gp315lI6orf9 009CPI 453 1315 326 1418 78 569 09ap20802orf14 OIOCP1 465 1327 337 1429 79 570 05apl0914orf3 OIOCPI 569 2482 345 1864 80 571 05ceI06l3orfl O0OCP1 586 2499 349 1868 81 572 05cel0613orf2 009CPI 587 2500 350 1869 82 573 09ge70821orf2 009CP1 491 1353 354 1446 83 574 11lapll902orfD 009CP 1 497 1359 357 1449 84 575 11cpl2006orfl3 009CPI 521 1383 381 1473 85 576 Ileel0423rfl 009CP1 530 1392 390 1482 86 577 lleeI0423orf4 009CPI 531 1393 391 1483 87 578 SIlgel03090orfl2 009CP1 534 1396 394 1486 88 579 llgel0309orf7 009CPI 551 1413 395 1487 89 580 12ap10324orf6 009CP 1 565 1427 398 1490 90 581 12gel0305orf5 009CP1 573 1435 403 1495 91 582 13ael05llorf3 009CP1 587 1449 410 1502 92 583 13ael0712orfl5 009CPI 592 1454 413 1505 93 584 I3apll517orfl5 009CPI 599 1461 418 1510 94 585 13eel02160rf7 009CP1 610 1472 422 1514 95 586 13epl2003orf21 009CP1 625 1487 435 1527 96 587 14ae21813orf2 009CPI 630 1492 438 1530 97 588 14cel0720orfI2 001CCP6 631 1493 439 1531 98 589 02ep30607orfl9 009CP1 1065 1516 440 1372 99 590 14cel0720rf3 009CPI 632 1494 440 1532 100 591 14cpl0705orf4 009CP1 645 1507 447 1539 101 592 14eel0308orfl 009CP1 649 1511 451 1543 102 593 14ee0308orf4 O1OCP1 651 1513 452 1544 103 594 06cplI 1722orfl I 009CP1 739 2652 454 1973 104 595 14eell217orf2 009CPI 655 1517 456 1548 105 596 14eel 1217orf3 O1OCPI 656 1518 457 1549 106 597 06cp11722orf5 009CP 1 745 2658 460 1979 107 598 14ge07050orf4 009CPI 660 1522 461 1553 108 599 14gplI 1820orfl 001CP6 663 1525 464 1556 109 600 04gel l713orf35 OIOCPI 1098 1549 466 1398 110 601 06cp20302orf3 O0OCPI 751 2664 466 1985 111 602 06cp20302orf5 OIOCPI 752 2665 467 1986 112 603 06cp20302orf6 O01CP1 753 2666 468 1987 113 604 06cp20302orf7 O0IOCPl 754 2667 469 1988 114 605 06eeI0207orfl 009CP1 763 2676 471 1990 115 606 29ge10307orf3 O0IOCP1 679 1541 475 1567 116 607 06ee 161lorfl 009CPI 770 2683 475 1994 117 608 29ge10307rf4 O0OCP1 680 1542 476 1568 118 609 06epIO306orf0 O0IOCPI 773 2686 477 1996 119 610 06ep 10306orfl1 009CP1 774 2687 478 1997 120 611 hpleI05230orf3 009CPI 687 1549 481 1573 121 612 hplpl3852orf4 009CP 1 700 1562 494 1586 122 613 hplpl3852orf6 009CPI1 701 1563 495 1587 123 614 hplpl3852orf7 009CPI 702 1564 496 1588 124 615 hplpI3947orflI 009CP1 708 1570 502 1594 125 616 hplpl40l3orfi7 009CPI 715 1577 508 1600 126 617 hp2el02290orf2 009CP1 716 1578 509 1601 127 618 WO 97/37044 PCT/US97/05223 hp2e I 0229orf4 009CP1 717 1579 510 1602 128 619 hp2e 10911lorf24 OIOCP1 722 1584 515 1607 129 620 06gp 101080orf2 001CP6 816 2729 516 2035 130 621 07apll 015orf2 009CP 1PI 1167 1618 520 1452 131 622 hp2ell 858orf6 001CP6 730 1592 520 1612 132 623 07ap 1015orf4 009CPI1 1168 1619 521 1453 133 624 hp2p l0272orfl 009CP1 733 1595 522 1614 134 625 hp2pl0625orf5 009CP1 742 1604 525 1617 135 626 hp2p 10625orf6 009CP 1PI 743 1605 526 1618 136 627 hp3e10349rf27 009CPI 751 1613 530 1622 137 628 hp3el 1024orf34 009CPI 1 763 1625 539 1631 138 629 hp3el1024orf49 009CP1 767 1629 543 1635 139 630 hp3el 1024orf5 009CP1 768 1630 544 1636 140 631 hp3ell075orf3 OOCPI 773 1635 546 1638 141 632 07ap8060Olorf2 O01CP1 863 2776 548 2067 142 633 07ce 10203orfl O0OCP1 871 2784 554 2073 143 634 07cel0203orfl7 009CPI 873 2786 556 2075 144 635 hp3pI1086rf O0IOCPI 796 1658 561 1653 145 636 07cel1206orfl 009CPI 1 884 2797 567 2086 146 637 hp4p I 3446orfl 3 009CP1 808 1670 571 1663 147 638 IceI0917orfl0 009CP1 498 1360 572 1504 148 639 hp4p13446orf3 009CPII 809 1671 572 1664 149 640 15211 orf23 OIOCPI 827 1689 581 1673 150 641 07cpl 1213orfl 009CPI 905 2818 584 2103 151 642 hp5pl5612orf2 009CPI 848 1710 594 1686 152 643 hp5pl5641orf23 001CP6 851 1713 596 1688 153 644 12ap 10324orf5 OIOCP1 1265 1716 598 1530 154 645 07ep30818orf4 009CPI 932 2845 600 2119 155 646 Olcell618orfl 009CP1 52 1965 607 1699 156 647 OIceI1618orf22 009CP1 61 1974 608 1700 157 648 OIce I618orf27 009CPI 62 1975 609 1701 158 649 OlceII618orf7 009CPI 67 1980 610 1702 159 650 OIcelI6180rf9 009CP1 68 1981 611 1703 160 651 Olcpl171Oorfl6 009CP 1 84 1997 612 1704 161 652 Olcpil710orfl8 009CP1 86 1999 613 1705 162 653 01cpl1710OorfS 009CP1 90 2003 615 1707 163 654 Olcp11710Oorf6 009CP1 91 2004 616 1708 164 655 Olep 11710rf9 009CP 1 93 2006 617 1709 165 656 Olgel0203orfl4 009CPI 125 2038 618 1710 166 657 01ge0I203orf6 009CPI 128 2041 619 1711 167 658 Olgel0203orf7 009CPI 129 2042 620 1712 168 659 03gel0505orf2 009CPI 376 2289 644 1736 169 660 04ce I617orfl0 001ICP6 404 2317 645 1737 170 661 14gpl2015orfl3 001CP6 1330 1781 648 1580 171 662 14gpl2015orfl6 009CP1 1332 1783 650 1582 172 663 04ge 11210orfl 009CP1 449 2362 651 1743 173 664 04ge ll713orfl 0 009CPI 455 2368 653 1745 174 665 04ge11713orf28 009CPI 462 2375 655 1747 175 666 04gel 713orf36 OIOCP1 464 2377 656 1748 176 667 09ap 1406orf2 009CP 1 1014 2927 657 2176 177 668 04gp11213orf5 001CP6 487 2400 661 1753 178 669 29geIOIlIlorfl 009CP1 1353 1804 669 1601 179 670 WO 97/37044 PCT/US97/05223 -81 05ae202200rf37 001CP6 548 2461 675 1767 180 671 hp2el09llorf2S 009CP 1 1377 1828 686 1618 181 672 06apI0209rfl 009CP 1 697 2610 690 1782 182 673 06apIO209orf4 009CPI 699 2612 691 1783 183 674 06eel07090orf2 009CPI1 767 2680 695 1787 184 675 06epl03O6orfl 009CPI 772 2685 696 1788 185 676 07cpl0312orf5 009CPI 898 2811 703 1795 186 677 07cp21714orfl 009CP1 908 2821 706 1798 187 678 07cp21714orf3 009CPI 910 2823 708 1800 .188 679 09ap20802orfl3 009CP 1 1025 2938 713 1805 189 680 09ap20802orf2l O0CPI 1031 2944 714 1806 190 681 11cel0908orfl 001CP6 1100 3013 716 2235 191 682 hp4pl 1352orf9 009CP1 1419 1870 718 1650 192 683 llgel0309orf24 009CPI 1155 3068 725 1817 193 684 llgel03090rf28 009CPI 1157 3070 727 1819 194 685 llgel0309orf39 009CP1 1164 3077 731 1823 195 686 lgel0309orf4 009CP1 1165 3078 732 1824 196 687 lgel0309orf5 OIOCPI 1166 3079 733 1825 197 688 11cpl2002orf3 009CP 1 1129 3042 745 2264 198 689 13ael0610orfl O0OCP1 1245 3158 747 1839 199 690 IleeI0423orf2 009CP1 1137 3050 753 2272 .200 691 14cpl0119orfl O0OCPI 1349 3262 758 1850 201 692 1lgel0308orfl 009CP1 1145 3058 761 2280 202 693 14eeIO308orf7 009CP1 1375 3288 762 1854 203 694 14gel0705orf3 009CPI1 1411 3324 763 1855 204 695 hplel0506orf5 009CP1 1495 3408 766 1858 205 696 hp3pl0807orf4 009CPI 1746 3659 775 1867 206 697 hp3pl0807orf7 009CP1 1749 3662 776 1868 207 698 hp4el2063orfl OIOCP1 1754 3667 777 1869 208 699 12ap10324orf8 009CPI 1200 3113 778 2297 209 700 hp4plI352orf5 O0IOCPI 1783 3696 782 1874 210 701 12gel0610orf2 O0IOCP1 1217 3130 784 2303 211 702 13eel20l6orf7 01OCPI 1312 3225 844 2363 212 703 14eeI0419orf5 010CPI 1384 3297 898 2417 213 704 14ee 11217orf4 OIOCP 1390 3303 904 2423 214 705 14ee2lll8orfl O0IOCP1 1391 3304 905 2424 215 706 14ee21118orf2 OIOCP1 1392 3305 906 2425 216 707 14eplI15orf3 010OCP1 1397 3310 911 2430 217 708 14ep ll905orfl3 OIOCP1 1400 3313 914 2433 218 709 1l4ep 1l905orf9) OIOCP1 1408 3321 922 2441 219 710 29ap10306rf3 O1OCP1 1462 3375 957 2476 220 711 29apIl902orfl OIOCPI 1463 3376 958 2477 221 712 29ep20112orf2 OIOCP1 1469 3382 964 2483 222 713 29geIOIllorf3 01OCPI 1471 3384 966 2485 223 714 hplel0554orfl 010OCP1 1500 3413 982 2501 224 715 hplplO543orf4 OIOCP 1506 3419 988 2507 225 716 hplpl3947orfl0 010ICPI 1552 3465 1025 2544 226 717 hp2el091 Iorf35 010CPI1 1592 3505 1047 2566 227 718 hp3e00570orf3 OIOCPI 1617 3530 1062 2581 228 719 hp3el0128orfl OIOCP1 1619 3532 1064 2583 229 720 hp3el0302orfl7 OIOCP1 1627 3540 1072 2591 230 721 hp3el0302orf26 OIOCPI 1633 3546 1078 2597 231 722 WO 97/37044 PCT/US97/05223 -82hp3elIIl22orf3 OIOCPI 1693 3606 1127 2646 232 723 hp3pl0156orf2 OIOCP1 1715 3628 1141 2660 233 724 hp3plOlS56orf3 010CP 1716 3629 1142 2661 234 725 hp3plOI56orf7 0OICP1 1718 3631 1144 2663 235 726 hp3pl0304orf2 O1OCP1 1724 3637 1148 2667 236 727 hp3plO8O7orf6 OIOCPI 1748 3661 1160 2679 237 728 hp3p1086orf2 OIOCPI 1753 3666 1164 2683 238 729 hp4el4535orfl OIOCPI 1776 3689 1178 2697 239 730 hp4e14535orf8 OIOCPI 1777 3690 1179 2698 240 731 hp4p 1393rfI O1OCPI 1785 3698 1182 2701 241 732 hp4p 113 93 orf2 OIOCPI 1786 3699 1183 2702 242 733 hp4p 11393orf6 O1OCPI 1790 3703 1187 2706 243 734 hp4pl200rf2 OIOCP1 1792 3705 1189 2708 244 735 hp4pl3446orfS OOCPI 1806 3719 1203 2722 245 736 hp~pl5580orfl O1OCP1 1889 3802 1258 2777 246 737 hp5pl5653rf OIOCPI 1898 3811 1264 2783 247 738 hp5pI5653orf2 O1OCP1 1899 3812 1265 2784 248 739 04ap20904orf3 O1OCPI 205 1067 1281 2800 249 740 O6gpI 192OrfllI O1OCP1 398 1260 1288 2807 250 741 12ge20305orf26 OIOCPI 581 1443 1301 2820 251 742 l3ee0216orfS7 009CP1 609 1471 1303 2822 252 743 04cp1120224256567_c3_117 009CP1 783 1715 253 744 OlaellOlO_40688_c2 38 009CP1 790 1882 254 745 01ae2001_24218781-f2 18 009CPI 791 1883 255 746 01ge1161923711062c3_14 009CP1 795 1887 256 747 Olgpl016_4103403_c2_13 009CP1 799 1891 257 748 02ae11612_1074212fl_1 009CP1 800 1892 258 749 02ae11612_22477267-f2 27 009CP1 802 1894 259 750 02ae11612_23598175 fl 2 009CPI 803 1895 260 751 02ae11612_33203250 cl 51 009CPI 804 1896 261 752 02ge10116 36367936 c19 009CP1 807 1899 262 753 02ge11622 875260 S 36 009CPI 809 1901 263 754 02gp20706 1203402 c3 58 009CP1 810 1902 264 755 02gp20706 15781452 c2 51 009CPI 813 1905 265 756 02gp20706_23632775_13 32 009CPI 815 1907 266 757 02gp20706 23866562.c2 53 009CP1 816 1908 267 758 02gp20706_48925580B_19 009CPI 821 1913 268 759 02gp2081424415958f39 009CPI 824 1916 269 760 O4cpl 1202_24261588_f2_23 OIOCP1 827 1919 270 761 O7ap11015_23938312 -S2 009CP1 831 1763 271 762 05ae30220 21619067 f 56 009CP1 835 1927 272 763 07ap1121335401528_c21 009CPI 835 1767 273 764 05ae30220 24882812 c3 103 009CPL1 837 1929 274 765 05ae30220 25953163 c3 98 009CP1 839 1931 275 766 05ee10816 14649077 B 18 009CP1 841 1933 276 767 05ee10816 4103408 f2 11 009CPI 843 1935 277 768 05ee10816 4687651 ci 22 009CPI 844 1936 278 769 06ap11119 16594193 fl 9 009CPI 845 1937 279 770 06ap20306234337632_ 3 9 009CPI 851 1943 280 771 06cp30603 23452 c380 009CPI 854 1946 281 772 06cp30603 23476568 ci 44 009CP1 855 1947 282 773 06cp30603 4689068 c3 79 009CPI 857 1949 283 774 WO 97/37044 PCTIUS97/05223 83 07apl 1111_234693_c3 14 009CP1 860 1952 284 775 07ap20216 72272020f3 10 009CP 865 1957 285 776 07ep1 1916_5273452_c3 1 009CPI 866 1958 286 777 09cp11003 19532625 c3 17 009CP1 869 1961 287 778 09cp20502_24001388 cl 31 009CPI 871 1963 288 779 12gp31106 3024126 f2 25 009CP1 876 1968 289 780 13ae10712_14100018 f2_12 009CP1 879 1971 290 781 13ae10712 29569208 c2 27 009CPI 880 1972 291 782 l4ap10815_20585777_ci_13 009CPI 884 1976 292 783 29ge30321_34157812_f3_10 009CP1 886 1978 293 784 hplp14013_11726503_c2_20 009CPI 888 1980 294 785 hp2p10272_23697200_S_22 009CPI 891 1983 295 786 hp2p10272_26829136_ft_1 009CP1 894 1986 296 787 hp5el5211_819455_c2_24 009CP1 897 1989 297 788 hp5pl52l2_34064750_f29 009CPI 901 1993 298 789 hp5p15641_21698387_c220 009CPI 904 1996 299 790 hp6e0967_23476502 V2 6 009CP1 908 2000 300 791 hp6e10967 24882750Vf2 7 009CP1 909 2001 301 792 hp6e12267 4876718 f2_23 009CPI 912 2004 302 793 hp6e20339 1190660 c246 009CPI 913 2005 303 794 hp6e20339 21492187 ci 40 009CP 1 914 2006 304 795 hp6e20339 34024187 ci 37 009CPI 916 2008 305 796 Olge]0203 35281542c3_.16 009CP1 930 2022 306 797 Olge10203 860166 S3 9 009CPI 931 2023 307 798 Olge11619 13788141_c2_11 009CPI 932 2024 308 799 Olgel1619 24415880 c2_12 009CPI 934 2026 309 800 01ge11619 24417813 ci 8 009CPI 935 2027 310 801 02gp20706_16803513_fi_1 009CPI 946 2038 311 802 02gp20706_20365905 f2_8 009CP1 947 2039 312 803 02gp20814 3984818 fl 1 009CPI_ 949 2041 313 804 04cp11202 16603425 c2 72 009CPI 955 2047 314 805 04cp11202 19797128 fl 5 009CP1. 956 2048 315 806 04cp11202 23553177ci 75 009CPI 957 2049 316 807 O4cp 1 202 23553177 c3 109 009CPI 958 2050 317 808 04ge10816 22086531 f2 10 009CPI 963 2055 318 809 04ge10816 33726080_c2_29 009CPI 964 2056 319 810 05ae30220 14350428 fl 9 009CPI 969 2061 320 811 05ae30220 9882767 f2 34 009CP 1 978 2070 321 812 05cp21223 47254430 14 009CPI 981 2073 322 813 05ee10816 259703 V 7 009CP1 982 2074 323 814 06ap11119 24426508 3_26 009CP1 984 2076 324 815 06ee10709 21675012 fi 2 009CP1 993 2085 325 816 07ce11019_22051291 f I1 009CPI 1004 2096 326 817 07ep11916 5913592_3_18 009CPI 1006 2098 327 818 09cp11003 5945252_V4 009CPI 1013 2105 328 819 09ze10333_22460750_V_6 009CPI 1014 2106 329 820 12ge10321 4821082_£314 009CPI 1028 2120 330 821 14ap10221_13689381_c3_4 009CPI 1035 2127 331 822 29ep10720 24220926 f2 8 009CPI 1046 2138 332 823 29ep10720 24432762 c3 39 009CP1 1047 2139 333 824 29ge30321 12913562 fl 1 009CP 1050 2142 334 825 29ge30321 135253 f 6 009CPI 1051 2143 335 826 WO 97/37044 PCTIUS97/05223 -84- 29ge30321 21673965 f2 7 009CP1 1052 2144 336 827 29ge30321 24336712 fl 5 009CP 1 1053 2145 337 828 hp2p1027222692325 03 21 009CP_ 1059 2151 338 829 hp2p10272 24406280 ci 26 009CP1 1060 2152 339 830 hp3p10807 29343768 fI_ 009CP1 1061 2153 340 831 hp3p0807 29352212 f2 5 009CP1 1062 2154 341 832 hp6e20339 24317062 c3 57 009CP1 1078 2170 342 833 hp6p10233_12273302 flI 009CP 1087 2179 343 834 02ep20506 24611325 B 6 OIOCPI 1341 2860 344 835 07ap11213 35156577 ci 24 O1OCPI 1410 2929 345 836 07ap80601 976413 3-9 O1OCPl 1414 2933 346 837 hp3p10807 189075 f2 4 O1OCPI 1464 2983 347 838 02ge11622 21695936 ci 54 OIOCPI 1493 3012 348 839 12ge10321 243085130B 20 OIOCP1 1504 3023 349 840 14ce21516 85786 fl I OIOCPI 1508 3027 350 841 hp6e2267 14650278 B 29 OIOCPI 1517 3036 351 842 06cp30603 23476568 c2 133.aa 352 843 06ce20610_1367157 fl 8.aa 353 844 06ep30223 4698838 B f55 354 845 12ap103244805318_B2_3_ 355 846 14ee41924 2458267 c2 93 356 847 O6epl191724803153 c3 24 357 848 01ce11104_36125337 ci_8 358 849 01ce21104_33203250_c3_87 359 850 02ae31010_34616666 f2 27 360 851 02ae31010_3527000013_33 361 852 02ae31010 36132785 f2 29 362 853 02ge10116 15781452 ci 87 363 854 02ge10116 16803513 B2 34 364 855 02ge10116 16803513 B 34 365 856 02ge10116 36367936 ci 92 366 857 03ae10804 23485968 c3 47 367 858 05ce10910 25598277 B 3 368 859 06ae11016_30579712f2_21 369 860 06ce20610 29298537 c2 32 370 861 06ce20610_3913967_c3_36 371 862 06ce20610 4331338 f3 18 372 863 06cp11118212827_ci_17 373 864 06cp11217_19720300_B_11 374 865 O6cpl1217 4881263 f2 9 375 866 06cp112174897077_ fl6 376 867 06cp30603 21492187 f2_41 377 868 06cp30603_34024187_fI 20 378 869 06cp30603 34024187fi 20 379 870 06ep10615 14649077fB 52 380 871 06ep10615_9842_fl_46 381 872 06ep 1202 26353438 ci 22 382 873 06ep30223_23557202_c2_130 383 874 06ep30223 34409437f3 94 384 875 06ep30223 4698838 B2 55 385 876 06ep30223 4876077 c3 149 386 877 06ep30223 5109443 ci 109 387 878 WO 97137044 WO 9737044PCTIUS97/05223 85 06ep30223_5271902_ci_106 388 879 06gp10409 3398427 f2 12 389 880 06gp71906_24261588 c2_174 881 06gp71906 970325 03 190 391 882 O7ae 10923_24426508 f] 1 883 O7ae 10923 24426508 f] 1 884 09ce 104 13_ 414011_fI_-3 885 09ce10413_5865665 f1_4 395 886 09ce52017 29324062 ci21 396 887 09cp10224_1062966_c3_61 397 888 09cp10224_1412715_c3_56 398 889 09cp 10224 4295 10 c2 46.aa ___399 890 09cp10224 4484718 ci 38 891 09cp21607 7224187 c2 12 401 892 09cp61003 14562637 c2 93 402 893 09cp6 1003 19532625 -ci 78 894 09cp61003 24063587 ci 74 895 09cp61003_24335762_c3_111 405 896 09cp61003_5945252_fl_5 406 897 11ae80818 11188791 c3 60 407 898 11ae80818 783127 c3 63 408 899 11ae80818 7952 ci 49 409 900 11ap20714 4960432 c3 97 410 901 l1ap20714 5271967 ci 60 411 902 11 ap20714_7227202f1343.aa ___412 903 11ap20714 7227202 S3 43.aa 413 904 14ce3 1519 15635927 S3 15 414 905 14ee41924_23527267_c3107 906 14ee41924 23834800 1 2 -32 907 hplp13939 25397327 130 22 417 908 hp2elO9ll 24855312 ci 69 418 909 hp2eIO9lI_3349_ci_63 910 hp4e13394 35957200 fI21 ___420 911 hp4e13394_5964452_c2_97 912 hp4e53394 22864682 c2 86.aa 913 hp5e15044 4554652_-1 3 423 914 29300311 ci 29 ___424 915 hp5p15575 33445317 1 2 -20.aa 916 hp5p15575 6140713 f2 18 426 1917 hp5p]5641 12195281 ci 24 427 1918 hp5p15641 24304527_c3_35 428 919 hp5p15641_25635452_c3_34 920 hp5p15870 14350428 flI1 430 921 hp6plO59O_23440913_c2_31 431 922 hp6p10606_19546933_c3_31 432 923 hp6p10903_43 98263f 3_6.aa 433 924 hp6p 10903_4398263_13_6.aa 434 925 hp6p10904_2214676_ci14 435 926 hp6p10904_23704412_f2_5 436 927 01cp20708 36134808 f2 11 437 928 02ae31010 1-2504512 13 28.aa 438 929 02ae31010 16833312 1219 439 930 WO 97/37044 PCT/US97/05223 -86- 02ae31010 211708734 440 931 2ae31010 30208317fl_ 441 932 02ae31010 36132_85 Q 29 442 933 2ae31010_5085162 cl 47 443 934 02cpO615 26573462_ __45 444 935 03ae_0804 12609533cl26 445 936 03ae1080 4 _21698400_c2_32 446 937 04ep419 0 3 26757937_B_16 447 938 04ep41903 26757937 03 16 448 939 04ep41903 4101593 f2 10.aa 449 940 05ep0815 26570332 c2 99 450 941 05ep10815 4195292 ci 84 451 942 05ep10815 4719175 ci 83_ 452 943 06cp30603 679218 f234 453 944 06ep10615 961562_12_41 454 945 O6ep 1202 133293 c19 455 946 06ep11202 26353438 ci 22 456 947 O6epl1202 4884677 ci_17 457 948 06ep11202 792962 c2 26.aa 458 949 06ep11917 24803153 c3 24 459 950 06ep30223 16512 c30 160 460 951 06ep30223 23476067 ci 119 461 952 06gp71906 15115637 f2_59 462 953 06gp71906 25478192 cl 131 463 954 06gp71906 25504187 £3 112 464 955 09cp6003 492187 c2 80.aa 465 956 11ae80818 19632781 c3 57 466 957 11ae80818 7290627_c2_51 467 958 11ap20714 34023312_£3_46 468 959 Ileel1408_4977193 cl 41.aa 469 960 11ge0308 5256 f2_1 470 961 12ap10324 13178562_S3 6 471 962 14ce61516 13073577 12 12 472 963 14ee41924 16282067 ci 72 473 964 hp2eIO9ll 4882027 c2 87 474 965 hp3e1188_47327f2_5 475 966 hp3e11188_5082842_S12 476 967 hp4e13394 3368767 ci 80 477 968 hp5p5212 6928132 03 34 478 969 hp5p1564130273312 c2 28 479 970 hp5p15641_5211687_c2_29 480 971 hp6plO59O30521093_2_14 481 972 hp6p10904_7089062_ci 16 482 973 hp6p12129 16603417 £3 14 483 974 hp6p2244_3948467_ci_52 484 975 hp6p22217 23470967 fl 4 485 976 hp7e10192 4412568 12 5 486 977 hp7p10287_24611325_c224 487 978 hp7p10290 25548812 £3-14 488 979 hp7p0290 25585941 £3 12 489 980 hp7p10290 35156558 £3 15 490 981 hp7p10290 4351718 fl 6 491 982 PCTIUS97/05223 WO 97/37044 -87- IlaeII922 12586675f2_983 1 1037 02gel _23 8665 62 0 146 984 1038 07eeII402 2458267 0 108 985 1039 07ee50709 10213593 77 986 1040 06ep]0615 14649077 f2 30 987 1041 09cpIO713 23452 0 195 988 1042 11ap20714 7227202 f3 40 989 1043 01ce6101 6 _23609580c3_139 990 1044 06gp7190 6 024 1 26 cl 128 991 1045 O6ge205011410001 8 ci 34 992 1046 09cp10713 3 4 0 2 4 1 8 7 fl 31 993 1047 hp4p62853 5914693 c3 52 994 1048 13ae1061 0 _859692_c2_32_ 995 1049 13ae10610_3591212_3 996 1050 06ap1111 9 2 4 4 2 6 5 08 f 27 997 1051 14gp114 23 2680380 l 13 7 998 1052 06ap1060 9 25866 7 5 f2 19 999 1053 07ee50709 4818967f2 43 1000 1054 02ap1111723 495 1 8 7 c3 81 _1001 1055 09cp10713 3 4024 1 8 7 fl 31 1002 1056 07ee5070935156577 f 3 80 1003 1057 05ae30220_976413_c3 204 1004 1058 hp3e11188 47327 f29 1005 1059 12ap10324 4805318 f26 1006 1060 hple80523 234859 68 c2 49 1007 1061 hp7e10192 25598277_c2_15 1008 1062 hp4e53394 26209843 c3 98 1009 1063 06ep10615 9842 fl 5 1010 1064 06ep3022 3 34409437 2 64 1011 1065 06gp10409 3398427 2 12 1012 1066 09ce0413 5865665 fl 4 1013 1067 09cp10224 1062966 ci 44 1014 1 1068 01ce61016 12931513 c2 106 1015 1069 Olce61016 23609580 c3 139 1016 1070 14cp11908_25593768_c3_97 1017 1071 14cp11908_783127 cl 72 1018 1072 07ee11402 10759567 c2 86 1019 1073 07ee5070 9 4818967 2 43 1020 1074 hp4e13394 5088562 13 54 1021 1075 hp4e13394 15828963 c2 90 1022 1076 hp4e53394 19720300 c3 98 1023 1077 07ce10312 4554652 f3 2 1024 1078 05ae30220 14350428 3 91 1 1025 1 1079 hp8e0080 19546933 c2 88 1026 1080 01ce10320 30273587 S 38 1027 1081 07ee50709_26438968f12_36 1082 05ep10815 4719175 ci 115 1029 1083 06ep3223 23476067_ci_115 1030 1084 hp7e10590_26172564 cl 68 1031 1085 05ae30220_4977193_c3_198 1032 1086 Inl 187 I hp7e10557_21698387fI._ 07ee11402_19565702c2_88 vv 1034 1088 WO 97/37044 PCT/US97/05223 -88- 07ee50709 960952 f2 47 1035 1089 hp6p12244 3948467 c3 88 1036 1090 hp7e10590 13073577 c3 107 1293 1296 Olce61016 492187 c3 120 1294 1297 06ep10615_961562 fl_15 1295 1298
EXEMPLIFICATION
I. Cloning and Sequencing of H. pylori DNA H pylori chromosomal DNA was isolated according to a basic DNA protocol outlined in Schleif R.F. and Wensink Practical Methods in Molecular Biology, p.98, Springer-Verlag, NY., 1981, with minor modifications. Briefly, cells were pelleted, resuspended in TE (10 mM Tris, 1 mM EDTA, pH 7.6) and GES lysis buffer (5.1 M guanidium thiocyanate, 0.1 M EDTA, pH 8.0, 0.5% N-laurylsarcosine) was added. Suspension was chilled and ammonium acetate (NH 4 Ac) was added to final concentration of 2.0 M. DNA was extracted, first with chloroform, then with phenolchloroform, and reextracted with chloroform. DNA was precipitated with isopropanol, washed twice with 70% EtOH, dried and resuspended in TE.
Following isolation whole genomic H. pylori DNA was nebulized (Bodenteich et al., Automated DNA Sequencing and Analysis Venter, Academic Press, 1994) to a median size of 2000 bp. After nebulization, the DNA was concentrated and separated on a standard 1% agarose gel. Several fractions, corresponding to approximate sizes 900-1300 bp, 1300-1700 bp, 1700-2200 bp, 2200-2700 bp, were excised from the gel and purified by the GeneClean procedure (Bio 101, Inc.).
The purified DNA fragments were then blunt-ended using T4 DNA polymerase.
The healed DNA was then ligated to unique BstXI-linker adopters in 100-1000 fold molar excess. These linkers are complimentary to the BstXI-cut pMPX vectors, while the overhang is not self-complimentary. Therefore, the linkers will not concatemerize nor will the cut-vector religate itself easily. The linker-adopted inserts were separated from the unincorporated linkers on a 1% agarose gel and purified using GeneClean. The linker-adopted inserts were then ligated to each of the 20 pMPX vectors to construct a series of "shotgun" subclone libraries. The vectors contain an out-of-frame lacZ gene at the cloning site which becomes in-frame in the event that an adapter-dimer is cloned, allowing these to be avoided by their blue-color.
All subsequent steps were based on the multiplex DNA sequencing protocols outlined in Church G.M. and Kieffer-Higgins Science 240:185-188, 1988. Only major modifications to the protocols are highlighted. Briefly, each of the 20 vectors was WO 97/37044 PCT/US97/05223 -89then transformed into DH5a competent cells (Gibco/BRL, DH5ac transformation protocol). The libraries were assessed by plating onto antibiotic plates containing ampicillin, methicillin and IPTG/Xgal. The plates were incubated overnight at 370C.
Successful transformants were then used for plating of clones and pooling into the multiplex pools. The clones were picked and pooled into 40 ml growth medium cultures. The cultures were grown overnight at 37oC. DNA was purified using the Qiagen Midi-prep kits and Tip-100 columns (Qiagen, Inc.). In this manner, 100 ptg of DNA was obtained per pool. 15 96-well plates of DNA were generated to obtain a 5-10 fold sequence redundancy with 250-300 base average read-lengths.
These purified DNA samples were then sequenced using the multiplex DNA sequencing based on chemical degradation methods (Church G.M. and Kieffer-Higgins Science 240:185-188, 1988) or by Sequithrem (Epicenter Technologies) dideoxy sequencing protocols. The sequencing reactions were electrophoresed and transferred onto nylon membranes by direct transfer electrophoresis from 40 cm gels (Richterich P.
and Church Methods in Enzemology 218:187-222, 1993) or by electroblotting (Church, supra). 24 samples were run per gel. 45 successful membranes were produced by chemical sequencing and 8 were produced by dideoxy sequencing. The DNA was covalently bound to the membranes by exposure to ultraviolet light, and hybridized with labeled oligonucleotides complimentary to tag sequences on the vectors (Church, supra).
The membranes were washed to rinse off non-specifically bound probe, and exposed to X-ray film to visualize individual sequence ladders. After autoradiography, the hybridized probe was removed by incubation at 650 C, and the hybridization cycle repeated with another tag sequence until the membrane has been probed 38 times for chemical sequencing membranes and 10 times for the dideoxy sequencing membranes.
Thus, each gel produced a large number of films, each containing new sequencing information. Whenever a new blot was processed, it was initially probed for an internal standard sequence added to each of the pools.
Digital images of the films were generated using a laser-scanning densitometer (Molecular Dynamics, Sunnyvale, CA). The digitized images were processed on computer workstations (VaxStation 4000's) using the program REPLICAT" (Church et al., Automated DNA Sequenicng and Analysis Venter, Academic Press, 1994).
Image processing included lane straightening, contrast adjustment to smooth out intensity differences, and resolution enhancement by iterative gaussian deconvolution.
The sequences were then automatically picked in REPLICAT' and displayed for interactive proofreading before being stored in a project database. The proofreading was accomplished by a quick visual scan of the film image followed by mouse clicks on the WO 97/37044 PCT/US97/05223 bands of the displayed image to modify the base calls. For typical sequences derived by chemical sequencing, the error rate of the REPLICATM base calling software was with most errors occurring near the end of a sequence read. Many of the sequence errors could be detected and corrected because multiple sequence reads covering the same portion of the genomic DNA provide adequate sequence redundancy for editing. Each sequence automatically received a number correspond to (microtiter plate and probe information) and lane set number (corresponding to microtiter plate columns). This number serves as a permanent identifier of the sequence so it is always possible to identify the original of any particular sequence without recourse to a specialized database.
Routine assembly of H. pylori sequences was done using the program FALCON (Church, Church et al., Automated DNA Sequenicng and Analysis Venter, ed.), Academic Press, 1994). This program has proven to be fast and reliable for most sequences. The assembled contigs were displayed using a modified version of GelAssemble, developed by the Genetics Computer Group (GCG) (Devereux et al., Nucleic Acid Res. 12:387-95, 1984) that interacts with REPLICA T M This provided for an integrated editor that allows multiple sequence gel images to be instantaneously called up from the REPLICA T M database and displayed to allow rapid scanning of contigs and proofreading of gel traces where discrepancies occurred between different sequence reads in the assembly.
II. Identification. Cloning and Expression of H. pylori Nucleic Acids Expression and purification of the H. pylori polypeptides of the invention can be performed essentially as outlined below.
To facilitate the cloning, expression and purification of membrane and secreted proteins from H. pylori, a gene expression system, such as the pET System (Novagen), for cloning and expression of recombinant proteins in E. coli, is selected. Also, a DNA sequence encoding a peptide tag, the His-Tag, is fused to the 3' end of DNA sequences of interest in order to facilitate purification of the recombinant protein products. The 3' end is selected for fusion in order to avoid alteration of any 5' terminal signal sequence.
The exception to the above is ppiB, a gene cloned for use as a control in the expression studies. The sequence for H. pylori ppiB contains a DNA sequence encoding a His-Tag fused to the 5' end of the full length gene, because the protein product of this gene does not contain a signal sequence and is expressed as a cytosolic protein.
WO 97/37044 PCT/US97/05223 -91 PCR Amplification and Cloning of Nucleic Acids Containing ORF'sfor Membrane and Secreted Polypeptides from H. pylori Nucleic acids chosen (for example, from the nucleic acids set forth in the Sequence Listing) for cloning from the J99 strain of H. pylori are prepared for amplification cloning by polymerase chain reaction (PCR). Synthetic oligonucleotide primers specific for the 5' and 3' ends of open reading frames (ORFs) are designed and purchased from GibcoBRL Life Technologies (Gaithersburg, MD, USA). All forward primers (specific for the 5' end of the sequence) are designed to include an NcoI cloning site at the extreme 5' terminus. These primers are designed to permit initiation of protein translation at a methionine residue followed by a valine residue and the coding sequence for the remainder of the native H. pylori DNA sequence. All reverse primers (specific for the 3' end of any H. pylori ORF) include a EcoRI site at the extreme terminus to permit cloning of each H. pylori sequence into the reading frame of the pET- 28b. The pET-28b vector provides sequence encoding an additional 20 carboxyterminal amino acids including six histidine residues (at the extreme C-terminus), which comprise the His-Tag. An exception to the above, as noted earlier, is the vector construction for the ppiB gene. A synthetic oligonucleotide primer specific for the end of ppiB gene encodes a BamHI site at its extreme 5' terminus and the primer for the 3' end of the ppiB gene encodes a XhoI site at its extreme 5' terminus.
Genomic DNA prepared from the J99 strain ofH. pylori is used as the source of template DNA for PCR amplification reactions (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). To amplify a DNA sequence containing an H. pylori ORF, genomic DNA (50 nanograms) is introduced into a reaction vial containing 2 mM MgCl 2 1 micromolar synthetic oligonucleotide primers (forward and reverse primers) complementary to and flanking a defined H. pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of 100 microliters.
Upon completion of thermal cycling reactions, each sample of amplified DNA is washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD, USA). All amplified DNA samples are subjected to digestion with the restriction endonucleases, NcoI and EcoRI (New England BioLabs, Beverly, MA, USA)(Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F.
Ausubel et al., eds., 1994). DNA samples are then subjected to electrophoresis on 1.0 NuSeive (FMC BioProducts, Rockland, ME USA) agarose gels. DNA is visualized by exposure to ethidium bromide and long wave uv irradiation. DNA contained in slices WO 97/37044 PCT/US97/05223 -92isolated from the agarose gel is purified using the Bio 101 GeneClean Kit protocol (Bio 101 Vista, CA, USA) Cloning ofH. pylori Nucleic Acids Into an Expression Vector The pET-28b vector is prepared for cloning by digestion with endonucleases, NcoI and EcoRI (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). In the case of cloning ppiB, the pET-28a vector, which encodes a His-Tag that can be fused to the 5' end of an inserted gene, is used and the cloning site prepared for cloning with the ppiB gene by digestion with BamHI and XhoI restriction endonucleases.
Following digestion, DNA inserts are cloned (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) into the previously digested pET-28b expression vector, except for the amplified insert for ppiB, which is cloned into the pET-28a expression vector. Products of the ligation reaction are then used to transform the BL21 strain of E. coli (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994) as described below.
Transformation Of Competent Bacteria With Recombinant Plasmids Competent bacteria, E coli strain BL21 or E. coli strain BL21(DE3), are transformed with recombinant pET expression plasmids carrying the cloned H. pylori sequences according to standard methods (Current Protocols in Molecular, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994). Briefly, 1 microliter of ligation reaction is mixed with 50 microliters of electrocompetent cells and subjected to a high voltage pulse, after which, samples are incubated in 0.45 milliliters SOC medium yeast extract, 2.0 tryptone, 10 mM NaC1, 2.5 mM KC1, 10 mM MgC12, 10 mM MgSO4 and mM glucose) at 37 0 C with shaking for 1 hour. Samples are then spread on LB agar plates containing 25 microgram/ml kanamycin sulfate for growth overnight.
Transformed colonies of BL21 are then picked and analyzed to evaluate cloned inserts as described below.
Identification Of Recombinant Expression Vectors With H. Pylori Nucleic Acids Individual BL21 clones transformed with recombinant pET-28b-H.pylori ORFs are analyzed by PCR amplification of the cloned inserts using the same forward and reverse primers, specific for each H. pylori sequence, that were used in the original PCR amplification cloning reactions. Successful amplification verifies the integration of the H. pylori sequences in the expression vector (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., eds., 1994).
WO 97/37044 PCT/US97/05223 -93- Isolation and Preparation of Nucleic Acids From Transformants Individual clones of recombinant pET-28b vectors carrying properly cloned H.
pylori ORFs are picked and incubated in 5 mls of LB broth plus 25 microgram/ml kanamycin sulfate overnight. The following day plasmid DNA is isolated and purified using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA, USA).
Expression OfRecombinant H. Pylori Sequences In E. coli The pET vector can be propagated in any E. coli K-12 strain e.g. HMS174, HB101, JM109, DH5, etc. for the purpose of cloning or plasmid preparation. Hosts for expression include E. coli strains containing a chromosomal copy of the gene for T7 RNA polymerase. These hosts are lysogens of bacteriophage DE3, a lambda derivative that carries the lacI gene, the lacUV5 promoter and the gene for T7 RNA polymerase.
T7 RNA polymerase is induced by addition of isopropyl-B-D-thiogalactoside (IPTG), and the T7 RNA polymerase transcribes any target plasmid, such as pET-28b, carrying its gene of interest. Strains used include: BL21(DE3) (Studier, Rosenberg, A.H., Dunn, and Dubendorff, J.W. (1990) Meth. Enzymol. 185, 60-89).
To express recombinant H. pylori sequences, 50 nanograms of plasmid DNA isolated as described above is used to transform competent BL21(DE3) bacteria as described above (provided by Novagen as part of the pET expression system kit). The lacZ gene (beta-galactosidase) is expressed in the pET-System as described for the H.
pylori recombinant constructions. Transformed cells are cultured in SOC medium for 1 hour, and the culture is then plated on LB plates containing 25 micrograms/ml kanamycin sulfate. The following day, bacterial colonies are pooled and grown in LB medium containing kanamycin sulfate (25 micrograms/ml) to an optical density at 600 nM of 0.5 to 1.0 O.D. units, at which point, 1 millimolar IPTG was added to the culture for 3 hours to induce gene expression of the H. pylori recombinant DNA constructions After induction of gene expression with IPTG, bacteria are pelleted by centrifugation in a Sorvall RC-3B centrifuge at 3500 x g for 15 minutes at 4 0 C. Pellets are resuspended in 50 milliliters of cold 10 mM Tris-HC1, pH 8.0, 0.1 M NaCl and 0.1 mM EDTA (STE buffer). Cells are then centrifuged at 2000 x g for 20 min at 4 0 C. Wet pellets are weighed and frozen at -80 0 C until ready for protein purification.
III. Purification Of Recombinant Proteins From E. Coli Analytical Methods The concentrations of purified protein preparations are quantified spectrophotometrically using absorbance coefficients calculated from amino acid content (Perkins, S.J. 1986 Eur. J. Biochem. 157, 169-180). Protein concentrations are WO 97/37044 PCT/US97/05223 -94also measured by the method of Bradford, M.M. (1976) Anal. Biochem. 72, 248-254, and Lowry, Rosebrough, Farr, A.L. Randall, R.J. (1951) J. Biol. Chem. 193, pages 265-275, using bovine serum albumin as a standard.
SDS-polyacrylamide gels (12% or 4.0 to 25 acrylamide gradient gels) are purchased from BioRad (Hercules, CA, USA), and stained with Coomassie blue.
Molecular weight markers include rabbit skeletal muscle myosin (200 kDa), E. coli galactosidase (116 kDa), rabbit muscle phosphorylase B (97.4 kDa), bovine serum albumin (66.2 kDa), ovalbumin (45 kDa), bovine carbonic anhydrase (31 kDa), soybean trypsin inhibitor (21.5 kDa), egg white lysozyme (14.4 kDa) and bovine aprotinin kDa).
1. Purification ofsoluble proteins All steps are carried out at 4 0 C. Frozen cells are thawed, resuspended in volumes of lysis buffer (20 mM Tris, pH 7.9, 0.5 M NaC1, 5 mM imidazole with glycerol, 0.1 -mercaptoethanol, 200 ml lysozyme, 1 mM phenylmethylsulfonyl fluoride (PMSF), and 10 ug/ml each ofleupeptin, aprotinin, pepstatin, L-1-chloro-3-[4tosylamido]-7-amino-2-heptanone (TLCK), L-1-chloro-3-[4-tosylamido]-4-phenyl-2butanone (TPCK), and soybean trypsin inhibitor, and ruptured by several passages through a small volume microfluidizer (Model M-110S, Microfluidics International Corporation, Newton, MA). The resultant homogenate is made 0.1 Brij 35, and centrifuged at 100,000 x g for 1 hour to yield a clear supernatant (crude extract).
Following filtration through a 0.8 (m Supor filter (Gelman Sciences, FRG) the crude extract is loaded directly onto a Ni 2 nitrolotriacetate-agarose (NTA) with a milliliter bed volume (Hochuli, Dbeli, and Schacheer, A. (1987) J.
Chromatography 411, 177-184) pre-equilibrated in lysis buffer containing 10 glycerol, 0.1 Brij 35 and 1 mM PMSF. The column is washed with 250 ml (50 bed volumes) of lysis buffer containing 10 glycerol, 0.1 Brij 35, and are eluted with sequential steps of lysis buffer containing 10 glycerol, 0.05 Brij 35, 1 mM PMSF, and: either 20, 100, 200, or 500 mM imidazole. Fractions are monitored by absorbance at OD 2 8 0 nm, and peak fractions are analyzed by SDS-PAGE.
2. Purification of insoluble proteins from inclusion bodies The following steps are carried out at 4 0 C. Cell pellets are resuspended in lysis buffer with 10% glycerol 200 ml lysozyme, 5 mM EDTA, ImM PMSF and 0.1 mercaptoethanol. After passage through the cell disrupter, the resulting homogenate is made 0.2 deoxycholate, stirred 10 minutes, then centrifuged at 20,000 x g, for 30 min.
The pellets are washed with lysis buffer containing 10 glycerol, 10 mM EDTA, 1% Triton X-100, 1 mM PMSF and 0.1% -mercaptoethanol, followed by several washes WO 97/37044 PCT/US97/05223 with lysis buffer containing 1 M urea, 1 mM PMSF and 0.1 -mercaptoethanol. The resulting white pellet is composed primarily of inclusion bodies, free of unbroken cells and membranous materials.
Dialysis and concentration ofprotein samples Urea is removed slowly from the protein samples by dialysis against Trisbuffered saline (TBS; 10 mM Tris pH 8.0, 150 mM NaC1) containing 0.5 deoxycholate (DOC) with sequential reduction in urea as follows; 6M, 4M, 3M, 2M, 1M, 0.5 M and finally TBS without any urea. Each dialysis step is conducted for a minimum of 4 hours at room temperature.
After dialysis, samples are concentrated by pressure filtration using Amicon stirred-cells. Protein concentrations are measured using the methods of Perkins (1986 Eur. J. Biochem. 157, 169-180), Bradford ((1976) Anal. Biochem. 72, 248-254) and Lowry ((1951) J. Biol. Chem. 193, pages 265-275).
IV. Assessment Of The Antigenicitv Of Outer Membrane Localized Antigens Of H pylori Purification of outer membranes form H pylori can be performed by essentially follwing the protocol outlined below.
H. pylori strains J99 (ATCC# 55679) and Ah244 are grown on chocolate blood agar containing 5% (vol/vol) horse blood, at 37(C in an atmosphere containing
CO
2 for 48 h. Bacteria were harvested by suspension in 20 mM Tris, pH 7.5. The cells are collected by centrifugation at 12,000 Xg, for 20 min at 4(C and washed 3 times with mM Tris, pH 7.5. Cells are suspended in 20 mM Tris, pH 7.5 and broken by sonication on ice (eight bursts of 30 s at 60 watts with 60 s pauses between bursts).
DNase (0.1 mg) and RNase (0.5 mg) are added to the cell suspension, and the mixture is incubated for 30 minutes at room temperature. The cell suspension is centrifuged at 12,000 Xg for 20 min, at 4(C. The supernatant was retained and centrifuged again. Total membranes are collected from the supernatant by centrifugation at 40,000 Xg for minute, at 40C. The pellet are washed twice in 20 mM Tris, pH 7.5. The protein content is assayed using the Bradford protein assay, with bovine serum albumin (BSA) as a standard. The suspension is then adjusted to 1 mg protein /ml. The solubilization of the membranes is realized by adding N-lauryl-sarcosine to this suspension in a ratio of 6 mg of N-lauryl-sarcosine per mg of protein. The suspension is incubated for minutes at room temperature in presence of N-lauryl-sarcosine. Outer membranes are collected by centrifugation at 40,000 Xg for 30 minutes at 4 0 C. The pellet is washed 3 times with Milli Q quality water, aliquoted and stored at -200C until use.
WO 97/37044 PCT/US97/05223 -96- Identification Of Outer Membrane Antigens ofH. pylori Outer membrane antigens can be identified using a protocol outlined below.
Proteins are separated on sodium dodecylsulfate polyacrylamide gels (SDS- PAGE) according to the method described by Laemmli, U.K. (1970) Nature (London) Volume 227, 680-685. Samples are prepared by suspension in standard treatment buffer and heated at 100 0 C for 10 min. Approximately 1-5 mg of protein is loaded per well on 8X10 cm minigels (0.75 mm). The separated proteins are then transferred to PVDF membranes as described below.
Electroblotting of separated proteins to PVDF membranes is performed in a Bio Rad Mini-Trans Blot Electrophoretic Transfer cell. The PVDF membrane ImmobilonpSQ is employed. Electroblotting is carried out for 60 min at 50V using CAPS transfer buffer (10mM 3-[Cyclohexylamino]-l-propanesulfonic acid, 10% methanol). The membrane is stained with 0.2% Ponceau S and destained with Milli Q quality water.
Antigens within the preparation are then identified using western immunoblotting. After electroblotting, non specific binding sites of the PVDF membrane are blocked with 5% non fat dry milk in 10 mM Tris-HC1-0.9% NaCI, pH The membrane is incubated with a appropriate dilution of normal mouse serum in mM Tris-HCl-0.9% NaCl-0.5% Tween 20-0.5% BSA, pH 7.5, for 2 h at 37(C and then washed three times with 10 mM Tris-HCl-0.9% NaCl-0.5% Tween 20, pH (TTBS). Alkaline phosphatase conjugated anti-mouse Ig, from goat is then added in mM Tris-HCl-0.9% NaCl-0.5% Tween 20-0.5% BSA, pH 7.5 and incubated for Ih at room temperature. After this incubation, the membrane is washed three times in TTBS.
The reactive bands are revealed using 5-bromo-4-chloro-3-indolyl phosphate (Bio-Rad) as the Alkaline phosphatase substrate and Nitro Blue Tetrazolium (Bio-Rad) as the color development reagent.
For amino acid microsequencing, proteins that are identified as immunoreactive are cut from a fresh unreacted immobilon membranes and microsequenced at the Worcester Foundation microsequencing facility. Membranes from which the protein bands are cut are then subjected to western immunoblot as described above to confirm that the appropriate band had been excised.
V. Analysis Of H Pylori Proteins As Vaccine Candidates To investigate the immunomodulatory effect of H. pylori proteins, a mouse/H.
pylori model was used. This model mimics the human H. pylori infection in many respects. The focus is on the effect of oral immunization in H. pylori infected animals in order to test the concept of therapeutic oral immunotherapy.
WO 97/37044 PCT/US97/05223 -97- Animals Female SPF BALB/c mice were purchased from Bomholt Breeding center (Denmark). They were kept in ordinary makrolon cages with free supply of water and food. The animals were 4-6 weeks old at arrival.
Infection After a minimum of one week of acclimatization, the animals were infected with a type 2 strain (VacA negative) ofH. pylori (strain 244, originally isolated from an ulcer patient). In our hands, this strain has earlier proven to be a good colonizer of the mouse stomach. The bacteria were grown overnight in Brucella broth supplemented with 10 fetal calf serum, at 37 0 C in a microaerophilic atmosphere (10% COz, 5%02). The animals were given an oral dose of omeprazole (400 pmol/kg) and 3-5 h after this an oral inoculation of H. pylori in broth (approximately 108 cfu/animal). Positive take of the infection was checked in some animals 2-3 weeks after the inoculation.
Antigens Recombinant H. pylori antigens were chosen based on their association with externally exposed H. pylori cell membrane. These antigens were selected from the following groups: Outer Membrane Proteins; Periplastic/Secreted proteins; Outer Surface proteins; and Inner Membrane proteins. All recombinant proteins were constructed with a hexa-HIS tag for purification reasons and the non-Helicobacter pylori control protein (p-galactosidase from E. coli; LacZ), was constructed in the same way.
All antigens were given in a soluble form, i.e. dissolved in either a HEPES buffer or in a buffer containing 0.5% Deoxycholate (DOC).
The antigens are listed in Table 8 below.
Table 8 Helicobacter pvlori proteins Outer membrane Proteins Protein 1 Protein 2 Protein 3 Protein 4 Protein Periplastic/Secreted proteins Protein 6 Other cell envelope proteins Protein 7 Protein 8 WO 97/37044 PCT/US97/05223 -98- Flagella-associated proteins Protein 9 Control proteins p-galactosidase (LacZ) Immunizations Ten animals in each group were immunized 4 times over a 34 day period (day 1, 25 and 35). Purified antigens in solution or suspension were given at a dose of 100 Gjg/mouse. As an adjuvant, the animals were also given 10 jg/mouse of Cholera toxin (CT) with each immunization. Omeprazole (400 jmol/kg) was given orally to the animals 3-5 h prior to immunization as a way of protecting the antigens from acid degradation. Infected control animals received HEPES buffer CT or DOC buffer CT. Animals were sacrificed 2-4 weeks after final immunization. A general outline of the study is shown in Table 9 below.
Table 9 Study outline, therapeutic immunization: Mice were all infected with H. pylori strain Ah244 at day Mouse strain Dates for Substance n=10 Dose/mouse dosing 1. Controls, PBS Balb/c 0.3 ml 0, 14,24, 34 2. Cholera toxin, 10 [ig Balb/c 0.3 ml 0, 14, 24, 34 3. Protein 1, 100 gg CT 10 jg Balb/c 0.3 ml 0, 14, 24, 34 4. Protein 5, 100 ig CT 10 jg Balb/c 0.3 ml 0, 14, 24, 34 Protein 10, 100 gig CT 10 jg Balb/c 0.3 ml 0, 14, 24, 34 6. Protein 9, 100 jig CT 10 jg Balb/c 0.3 ml 0, 14, 24, 34 7. Protein 2, 100 jg CT 10 jg Balb/c 0.3 ml 0, 14, 24, 34 8. Protein 6, 100 jg CT 10 jg Balb/c 0.3 ml 0, 14, 24, 34 9. Protein 4, 100 ig CT 10 jg Balb/c 0.3 ml 0, 14,24,34 Protein 7, 100 gg CT 10 jg Balb/c 0.3 ml 0, 14, 24, 34 11. Protein8, 100 jg+CT 10 jg Balb/c 0.3ml 0, 14,24,34 12. Protein 3, 100 jtg CT 10 jig Balb/c 0.3 ml 0, 14, 24, 34 Analysis of infection Mucosal infection: The mice were sacrificed by CO 2 and cervical dislocation.
The abdomen was opened and the stomach removed. After cutting the stomach along the greater curvature, it was rinsed in saline. The mucosa from the antrum and corpus of an area of 25mm 2 was scraped separately with a surgical scalpel. The mucosa scraping was suspended in Brucella broth and plated onto Blood Skirrow selective plates. The WO 97/37044 PCT/US97/05223 -99plates were incubated under microaerophilic conditions for 3-5 days and the number of colonies was counted. The identity ofH. pylori was ascertained by urease and catalase test and by direct microscopy or Gram staining.
The urease test was performed essentially as follows. The reagent, Urea Agar Base Concentrate, was purchased from DIFCO Laboratories, Detroit, MI (Catalog 0284-61-3). Urea agar base concentrate was diluted 1:10 with water. 1 ml of if the diluted concentrate was mixed with 100-200 p1 of actively growing H. pylori cells.
Color change to magenta indicated that cells were urease positive.
The catalase test was performed essentially as follows. The reagent, Tetramethyl-p-Phenylenediamine, was purchased from Sigma, St. Louis, MO (Catalog T3134). A solution of the regent w/v in water) was prepared. H. pylori cells were swabbed onto Whatman filter paper and overlaid with the 1% solution. Color change to dark blue indicated that the cells were catalase positive.
Serum antibodies: From all mice serum was prepared from blood drawn by heart puncture. Serum antibodies were identified by regular ELISA techniques, where the specific antigens ofHelicobacterpylori were plated.
Mucosal antibodies: Gentle scrapings of a defined part of the corpus and of 4 cm of duodenum were performed in 50% of the mice in order to detect the presence of antibodies in the mucous. The antibody titers were determined by regular ELISA technique as for serum antibodies.
Statistical analysis: Wilcoxon-Mann-Whitney sign rank test was used for determination of significant effects of the antigens on Helicobacter pylori colonization.
P<0.05 was considered significant. Because the antrum is the major colonization site for Helicobacter most emphasis was put upon changes in the antral colonization.
Results Antibodies in sera: All antigens tested given together with CT gave rise to a measurable specific titer in serum. The highest responses were seen with Proteins 3, 4, 9, 1, and 7 (see Figure 1).
Antibodies in mucus: In the mucus scrapings, specific antibodies against all antigens tested were seen. By far the strongest response was seen with Protein 6, followed by 1, 3, and 9 (see Figure 2).
Therapeutic immunization effects: All control animals (BALB/c mice) were well colonized with H. pylori (strain AH244) in both antrum and corpus of the stomach. Of the antigens tested 3 proteins (Proteins 4, 7, and 1) gave a good and significant reduction and/or eradication of the H WO 97/37044 PCT/US97/05223 -100pylori infection. The degree of colonization of the antrum was lower following immunization with Proteins 8, 9, and 3 compared to control. The effect of Proteins 5, 2, and 6 did not differ from control. The control protein lacZ, i.e. the non-H. pylori protein, had no eradication effect and in fact had higher Helicobacter colonization compared to the HEPES CT control. All data are shown in Figures 3 and 4 for proteins dissolved in HEPES and DOC respectively. Data is shown as geometric mean values. n=8-10 Wilcoxon-Mann-Whitney sign rank test p<0.05; x/10 number of mice showing eradication ofH. pylori over the total number of mice examined.
The data presented indicate that all of the H pylori associated proteins included in this study, when used as oral immunogens in conjunction with the oral adjuvant CT, resulted in stimulation of an immune response as measured by specific serum and mucosal antibodies. A majority of the proteins led to a reduction, and in some cases complete clearance of the colonization ofH. pylori in this animal model. It should be noted that the reduction or clearance was due to heterologous protection rather than homologous protection (the polypeptides were based on the H. pylori J99 strain sequence and used in the therapeutic immunization studies against a different (AH244) challenge strain), indicating the vaccine potential against a wide variety ofH. pylori strains.
The highest colonization in the antrum was seen in animals treated with the non- Helicobacter protein LacZ, indicating that the effects seen with the Helicobacter pylori antigens were specific.
Taken together these data strongly support the use of these H. pylori proteins in a pharmaceutical formulation for the use in humans to treat and/or prevent H. pylori infections.
VI. Sequence Variance Analysis of genes in Helicobacter pylori strains Four genes were cloned and sequenced from several strains ofH. pylori to compare the DNA and deduced amino acid sequences. This information was used to determine the sequence variation between the H. pylori strain, J99, and other H. pylori strains isolated from human patients.
Preparation of Chromosomal DNA.
Cultures ofH. pylori strains (as listed in Table 12) were grown in BLBB (1% Tryptone, 1% Peptamin 0.1% Glucose, 0.2% Yeast Extract 0.5% Sodium Chloride, Fetal Bovine Serum) to an OD 600 of 0.2. Cells were centrifuged in a Sorvall RC-3B at 3500 x g at 4 0 C for 15 minutes and the pellet resuspended in 0.95 mis of 10 mM Tris- HCI, 0.1 mM EDTA Lysozyme was added to a final concentration of Img/ml WO 97/37044 PCT/US97/05223 101 along with, SDS to 1% and RNAse A TI to 0.5mg/ml and 5 units/ml respectively, and incubated at 37 0 C for one hour. Proteinase K was then added to a final concentration of 0.4mg/ml and the sample was incubated at 55 C for more than one hour. NaCI was added to the sample to a concentration of 0.65 M, mixed carefully, and 0.15 ml of CTAB in 0.7M NaCL (final is 1% CTAB/70mM NaCL) was added followed by incubation at 65 0 C for 20 minutes. At this point, the samples were extracted with chloroform:isoamyl alcohol, extracted with phenol, and extracted again with chloroform:isoamyl alcohol. DNA was precipitated with either EtOH (1.5 x volumes) or isopropanol (0.6 x volumes) at -70°C for O1minutes, washed in 70% EtOH and resuspended in TE.
PCR Amplification and cloning.
Genomic DNA prepared from twelve strains of Helicobacter pylori was used as the source of template DNA for PCR amplification reactions (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). To amplify a DNA sequence containing an H. pylori ORF, genomic DNA (10 nanograms) was introduced into a reaction vial containing 2 mM MgCl 2 1 micromolar synthetic oligonucleotide primers (forward and reverse primers, see Table 10) complementary to and flanking a defined H. pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, dCTP, dTTP and 0.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of microliters in duplicate reactions.
Table Oligonucleotide primers used for PCR amplification of H pylori DNA sequences.
Outer membrane Forward primer 5' to 3' Reverse Primer 5' to 3' Proteins Protein 11 (for strains 5'-TTAACCATGGTGAAAAGC AH4, AH15, AH61, GATA-3' (SEQ ID NO:1091) CTTTAG-3' (SEQ ID NO:1092) 5294, 5640, AH18, and AH244) Protein 11 (for strains 5'-TTAACCATGGTGAAAAGC 5155, 7958, GATA-3' (SEQ ID NO:1093) CAATC-3' (SEQ ID NO:1094) AH24.and J99) Protein 12 5'-ATATCCATGGTGAGTTTG ATGA-3' (SEQ ID NO:1095) TTGCCA-3' (SEQ ID NO:1096) Protein 13 5'-AATTCCATGGCTATCCAA ATCCG-3' (SEQ ID NO:1097) GTAGTATT-3' (SEQ ID NO:1098) Protein 14 5'-GATACCATGGAATTTATGA __AAAAG-3' (SEQ ID NO:1099) AGTTATAC-3' (SEQ ID NO:1100) WO 97/37044 PCT/US97/05223 -102- The following thermal cycling conditions were used to obtain amplified DNA products for each ORF using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal cycler: Sequences for Proeins 12 and 14; Denaturation at 94 0 C for 2 min, 2 cycles at 94 0 C for 15 sec, 30 0 C for 15 sec and 72 0 C for 1.5 min 23 cycles at 94 0 C for 15 sec, 55 0 C for 15 sec and 72 0 C for 1.5 min Reactions were concluded at 72 0 C for 6 minutes.
Sequence for Protein 11 for strains AH5, 5155, 7958, AH24,and J99; Denaturation at 94°C for 2 min, 2 cycles at 94 0 C for 15 sec, 30 0 C for 15 sec and 72 0 C for 1.5 min cycles at 94 0 C for 15 see, 55 0 C for 15 sec and 72 0 C for 1.5 min Reaction was concluded at 72 0 C for 6 minutes.
Sequences for Protein 11 and Protein 13 for strains AH4, AH 15, AH61, 5294, 5640, AH18, and Hp244; Denaturation at 94 0 C for 2 min, 2 cycles at 94 0 C for 15 sec, 30 0 C for 20 sec and 72 0 C for 2 min cycles at 94 0 C for 15 sec, 55 0 C for 20 sec and 72 0 C for 2 min Reactions were concluded at 72 0 C for 8 minutes.
Upon completion of thermal cycling reactions, each pair of samples were combined and used directly for cloning into the pCR cloning vector as described below.
Cloning ofH. pylori DNA sequences into the pCR TA cloning vector.
All amplified inserts were cloned into the pCR 2.1 vector by the method described in the Original TA cloning kit (Invitrogen, San Diego, CA). Products of the ligation reaction were then used to transform the TOP1 OF' (INVaF' in the case of H.
pylori sequence 350) strain ofE. coli as described below.
Transformation of competent bacteria with recombinant plasmids Competent bacteria, E coli strain TOP10F' or E. coli strain INVaF' were transformed with recombinant pCR expression plasmids carrying the cloned H. pylori sequences according to standard methods (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). Briefly, 2 microliters of micromolar BME was added to each vial of 50 microliters of competent cells.
Subsequently, 2 microliters of ligation reaction was mixed with the competent cells and incubated on ice for 30 minutes. The cells and ligation mixture were then subjected to a "heat shock" at 42 0 C for 30 seconds, and were subsequently placed on ice for an additional 2 minutes, after which, samples were incubated in 0.45 milliliters SOC medium yeast extract, 2.0 tryptone, 10 mM NaCI, 2.5 mM KC1, 10 mM MgC12, 10 mM MgSO4 and 20, mM glucose) at 37°C with shaking for 1 hour. Samples WO 97/37044 PCTIUS97/05223 -103 were then spread on LB agar plates containing 25 microgram/mI kanamycin sulfate or 100 micrograms/mI ampicillan for growth overnight. Transformed colonies of TOP 1 OF' or INVaF' were then picked and analyzed to evaluate cloned inserts as described below.
Identi~fi cation of recombinant P CR plasm ids carrying H pylori sequences Individual TOP IlOF' or INVaF' clones transformed with recombinant pCR- Hpylori OR.Fs were analyzed by PCR amplification of the cloned inserts using the same forward and reverse primers, specific for each H pylori sequence, that were used in the original PCR amplification cloning reactions. Successful amplification verified the integration of the H pylori sequences in the cloning vector (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994).
Individual clones of recombinant pCR vectors carrying properly cloned H pylori ORFs were picked for sequence analysis. Sequence analysis was performed on ABI Sequencers using standard protocols (Perkin Elmer) using vector-specific primers (as found in PCRJI or pCR2. 1, Invitrogen, San Diego, CA) and sequencing primers specific to the ORF as listed in Table I11 below.
Table 11I Olgncetd rmr sdfrsgecn of H nyloni DNA senhiencpz Outer Forward primers 5' to 3' Reverse Primers 5' to 3' membrane Proteins Protein 11 5'-CCCTTCATTTJTAGAAATCG-3' 5'-CTTTGGGTAAAAACGCATC-3' (SEQ ID NO: 10 1) (SEQ ID NO:] 1108) 5'-ATTTCAACCAATTCAATGCG3T 5'-CGATCTTTGATCCTAATTCA3' (SEQ ID NO: 1102) (SEQ ID NO: 1109) 5'-GCCCCTTTTGATTTGAAGCT-3 (SEQ ID NO: 1103) (SEQ ID NO: I AAGT-3' (SEQ ID NO: 1104) ATCG-3' (SEQ ID NO: 1105) AGT-3'(SEQ ID NO: 1106) GT-3' (SEQ ID NO:1 1107) Protein 12 5'-TTGAACACTTTTGATTATG CGG-3' (SEQ ID NO:1I 111) 3'(SEQ ID NO: 1113) (SEQ ID NO: 1112) 3'(SEQ ID NO: 1114) Protein 13 5'-CTTATGGGGGTATTGTC-_3' 5'-AGGTTGTTGCCTAAAGACT-3' (SEQ ID NO: 1115) (SEQ ID NO: 1117) 5'-AGCATGTGGGTATCCAGC-3' 5'-CTGCCTCCACCTTTGATC-3' (SEQ ID NO: 1116) (SEQ ID NO: 1118) Wn o7/I'7AA
VV
S PCT/US97/05223 -104- Protein 14 5'-ACCAATATCAATTGGCACT-3' 5'-CTTGCTTGTCATATCTAGC-3' (SEQ IDNO:1119) (SEQ ID NO:1121) 5'-ACTTGGAAAAGCTCTGCA-3' 5'-GTTGAAGTGTTGGTGCTA-3' (SEQ ID NO:1120) (SEQ ID NO:1122) AG-3' (SEQ ID NO: 1123) 3' (SEQ ID NO:1125) (SEQ ID NO: 1124) TTGTC-3' (SEQ ID n. 111 Vector 5'-GTAAAACGACGGCCAG-3' Primers (SEQ ID NO:1127) 5'-CAGGAAACAGCTATGAC-3' (SEQ ID NO: 1128) Results To establish the PCR error rate in these experiments, five individual clones of Protein 11, prepared from five separate PCR reaction mixtures from H pylori strain J99, were sequenced over a total length of 897 nucleotides for a cumulative total of 4485 bases of DNA sequence. DNA sequence for the five clones was compared to the DNA sequence of Protein 11 obtained previously by a different method, random shotgun cloning and sequencing. The PCR error rate for the experiments described herein was determined to be 2 base changes out of 4485 bases, which is equivalent to an estimated error rate of less than or equal to 0.04%.
DNA sequence analysis was performed on four different open reading frames identified as genes and amplified by PCR methods from a dozen different strains of the bacterium Helicobacterpylori. The deduced amino acid sequences of three of the four open reading frames that were selected for this study showed statistically significant BLAST homology to defined proteins present in other bacterial species. Those ORFs included: Protein 11, homologous to the val A B genes encoding an ABC transporter in F. novicida; Protein 12, homologous to lipoprotein e (P4) present in the outer membrane of H. influenzae; Protein 13, homologous to fecA, an outer membrane receptor in iron (III) dicitrate transport in E. coli. Protein 14 was identified as an unknown open reading frame, because it showed low homology with sequences in the public databases.
To assess the extent of conservation or variance in the ORFs across various strains ofH. pylori, changes in DNA sequence and the deduced protein sequence were compared to the DNA and deduced protein sequences found in the J99 strain of H pylori (see Table 12 below). Results are presented as percent identity to the J99 strain of H. pylori sequenced by random shotgun cloning. To control for any variations in the J99 sequence each of the four open reading frames were cloned and sequenced again from the J99 bacterial strain and that sequence information was compared to the sequence information that had been collected from inserts cloned by random shotgun sequencing of the J99 strain. The data demonstrate that there is variation in the DNA WO 97/37044 PCT/US97/0522 3 -105sequence ranging from as little as 0.12 difference (Protein 14, J99 strain) to approximately 7% change (Protein 11, strain AH5). The deduced protein sequences show either no variation (Protein 14, strains AH 18 and AH24) or up to as much as 7.66% amino acid changes (Protein 11, Strain TABLE 12 Multiple Strain DNA Sequence analysis ofH. pylori Vaccine Candidates J99 Protein 11 11 12 12 13 13 14 14 Length of Region 248 a.a. 746 nt. 232 a.a. 696 nt. 182 a.a. 548 nt. 273 a.a. 819 nt. Sequenced: Strain Tested AA Nuc. AA Nuc. AA Nuc. AA Nuc.
identity identity identity identity identity identity identity identity J99 100.00% 100.00% 100.00% 100.00% 100.00% 100.00% 99.63% 99.88% AH244 95.16% 95.04% n.d. n.d. 99.09% 96.71% 98.90% 96.45% AH4 95.97% 95.98% 97.84% 95.83% n.d. n.d. 97.80% 95.73% 92.34% 93.03% 98.28% 96.12% 98.91% 96.90% 98.53% 95.73% 95.16% 94.91% 97.41% 95.98% 99.82% 97.99% 99.63% 96.09% AH61 n.d. n.d. 97.84% 95.98% 99.27% 97.44% n.d. n.d.
AH24 94.75% 95.04% 97.84% 95.40% 99.27% 96.71% 100.00% 96.46% n.d. not done VII. Experimental Knock-Out Protocol for the Determination ofEssential H Ylori Genes as Potential Therapeutic Targets Therapeutic targets were chosen from genes whose protein products appear to play key roles in essential cell pathways such as cell envelope synthesis, DNA synthesis, transcription, translation, regulation and colonization/virulence.
The protocol for the deletion of portions of H pylori genes/ORFs and the insertional mutagenesis of a kanamycin-resistance cassette was modified from previously published methods (Labigne-Roussel et al., 1988, J. Bacteriology 170, pp.
1704-1708; Cover et al.,1994, J. Biological Chemistry 269, pp. 10566-10573; Reyrat et al., 1995, Proc. Natl. Acad. Sci. 92, pp 8768-8772).
Identification and Cloning ofH. pylori Gene Sequences The sequences of the genes or ORFs (open reading frames) selected as knock-out targets were identified from the H. pylori genomic sequence and used to design primers to specifically amplify the genes/ORFs. All synthetic oligonucleotide primers (Table WO 97/37044 PCT/US97/05223 106- 13) were designed with the aid of the OLIGO program (National Biosciences, Inc., Plymouth, MN 55447, USA), and were purchased from Gibco/BRL Life Technologies (Gaithersburg, MD, USA). Specific primers (F1 and R were chosen which flanked most or all of the ORF, depending on its size. If the ORF was smaller than 800 to 1000 base pairs, flanking primers were chosen outside of the open reading frame.
TABLE 13 01 igonucleotide Sequences for Knock-Out Gerte/ORFs Cloning Primers Deletion-creating Targeting Primers Primers Gene Access- Fl RI F2 R2 F3 R3 Namne ion Number rnh P23329 TTGCCCCATCGTA AGAGCGTATTTCA TCTTGCATCTTAAT CGGGTCAAAACGA AATCCGTTTCGCT AACACTTCAATTT TTGATAGA CCCGAAAG CCACTCC CCACTTAA AATTTAGT CCTCTATA ID NO:1 1129) (SEQ I DNO:1 1130) (SEQ I D NO:1 1131) (SEQ I D NO:I 1132) (SEQ ID NO:] 133) (SEQ ID NO:I 1134) ppiB P29820 TGGTATAAGGATT TTGACTAAACACA ATAGAGAGCGTTG CCTTTATTGGTTTT ATGTCCGTTGTCT TAGGGTGTCTAG TGAATGGA TGCGAGAA TGTTTAGC GATCGTG GTATGGAA GGATTUGAT (SEQ ID NO: 1135) (SEQ ID NO: 1136) (SEQ ID NO:I 1137) (SEQ ID NO: 1138) (SEQ ID NO: 1139) (SEQ ID NO: 1140) tsf P34828 GCGTTTGGCTTCTT GAAATGGAAAATA GCAAATCCCCAGC GTGGCTAAAAATG GTTAGGAAATTAG GCTAAAACTTCAT CGTTGTC GCGGTCAA CACTTCC AGGGCTT AAATCATTG CGCTCAAT (SEQ I DNO:1 1141) (SEQ I D NO:] 1142) (SEQ I D NO:! 1143) (SEQ ID NO:I 144) (SEQ ID NO:] 1145) (SEQ ID NO:I 1146) MurD P 14900 GTTGGGCAGAAAA CAAACAAACCTGA CATTGATGCCTAA CGTGGTGGTTTTC GGGGCATTGTGTF TGGTCTATCATGC TAAGGTGA CAAGAAAC AACTTCG CCGTTAG TGTTTTT GAATTAT (SEQ ID NO:] 1147) (SEQ I DNO:] 148) (SEQ ID NO:1 1149) (SEQ I DNO:] 150) (SEQ ID NO:1 1151) (SEQ ID NO:] 1152) MurE P22188 GCGTTTGGGGATT CGCGCTAGAGGCT GCCCTGATCCATT CTGTTTTTAGCGTC GGCGTTATTAAGC TTTCACCGGCAAT TGATGTTC TGTAAAA CCCCCCT CCTGTA GACATCG TTTAGGC (SEQ ID NO:1 1153) (SEQ ID NO: 1154) (SEQ ID NO: 1155) (SEQ I D NO:1 1156) (SEQ I D NO:! 1157) (SEQ ID NO: 1158) AlgA P07874 GCGTTTTGAUTCT GTAAAAACACCGC GCGTGTTTTCTAA GGAATTTTAACGC AAATCTCTGTGGG AATCAAAAACAA GTCTGTTA TAACGCAT GGGTTCA TCTTTTT CTTAGTG GAGCGTGG (SEQ I D NO:I 1159) (SEQ ID NO:! 1160) (SEQ ID NO:] 1161) (SEQ I D NO:l 1162) (SEQ ID NO:1 1163) (SEQ ID NO:] 1164) metL P19358 GCCCCAGCCCCAT GGAGGGCGCAATT TCACGCTTTCTAA CGCTAATCACATC TGCCCAAAAATCC AACGGGTTTGAC AATACAAA AAACATCG ATCATCA CTTTCTT ACTAACG ACTGATGA ID NO:] 165) (SEQ ID NO:] 166) (SEQ ID NO: 1167) (SEQ ID NO: 1168) (SEQ ID NO: 1169) (SEQ ID NO:] 170) TABLE 13 (continued) fusA X 16278 GAATrGCGGTGGTrT GCGTTTTTAAGAC GGGCGATGTGATT GGATAGCCTGCCA AAGTTTATGCGGG GGAGCAATCAGC TTAGAGAG TGAATACA GGCGATT AAACGCC CGAGATT CATTTTTC (SEQ I DNO:! 171) (SEQ ID NO:1 172) (SEQ ID NO: 1173) (SEQ I DNO:l 1174) (SEQ ID NO:1 1175) (SEQ ID NO:1 1176) FHgE U09549 CTAGCGATTCAAG CGGCCTCCTTCAA AGCGGGCAGTTTA GCATTGATCGCAT ACGGGTTAGCAGG CAAAAGAGGCGG GCGATGG ACACAIT GGACCAC TTTTAGCC GCAGAAT GTTCATGC (SEQ I DNOAl 177) (SEQ ID NO:1 1178) (SEQ ID NO:l 1179) (SEQ I DNO:1 1180) (SEQ I D NO:I 1181) (S EQ I DNO:I 1182) Fli M- M37691I TTTAGAAGTCGTT CATACACGCTCAC AGTGTGGTCGCCT CCCCTAATAGTCT CAAAAGATTGAAG AATGGTTTTCCTA set A GATGAGA TTCATCG GTGGTGGAG GTCAATCAT CAGAAGAGT TACCCTTGA I D NO: 1183) (SEQ ID NO: 1 184) (SEQ ID NO: 1185) (SEQ ID NO: 1186) (SEQ ID NO: 1187) (SEQ ID NO: 1188) FliM- M3 7691 GAGAGCAAATCCT CATACACGCTCAC AGTGTGGTCGCCT CCCCTAATAGTCT CAAAAGATTGAAG AATGGTTTTCCTA set B TATCCAG TTCATCG GTGGTGGAG GTCAATCAT CAGAAGAGT TACCCTTGA (SEQ I D NO:1 1189) (SEQ ID NO:1 190) (SEQ ID NO:1 1191) (SEQ ID NO:1 1192) (SEQ ID NO:1 1193) (SEQ ID NO:1 194) MurC U32794 TTGAAACCCCAAA TCAACTGATAGGT GCTAGGATTTATG TACGAGACAAAAT ATAGGCATGCAGA CCATTACATTTCG AGTTTTAC AATATCCC CCAAT11'A AGGGATTT ATTTTTCC CCTC (SEQ ID NO: 1195) (SEQ ID NO: 1196) (SEQ ID NO:1 1197) (S EQ I D NO:1 1198) (SEQ I D NO: 1199) (SEQ ID NO: 1200) dnaE M 10040 CGATAGATATTGT GGGCTTGTATTCA GTTTTAAAAACGC TTCTAAAAGGTGG TAAGTCAAGCCAT TTTTGGGGTAAAA AGAAGTCA TTTTGTAA CATAGCCA TAATCTTC AAAACCAAA AGGCTGAA (SEQ ID NO: 120 1) (SEQ ID NO: 1202) (SEQ I D NO: 1203) (SEQ I D NO: 1204) (SEQ ID NO: 1205) (S EQ I D NO: 1206) SerS X05017 ATCTTTTTGCCCTT AGACAGCACCAGT CAGCCACACTTCA GTAAGGCGTTAGA GCCCCATTAAAAT AAAGGATACAAG GCTCATA TTGATAAA ATGTCTAT AAAATACC CCTTTTCT GGGGA I D NO: 1207) (SEQ I D NO: 1208) (SEQ ID NO: 1209) (SEQ ID NOA 2 10) (SEQ ID NO: 121 1) (SEQ I D NO: 1212) gly P00960 CTCGCTCCATTTTA TTTTTTAGGGAGG TGTTTGGAAATGC CTTTTGGGGGAGT TTTGATAAACGCC TTTCAAAACGCT TCTTTTA ATTGAGAT TGGTGATC TTGACAAG CACTTTTT ACCTTTTG ID NO: 1213) (SEQ ID NO: 1214) (SEQ ID NO: 1215) (SEQ ID NO: 1216) (SEQ I DNO: 1217) (SEQ I DNO: 1218) Gltx 1,14580 TCTATFTCTTTrTGAT ATAATGAGTITTGA ACAATAATAGGCT AATTAGCCCTTAA AACAACCGCTAAA CTTCAGCGATACT- GCTCTCT TCGTTACG TTGTCTTC AATAGATG ATCAAAC AAAAGAT (SEQ ID NO: 1219) (SEQ I DNO: 1220) (SEQ ID NO: 122 1) (SEQ ID NO: 1222) (SEQ ID NO: 1223) (SEQ ID NO: 1224) Sig28 M37691 TAGGGGCGATTGA GCTGGATAAGGAT TTTTTGGGGGTAT GGCTGGTAAATAC AGGCTATTCAAGG ATTCTCATCAACG (1liA) AAACAGC TTGCTCT GCTAAAA TGGATAG TGGCTAAA ACTTCTAAA I DNO: 1225) (SEQ I DNOA1226) (SEQ ID NO: 1227) (SEQ ID NO: 1228) (SEQ ID NO: 1229) (SEQ ID NO: 1230) TABLE 13 (continued) Sig54 M73443 GCAGTTGGCGGTA GAGAGCGAAGTTT TGATTGTTGGGTA AAAATCGGTCTGA CTTTTCCTTTCGCT- AAAACAAACGCA TTTGGTG ATGAGAA GCTCTCA TGCTCTTA TGAAGA TCAAAAAT (SEQ I DNO: 123 1) (SEQ ID NO: 1232) (SEQ ID NO: 1233) (SEQ ID NO: 1234) (SEQ ID NO: 1235) (SEQ ID NO: 1236) Muri U12405 TTTCAAGGCGAGG GCACAAAGACCCC CGCCCGAATGGAT TGCAACAAAAATA TTTTTAAGGGCGT TGGGTTTTAAGGA AGGCAGAT ACCACGAT GAGTAGG CGCCCTT ATTTTTGT ATGTGATG (SEQ ID NO: 1237) (SEQ ID NO: 1238) (SEQ ID NO: 1239) (SEQ I D NO: 1240) (SEQ I D NO: 124 1) (SEQ I D NO: 1242) dnaB D26 185 CGCGCTCAAAATC GGCCCAHTCTTTC GCCCCATTCCTGTT CGCTTTAACGCTC AGCGTTTTTGTAA TCCCTATCATAGC CCTAAAT GGATATT TTTAGC CTTTCAC GGGGGTAT GTTAGTGC ID NO: 1243) (SEQ ID NO: 1244) (SEQ ID NO: 1245) (SEQ ID NO:I1246) (SEQ ID NO: 1247) (SEQ ID NO: 1248) MurG D1 0602 CTTAGGGGTTTTT GCACAATTCCCAC CCAAAGCTAAAGC GCTCATGGATATA CTTAGCCCCTTTA CGCAAAAGGGTA AGCATGAA ACGCTGC GGTGTTTT AAGGGGTATT GTGTTTA GGGGATAA (SEQ ID NO: 1249) (SEQ ID NO: 1250) (SEQ ID NO: 125 1) (SEQ ID NO: 1252) (SEQ I DNO: 1253) (SEQ ID NO: 1254) IpxC U32794 TTTTATTTTTAGAA CAAACTTATCGCC AAAGATAACGCTA TAATTCTACAGAG GCGGTCATGGAAT ATTCAAAGAAAG ACGAATC CTCTCTA GGATTTCTAC TGGTTAATGG TTTTAGA CTGGCTGTCT (SEQ ID NO: 1255) (SEQ I D NO: 1256) (SEQ ID NO: 1257) (SEQ ID NO: 1258) (SEQ ID NO: 1259) (SEQ ID NO: 1260) kdtA M86305 GCTTGTGGGGGTT GAACCCCCTAAAA ACCATGCTCATTA GTAAGTTTGAGCG AAAAAGAAAGAA AAAGATACTCCC GTTTAT TGACAAT ACGCTAGG GCTAATTC GAACTCGTG CTGTGATTFA (SEQ I D NO: 126 1) (SEQ ID NO: 1262) (S EQ I D NO: 1263) (SEQ ID NO: 1264) (SEQ I D NO: 1265) (SEQ ID NO: 1266) IpxB U09549 CAAAGAAACGCAA GCATGGTATTCAG GCAGCGGCACAGC CGCCCCCAAAAAG AAAGGTTTGAAAC CACTTGAGCGTTA AATACAG CGTTTTC GACTTTAG TCGCAGTA AAGAAATCT GCAACAAT I D NO: 1267) (SEQ I D NO: 1268) (SEQ I D NO: 1269) (SEQ ID NO: 1270) (SEQ ID NO: 127 1) (S EQ [ID NO: 12 72) KO 24 GGGGTGTTTGAAA CGCTAGGAGAAAG CAAGGGCGTTTTT GOGATTOTTACAG GGAATACAATAAC GCCTTTTTAGACA TTGATAGA GAAGGAAA TGGGGTAT GAAAAGAT GCATAAAT ACCCTACT ID NO: 1273) (SEQ I DNO: 1274) (SEQ I DNO: 1275) (SEQ ID NO: 1276) (SEQ I DNO: 1277) (SEQ ID NO: 1278) KO 26 CCCCTAAACTCAA GCTAGAAATGCCA GCGATTATGGGGT TTATTGTGGAGTT TATGCGGCTCATC CCCTAAATCCAA ATCTCAAT TGAGAAAG ATTTATTG GCTTGTCA CTATTAAA ATCAAGCAG ID NO: 1279) (SEQ ID NO: 1280) (SEQ ID NO: 128 1) (SEQ ID NO: 1282) (SEQ ID NO: 1283) (SEQ ID NO: 1284) WO 97/37044 PCT/US97/05223 -110- Genomic DNA prepared from the Helicobacterpylori HpJ99 strain (ATCC 55679) was used as the source of template DNA for amplification of the ORFs by PCR (polymerase chain reaction) (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F.
Ausubel et al., editors, 1994). For the preparation ofgenomic DNA from H. pylori, see Example I. PCR amplification was carried out by introducing 10 nanograms of genomic HpJ99 DNA into a reaction vial containing 10 mM Tris pH 8.3, 50 mM KC1, 2 mM MgC12, 2 microMolar synthetic oligonucleotide primers (forward=F1 and reverse=R1), 0.2 mM of each deoxynucleotide triphosphate (dATP,dGTP, dCTP, dTTP), and 1.25 units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of 40 microliters. The PCR was carried out with Perkin Elmer Cetus/GeneAmp PCR System 9600 thermal cyclers. The thermal cycling conditions used to obtain amplified DNA products for each knock-out target are shown in Table 14.
TABLE 14 PCR Conditions MurC Denaturation at 94 0 C for 2 min., cycles of 94 0 C for 15 sec, 48 0 C for 15 sec, 72 0 C for 1 min, 30 sec, Final Extension of 72 0 C for 20 minutes.
Sig54 Denaturation at 94 0 C for 2 min., cycles of 94 0 C for 15 sec, 50 0 C for 15 sec, 72°C for 1 min, 30 sec, Final Extension of 72 0 C for 20 minutes.
lnxC Denaturation at 94 0 C for 2 min., 32 cycles of 94 0 C for 15 sec, 50 0 C for 15 sec, 72 0 C for 1 min, 30 sec, Final Extension of 72 0 C for 20 minutes.
K024.K026,K027 Denaturation at 94°C for 2 min., cycles of 94C for 15 sec, 50.5 0 C for 20 sec, 72 0 C for 2 min, Final Extension of 72 0 C for 20 minutes.
K029,26kDa Protein Denaturation at 94°C for 2 min., 28 cycles of 94 0 C for 15 sec, 50.5 0 C for 20 sec, 72 0 C for 2 min, Final Extension of 72 0 C for 20 minutes.
DnaE,Glvcyl Denaturation at 94°C for 2 min., cycles of 94 0 C for 15 sec, 51 0 C for 15 sec, 72 0 C for 1 min, 30 sec, Final Extension of 72 0 C for 20 minutes.
WO 97/37044 PCT/US97/05223 111 Gltx Denaturation at 94 0 C for 2 min., cycles of 94 0 C for 15 sec, 51OC for 15 sec, 72°C for 1 min, 30 sec, Final Extension of 72 0 C for 20 minutes.
K028 Denaturation at 94 0 C for 2 min., cycles of 94 0 C for 15 sec, 51 0 C for 15 sec, 72 0 C for 2min, Final Extension of 72 0 C for 20 minutes.
K030 Denaturation at 94 0 C for 2 min., cycles of 94 0 C for 15 sec, 51.5 0 C for 15 sec, 72 0 C for Imin, 45 sec, Final Extension of 72 0 C for 20 minutes.
MurI, MurG Denaturation at 94 0 C for 2 min., cycles of 94 0 C for 15 sec, 52°C for 15 sec, 72 0 C for 1 min, 30 sec, Final Extension of 72 0 C for 20 minutes.
DnaB. KdtA,LpxB Denaturation at 94°C for 2 min., 27 cycles of 94 0 C for 15 sec, 52°C for 15 sec, 72°C for 1 min, 30 sec, Final Extension of 72C for 20 minutes.
Tsf,FlgE, FliM.Sig28,MurB Denaturation at 94 C for 2 min., cycles of 94 0 C for 15 sec, 52°C for 15 see, 72 0 C for 1 min, 30 sec, Final Extension of 72 0 C for 20 minutes.
PpiB Denaturation at 94 0 C for 2 min., cycles of 94 0 C for 15 sec, 52 0 C for 15 sec, 72 0 C for 2 min, 30 sec, Final Extension of 72 0 C for 20 minutes.
MurD,MurE, AlgA.MetLFusASerSRnh Denaturation at 94 0 C for 2 min., cycles of 94 0 C for 15 sec, 55°C for 15 sec, 72 0 C for 1 min, 30 sec, Final Extension of 72 0 C for 20 minutes.
Upon completion of thermal cycling reactions, each sample of amplified DNA was visualized on a 2% TAE agarose gel stained with Ethidium Bromide (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) to determine that a single product of the expected size had resulted from the reaction. Amplified DNA was then washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD, USA).
PCR products were cloned into the pT7Blue T-Vector (catalog#69820-1, Novagen, Inc., Madison, WI, USA) using the TA cloning strategy (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). The WO 97/37044 PCT/US97/05223 -112ligation of the PCR product into the vector was accomplished by mixing a 6 fold molar excess of the PCR product, 10 ng of pT7Blue-T vector (Novagen), 1 microliter of T4 DNA Ligase Buffer (New England Biolabs, Beverly, MA, USA), and 200 units of T4 DNA Ligase (New England Biolabs) into a final reaction volume of 10 microliters.
Ligation was allowed to proceed for 16 hours at 16 0
C.
Ligation products were electroporated (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994) into electroporationcompetent XL-1 Blue or DH5-oc E.coli cells (Clontech Lab., Inc. Palo Alto, CA, USA).
Briefly, 1 microliter of ligation reaction was mixed with 40 microliters of electrocompetent cells and subjected to a high voltage pulse (25 microFarads, 2.5 kV, 200 ohms) after which the samples were incubated in 0.45 ml SOC medium yeast extract, 2% tryptone, 10 mM NaC1, 2.5 mM KC1, 10 mM MgCl 2 10 mM MgSO 4 and mM glucose) at 37 0 C with shaking for 1 hour. Samples were then spread onto LB g/1 bacto tryptone, 5 g/1 bacto yeast extract, 10 g/1 sodium chloride) plates containing 100 microgram/ml of Ampicillin, 0.3% X-gal, and 100 microgram/ml IPTG. These plates were incubated overnight at 37 0 C. Ampicillin-resistant colonies with white color were selected, grown in 5 ml of liquid LB containing 100 microgram/ml of Ampicillin, and plasmid DNA was isolated using the Qiagen miniprep protocol (Qiagen, Gaithersburg, MD, USA).
To verify that the correct Hpylori DNA inserts had been cloned, these pT7Blue plasmid DNAs were used as templates for PCR amplification of the cloned inserts, using the same forward and reverse primers (Fl and RI) used for the initial amplification of the J99 H.pylori sequence. Recognition of the primers and a PCR product of the correct size as visualized on a 2% TAE, ethidium bromide stained agarose gel were confirmation that the correct inserts had been cloned. Two to six such verified clones were obtained for each knock-out target, and frozen at -70 0 C for storage. To minimize errors due to PCR, plasmid DNA from these verified clones were pooled, and used in subsequent cloning steps.
The sequences of the genes/ORFs were again used to design a second pair of primers (F2 and R2) which flanked the region of H pylori DNA to be either interrupted or deleted (up to 250 basepairs) within the ORFs but were oriented away from each other. The pool of circular plasmid DNAs of the previously isolated clones were used as templates for this round of PCR. Since the orientation of amplification of this pair of deletion primers was away from each other, the portion of the ORF between the primer would not be included in the resultant PCR product. The PCR product was a linear piece of DNA with H. pylori DNA at each end and the pT7Blue vector backbone WO 97/37044 PCT/US97/05223 -113between them which, in essence, resulted in the deletion of a portion of the ORFs. The PCR product was visualized on a 1% TAE, ethidium bromide stained agarose gel to confirm that only a single product of the correct size had been amplified.
A Kanamycin-resistance cassette (Labigne-Roussel et al., 1988 J. Bacteriology 170, 1704-1708) was ligated to this PCR product by the TA cloning method used previously (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F.
Ausubel et al., editors, 1994). The Kanamycin cassette containing a Campylobacter kanamycin resistance gene was obtained by carrying out an EcoRI digestion of the recombinant plasmid pCTB8:kan (Cover et al.,1994, J. Biological Chemistry 269, pp.
10566-10573). The proper fragment (1.4 kb) was isolated on a 1% TAE gel, and isolated using the QIAquick gel extraction kit (Qiagen, Gaithersburg, MD, USA). The fragment was end repaired using the Klenow fill-in protocol, which involved mixing 4ug of the DNA fragment, 1 microliter of dATP,dGTP, dCTP, dTTP at 0.5 mM, 2 microliter of Klenow Buffer (New England Biolabs) and 5 units of Klenow DNA Polymerase I Large (Klenow) Fragment (New England Biolabs) into a 20 microliter reaction, incubating at 30 0 C for 15 min, and inactivating the enzyme by heating to 75 0 C for minutes. This blunt-ended Kanamycin cassette was then purified through a Qiaquick column (Qiagen, Gaithersburg, MD, USA) to eliminate nucleotides. The OTO overhang was then generated by mixing 5 micrograms of the blunt-ended kanamycin cassette, mM Tris pH 8.3, 50 mM KC1, 2 mM MgCl 2 5 units of DNA Polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA), 20 microliters of 5 mM dTTP, in a 100 microliter reaction and incubating the reaction for 2 hours at 37 0 C. The "Kan- T" cassette was purified using a QIAquick column (Qiagen, Gaithersburg, MD, USA).
The PCR product of the deletion primers (F2 and R2) was ligated to the Kan-T cassette by mixing 10 to 25 ng of deletion primer PCR product, 50 75 ng Kan-T cassette DNA, 1 microliter 1Ox T4 DNA Ligase reaction mixture, 0.5 microliter T4 DNA Ligase (New England Biolabs, Beverly, MA, USA) in a 10 microliter reaction reaction and incubating for 16 hours at 16°C.
The ligation products were transformed into XL-1 Blue or DH5-a E.coli cells by electroporation as described previously. After recovery in SOC, cells were plated onto LB plates containing 100 microgram/ml Ampicillin and grown overnight at 37 0 C. These plates were then replica plated onto plates containing 25 microgram/ml Kanamycin and allowed to grow overnight. Resultant colonies had both the Ampicillin resistance gene present in the pT7Blue vector, and the newly introduced Kanamycin resistance gene.
Colonies were picked into LB containing 25 microgram/ml Kanamycin and plasmid WO 97/37044 PCT/US97/05223 -114- DNA was isolated using the Qiagen miniprep protocol (Qiagen, Gaithersburg, MD,
USA).
Several tests by PCR amplification were conducted on these plasmids to verify that the Kanamycin was inserted in the H. pylori gene/ORF, and to determine the orientation of the insertion of the Kanamycin-resistance gene relative to the H. pylori gene/ORF. To verify that the Kanamycin cassette was inserted into the H pylori sequence, the plasmid DNAs were used as templates for PCR amplification with the set of primers (Fl and R1) originally used to clone the H. pylori gene/ORFs. The correct PCR product was the size of the deleted gene/ORF but increased in size by the addition of a 1.4 kilobase Kanamycin cassette. To avoid potential polar effects of the kanamycin resistance cassette on H. pylori gene expression, the orientation of the Kanamycin resistance gene with respect to the knock-out gene/ORF was determined and both orientations were eventually used in H. pylori transformations (see below). To determine the orientation of insertion of the kanamycin resistance gene, primers were designed from the ends of the kanamycin resistance gene ("Kan-1" ATCTTACCTATCACCTCAAAT-3' (SEQ ID NO:1285), and "Kan-2" AGACAGCAACATCTTTGTGAA-3' (SEQ ID NO:1286)). By using each of the cloning primers (Fl and R1) in conjunction with each of the Kan primers (4 combinations of primers), the orientation of the Kanamycin cassette relative to the H.pylori sequence was determined. Positive clones were classified as either in the "A" orientation (the same direction of transcription was present for both the H pylori gene and the Kanamycin resistance gene), or in the orientation (the direction of transcription for the H.pylori gene was opposite to that of the Kanamycin resistance gene). Clones which shared the same orientation (A or B) were pooled for subsequent experiments and independently transformed into H. pylori.
Transformation ofPlasmid DNA into H pylori cells Two strains ofH. pylori were used for transformation: ATCC 55679, the clinical isolate which provided the DNA from which H pylori sequence database was obtained, and AH244, an isolate which had been passaged in, and had the ability to colonize the mouse stomach. Cells for transformation were grown at 37 0 C, 10% CO 2 100% humidity, either on Sheep-Blood agar plates or in Brucella Broth liquid. Cells were grown to exponential phase, and examined microscopically to determine that the cells were "healthy" (actively moving cells) and not contaminated. If grown on plates, cells were harvested by scraping cells from the plate with a sterile loop, suspended in 1 ml of Brucella Broth, spun down (1 minute, top speed in eppendorf microfuge) and WO 97/37044 PCT/US97/05223 -115resuspended in 200 microliters Brucella Broth. If grown in Brucella Broth liquid, cells were centrifuged (15 minutes at 3000 rpm in a Beckman TJ6 centrifuge) and the cell pellet resuspended in 200 microliters of Brucella broth. An aliquot of cells was taken to determine the optical density at 600 nm, in order to calculate the concentration of cells.
An aliquot (1 to 5 OD 6 00 units/25 microliter) of the resuspended cells was placed onto a prewarmed Sheep-Blood agar plate, and the plate was further incubated at 37 0 C, 6%
CO
2 100% humidity for 4 hours. After this incubation, 10 microliters of plasmid DNA (100 micrograms per microliter), was spotted onto these cells. A positive control (plasmid DNA with the ribonuclease H gene disrupted by kanamycin resistance gene) and a negative control (no plasmid DNA) were done in parallel. The plates were returned to 37 0 C, 6% CO 2 for an additional 4 hours of incubation. Cells were then spread onto that plate using a swab wetted in Brucella broth, and grown for 20 hours at 37 0 C, 6% CO 2 Cells were then transferred to a Sheep-Blood agar plate containing micrograms/ml Kanamycin, and allowed to grow for 3 to 5 days at 37 0 C, 6% CO 2 100% humidity. If colonies appeared, they were picked and regrown as patches on a fresh Sheep-Blood agar plate containing 25 micrograms/ml Kanamycin.
Three sets of PCR (three tests) were done to verify that the colonies of transformants have arose from homologous recombination at the proper chromosomal location. The template for PCR (DNA from the colony) was obtained by a rapid boiling DNA preparation method. An aliquot of the colony (stab of the colony with a toothpick) was introduced into 100 microliters of 1% Triton X-100, 20 mM Tris, pH 8.5, and boiled for 6 minutes. An equal volume of phenol chloroform was added and vortexed. The mixture was microfuged for 5 minutes and the supernatant was used as DNA template for PCR with combinations of the following primers to verify homologous recombination at the proper chromosomal location.
TEST 1 PCR with Fl and R1 primers (cloning primers originally used to amplify the gene/ORF). A positive result of homologous recombination at the correct chromosomal location should show a single PCR product whose size is expected to be the size of the deleted gene/ORF but increased in size by the addition of a 1.4 kilobase Kanamycin cassette. A PCR product of just the size of the gene/ORF was proof that the gene had not been knocked out and that the transformant was not the result of homologous recombination at the correct chromosome location.
TEST 2 PCR with F3 (primer designed from sequences upstream of the gene/ORF), and either primer Kan-1 or Kan-2 (primers designed from the ends of the kanamycin resistance gene), depending on whether the plasmid DNA used was of "A" or orientation. A positive result of homologous recombination at the correct WO 97/37044 PCT/US97/05223 -116chromosomal location of the sequences of the gene/ORFs upstream from the kanamycin resistance gene should show a single PCR product, the expected size to be from the location ofF3 to the insertion site of kanamycin resistance gene. No PCR product or PCR product(s) of incorrect size(s) would prove that the plasmid had not been integrated at the correct site and that the gene had not been knocked out.
TEST 3 PCR with R3 (primer designed from sequences downstream of the gene/ORF) and either primer Kan-1 or Kan-2, depending on whether the plasmid DNA used was of or orientation. A positive result of homologous recombination at the correct chromosomal location downstream from the kanamycin resistance gene would show a single PCR product, the expected size to be from the insertion site of kanamycin resistance gene to the downstream location of R3. Again, no PCR product or PCR product(s) of incorrect size(s) would prove that the plasmid had not been integrated at the correct site and that the gene had not been knocked out.
Genes that are not essential for survival in vitro normally resulted in many transformants as observed for the positive control of ribonuclease H gene. Any transformants showing positive results for all three tests above would result in the conclusion that the gene was not essential for survival in vitro.
Genes that are essential for survival in vitro normally showed very few transformants. All transformants would be screened. A negative result of any of the three above tests for each transformant would lead to the conclusion that the gene had not been disrupted, and that the gene was essential for survival in vitro.
In the event that no colonies resulted from two independent transformations while the positive control with the disrupted ribonuclease H plasmid DNA produced transformants, the plasmid DNA was further analyzed by PCR on DNA from transformant populations prior to plating for colony formation, to verify that it can enter the cells and undergo homologous recombination at the correct site. Briefly, plasmid DNA was incubated according to the transformation protocol described above. DNA was extracted from the H. pylori cells immediately after incubation with the plasmid DNA and the DNA was used as templates for the above TEST 2 and TEST 3. Positive results of TEST 2 and TEST 3 would verify that the plasmid DNA could enter the cells and undergo homologous recombination at the correct chromosomal location. If TEST 2 and TEST 3 are positive, then failure to obtain viable transformants indicates that the gene is essential and cells suffering a disruption in that gene are incapable of colony formation Genes used in these experiments have been found to be essential, non-essential, or are still in progress, as indicated in Table WO 97/37044 PCT/US97/05223 -117- TABLE Summary of knock-out genes/ORFs Gene Accession Pathway Status Number rnh P23329 Transcription Not essential ppiB P29820 Translation Not essential tsf P34828 Translation Essential MurD P14900 Cell envelope Essential MurE P22188 Cell envelope Essential AlgA P07874 Virulence/Colonization Not essential metL P19358 Amino acid biosynthesis aspartate family Not essential fusA X16278 Translation Essential FIgE U09549 Virulence/Colonization Not essential-motility impaired FIiM M37691 Virulence/Colonization Not essential-motility impaired MurC U32794 Cell envelope Essential dnaE M10040 DNA replication Essential serS X05017 Translation Essential gly P00960 Translation Essential gltX L14580 Translation Essential sig28 (fliA) M37691 Regulatory functions Not essential-motility impaired sig54 Regulatory functions Not essential-motility impaired Murl U 12405 Cell envelope Essential dnaB D26185 DNA replication In progress MurG D10602 Cell envelope In progress IpxC/envA U32794 Cell envelope In progress kdtA M86305 Cell envelope In progress lpxB U09549 Cell envelope In progress KO 24 thiolase-like In progress KO 26 histone-like In progress KO 27 respiratory chain NADH dehydrogenase- In progress like WO 97/37044 PCT/US97/05223 118- VIII. Cloning, purification and characterization of the gene encoding the peptidvlpropyl cis-trans isomerase of H pylori The Helicobacter pylori genome contains an open reading frame (ORF) of 170 amino acids was found to have homology with the Synechococcus sp. (strain PCC 7942).ppi gene (NCBI Accession number P29820). Therefore, to evaluate whether this ORF encoded a protein with PPiase activity, the gene was isolated by polymerase chain reaction (PCR) amplification cloning, overexpressed in E. coli, and the protein purified to homogeneity. To facilitate purification, a polyhistidine tag was added to the Nterminus of this ORF. A simple assay using PPIase to evaluate protein folding function was developed for future use as a high-throughput drug screen.
Currently, the class of PPIases is divided into three unrelated families: the cyclophilins, the FK506-binding (FKBPs) and the parvulins. Although PPIase mutants have been reported from yeast and fruit fly, attempts to isolate disruption mutants in Escherichia coli were unsuccessful (Shieh, et.al. (1989) Nature 338:67-70). This suggests that this activity is essential for viability in bacteria.
Cloning, expression and protein purification To facilitate the cloning, expression and purification of ppi from H pylori, a powerful gene expression system, the pET System, for cloning and expression of recombinant ppi in E. coli. In this study, the sequence for H. pylori ppi contains a DNA sequence encoding a His-Tag fused to the 5' end of the full length gene, because the protein product of this gene does not contain a signal sequence and is expressed as a cytosolic protein.
A synthetic oligonucleotide primer CT-3' (SEQ ID NO:1287)) specific for the 5' end of ppi gene encoded a BamHI site at its extreme 5' terminus and a primer (5'-TATCTCGAGTTATAGAGAAGGGC-3' (SEQ ID NO: 1288)) specific for the 3' end of the ppi gene encoded a XhoI site at its extreme terminus. Genomic DNA prepared from the J99 strain of Helicobacter pylori was used as the source of template DNA for PCR amplification reactions (Current Protocols in Molecular Biology, John Wiley and Sons, Inc., F. Ausubel et al., editors, 1994). To amplify a DNA sequence containing the H. pylori ppi gene, genomic DNA nanograms) was introduced into a reaction vial containing 2 mM MgCl 2 1 micromolar synthetic oligonucleotide primers (forward and reverse primers) complementary to and flanking a defined H. pylori ORF, 0.2 mM of each deoxynucleotide triphosphate; dATP, dGTP, dCTP, dTTP and 2.5 units of heat stable DNA polymerase (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of 100 microliters.
WO 97/37044 PCT/US97/05223 -119- The following thermal cycling conditions were used to obtain amplified DNA products for each ORF using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal cycler: Conditionsfor amplification of H. pylori ppiB; Denaturation at 94 0 C for 2 min, 2 cycles at 94 0 C for 15 sec, 32°C for 15 sec and 72 0 C for 1.5 min cycles at 94 0 C for 15 sec, 56°C for 15 sec and 720C for 1.5 min Reactions were concluded at 72 0 C for 6 minutes.
Upon completion of thermal cycling reactions, the amplified DNA was washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD, USA). The amplified DNA sample was subjected to digestion with the restriction endonucleases, BamHI and XhoI (New England BioLabs, Beverly, MA, USA) (Current Protocols in Molecular Biology, Ibid). The DNA as subjected to electrophoresis on a NuSeive (FMC BioProducts, Rockland, ME USA) agarose gel. DNA was visualized by exposure to ethidium bromide and long wave uv irradiation. DNA contained in slices isolated from the agarose gel was purified using the Bio 101 GeneClean Kit protocol (Bio 101 Vista, CA, USA) Cloning, transformation, expression and purification of the PPI gene was carried out essentially as described in Examples II and III above.
Assay for PPlase activity The assay for PPIase was essentially as described by Fisher (Fischer, et.al.
(1984) Biomed. Biochim. Acta 43:1101-1111). The assay measures the cis-trans isomerization of the Ala-Pro bond in the test peptide N-succinyl-Ala-Ala-Pro-Phe-pnitroanilide (Sigma S-7388, lot 84H5805). The assay is coupled with achymotrypsin, where the ability of the protease to cleave the test peptide occurs only when the Ala-Pro bond is in trans. The conversion of the test peptide to the trans isomer in the assay is followed at 390 nm on a Beckman Model DU-650 spectophotometer.
The data were collected every second with an average scanning of time of 0.5 second.
Assays were carried out in 35 mM Hepes, pH 8.0, in a final volume of 400 ul, with 10 t M a-chymotrypsin (type 1-5 from bovine Pancreas, Sigma C-7762, lot 23H7020) and nM PPIase. To initiate the reaction, 10 gl of the substrate 2 mM N-Succinyl-Ala- Ala-Pro-Phe-p-nitroanilide in DMSO) was added to 390 gl of reaction mixture at room temperature.
WO 97/37044 PCT/US97/05223 120- PPiase assay in crude bacterial extract.
A 50 ml culture of Helicobacterpylori (strain J99) in Brucella broth was harvested at mid-log phase (OD 600 nm 1) and resuspended in lysis buffer with the following protease inhibitors: 1 mM PMSF, and 10 p.g/ml of each ofaprotinin, leupeptin, pepstatine, TLCK, TPCK, and soybean trypsin inhibitor. Ther suspension was subjected to 3 cycles of freeze-thaw (15 minutes at -70 oC, then 30 minutes at room temperature), followed by sonication (three 20 second bursts). The lysate was centrifuged (12,000 g x 30 minutes) and the supernatant was assayed for PPiase activity.
Results PPI from H. pylori was expressed in E. coli using the pET-28b expression vector from Novagen (cat 69868-1). The expressed recombinant protein was isolated from the soluble fraction of bacterial cells that had been disrupted by cavitation in a Microfluidics Cell disruption chamber. The expression levels of recombinant PPI produced 100 mg of protein. The recombinant protein could be purified to homogeneity by Ni 2 chelate chromatography and gel filtration. On sodium dodecyl sulfate polyacrylamide gels, the recombinant protein migrates as a single band at 21 kDa, in accordance with the predicted molecular weight of 20,975 deduced from the gene sequence.
The PPIase activity was assayed using the chromogenic tetrapeptide substrate succinyl-Ala-Ala-Pro-Phe-p-nitroanilide. An initial velocity of 4.9 utmole/min/mg protein was measured with the purified enzyme (Figure This corresponds to a kcat of 1.6 sec which is similar to the one obtained for the E. coli PPIase (Liu, J. and Walsh, C.T. (1990) Proc.Natl. Acad. Sci. USA 87:4028-4032) and the one from porcine kidney (Fischer, G. (1989) Nature 337:476-478).
The recombinant protein has a high catalytic efficiency of 2.06 X 109 MI s' when the assay is measured at 25 0 C. These values are one to two orders of magnitude higher than that observed for other characterized PPIases. However, in those studies, the ppiase assay was conducted at 10 0 C, which may account for the discrepency. The calalytic efficiency is very close to the 1 X 10 to 1 X 10 M 1 s- upper diffusinal limit for "kinetically perfect" enzymes (Albery, W.J. and Knowles, J.R. (1976) Biochemistry 15:5631-5640) and suggests that by at least one measure, the H. pylori PPIase is a highly effective catalyst in the cis-trans isomerisation of the Ala-Pro bond in the oligopeptide substrate.
WO 97/37044 PCT/US97/05223 121 The presence of PPIase was also determined in an H pylori extract. As with the assay for the recombinant protein, PPIase activity was detected, and was dependent on the concentration of extract added (Figure 6).
These results show that PPIase activity can be measured on either H. pylori extracts or on the recombinant protein in E. coli. The high catalytic efficiency also demonstrates that H pylori enzymes, such as PPIase, can be expressed at high levels and in an active form in E. coli. Such high yields of purified proteins provide for the design of various high throughput drug screening assays.
IX. Cloning, purification, and characterization of the gene encoding the glutamate racemase of H pylori.
The Helicobacter pylori genome contains an open reading frame (ORF) of 255 amino acids that was found to have homology to the Staphylococcus haemolyticus glutamate racemase gene (dga) (NCBI Accession number U 12405) and to the E. coli murl gene which encodes glutamate racemase activity in that organism. To evaluate whether this H. pylori ORF encodes a protein with glutamate racemase activity, the gene was isolated by polymerase chain reaction (PCR) amplification cloning, overexpressed in E. coli, and the protein purified to apparent homogeneity. A simple assay for glutamate racemase activity resulting in the isomerization of D-glutamic acid to Lglutamic acid was developed to facilitate purification and for future use as a highthroughput drug screen.
The ORF in H. pylori has been found by gene disruption studies to be essential for viability ofH. pylori cells in laboratory culture (see Example VII above). Therefore, inhibition of the enzymatic activity would be expected to be lethal for the organism, and such inhibitors may have utility in antimicrobial therapy of human infectious diseases.
Cloning ofH. pylori murl gene encoding glutamate racemase A 765 base pair DNA sequence encoding the murl gene ofH. pylori was isolated by polymerase chain reaction (PCR) amplification cloning. A synthetic oligonucleotide primer (5'-AAATAGTCATATGAAAATAGGCGTTTTTG (SEQ ID NO:1289)) encoding an Ndel restriction site and the 5' terminus of the murl gene and a primer (SEQ ID NO:1290)) encoding an EcoRI restriction site and the 3' end of the murl gene were used to amplify the murl gene of H. pylori using genomic DNA prepared from the J99 strain of H. pylori as the template DNA for the PCR amplification reactions (Current Protocols in Molecular Biology, John Wiley and Sons, Inc. F. Ausubel et al., editors, 1994). To amplify a DNA sequence containing the murl gene, genomic DNA (25 nanograms) was introduced into WO 97/37044 PCT/US97/05223 -122each of two reaction vials containing 1.0 micromole of each synthetic oligonucleotide primer, 2.0 mM MgCl2, 0.2 mM of each deoxynucleotide triphosphate (dATP, dGTP, dCTP dTTP), and 1.25 units of heat stable DNA polymerases (Amplitaq, Roche Molecular Systems, Inc., Branchburg, NJ, USA) in a final volume of 50 microliters.
The following thermal cycling conditions were used to obtain amplified DNA products for the murl gene using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal cycler: Conditionsfor amplification ofH. pylori murl; Denaturation at 94 0 C for 2 min, 2 cycles at 94 0 C for 15 sec, 30 0 C for 30 sec and 72 0 C for 15 sec 23 cycles at 94 0 C for 15 sec, 53°C for 30 sec and 72 0 C for 15 sec Reactions were concluded at 72 0 C for 20 minutes Upon completion of thermal cycling reactions, the amplified DNA was washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD, USA). The amplified DNA sample was subjected to digestion with the restriction endonucleases, NdeI and EcoRI (New England Biolabs, Beverly, MA USA) (Current Protocols in Molecular Biology, Ibid). The DNA samples from each of two reaction mixtures were pooled and subjected to electrophoresis on a 1.0% SeaPlaque (FMC BioProducts, Rockland, ME, USA) agarose gel. DNA was visualized by exposure to ethidium bromide and long wave uv irradiation. Amplified DNA encoding the H. pylori murl gene was isolated from agarose gel slices and purified using the Bio 101 GeneClean Kit protocol (Bio 101 Vista, CA, USA).
Cloning ofH. pylori DNA sequences into the pET-23 prokaryotic expression vector.
The pET-23b vector can be propagated in any E. coli K-12 strain, HMS174, HB101, JM109, DH5ca, etc., for the purpose of cloning or plasmid preparation. Hosts for expression include E. coli strains containing a chromosomal copy of the gene for T7 RNA polymerase. These hosts are lysogens of bacteriophage DE3, a lambda derivative that carries the lacI gene, the lacUV5 promoter and the gene for T7 RNA polymerase.
T7 RNA polymerase is induced by addition of isopropyl-B-D-thiogalactoside
(IPTG),
and the T7 RNA polymerase transcribes any target plasmid, such as pET-28b, carrying its gene of interest. Strains used in our laboratory include: BL21(DE3) (Studier, F.W., Rosenberg, Dunn, and Dubendorff, J.W. (1990) Meth. Enzymol. 185, 60-89).
The pET-23b vector (Novagen, Inc., Madison, WI, USA) was prepared for cloning by digestion with Ndel and EcoRI (Current Protocols in Molecular Biology, WO 97/37044 PCT/US97/05223 123- Ibid). Following digestion, the amplified, agarose gel-purified DNA fragment carrying the murl gene was cloned (Current Protocols in Molecular Biology, Ibid) into the previously digested pET-23b expression vector. Products of the ligation reaction were then used to transform the BL21(DE3) strain of E. coli.
Transformation of competent bacteria with recombinant plasmids Competent bacteria, E coli strain BL21 or E. coli strain BL21(DE3), were transformed with recombinant pET23-murI expression plasmid carrying the cloned H.
pylori sequence according to standard methods (Current Protocols in Molecular, Ibid).
Briefly, 1 microliter of ligation reaction was mixed with 50 microliters of electrocompetent cells and subjected to a high voltage pulse, after which, samples were incubated in 0.45 milliliters SOC medium yeast extract, 2.0 tryptone, 10 mM NaC1, 2.5 mM KC1, 10 mM MgCI 2 10 mM MgSO 4 and 20 mM glucose) at 37 0 C with shaking for 1 hour. Samples were then spread on LB agar plates containing 100 microgram/ml ampicillin for growth overnight. Transformed colonies of BL21 were then picked and analyzed to evaluate cloned inserts as described below.
Identification ofrecombinant pET expression plasmids carrying H. pylori sequences Individual BL21 clones transformed with recombinant pET-23-murl were analyzed by PCR amplification of the cloned inserts using the same forward and reverse primers, specific for each H. pylori sequence, that were used in the original PCR amplification cloning reactions. Successful amplification verified the integration of the H pylori sequences in the expression vector (Current Protocols in Molecular Biology, Ibid).
Isolation and Preparation ofplasmid DNA from BL21 transformants Colonies carrying pET-23-murI vectors were picked and incubated in 5 mis of LB broth plus 100 microgram/ml ampicillin overnight. The following day plasmid DNA was isolated and purified using the Qiagen plasmid purification protocol (Qiagen Inc., Chatsworth, CA, USA).
Cloning and expression of the E. coli groE operon It has been demonstrated that coexpression of the E. coli murl gene with the genes in the E. coli groE operon reduces the formation of insoluble inclusion bodies containing recombinant glutamate racemase (Ashiuchi, Yoshimura, Kitamura, Kawata, Nagai, Gorlatov, Esaki, N. and Soda, 1995, J. Biochem. 117, WO 97/37044 PCT/US97/05223 -124- 495-498). The groE operon encodes two proteins, GroES (97 amino acids) and GroEL (548 amino acids), which are molecular chaperones. Molecular chaperones cooperate to assist the folding of new polypeptide chains Ulrich Hartl, 1996, Nature London 381, pp. 571-580).
The 2210 bp DNA sequence encoding the groE operon of E. coli (NCBI Accession number X07850) was isolated by polymerase chain reaction (PCR) amplification cloning. A synthetic oligonucleotide primer AATTTTTTTTCT-3' (SEQ ID NO:1291)) encoding an EcoRI restriction site and the terminus of the groE operon containing the endogenous promoter region of the groE operon and a primer (5'-ATAAGTACTTGTGAATCTTATACTAG (SEQ ID NO:1292)) encoding a Scal restriction site and the 3' end of the groEL gene contained in the groE operon were used to amplify the groE operon ofE. coli using genomic DNA prepared from E. coli strain MG1655 as the template DNA for the PCR amplification reactions (Current Protocols in Molecular Biology, Ibid). To amplify a DNA sequence containing the E. coli groE operon, genomic DNA (12.5 nanograms) was introduced into each of two reaction vials containing 0.5 micromoles of each synthetic oligonucleotide primer, 1.5 mM MgCl 2 0.2 mM each deoxynucleotide triphosphate (dATP, dGTP, dCTP dTTP) and 2.6 units heat stable DNA polymerases (Expanded High Fidelity PCR System, Boehringer Mannheim, Indianapolis, Indiana) in a final volume of 50 microliters. The following thermal cycling conditions were used to obtain amplified DNA products for the groE operon using a Perkin Elmer Cetus/ GeneAmp PCR System 9600 thermal cycler: Conditionsfor amplification and cloning of the E. coli groE operon; Denaturation at 94 0 C for 2 min, 2 cycles at 94 0 C for 15 sec, 30 0 C for 30 sec and 72 0 C for 2 min 23 cycles at 94 0 C for 15 sec, 55 0 C for 30 sec and 72 0 C for 2 min Reactions were concluded at 72 0 C for 8 minutes Upon completion of thermal cycling reactions, the amplified DNA was washed and purified using the Qiaquick Spin PCR purification kit (Qiagen, Gaithersburg, MD, USA). The amplified DNA sample was subjected to digestion with the restriction endonucleases, EcoRI and Scal (New England Biolabs, Beverly, MA USA) (Current Protocols in Molecular Biology, Ibid). The DNAs from each of two reaction mixtures were pooled and subjected to electrophoresis in a 1.0% SeaPlaque (FMC BioProducts, Rockland, ME, USA) agarose gel. DNA was visualized by exposure to ethidium bromide and long wave uv irradiation. DNA contained in slices isolated from the WO 97/37044 PCT/US97/05223 125 agarsoe gel was purified using the Bio 101 GeneClean Kit protocol (Bio 101 Vista, CA,
USA).
A DNA fragment, EcoRI to Seal, containing the E. coli groE operon was cloned into the corresponding sites of the pACYC 184 expression vector (New England Biolabs, Beverly, MA, USA) to make pACYC184-groE. The BL21(DE3) strain of E. coli was transformed with pACYC-groE. A tetracycline-resistant transformant overexpressing proteins of Mr 14,000 (GroES) and Mr 60,000 (GroEL) was isolated.
Transformation ofE. coli strain BL21 (DE3) carrying the pACYC-groEplasmid ofE. coli.
Competent bacteria derived from a clone of strain BL21 (DE3) carrying the pACYC-groE plasmid were transformed with 50 nanograms of pET23-murlplasmid DNA, isolated as described above (Current Protocols in Molecular Biology, Ibid). A clone of BL21(DE3) carrying both the pACYC-groE expression plasmid and the pET- 23-murlplasmid was isolated and used for expression of recombinant glutamate racemase as described below.
Expression of recombinant H. pylori murl A bacterial clone of BL21 (DE3) carrying both the pACYC-groE expression plasmid and the pET-23-murl plasmid was cultured in LB broth supplemented with mM D,L-glutamic acid and 100 microgram/ml ampicillin and 10 micrograms/ml tetracycline at 30 0 C until an optical density at 600 nM of 0.5 to 1.0 O.D. units was reached, at which point, isopropyl-beta-D-thiogalactoside (IPTG) was added to the culture at a final concentration of 1.0 mM. Cells were cultured overnight to induce gene expression of the H. pylori recombinant DNA constructions.
After induction of gene expression with IPTG, bacteria were pelleted by centrifugation in a Sorvall RC-3B centrifuge at 3000 x g for 20 minutes at 4 0 C. Pellets were resuspended in 50 milliliters of cold 10 mM Tris-HCl, pH 8.0, 0.1 M NaCI and 0.1 mM EDTA (STE buffer). Cells were then centrifuged at 2000 x g for 20 min at 4 0
C.
Pellets were weighed (average wet weight 6 grams/liter) and processed to purify recombinant protein as described below.
Purification ofsoluble glutamate racemase All steps were carried out at 4 0 C. Cells were suspended in 4 volumes of lysis buffer (50 mM Potassium phosphate, pH 7.0, 100 mM NaC1, 2 mM EDTA, 2 mM EGTA, 10% glycerol, 10 mM D,L-glutamic acid, 0.1 P-mercaptoethanol, 200 Pg/ ml lysozyme, 1 mM PMSF, and 10 ug/ml each of leupeptin, aprotinin, pepstatin, L-1- WO 97/37044 PCT/US97/05223 126chloro- 3 4 -tosylamido]-7-amino-2-heptanone (TLCK), L-l-chloro-3-[ 4 -tosylamido]-4phenyl-2-butanone (TPCK), and soybean trypsin inhibitor, and ruptured by three passages through a small volume microfluidizer (Model M- 10S, Microfluidics International Corporation, Newton, MA). The resultant homogenate was diluted with 1 volume of buffer A (10 mM Tris-HCI pH 7.0, 0.1 mM EGTA, 10 glycerol, 1 mM DL- Glutamic acid, 1 mM PMSF, 0.1% beta-mercaptoethanol), made 0.1 Brij-35, and centrifuged (100,000 x g, 1 h) to yield a clear supernatant (crude extract).
After filtation through a 0.80-um filter, the extract was loaded directly onto a ml Q-Sepharose column pre-equilibrated in buffer A containing 100 mM NaCI and 0.02 Brij-35. The column was washed with 100 ml (5 bed volumes) of Buffer A containing 100 mM NaCl and 0.02 Brij-35, then developed with a 100-ml linear gradient of increasing NaCl (from 100 to 500 mM) in Buffer A. A band of Mr 28,000 corresponding to glutamate racemase, the product of the recombinant H. pylori murl gene, eluted at a gradient concentration of approximately 200-280 mM NaCI. Individual column fractions were then characterized for glutamate racemase activity (see below for description of assay) and the protein profile of the fractions were analyzed on 12 acrylamide SDS-PAGE gels.
Fractions containing glutamate racemase were pooled, brought to 70% saturation with solid (NH 4 2 S0 4 stirred for 20 min, and then centrifuged at 27,000 x g for 20 min.
The resulting pellet was resuspended in lysis buffer to a final volume of 8 ml and loaded directly onto a 350-ml column (2.2 x 92 cm) of Sephacryl S-100HR gel filtration medium equilibrated in buffer B (10 mM Hepes pH 7.5, 150 mM NaC1, 0.1 mM EGTA, glycerol, 1 mM D,L-glutamatic acid, 0.1 mM PMSF, 0.1 beta-mercaptoethanol) and run at 30 ml/h. Fractions found to contain a glutamate racemase activity were pooled, and 0.5 volume of buffer C (10 mM Tris pH 7.5, 0.1 mM EGTA, 10% glycerol, 1 mM D,L-glutamic acid, 0.1 mM PMSF, 0.1 B-mercaptoethanol)was added (to reduce the NaCI concentration to 100 mM), and loaded onto a MonoQ 10/10 highpressure liquid chromatography column equilibrated in buffer C containing 100 mM NaC1. The column was washed with 5 bed volumes of this buffer and developed with a 40 ml linear gradient of increasing NaCl (from 100 to 500). Glutamate racemase eluted as a sharp peak at 310 mM NaC1. Fractions containing a glutamate racemase activity were pooled, concentrated by dialysis against storage buffer [50% glycerol, 10 mM 3- (N-morpholino-propanesulfonic acid (MOPS) pH 7.0, 150 mM NaCI, 0.1 mM EGTA, 0.02 Brij-35, 1 mM dithiothreitol and stored at -20 0
C.
WO 97/37044 PCT/US97/05223 -127- Assays for glutamate racemase activity.
Conversion of D-glutamate to L-glutamate (two enzyme coupled assay) The activity of glutamate racemase, interconversion of the enantiomers of glutamic acid, was measured using D-glutamic acid as substrate. The method of Gallo and Knowles (Gallo, K.A. and Knowles, 1993, Biochmistry 32, 3981-3990) that was used to measure the glutamate racemase activity of Lactobacillusfermenti was adapted for the measurement of glutamate racemase activity of the H. pylori murl gene product isolated as a recombinant protein from E. coli. In this assay, the measurement of the activity of glutamate racemase is linked to an OD change in the visible range in a series of coupled reactions to the activities of L-glutamate dehydrogenase (reduction of NAD to NADH) and diaphorase (reduction of the dye p-iodonitrotetrazolium violet, INT). Initial rates were determined by following the increase in absorbance at 500 nm in a reaction volume of 200 il containing 50 mM Tris-HC1, pH 7.8, 4% v/v glycerol, mM NAD, 2 mM INT, 60 Units/ml L-glutamate dehydrogenase, 5 Units/ml diaphorase, and varying concentrations of either substrate (from 0.063 mM to 250 mM D-glutamic acid) or purified enzyme (from 1 p.g to 50 pg). After a preincubation of all reagents except either the substrate (D-glutamic acid) or the enzyme (murl gene product) for a period of 5 minutes, reactions were initiated by adding the missing ingredient the enzyme or the subtrate, as required), and the increase in optical density at 500 nm was measured in a Microplate Spectophotometer System (Molecular Devices, Spectra MAX 250). Measurements were followed for 20 minutes, and initial velocities were derived by calculating the maximum slope for the absorbance increases. The coupled reaction can be summarized as shown below: 1) D-glutamate L-glutamate glutamate racemase 2) L-glutamate H 2 0 NAD 2-oxoglutarate NH 3
NADH
L-glutamate dehydrogenase 3) NADH INT NAD formazan (color) diaphorase Conversion of D-glutamate to L-glutamate (single enzyme coupled assay) In this assay, the conversion of D-glutamic acid to L-glutamic acid is coupled to the conversion of L-glutamic acid and NAD by L-glutamate dehydrogenase to 2oxoglutarate, ammonia. The production of NADH is measured as an increase of WO 97/37044 PCT/US97/05223 -128absorbance at 340 nm (the reduction ofNAD' to NADH) at 37 0 C. The standard assay mixture (adapted from Choi, Esaki, Yoshimura, and Soda, 1991, Protein Expression and Purification 2, 90-93) contained 10 mM Tris-HC1, pH 7.5, 5 mM NAD+, 5 Units/ml L-glutamate dehydrogenase, varying concentrations of the substrate D-Glutamic Acid (0.063 mM to 250 mM), and the purified recombinant H. pylori enzyme glutamate racemase (1 jtg to 50 jig). The reaction was started by the addition of either the substrate D-glutamic acid or the recombinant glutamate racemase after a preincubation at 37 0 C for 5 minutes with all of the other assay ingredients. The change in absorbance at 340 nm was measured in a Spectra MAX 250. Initial velocities were derived from the initial slopes. The coupled reactions can be summarized as shown below: 1) D-glutamate L-glutamate glutamate racemase 2) L-glutamate H 2 0 NAD 2-oxoglutarate NH 3
NADH
L-glutamate dehydrogenase Results 1) Expression of the H. pylori murl gene in E. coli cells To examine its biochemical properties, the H pylori glutamate racemase was overexpressed in E. coli and purified. In the presence of the E. coli chaperones GroES and GroEL, the glutamate racemase was expressed as a soluble protein. About 20 mg of soluble MurI was produced per liter of culture as judged by intensity of the protein band after SDS-PAGE. No band corresponding to the molecular weight of murl protein was seen in control gel lanes containing extracts from cells transformed with the pET vector lacking a murl insert. Addition of 1 mM DL-glutamic acid during cultivation of the expressing cells increased the apparent expression level by about five-fold.
2) Purification ofrecombinant H. pylori murl protein Murl was purified by cation exchange chromatography and gel filtration. Upon SDS-PAGE analysis, the purified protein migrated as a single polypeptide species with an apparent mass 29 kDa which is consistent with the predicted mass of 28,858.
WO 97/37044 PCT/US97/05223 129- 3) Kinetic properties ofrecombinant H. pylroi murl enzyme Kinetic constant for recombinant glutamate racemase were estimated by assaying its activity at various concentrations of protein and D-glutamic acid as described above.
Purified recombinant H. pylori glutamate racemase exhibits a Vmax of- 300 nmoles/min/mg protein (kcat 8.6 min and a Km of -100 upM for D-glutamate.
Although the Vmax value is lower than that observed for highly purified glutamate racemase from some other bacterial species, its Km for D-glutamic acid is higher than that observed for the enzyme from most other species, resulting in a catalytic efficiency (kcat/Km) which is typical of purified preparation from E. coli and P. Pentococcus.
4) Characterization ofMurI: Inhibition by L-serine-O sulfate The H. pylori glutamate racemase was tested for inactivation with a sucuide inhibitor, L-serine-O sulfate, which is known to inhibit murl from E. coli. The enzyme was incubated in the presence of 20 mM L-serine-O sulfate, and at different times interval, aliquots were removed to determine residual activity. The initial velocity of purified recombinant H. pylroi murl protein was determined in the single enzyme coupled asssay following incubation with the inhibitor L-serine-O-sulfate (LSOS) at mM for the times indicated on the x-axis. The control was incubated in an identical manner but without LSOS. As shown in Figure 7, the H. pylori glutamate racemase can be readily inactivated by the inhibitor.
Future application of the glutamate racemase activity in high throughput drug screening assays.
The assays for measurement of H. pylori glutamate racemase activity described above have been carried out in 96-well plates in which multiple reactions were conducted simultaneously. Measurements of activity in a multi-well format are readily amenable to scale-up to permit rapid analysis of numerous compounds for inhibition of the glutamate racemase activity. Compunds which inhibit the activity of glutamate racemase may have application as novel antibiotics and may be suitable for the treatment and eradication of bacterial H. pylori) infections in humans. Known inhibitors of glutamate racemase, such as L-serine-O-sulfate, can be used to calibrate high throughput screens of new compound libraries to facilitate identification of new compounds with properties suitable for in vivo human therapeutics.
WO 97/37044 PCT/US97/05223 130-
EOUIVALENTS
Those skilled in the art will recognize, or be able to ascertain using no more than routine experimentation, many equivalents to the specific embodiments and methods described herein. Such equivalents are intended to be encompassed by the scope of the following claims.
WO 97/37044 PCT/US97/05223 131 SEQUENCE LISTING GENERAL INFORMATION:
APPLICANT:
NAME: Astra Aktiebolag STREET: S-151 CITY: Sodertalje
STATE:
COUNTRY: Sweden POSTAL CODE (ZIP) (ii) TITLE OF INVENTION: NUCLEIC ACID AND AMINO ACID SEQUENCES RELATING TO HELICOBACTER PYLORI AND VACCINE COMPOSITIONS THEREOF (iii) NUMBER OF SEQUENCES: 1298 (iv) CORRESPONDENCE ADDRESS: ADDRESSEE: LAHIVE COCKFIELD STREET: 60 State Street, Suite 510 CITY: Boston STATE: Massachusetts COUNTRY: USA ZIP: 02109-1875 COMPUTER READABLE FORM: MEDIUM TYPE: CD/ROM IS09660
COMPUTER:
OPERATING SYSTEM:
SOFTWARE:
(vi) CURRENT APPLICATION DATA: APPLICATION NUMBER: FILING DATE: (vii) PRIOR APPLICATION DATA: APPLICATION NUMBER: US 08/625,811 FILING DATE: 29-MAR-1996 (viii) PRIOR APPLICATION DATA: APPLICATION NUMBER: US 08/758,731 FILING DATE: 02-APR-1996 (ix) PRIOR APPLICATION DATA: APPLICATION NUMBER: US 08/736,905 FILING DATE: 25-OCT-1996 PRIOR APPLICATION DATA: APPLICATION NUMBER: US 08/738,859 FILING DATE: 28-OCT-1996 WO 97/37044 PCT/US97/05223 132 (xi) PRIOR APPLICATION DATA: APPLICATION NUMBER: US 08/761,318 FILING DATE: 06-DEC-1996 (xii) ATTORNEY/AGENT INFORMATION: NAME: Mandragouras, Amy E.
REGISTRATION NUMBER: 36,207 REFERENCE/DOCKET NUMBER: GTN-009C2PC (xiii) TELECOMMUNICATION INFORMATION: TELEPHONE: (617)227-7400 TELEFAX: (617)227-5941 WO 97/37044 PCTUS97/05223 133 INFORMATION FOR SEQ ID NO:1: SEQUENCE CHARACTERISTICS: LENGTH: 288 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...288 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: GAGAGGATAA AAAGGAGAGC GGTGATGAAA AAAATCGTTG TGAGTTTATG CGTGGCGTTA GGTTTTTTAA GCGCGGATCC AGCGCAGGCC AATAAAGCGA TCAGTGATGC GGATTTGATT 120 GAAGAGATAA GGGACTTGAA AAAAATCATC AGCGCACAAA ACACGGAGAT TAACCAATTA 180 AGAAAAGTGC AAGAAGTCTT ATCTGGGCAA TTAGGGGACA TGCGTAAGGA TATATTAAGC 240 ACTAGAGATT ACTGTATCAG CTTAAGGCCT TATATTTATA ATTGGCGC 288 INFORMATION FOR SEQ ID NO:2: SEQUENCE CHARACTERISTICS: LENGTH: 903 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...903 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: AACGCTTGGA ATGATTTCGT TAAAAGCGGA TCTTTAGGGC CTTTTAGCAA TGGTTATTAT GAGCATAAAA CTATTCGTTT AAACCCAGAA CAAAATTTAA TCGTCTTAAG CCACTACCTC 120 AAGCTTTTAG AAATCCAAAG GGAAGCGGCG AAAATGACCG CTATTTTTGG GGCCAAACAA 180 CCTCACCCAC AAAGCCTAAC GGTGGGGGGT GTTACGAGTG TCATGGATAT ATTGGATCCA 240 ACAAGGTTGG CTGAATGGAA GAGCAAGTTT GAAGTGGTGG CTAATTTTAT CAACCATGCT 300 WO 97/37044 WO 9737044PCTJUS97/05223
TACTACCCTG
GGCTGTGGTT
CTTTTGAGTA
AGTTTGATTA
CAACTCCACC
AGCGTGGGGA
TCTTGGATAA
GTAGTGGTAG
AAAGACACGA
TGTATTGAAG
ATTTGGTGAT
TAAGGAATTT
GTGGGGTGGT
AAGAAGAGGT
CTTATGACGG
TTGAAAATAA
AATCGCCCAG
GTTTAGCGGC
AACTGCCTTT
CTAAGACTAT
GGCAGGCGAA
TATCGCTTAT
GCTTGATGGG
TACGCATTCT
GCAAACTA.AC
AATAATCCCT
ATACGATAGT
GAAAAACCCT
AGAGGCGTTG
CGCTGATAAT
ATGTTCGCTA
GAAGAAGTGT
GATATTTCTA
TGGTATCAAT
CCGCATTATA
GCTAAAGTGC
AAGCCCATGG
TATGTTACTG
TTTTCAACGC
GGTCTTTTGG
ACGAACCATC
TGCTTGGGA.A
AATTACACCC
ATGAAGACAC
CCGGTTTAA.A
TTGACACTA-A
AAGTAGGGCC
AAGTGGCTAC
TTGGGCGCAC
CGTTTGACGC
TGTTATTAAA
GGATAAATAC
CATTGATGAA
TAA.AGAAGTG
AGACCCGAG
GGATAAATAT
TTTA.AGTTCC
TAAATTTTTA
AGCTGCAAGG
GTTAGTGGAA
INFORMATION FOR SEQ ID NO:3: SEQUENCE CHARACTERISTICS: LENGTH: 507 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION I1.. .507 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:
GGGATTAACA
AGGATTGAGG
CTTTTTAGGG
GCTCAAAGGA
GAAAACGCTT
ATGGCGTTGC
TGCGATATTT
TACAGCCCTT
GGAATGATTT
TGTCAAAAAA
TGATCGTAGA
GGCTAGAAAC
TTTGTGGGGT
TAGGAATCAC
TTTTTCACGA
TGAGCACTTT
ATCCCATTAA
CGTTAAAAGC
AATCGTAGTC
TGATGATAAT
GATCATTAAG
ATGCACTTAT
TCCCCCATTA
TCATGTGGTG
AAAAGCTGAT
TACCGGTGCG
GGATCTT
GATCCTATCA
GTGATCACCG
GGCAGAGACC
TCGCATTATA
AACGCGCAAT
CATTTCTATA
CCCATTCAAG
GGGCGAATTA
CTAGGATTGA
ATGCGTTTTC
CACGAGATGC
AAGCCGGTGT
TGGTGCGATC
CTTTGCATGG
CAGCAAAACT
AAAGCGGTTC
GGGGCATTTA
TTCTTCTACG
AGGCTTTATC
TACTGCGGTA
TTTGATGAAC
GCTTGATTGG
TTCTTTCAAA
TAAAACGCTT
120 180 240 300 360 420 480 507 INFORMATION FOR SEQ ID NO: SEQUENCE CHARACTERISTICS: LENGTH: 555 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCT/US97/05223 135 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...555 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4:
TGGCTATGGT
CTTTTTGGCG
TTTTTCCCTA
ATCACTTCGT
GGCTCAGTTT
GCGCATGAAG
AAAACTTTTA
TCAAGCGAAA
GCATGGGGAG
TCTTATAAGG
TACACCAAAG
ATGAAGGGAT
GCGTGGATAT
ATGAAAAGGT
ATGCCTTTGA
TGGAAATGCT
TCGTGGGGCT
TGTTAAACGC
TTAAAATGCA
GTTTT
AATGAGTCAA
TGGGGTGCAT
TGTGGATGGG
TTTGATTTTA
TTTTAAGGAC
GCACACTTTA
TGTGCCTTTT
TTTAGAAACA
ACGCACCGAT
AAAATCCTAA TTTTAGGCAT CTAGCCCACT ACCTCAAAAG GGGACTATGG CTCAGCAACT GATTGCGTGA GCGCTAAAGG GCTCCTAAAG AAATCACATG AGGCTCACGG AGTTTTTAGG GTGATAGGGA GCGAGACCAC GCCCTAAAAG CCATAGAAAC CATATTGCCC TAGATTGTAT
TGGCAATATC
GAATTTTTCT
CATTCCTTTA
CGTTGAAATT
GGCTGGGAGT
GGATTTGCCT
TTTCAAGCTT
CCAACTCAAC
CGCTGAACTC
120 180 240 300 360 420 480 540 555 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 324 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...324 (xi) SEQUENCE DESCRIPTION: SEQ ID
TATCAAATCT
TCTCTCTCGT
GAAGGAAGCT
TTAAATCAAG
GATGTGCAAA
GGAAATGGTG
TAAACCAAAG AAAAACCATG AAAAAAGTCC TCTTACTAAC TCTCTCTCTC TTTGGCTCCA CGCTGAAAGG AACGGATTTT ATTTAGGTTT AAATTTTGCA ATATTCAAGG ACAAGGTAGC ATCGGCGAAA AAGCTTCAGC AGAAAACGCC CGATCAATAA CGCACAAAAT TCATTATTCC CTAACACACA AGCCATAAGA ACGCCTTAAA TGCAGTGAAA GATTCAAACA AAATCGCTAA CCGATTCGCA GATCGGGCGG TATT INFORMATION FOR SEQ ID NO:6: SEQUENCE CHARACTERISTICS: LENGTH: 384 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 136 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...384 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:6:
CGCGTTAGTG
AATCAAGAAT
GTGCGTATTA
GCAGGGCCTA
AAAATCGCTG
TGCATCGCAT
GTAGAGCCTA
GAAATCTAAA GAGCGATCAA AGCACTTGTG CTCCTTATCA TATTGATAAA ATAAAGGGCG CTACATTGGT CAAGTGCCAA GGGGCATGTT AAGCCATTGG AAAACGGCGT GGTGGAAAAT TATCAAGCGG TAGTGCCTTC CACTTGGAAT GAGATTCTAA AAATCAAAGG GGGGCTTATG AAATGAGCTT GATTGGCACT ATTTAACCCA GCCTTTAGAA ATCATTAGGA CTATCCATTC TTTTGATCCA GCTCAGTGCA TGTGATGGAT TTTAAAGGGC AGTCTTTGAA CGAGTTCAAA ATTTCGCTAA ATTC INFORMATION FOR SEQ ID NO:7: SEQUENCE CHARACTERISTICS: LENGTH: 258 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...258 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:7:
TTATACCAGA
GAATTTTTGG
AGGTGCGTTA
GCTAAGGCCA
ATCATTAAAA
ACGACAAGGG TTTTAAAACA GAACTGCGCA TTTTGAGCGT GTTTATCGTG TGAATATTCT AGGGTTTATG TTGGCTCATA TGCTTCATTT CTGGTTTTTA AAGCTTTAGC GTGGCTCATG AAAACTTTTG ATAGGCGCCG TTATTTTGAC ATTTGGATTT TGTGTTTGGA GATTCTAAAA GCGAAGAAGA GAAAAAAAGG
AAGGGGTA
INFORMATION FOR SEQ ID NO:8: WO 97/37044 PCT/US97/05223 137 SEQUENCE CHARACTERISTICS: LENGTH: 201 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...201 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8:
TACAAAAAAG
TTGCTTTTTA
CTGTTGAGTT
AATCTCTATA
GAAATCAAAT AATGAATATT CAAACAAAGA AAAGATTTTT AGCAAATTTA GCCTGTTTTC TTGCCTTAAG GCTGAAACCC TTTCAGAAGA CCATCAAATC CAGACGCTTT CCATAGAGGG GATTTTGCCA CCGCTCAAAA AGGCTATATG GAGCAAACCA A 120 180 201 INFORMATION FOR SEQ ID NO:9: SEQUENCE CHARACTERISTICS: LENGTH: 768 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...768 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: TTGTGTTTGA AACTCTTAAT GCCACTTGTC TAGGAGCAGC CGCAAGGAAA ATTTGGAAGT TACAACTCAG GATACTATGA TCCAATAACG ACTACCCTAT GATTATTTGG AATTGTTTGA AGGGTGTATG AATTGAGCGA GGCGAACCTC TCAAAATCAC
TTGGAATTTT
TATTTATAGC
GGTTTTTAAA
AGAGACTAAA
TATTTTGCCT
AGGGCATGCG
GTTTTTGGAT
ATGGCATTCT
AAGGAGCATT
AACGCATCGC
AAAGACCAGA
AAAGTCGTTT
AGCGGTTCAT
GAATTTAACA
AAAAAATTGC
AATTGCCATG
CTTTGAAAGT CAATTTCTTT TTAACGCTAT CAAATTACTC CATGTTGTGG CCAGCCAAGC TATACAATAT CAAACTTTAT GCACAGGGAT GATGCGGCAT TGGTTAAAGA TTTTTGCTCT AAGTCAAATA CGAAGATAAG CCTTAAGGGT GGCTAAAGTG 120 180 240 300 360 420 480 WO 97/37044 PCT/US97/05223 138 A7TGACTCGO CGAAAAATCT CATCAGACAG CTTAAAAATG TGGAACTCAT TGAATTGGAA AAGAAGAAG IkATGCTGCGG GTTTGGGGGG ACTTTTTCAG TTAAAGAGCC TGA-AATTTCA OCGGTTATGG TCAAAGAAAA GATTAAAGAC ATAGAGAGCC GTCATGTGOA TGTGATCGTT CCAGCGOATO CTGGGTOCTT OATGAATATC AGCACCGCTA TGCAAAAAAT GGGCTCTTTG AC.AAACCCA TGCATTTTTA TOACTTTTTA GCCTCAAGGC TTGGACTT INFORMATION FOR SEQ ID NO:l0: SEQUENCE CHARACTERISTICS: LENGTH: 765 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .765 (xi) SEQUENCE DESCRIPTION: SEQ ID 540 600 660 720 768
ATGAAAAAGA
GGCGCGTTTT
ACCCCAAGCG
OCTCCAGCCC
GTAGATGCGA
ACTTATGGAT
ATCATGGATG
TATAACTTCT
TTAGGAGGGG
TGCAACAACA
GAATTTGGTT
CCTTTATTTA
TATAAA AGGA TTTTTTTAGG TATGGCATTA GCCTTTAGTG TAGGAGGGGG GTTTCAATAT TCTAATTTAG CTAACAATAA CACCCCGATA AACACTTCAA AAGAAACGCC AAGCGTGATC AACACCAATA TGGCAGGGTA TAAGTGGTTC TTTGGCAAAA ACTACAGCTA TAACCATGCG AATTTGAGCT GCGCATCTCA AGTGAATAAC TTCACTTATG ATGAAAGCAA AGAGGGCTAT AACACAGCAG ATTCGTTTAT CGTTCAAGGA GAGAGCTACT CCGCCGGCTG TTCAGCGAGC ATGAACACGA TTAGGAGCAA TTTCTCTAAA CACAGCGGGA CCAACCAATT CTATAAAGAA AGGGGCGTAG ATTTCTCTAT CTATTTTAAC TACATGATCA
TGTCCATGGC
AAAACCAAAA
TGTTTGGCAA
ATTACGGGCA
CCAAACGCTT
TTGTAGGCAG
GCGTGGGCTT
GGTTGTTCGT
TGAAATCTCA
GCTACTTCCA
TTGAAGTGGG
ATGGATCGGT
ACCTC
AGAAAAAAGC
CACCACCCGC
CAATCAAGCG
AATGTATGGG
TGGCTTTAGG
TAAGCTTGGA
TGATGCGCTC
GGGCTTTGGA
GATGCAAATT
AATGCCTGTG
CTTTAAATTG
AGATGTGTTC
120 180 240 300 360 420 480 540 600 660 720 765 INFORMATION FOR SEQ ID NO:1l: SEQUENCE CHARACTERISTICS: LENGTH: 654 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 WO 9737044PCTIUS97/05223 139 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .654 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1l:
GCATGTGAAA
GTTTTTATTT
GATTACCCCA
TTTAACGCTC
CAAATCGCTT
TTTTTTTCTT
ACGATTGAGc
TTGATTTTAG
GCTCTCAGCG
CAAGAAGATA
CTTATGGAGG
TTAGAGCATG
TACTAGGGTG
AAAACAGCCC
ATTTTGTGCC
ATTTTTTAAA
TTGAAGAGAG
CTA.ACACTGA
AATGTGTCCC
TGCCTATTGA
AATCCTATCA
GTTTAGAAAA
GGCGTTTGAT
TCAGTTTTTC
TTTAAAAACC
GCCCTTTTAC
GGATAAAAGC
TCCTAAGGAT
CCCAAAAGCC
ACAAACCGCT
TTACGCCAAT
TAACGCTTTA
GCGTTTGAAC
AGAATGTTAA
AACAAAACGA
GCTTCCACTT
CAAAAAGAAT
GCTCTCACTT
TTAAAAGCCA
GTCATGCGTT
TGCCCATTTG
CGTTTGGGCG
ATCAAAGCCC
GCTATAGAAA
GGGTTTTAAG
CTCTGCATTT
TAACCCCCCC
TTAAAAAAGC
TCAATATTTC
TTAAAGAAAG
TTTTAAACCT
ACACCCTTTT
ATAACCCCTC
TTAATAAGGC
ATGCGGCATG
CGTTGGTGTT
AAAATATAAG
TAAAATCTTT
GCTCGCCCAG
AGGCAATGTT
GCTTAAAAAG
TCAAGCGAGC
AATCCCCACC
TCTTTTTCCC
TTACTATTCT
GCTT
INFORMATION FOR SEQ ID NO:12: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 372 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 372 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: ACGATCCAAG AAAGGCTTTA ACTTTAAAAC TCATTAAAGA GTTTTGCAAG GGCTTTTGAA GACCA-ACAAA GCGTGGTGGT TTTCAAAACA AACTAA.ACCC GACGGGTGGT TTAATTTTGT GAGTCTATTA AA
CCGCCATGA.A
AGCCAAAAAG
TATTGTGCCG
TAGCGGTAA-A
CATGTTTGAT
CTCCACCAAC
ATCAGCCGCT
CGTTTGAATT
GATCTTATCA
ACCCCCTCTA
TATTCTAGGG
TTTTCTAATT
TGCAGGTAAA
ATAACGATGA
CCATTAATAG
AAGAAGCCTT
CGGAATTTTT
CCTTACTGAT
GACTGATGAA
TATACGAGAT
CATTGAAATA
TTACTTTTTG
CCCTTTAAGC
AAAAAACCCG
120 180 240 300 360 372 INFORMATION FOR SEQ ID NO:13: SEQUENCE CHARACTERISTICS: LENGTH: 1134 base pairs TYPE: nucleic acid WO 97/37044 WO 9737044PCTIUS97/05223 140 STRLANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) AINTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscjfeature LOCATION .1134 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:13:
ATCTCTATAG
AGTTTAGGGG
AATCGTAACG
ATTGATAAGG
GACAATGTGT
TTGAATAAGT
TATTTTTTAC
TATGGTTGCT
CTTGATTTGG
GCCCAATTTT
GCAGAATTAT
TTTGATCAAG
TTAGGCTTAG
AAAGAAGAGA
TGGCTTGCTA
GGGTATTCCT
GCCTTAGCGT
AAATTAGGGA
CAAAACGAAC
AGCAAACCA.A
ACATTAAA.AC
ATGTTTCTAT
CGATTGAACT
TAGGGACTTT
TTTATAACCA
AAAACCGCAA
CAGAGCAATT
CTAAAACGAC
ACATAGGGGT
TCCCCTTTGA
CTTCCAAACA
AGGCGATTTA
TGTTGCCCAT
AAACTAAAGA
TAATAGATTA
TAGATTCTAA
ATTGTTTGGA
CCGAATTGAA
TAAAGTGGTG
CGCTATGCAT
CAATA.AGATT
ATTACACAAG
GTATTTGACT
AGTGCATGAT
AAAAGAGGGT
GTGCCAAAAA
TTTCGCTCGT
ATTAATCTTG
CAGGCGTTTG
GGCTTCTTTA
TCATTATGAA
CATTCAAAAA
TAAAGAAGAC
TGACATGGAT
CTCAGTGCTT
GGCTAAAAAA
AGAACACAAT
TATGCTAAAG
TTAGCCATGC
TTAGTGGATG
ATTCGTAAGG
CAAAAGCGTT
*GAAGACAGCC
TTGGTTCTGT
GCGCTCAACA
TTGTATGAAA
TTAAAAGAGT
TTGCTAGACT
ATCTATCAAG
AGCTTGAGCG
TTAGAGCAAG
GCGCAAGACG
GTTAAA.AGGG
TATTTGGATT
ATCTTTTCTA
AAAATCATTC
AGGCGGCCAT
TTTATCAAAA
GCTATGCACA
AAGAAAAAAC
TGGATAAGGC
TAGAAAAACT
TGCAATCTCA
CCTTCACGCA
AAAACCCTAT
TTGATAAGGC
TATACACCGC
AAkAGAAAAGA
CGAATAAGAA
CCACCAAAGA
CTTTCTTTTA
GCATGGATCT
CTTTAGCATG
GCATCGCTAA
AAGAATGCAA
TTCAGCGGCG
AATCACCAAT
AATGGGGCAG
CATAGCCACA
TTTCCCGTTG
CATTACGATC
TATAGACAGG
ATTTAACGAG
TGTTCAAPAC
CCAGCAAATC
GCAAAAAAAA
CCCTAAATTC
AAAGCTCACC
GCGCCAAGCA
TAATTTTTTA
TGTGAGGAAA
GGGTTATTAC
AGAGCTTATC
GAAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1134 INFORMATION FOR SEQ ID NO:14: SEQUENCE CHARACTERISTICS: LENGTH: 468 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature WO 97/37044 WO 9737044PCTIUS97/05223 141 LOCATION .468 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:14:
TTTTACACTG
CAAC-CGCTCT
TCTAAAAAAG
AGGATCTTGC
AAAGAAAAAG
ACCTTACAAA
TTAAGACAAA
GATTCTAAAT
GCAAAAAGGG
TTGGCGAAGA
CCGAATTGAA
AAACCGAAAA
AAATAGACGA
CAGAAGAAAA
TCAAGCAGGC
CGGCTCTGAT
GGCTTGCATG
AACCGCGCAA
AGAGGATTTG
CGCCCGCCTT
AAAACTGAA.A
AAAACGCCTT
TAAAGACAGC
TTTAGAAAAT
CGTAAGATCT
GAATTGTTGC
CGCCAATTGA
TTAGATGAAA
AATTTAGCCG
AAAAATTTGA
AAGATTGGCG
TTACCCACTC
TGTTAATGGG
AATGCTCTGC
GTGAAAAAGA
AAAGCGATCT
CTAAAGAAGA
TAGAAGAAAA
AGACTTATTC
AAAAC OCT
CTTGATTTTA
GATTTTTGAA
GCAGTCTTTG
GTTAAACAAG
ACCCTTTAAA
TGAAGGCATT
TAAAATGAAA
120 180 240 300 360 420 468 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 423 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .423 (xi) SEQUENCE DESCRIPTION: SEQ ID AAAAGGCTTT GTGATAGGCT TTGGAGCGGG GGATATTACC GTAATGGGCG TAGCGGTTGT TTTATTTTTA ACGCTAATTT GATTTTGGTT TAGCTAGCCC CAAACAAAAG ATTTTAGCTT GGAGCGAGCA TCAGCGTTTA TACTTATAAA CAAAACCAAC TTGCAAAGAG CGTTTTTAAG GGGGGAAACC TTGTTGTGTA CAAACC'rTTA ATTTAGTGAG CGGGACTTTG AGCTTTTTAG AAAGATGTTC TTGTGGATTT GGATTCTTGT CAGACGCTTC
CCT
INFORMA'TION FOR SEQ ID NO:16: SEQUENCE CHARACTERISTICS: LENGTH: 303 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic)
TATCAGTTGA
TATTGTTTTT
TTTTAATCGT
AAAACCAACA
AAGGCATCAA
GCAAAAAACA
AAAAAGATCC
GAGGCGAAAT
AGTTTTAAGG
AGGGATTATA
AGAAATCGCT
AGTCAATAAC
AACCCCTATG
CTTAATCCAA
(iii) HYPOTHETICAL: NO PCT/US97/05223 WO 97/37044 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...303 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: GCAAAAAATA CTATTTTTAA TAACGCTAAT TTTAACAACA AATTCTAGCG CGACCACTTC GTTTGTGGGG GATTTCACTA ATCGCTGGGA ACGCTGTTTT TGGGAACTCT ACTAATGGCT AATAATACCG GCTCTGTTAA TATTGCAGGG AATGCAACCT AGCCCTACGA ACACAGGCGT GAAAGGGAAA GTTACTCTCA
TGA
GCACTTCTTT TAATTTCAAT ACGCTAATTC AAATTTGCAA CTCAAAATAC CGCTAATTTT TTGATAACGT GGTATTTAAC ATAACATCAC TTTAAAAACT INFORMATION FOR SEQ ID NO:17: SEQUENCE CHARACTERISTICS: LENGTH: 369 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...369 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17:
TGGGCGATGA
GAGATAAAGG
CATCAAATTG
GCGATTCAAA
AATCAAGAGT
CAATTGCAAG
TTAGGGAGT
ATGTTTGGGT GTATAGGCCT TTGTTGGCTT TTATGGATAA ACAGCTTGGC TAAAATCAAA ACGGATAACA CCCAAAGCGT AGACTCTTCT TAAAGAAGCC GCTGAAAAGC GCAGGGAAAT AAGCTACAGA GTCTTATGAC GCTGTGATCA AGCAAAAGGA TTGAAGCGTT CGCAAAGCAA TTACAAAATG AAAAACAAAT CGCAAATGAC GGTATTTGAA GACGAGTTAA ACAAGCGTGT
CAGACAGGCA
GGAGATTGGC
GCTAGCAGAA
GAACGAACTC
CTTAAAAGAG
GGCTATGGGT
120 180 240 300 360 369 INFORMATION FOR SEQ ID NO:18: SEQUENCE CHARACTERISTICS: LENGTH: 309 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCTfUS97/05223 143 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...309 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: AAGGGGGGAT TTATATCGGC AAAGAGTTGT TTAAGCATGG GATGAAATCA AAAACAACAA AGCGCGAGAT TTTTTTTATA TTTTTCCTCT TACTGCTTGG GTTTGGGTAT TATTTAGGGA TCTTTAGAGG TCTATTTGGA TTTAAGAGAC AAGCATGAAC GAATTGCAAA GCAAGAATGT GCGCTTGCAA AAGCGTTTGT
CCTAGAGAT
CTAGTGGCCT
GCCATAGCTC
AGTTGCTTTT
GATTGCAACA
TTGAGTTGAG
TTTTGAAAAC
CCTTATTGTC
TGGGGGTTCT
AGAAATCACC
GGAATTACGG
120 180 240 300 309 INFORMATION FOR SEQ ID NO:19: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 996 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .996 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:19:
TTATACTATA
CTTAGTTTTG
TTTTTAATGG
ATGGCTAATC
AGCTGTAATT
GATGGCGTGG
AGGAATTATC
AAAGACTTTC
AGCTGCGGTA
AAAGCCCTTT
GTAGGGTATA
AATTTTAAAA
ATAGCCAATT
TAAGGAATGA
CAAGCTGTTT
AAGCCCTTAA
TGAGGATGGG
ATCAGAATAT
TCGCTTGCGC
CAAAGGCTAT
GTTTGGGTTT
CTTTTTCTAA
TGTATAAGAG
GAGGGTGCCA
AAATTGGGGT
GATGATAAAG
TGGTCATTTG
GAGAGGGGAT
GGTTGGTTGC
TTCAAAAGCC
GAGTTTAGGC
CTATTATTAT
TATGTATTTC
ATACGCTTGC
CGCCAAAGGC
TTTAAAAGAC
TTTACTGATT
AGTTGGACTA
GTGGCTACAA
TACCATAGAG
ACGAGTTTAG
GTTTTTTATT
TCTATGTATG
AGGAGAGGGT
AATGGTACGG
AGTTTGAATT
GTGGAGAAGG
GGGGCGAGCT
TGTCTTTGTG
AAAAGTGGTT
CCGGTGAGAA
CGGTGGCTTT
GCTCTATGTA
ACAGAAGAGG
AAGATGGCGA
GCCACTTAAA
GCGTTAAACA
ATGGCATTAG
ATTTGAAAAA
GTGTGAGCTT
CGAGCTTTGG
TTTGATTTTA
ATATTTTAAG
TTATAAGAGG
TGAATATGGC
GTGTAATTTG
TGGCGTGCAA
AGGTGGGGTG
AAATTACGCC
TTGTAACTTT
AGCCCTTGCG
GGGATACTTG
120 180 240 300 360 420 480 540 600 660 720 WO 97/37044 PCT/US97/05223 144 TATGAAGCCG GTATGGATGT CAAACAAAAT GAAGAGCAGG CTTTGAATCT TTATAAAAAG 780 GGTTGCTCTT TAAAAGAAGG GAGCGGTTGT CATAATGTGG CGGTGATGTA TTACACGGGT 840 AAGGGAGCTC CAAAAGATTT GGAGAAAGCC ACTTCATATT ATAAGAAAGG GTGCGCTTTA 900 GGCTTTAGTG GTAGTTGTAA GGTGTTAGAA GTGATTGGCA AGGAGTCTGA TAATTTGCAA 960 GATGACGCGC AAAACGACAC GCAAGATAGC GTGCAA 996 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 282 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...282 (xi) SEQUENCE DESCRIPTION: SEQ ID ATTAAGGTGA GGAGCGTCAA AAAGACTGAG CGAGGCTTAT TGATTGAGAA ATTCACTTCT CAAGGCGAAT TAGTGCCTTT AGAAATTGTG GTAGAAACGA TCCTTTCAGA GATTAAAAGC 120 TCTAGTAAAG GGATCATTTT AATTGATGGC TATCCCAGGA GCGTGGAGCA AATGCAAGCT 180 TTGGATAAGG AATTGAACGC TCCAAATGAA GTGATTTTAA AAAGCGTGAT TGAAGTAGAA 240 GTGAGCGAAA ACACCGCTAA AGAAAGGGTT TTAGGGCAGT TT 282 INFORMATION FOR SEQ ID NO:21: SEQUENCE CHARACTERISTICS: LENGTH: 339 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...339 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: WO 97/37044 PCT/US97/05223 145 CACTTTGTTT TTAGGGGCGA TTTTTGTTGG GCTTATCGCT ATTACTCCAT AGATAAGGAG AGAGAGACTC AGAGGCATGA CAGCTCTTAC ATTGTGATAG ACGAATTAGT GGGCATGTGC 120 CTTGCGATGG CGATTAACGG ATTATCGTTA GTGGGTGTGA TCTTGAGTTT TATCTTTTTA 180 GGATGCTATG ATATTACTAA ACCCTCACTC ATTGGCAAAA TAGACAAAGA AGTTAAAGGG 240 GGCTTAGGGG TTGTGGCTGA TGACGCTTTA GCCGGGGTTT TAGCCGGATT GAGCACGTTA 300 TTAGCCATCA ATATTTTAGG ATTTTTTAAC ATTAAGTTT 339 INFORMATION FOR SEQ ID NO:22: SEQUENCE CHARACTERISTICS: LENGTH: 237 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...237 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: ATGTGGCCTG TGGCTCTCAA GCAGCCTAAT AGGGTGTCCC ATCATTTTTA TATCATGGCC ATGCTTTTTA TTTTATTTGA TGTAGAAATC GTTTTCATGT TCCCTTGGGC GATTGATTTT 120 AAAAAGTTAG GCTTGTTTGG GCTCGTTGAA ATGCTAGGCT TTGTCTTCTT TTTGGCAATC 180 GGTTTTATTT ACGCTTTAAA GCGAAACGCT TTGAGCTGGC AAAAATTAGA GGTGAAA 237 INFORMATION FOR SEQ ID NO:23: SEQUENCE CHARACTERISTICS: LENGTH: 180 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...180 WO 97/37044 PCT/US97/05223 146 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: TTTGATTTGG GCGTGCGCAC CAACTTTGCA AAAACCAATT TCAATAAGCA CAGATTAGAC CAAGGGATAG AATTTGGGGT GAAAATCCCT GTTATCGCTC ATAAATATTT TGCAACCCAA GGCTCAAGCG CGAGCTATAT GAGGAATTTT AGCTTCATAT GTGGGCTATT CAGTCGGTTT INFORMATION FOR SEQ ID NO:24: SEQUENCE CHARACTERISTICS: LENGTH: 444 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...444 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 120 180
TTTGGTTCAA
GGCGCAGTTG
TTAGGCGCTA
AACCCTAGGG
ACCTTTTATT
GATTTTAAAA
CTGAATTTAG
GAGCGTAAGG
GAGCTTTTAG AAGAATTTTT ACAAAGTGGG TTTTAATGCG TTTATTGATC GCGCTAGTTT AAGAAGCGGA TTTCATCTCT GATTGGGAAT GTGTTGCGTG CGCCAAATGC CATGGCATTA ATGAAAAAGG CGAAAAAAAA ATCCTTTACG CCTTTAAAGA CGCCTTGAGT TTGGGTTTTT AAGAAATCCA AGCGATTTAC CTTTACATCA ATCCTTCCAA GCCT
GCGAAAGAGA
TGTTCTTATG
ACGGACTGGC
AAGGCGAACA
CCCCTAAAAT
GTATGATGCC
CCTCTTTAGG
TTTTAGAAAA
GTGGTTAAAT
CCTTTATAAA
ACAAGAAATC
CAACCATTTG
TACATACAAC
GCATAAAGAC
120 180 240 300 360 420 444 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 516 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature WO 97/37044 PCT/US97/05223 147 LOCATION 1...516 (xi) SEQUENCE DESCRIPTION: SEQ ID AAAAATAGGC ATAAACTCTT ATACCTTCTA CTTAAAAACC TTAATTTTTT AAAAACCATT TCCACAATTT TTACACAAAA GAGGGTTATC ATTCGTTCGC AACAAGGATT TTCTTGTTAT 120 CTTAATTTAA AGGTCAAAAC GATGAAAAAG TTAGCCGCTT TATTTTTAGT AAGCGCGTTG 180 GGGGTTATGA GTTTGAACGC ATGGGAGCAA ACCCTAAAAG CTAATGATTT GGAAGTGAAA 240 ATCAAATCCG TGGGCAACCC CATTAAAGGC GATAACACTT TTGTGCTTAG CCCCACTTTA 300 AAAGGTAAGG CTTTAGAAAA AGCTATCGTT AGGGTGCAGT TTATGATGCC TGAAATGCCC 360 GGCATGCCAG CGATGAAAGA AATGGCGCAA GTGAGTGAAA AAAACGGCCT TTATGAAGCT 420 AAAACCAATC TTTCCATGAA CGGGACATGG CAGGTTAGGG TGGATATTAA ATCCAAAGAA 480 GGCCAAGTTT ATCGCACTAA AACAAGCCTG GATTTA 516 INFORMATION FOR SEQ ID NO:26: SEQUENCE CHARACTERISTICS: LENGTH: 222 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...222 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: TGCTTATTTG TCCTTACAAA ATCTCAAAGG ATTAGAATGA AACGGCTTTT ATTGTTAGCC TTGGCCCTCT TTTTTAGCCT CTCATGCACT AACGCTCAAG AAATTAAAGA AACTCAAGAG 120 ACTAAAAAA CTAAAGAAAC TAAAAGCCAA ACCCGTTTTA ACATTTCCAC CACTAAGGTC 180 ATAGAAAAAG AATTTTCTCA AAGCCGGCGC TATTACGCGC TT 222 INFORMATION FOR SEQ ID NO:27: SEQUENCE CHARACTERISTICS: LENGTH: 879 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: WO 97/37044 WO 9737044PCTJUS97/05223 148 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .879 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:27:
ATAA.AGTTTG
GGGCTTTTTA
TCTGATAGCC
AAGGTGGGGG
CAAGGCTATG
AAGATTGAGT
GTGGATATTA
GCTAATCCGT
ATAGAAGAGC
TTCACTAAAA
TTAGCCCTTT
TGGGTGAAAC
ATCGCTCCAG
GATTCCCTCA
TATGGCGATG
CTAAAAAGAT
AAATGTGGGG
ATAAAGAAAA
TTTTTAGCGA
ATGTAGTCAT
TTATCCCTGT
TCATGGCTAA
ATATGAAAGT
TGAAAGATAA
ATTACCCCAA
TAPLACAATAA
AACACCCTGA
CGATTAAAAA
TTTCTAGCGA
GAATCAAACC
AGTTTTAATT
GCTATTTTTA
GAAGGACGCT
TAAGCCTCCT
CGCTAAACGC
AGAAGCCTCA
TTTCACGCGC
CGCTTTGGGG
AGAGTTGATC
TATCAAACTT
GGCCATCGCA
ATTTAAATTA
AGGCAACCCT
CTTTTTAAAA
GGAAGAAATT
TTAAAAAAAG
GTTTTAATCG
TTAGAAGTCA
TTTGGCTCTG
ATGGCCCTTG
GCTAGGGTGG
ACTAAAGAAA
GTGATTTCTA
GTGAATAAAG
TTGAAATTTG
TTAGCCCATG
GGCATTACAA
AAGCTTTTAG
GAAGCTTATC
ATTTTTGAA
GTGGTTTTAT
CTTTAGTCTT
TTAAACAAAG
TGGATTCTAA
ATTTATTGGG
AATTTTTAAA
GAGAAAAAGT
AAGATGGGGT
GCACGACAGC
AACAAAACAC
ATAACACTTT
GCCTTGGCGA
AGTGGTTGAA
AAGAGACTTT
GAAAACAAAC
TAACGCATGC
GGGGGTTTTA
AGGGAACTAT
CGATGAAAAT
AGCCAATAAA
CGTGGATTTC
CATTAAAAAT
GGATTTTTAT
AGAGACTTTT
GTTGCTCGCT
TAAGGATGTG
TAACGAGATT
AGAGCCTATT
120 180 240 300 360 420 480 540 600 660 720 780 840 879 INFORMATION FOR SEQ ID NO:28: SEQUENCE CHARACTERISTICS: LENGTH: 2028 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori
FEATURE:
NAME/KEY: misc_feature LOCATION .2028 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:28:
GCTTTTGAAG
AGTTGTTTGG
ACGCCTTTAA
CAAATGTTTG
TTGAACAAAG
TTTAAAAACA
TTTAAAGAGC
GGGCATCGCT
TTTTTAACTA
TATCTTAACG
AATTAGAGCC
GTTTAGGGAC
ATCAAGGAGC
AAGGCTTTTG
AGCAGCAAGA
GCCCCTTGAA
AAAAGGATTT
TGAAAGCCTC
AGCCCATTGA
AGCAAGAAAA
TTTGAGTTTT
AAAATTTAGC
GATTAAAGTG
CACTTATAAT
CGCTCTTTTG
ACGCCCCTGG
GAGCGATTAC
AAGTTTGAGC
AGAAGTCTAT
AAAGATCGCT
TCCTTCAATT
TTAGATATTA
ATTTTTGGGT
GGCATTGACA
TATGGGAATG
AAAGGCATTA
ATGAGCGAAA
GTCCAAGTCG
CATTTTTTTA
GAACCCATTT
CGCCTAAAGG
GTAAGATTTT
ATAACCGCAG
GCGCGCTTTG
GCACTGAAAT
TCCAAATCGC
AAACCTGTTC
CTGGCTTGAA
ATGATCCCAC
TAAAAGAGAT
GGCGTGCGAA
AGATCCTAAC
TTATTACGCT
TTTCAACGAA
CAGCTTTCAT
TTATGACATG
TTCGTGTAAT
AATGGCGGAT
GCATTTTAAC
TTTAGAAAGG
WO 97/37044 WO 9737044PCT/US97/05223
GTGTTTTTTT
ATTAGCGAG
GGGGTTT.TGT
C77CATCAACA
GATAA-AGAGA
CATGGGGGTG
ACCGCCTTGT
GAAAAGCATT
CAAATCCCCT
CTGATTTTAC
CAAAGCTTGA
GATCAAGCCC
GATGAAATCA
AGCCGTTTCA
ATTAAAATAG
GCTAAATACA
TTGAACATGA
AAGTTAAAAA
ACTTTAAGTG
ACAGGCAAAA
AATCACCTTT
GAGCATAATT
GGGGATAAGG
GAAAAAACCC
TATACGATGT
GGGAGAGCCA
ATGTTTTAGA
CCCTTAGGAA
CGATTA-AGCA
AAGTGGTTTT
ATCTCAACGG
TTTTAGAAAT
TAAAACAATT
AAACCCTTTT
ATGGGGTGGA
CCATAGGCXA
GGATTTTATT
GCTTTAATGT
AAATGCACTT
ACCCCCAAAC
GCGTGGAAGA
CGCTTATAGA
GGGGGGAGGC
CCCTTTATAT
TACAAGTCTT
TAGACATCAT
GCGGGAAAGT
AAAGCTATAC
GGGGCTAGGG
AAGGATACGA
CGAGCCTAGC
TTTACAAAAA
TGCGGATTTT
TAGCGGGAGC
CACTAAAAAG
TAAAAATGTC
GGTGTGCATT
ACCCACCGCT
GATTGTAGGG
AACCCCACGA
TGCCGAGCAA
TAAAGGAGGG
TTTGCCTGAT
TTTAGAAATC
AGCTTATGAA
TGTGGGCTTA
TCAAAGGATC
TTTAGATGAG
GCATTCTTTG
CAAAAACGCT
CATTGCGAGC
GGGAAA.ATTT
TATTTGACTT
ATCGCCAGTC
ATTGGCTTGC
AAGGGGAACA
GTTGTGGATA
GTAAAAGATT
ATTGAGCGCC
AATATCAATA
ACGGGGGTGA
CAAACCCTTT
TTGGAGTATT
AGCAACCCCG
AAA.GAAGCTA
CGGTGTGAGA
GTGTTAGTCC
AAGGTGAAAG
TTTTTTGCTA
GGCTATATCA
AAATTGGCTA
CCTACCACCG
GTGGCGTTAG
GACTACATTA
GGCACGCCTT
TTA.GCTTTGG
TGGGGAGGGA
AAATCGGGAG
ATGAAAAAGA
CGCTCATTGT
TTGGGCCAAA
TAT TOCAAAA
CCAAATTTGA
ACATTA.AGAA
GCGGGAGCGG
TAAACCATGC
TGGATAAAGT
CCACTTACAC
AAATTTTAGG
AATGCCAAGG
AATGCGATAG
GCAAATCCAT
AATTCCCTAA
CTTTAGGGCA
AAGAATTGAG
GTTTGCATTT
GCAATTCCAT
TAGACATGGG
TAGAAGTGGC
AATTGAAA
TGCGCGALACG
CGGTTTGACA
CACGCTCAAA
CGTAGAGCAT
GGCTGGAAGG
TAACCATTCT
ACCCCCTAAA
TTTGAGCGTT
TAAAAGCTCG
TAAAAAAAAT
GATTTATTTA
GGGAGTGATG
CTATAGCACG
CGATGGGGAT
CTGTAAGGGC
TGCTGATGTG
AATCGCTGTG
AAACGCTACG
TAAAAAAGAC
TGAAGATGTG
GCTAGTGATT
GCCTGATGGG
GCAAAATTGC
660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2028 INFORMATION FOR SEQ ID NO:29: SEQUENCE CHARACTERISTICS: LENGTH: 414 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .414 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:29:
ATTATTATTG
TTGGATAATT
ATTCGCATTT
GTGGGGATCT
CTTGCCGCAA
AGGGATCAAT
ATATTTACTT
ACTATTCAAG
TCATCTCACG
ATGGCTATGA
CGCGCACTTG
GCACCGGTAA
CTTGCATGGC
GCCTGATTAT
CCAGTATTTT
ATACGCGCAA
AGTGGGTGGG
GCCCACCACT
TGTTTTTATC
TTTAGCCGCT
GGGACAGCTG
AGCGGGAGGG
CGCTTGATCG
ACTTTTAGAT
AGGGGGTATT
ATTAAATTGG
TTGTTACCGG
AAAAACCCAA
CTGCGGCGTT
TAACCAATTT
ACAAGGGCGT
TAATGGCGGA
ATTACACCAT
TTTGGATTAT
AACCCCTACC
CTATCAAGCT
GAGTCAAGCG
GAGTTTGAAT
CAGCTATGAG
TTCCAAAAAC
TGCGGGTTTG
GATT
120 180 240 300 360 414 INFORMATION FOR SEQ ID WO 97/37044 PCT/US97/05223 150 SEQUENCE CHARACTERISTICS: LENGTH: 519 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...519 (xi) SEQUENCE DESCRIPTION: SEQ ID CAAGGAATGG GCTTGAAAAA TCTCTCAACA TGTGTGAGCA ATTTTAATGA AGACACTTAC CAAGCCAGTA GGAAAGGCGA AATCACCCAA ACGCATTTAA ACGATGTGGA TAGCGGCACT ATTTTCACGC AAAATAACGA CTGGATAGAT ACAAAACCTA CAGGATCAGA ACCTTTATGG GGCATTTTAG AAACCACGAA CCGGTGGAGC GATTATTTAG CGGTGCAAGA AGCCAAACTA GTTTTTAATT TCGCTTACCA AGTCCCCTTA
CTTCTGGTGT
ACGCTAGACT
GATAATGTGC
TACTACGATC
GATGGCTATA
GTGCGAGAGA
CGAGCCTTTT
GAGCTTGATG
CCTCAATTT
TTTTATTCTT TTGTTTGGGG TAGTTTTAGA AAAAAAGATC CTATCATCAC GGCTATCGCT ATGAGTATTT TTTAGTGGAG TTTCTTATGA ACTTTTTGGC TTACAAGAGA TGAATTTGAT TGATCGCTTT TGACAAATTG CCTATAGTTT AGGCAAAATT 120 180 240 300 360 420 480 519 INFORMATION FOR SEQ ID NO:31: SEQUENCE CHARACTERISTICS: LENGTH: 327 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...327 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: CAAAAAGAAG TATCTAACTA TGAGGAATCC ATTTTAGTCA ACACTCCTTA TGCAAGGGTT AATATTCTCG CGATTGAAAA AAATGAAAGT CCTATTGAAT TACTGGCTCC GGTAGATTTA 120 WO 97/37044 PCT/US97/05223 151 GTTACCGCTT TGAGCGATTT GATGCTAGGA GGTGAGGGGG CGAGCAAGGA AGAAATGGAT 180 AATGACGATT TAGACGCTTT TAAAGAAATG GCTTCTAATA TTTTTGGTGC GATCGCTACA 240 AGCTTGAAGT CTCAAGAATT GCTCCCTAAA CTCAATTTCA CCACCACGAA CGCTGAAATC 300 GCTAAAGAGC TTCCTAAAAA AGAAGAT 327 INFORMATION FOR SEQ ID NO:32: SEQUENCE CHARACTERISTICS: LENGTH: 546 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...546 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: ATTTGTTTAT TAAAGGAAAA AATCATGTCA AATAGCATGT TGGATAAAAA TAAAGCGATT CTTACAGGGG GTGGGGCTTT ATTGTTAGGG CTAATCGTGC TTTTTTATTT GGCTTATCGC 120 CCTAAGGCTG AAGTGTTGCA AGGGTTTTTA GAGGCTAGGG AATACAGCGT GAGCTCTAAA 180 GTCCCTGGCC GCATTGAAAA GGTGTTTGTT AAAAAAGGCG ATCGCATTAA AAAGGGCGAT 240 TTAGTTTTTA GCATTTCTAG CCCTGAATTA GAAGCCAAGC TCGCTCAAGC TGAAGCCGGG 300 CATAAAGCCG CTAAAGCCGT TAGCGATGAA GTGAAAAGAG GCTCAAGAGA TGAAACGATC 360 AATTCTGCGA GGGACGTTTG GCAAGCGGCA AAATCCCAAG CGAATTTGGC TAAAGAGACT 420 TATAAGCGCG TTCAAGATTT GTATGACAAT GGCGTGGCGA GTTTGCAAAA GCGCGATGAA 480 GCCTATGCGG CTATGAAAGC ACCAAATACA ACGAGAGCGC GGCTTACCAA AAGTATAAAA 540 TGGCTT 546 INFORMATION FOR SEQ ID NO:33: SEQUENCE CHARACTERISTICS: LENGTH: 444 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 WO 9737044PCT/US97/05223 152 NAME/KEY: misc-feature LOCATION .444 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:33:
ATGGAATTTT
TCGTGGATGG
CATAAAAAAG
GCTTCGCCGG
GAAATGTTTA
ATCTATCATT
A.ACGCAAGGT
ATCTTAGTGG
TGAGCGGGTA
CAGCGTTATT
AGTTTGTAGG
CGATGGGTTT
AAAGCGGGGG
TTTATTGCAA
TTTATCGTGT
TTGTCAAGCC
TTTTTTATGG
TTATTTGCCA
CGTGGTTCAA
CACGCTTATT
TTGGTTGCAT
AAAATGCATG
GTTTAATGAA
TTTT
GTTAAAGCTT
CGCCTTTTTG
ATCCA.AGAGA
ACAGGGATTT
GCTAAATTGG
CGCGAGCTGG
ATACCCACGA
TCCATGTGAT
TCTATCATGC
AAAAGCTCTA
TGATGTTGTT
CTTTAGTGGT
AAAAAGACCC
TTTTAATGAT
AGCGGTCATT
AGAGAACGCG
TTCCTTTATC
GATCGCCCCT
ATTGCTTTTA
TACAGGGAAA~
CCTTATTGTG
120 180 240 300 360 420 444 INFORMATION FOR SEQ ID NO:34: SEQUENCE CHARACTERISTICS: LENGTH: 873 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .873 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:34:
GGATTAACCA
CTAGACATTG
CCTAAAAGAT
ATTTTATCTA
AGGGAdAGTT
GAAGATTTTA
AAAAAAGGGG
TCTTTTTACA
TTGTATTCCA
CAAAAAGAAG
ATTAAGGGGG
AAAGAGCATG
GATAGCGAAG
ACCCACCATG
AAACCCCAAA
TTAGACAGGC
ATTGCGTTAT
TCGCCACTTA
TCATGCGCAA
TCCGCTACCA
GATCCCTTTT
CTAAAATCCA
AAAAGAAAGA
ATAGAGGCTC
CCCATTTTTT
TTAAAGGGAT
AACTATGGGA
CCAAAGAAAA
GCGTGATCA-A
GACCTGATTT
GTTTAACAAA
CCCTAATATT
TAGCTTGCAA
GACCCGTTGC
GCTGTGCGAT
AAAAGCTCTC
GGGCGAACAT
CCCCAACAAT
TCATTTCAAT
TGCTTATTAC
TGGCGGCTTC
GCAGATCATT
GGCTCTATTA
ACTATGGGAG
TAAAAGAATT
ACA.AGATCCA
GTTAGGCGTT
GAAGTGGGCG
TCTAA.AAC!GC
CATTACAAAC
AAAATCGCTT
TGTTTTGAAT
TATGTGATAG
CTCAAAACGA
CAATGCGTTG
AACTATAAAG
CAAGCGTTCA
AACATGCGTT
CCTGAGTTTA
TCT
CGCATTCAAG
TGCTCTCTAA
TTATTTTCCT
TTTTTTTTAT
AAAAACGCCA
TAGTGGAAAA
ATGAAGCCGA
CGAGCATGGA
ACGCTTTTTT
TGGGGGATAA
ATTTTTTAAA
AAATTAAAGA
TAGTCAATAT
AAAAAGCTTT
AACCCTTTTA
TAAAACGCTC
GACCACTCAG
CACTAGGGGC
CCAATTTGAT
ATACCCCCTA
TGATATTATC
TAAGGATATT
TAATGTGAGC
AGGGGATAAT
CGAAGACGCT
AGATTTGAGC
GCACCAGATG
TTTCCCTAA.A
120 180 240 300 360 420 480 540 600 660 720 780 840 873 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 579 base pairs WO 97/37044 PCT/US97/05223 153 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...579 (xi) SEQUENCE DESCRIPTION: SEQ ID
GGCGTTTATC
ACTTTAACCA
GTGGCTGGAA
GTCCAGCACG
GCGATTTTTA
CAACATATCG
GAAAGGAAAG
AGCGTGGCGT
GATAACGCCG
ACTTCAGCGA
ATAATAAGGA
CAACAAGCAT
GCTTAGTTAA
ATGTGGATTT
TTGAAACGAG
CGCTCAAAGA
AATTTTTAGT
TTGTGTGCCA
ATGAAAAAGC
AATTCTTAAA
AATCATTGTC GCTTATAGGA AGATTGGCAA GATTTTAGCG TTTGGCGTTC AGAAGGTGCT AGATTTAAAA ATCAATGATT TGTTAGTGGC GAGCGCGTTT GATCACCCTT TAGGGTTTAT TGGAAGTTTA AACGCTTTAG CTAAAAAGAT AGGCGTCATC GCATCAGGCG ATCAGTTTGT TAGCGAGTTT AAAGCGAGCG CGGTGGAAAT AAAATTTGGC GTGCCATGCT GCGTGCTAAG CGGTATGAGT TTTGATGAAT TTTTAGAAAA AAGCATGGTG GATGAGCTT
GGTGCATTCC
TTTTAGCGGG
TACTCAATTA
CCCCGAAAGC
CGCTAATGAG
GCATAGCAAA
GGAGGGGGCG
GAGCATTAGC
AAGCGCTCAC
INFORMATION FOR SEQ ID NO:36: SEQUENCE CHARACTERISTICS: LENGTH: 240 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...240 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: AAACTCGCCC TTTTCAAACA ACGCCAACTC AAACAAAATA AAAACGCTCA AGCTAATCCT AAAAGCCCTT TCATCACCCA TGTGGTCTTG CCTAAAGAAA CCTTATCTTC TATCGCTAAA CGCTATCAAG TCAGCATTTC CAGTATCCAA TTAGCCAATA ACCTCAAAGA TTCTAATATC TTTATCCACC AGCGCTTAAT CATCCCCACT AACAAAAAAT TACTCGCTAC AAGGGAATTT 120 180 240 WO 97/37044 PCT/US97/05223 154 INFORMATION FOR SEQ ID NO:37: SEQUENCE CHARACTERISTICS: LENGTH: 735 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .735 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:37:
GGTCATAGAA
GCGCTGATTT
AAAACCTATA
GCGGGCGTTC
AAAGAAAAAT
CATAAAGTCC
AGCCCGGATC
GATTTAAGCC
AACACGCATC
GAAAACATCA
CCTAATCTTA
CCCCAAAAAA
GTGTTTAAAA
AAAGAATTTT CTCAAAGCCG TTTCTCAAAC CCTGCGTTTT CCCCCATTAA AAAGGGCGAT AAAGCGAGCT GTTATCGTCA TAAAACTACT AGGGCTAGAA AAAATGAAAT CACTATTTAC TCAATGAGGG GAGTTTCATT GATTGTGGGC GCTTGTTAAA AAGCGATTTT GTTCGTAGAG ACCCCATCAT AAATGCGCAA AATTGCTTTA TTACCCTAAC TGAAGATCTT GCCTAAAGAA
AAGAT
GCGCTATTAC
GATGGCTATG
AGGCTATTGA
TTGAAATTCA
AACTTTAGCA
TCTCGTTTCA
AAA.AAAGGGC
GTCAATCAAG
GGGGTTAAAG
GATAAAATGC
ATGTTCGCTC
GCGGTTTTGA
GCGCTTTAGA
TGGAAAAGCT
GCGTGTATTC
ACCAGCAAGT
TTGAAAAAAT
ACGGCGTTAT
AAGAGCTTTT
AGGATTTAGA
GCAAGCAAGC
TAGAAGCGCG
AAGTAGAAAT
GCCTAATGAA
TTATGCGAAT
CCCTGAATTA
GGGAGCGATT
CATCAGCAGC
TTTTAAAAAA
CAAAATCATA
ATTTTTAAAA
AATCACGCTT
CTTCAATGTG
CTTTCACAAA
120 180 240 300 360 420 480 540 600 660 720 735 TTAAGGGGGG GAAAGCTATC INFORMATION FOR SEQ ID NO:38: SEQUENCE CHARACTERISTICS: LENGTH: 456 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature WO 97/37044 PTU9/52 PCTIUS97/05223 155 LOCATION 1 .456 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38:
AGTGGAGAAC
GCGTCGTTTT
GAAACGACTG
AGCGATTTTG
AAAGAAAAAG
AAGGCAGAAA
CAGGCTTTAG
CCCAGAAAAA
AC AC AATGAA
TGATGGCTAA
ATGCTTCAGC
ATATGATTAA
AAGCCTTGAT
AGCTTAATCA
TGGAATTTTG
GAAATGCAGG
AAAATATATC
GCCGGCTCAT
AGGCGTGTTA
GCAACGAAAT
TGAGCAAGCT
GACTCCAGAA
GGCTAAAAA
ATTTTTACAA
TTAAATTTAG
AATIGCGAATA
GCGACAGTGG
CCTAATTTTG
ATCCGCACCG
TTTAAAGCGA
CAGGCTGAAG
CGCCAA
CGTTAGTGGG
ACTCTACGCA
ATGGCAGACC
ATTTTGACA-A
CGCTTGTAGA
TGATGGAAGC
AAGTGAAAAA
CGCGTTGAGC
TAACACGAAA
CATCACCAAA
GCTTAAAGAG
AAATGAGGCT
GGTTAAAAAA
AGATCCAAAT
INFORM4ATION FOR SEQ ID NO:39: SEQUENCE CHARACTERISTICS: LENGTH: 552 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .552 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: AGAGCCTTAA AAACGCATGG CGCTCTTGCC TTTGTTCAAG CTTTTTTGGA GTCTTTTAAG
GGATTTTTAA
TGCGCGATTT
ACAGCAGCTT
CAAGCGATTT
GCCTT'fTCTT
TTACGTCTAG
TTTGTCAAAA
GTCTTTTCAT
GCCAAGCGAC
TGCTCCTTTT
TTTTGGGCGC
ACCCCATAGA
TGCAAGTGGG
AAGTCCTTAA
AAAGCTATGA
CTTAATTAGC
GGCTCTGCTT
GTTTTTAAGC
AACGCGCATC
AGTCAAAAAC
AAACCCTCAC
AAAAACCTTT
GTTTTAATAG
TTAAGAA-ACC
ATGCCTTTTG
TTGCACGCTA
ATTTCCAA.AT
AATTTTGTGG
AAAGAAAAGA
CGAGCGTTTT
GCTGGGCTAG
TTTTGAACGT
ATCCCTTAAG
TCAGTCTAAA
AAGAGCGAGC
TTTTGCCTGA
AA.TCCTTTTT
TTATATAACA
TTTACTCACT
TTATAGCAAC
CAAATGCGTT
TTTTAAATGG,
AGAATCCAAG
TCAAGTTTCT
TCTTTATTGA CGACTACCCT TATTCAAAAA CGGCCCCTTA TTGTTTTGTT TG INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 345 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) WO 97/37044 WO 9737044PCTfUS97/05223 156 (iii) HYPOTHETICAL: NO (iv) AINTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .345 (xi) SEQUENCE DESCRIPTION: SEQ ID
GCCATGAATA
GACGAAAAGA
ATGGCTTTAA
ATCTATTTAA
AAACGCCTGC
AAATACTACA
TCAAAACCCA TTCTTCAAAT GAAAAAGAAC GCTTTGTACG AAGAATTATT CGCCGAAGCT ACAAATGAAA ATCCGCACGG TAGGGGTGTT GGTTTTTGGG GGTGCGTTTT TAGCCTTATT GCAATAATAT CTATTATATT AGCCGTAAAA TTAATACCCT TTTTAGAAGA ACAGCAAATC CTAAAAAACG AATTAGAAAA TAGAGAATAG TGAAAATATT GGCGATATTG CGTTT
CATAGAAGAG
CCTTTCTTTA
AGTGCCTAAA
AGAAGATCAA
AGAGCGTTTT
120 180 240 300 345 INFORMATION FOR SEQ ID NO:41: SEQUENCE CHARACTERISTICS: LENGTH: 714 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .714 (xli) SEQUENCE DESCRIPTION: SEQ ID NO:41:
CTCCTTTTAA
AAGATCATTA
AAAAACCAAT
GACACTCTAT
CA.ATTTTTAG
ATCGCTATTG
GAGATTTATG
TGTTTAGAGC
GAAGAAAATT
AACGATAAAT
ATGGTGCGTT
GTGGTGGATA
TCCACTTTCT
TTCAAGGGGC
TTGTTGTTTT
ACGCTGAAGG
ACAAAGTGGG
ATCAAAAAAC
ATTATTTAAG
CTATTAGTTC
CTAAAATCAT
TAGAGAGTTT
TAGATGAAGA
GGGTGGTTAT
TATTTTAAGG
TAGGGAAAAC
TACCGGATTA
GCAAAGGCGC
TAAGCCTAAT
CACTTCTAAA
GTTGTTGCTT
TATGAGCGCG
TATTCTAGCC
GCGTTTGAAG
AATCCATTTG
CAATAGCGAA
AAACTCTTGC
AATCTCAAAA
AGCGGTTCGG
TATTTAGAGA
GTGGATAAAA
AACCCTAGAT
GCAAGGGTTG
AGCGATATCA
CCCATTATTA
GGGTATGTGA
CACAAAACCA
AATGCTTCAC
AACATAAAAC
ATATCTTTTT
GTAAATCCAC
GTTTGTCTAG
TTGAAGGCCT
CCACTGTGGG
GGGAGCAATT
TTTCTCAAAT
AAGATAAAAA
GGGCTTTTGT
AAAAACACAC
GGATCGCTAG
CATTATGGAT
AGAAATCCCT
TCTGGCGTTT
TTATGCGAGG
AACCCCAGCG
GACGATCACT
TTGCCCCACA
CTGTCATTTA
AGGTTCGTTT
TGATGGGGTG
CATTGAAGCG
CGCG
INFORMATION FOR SEQ, ID NO:42: WO 97/37044 PTU9/52 PCTIUS97/05223 157 SEQUENCE CHARACTERISTICS: LENGTH: 1083 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1083 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:42:
CACCAATCTT
GGTTTTGAAA.
ATTCAAGGGC
AGCCGTTTAG
ACTTTCACCA
CAAGCCCTTA
CAACACATGA
AAAACGCTCT
AACGGCATGA
CAAAACGCGC
AAGATTTTAC
ATGGTAGATG
TTCACGCACC
GGGGCTGATA
AAATTAGAGA
AGCCATA.ACC
GTGATTTGTA
CAAAGCCCTT
TGG
TTAAAAGAGC
AAAGCATTTT
CTTTATTGAT
CGTATTTGAT
ATAAAGCGAG
TCCCCCCCTT
ATCTTTTAAA.
GCAAACAGCT
TGGATTTGAG
TCAAAAAAGA
AAGATAATGA
AGTATCAAGA
ATAATTTGTG
TTTCTAACAT
CCAACTACCG
AACACCGCCA
A.AGAATACCC
TTAAAGAAGG
GTTTGAACCA
AGACAATTTG
TTTAGCGGGA
TGGCGCTTGT
TAAAGAAATG
GCTTTGCACT
AAGGGCGTGC
CAAAATTTCA
CGTGCAAGAT
CAATTTAGTG
AAAACTCGCC
CACGAACGCC
CGTGGTGGGC
TTTAAATTTT
CTCTAGCGCT
CATTAAAACG
CACGCAAAAA
GCGAGAATTT
AGAAGAAAAG
AACGGAGCGC
GCTGGGAGCG
GGCGTGCCTA
CAAGAAAGGG
TTCCATCGTT
GATTTTTCGG
AATTTCAGGG
AGCGAATGTT
GATTTTGACG
AAAGAGACCA
CTGCAACTGG
GATGACGATC
TCCAAGCATT
GAAATCTTAG
CTTCAAAGTT
GAAGAGAGCC
AGAAAATATC
GGCGAGTTTT
AAAAAATTGC
GTAAGACTAA
GCGAAAACAC
CTTTGAAATT
TTGGTTTGCT
TOCTAGATAG
CGAGCATTTC
ACAAAGCGTA
ATTTGCTTTG
GCGAACGCTA
AATTTTTAAA
AGAGCATTTA
TTAAAGGGGC
CGTGCGCTA.A
TCAAAGGTTC
TGGATGTGGC
GCTATTTTGT
TAGAATAATG
CGCATGCCAC
GACTTTAACG
TTTAACGCTC
GTTGAAAAAC
GTTTTTAAGG
CGATGAAGTG
TCAAATCAAA
TGAGCTTTAT
TTTGAGCCTT
CCATTACATT
ACAATTGAGT
TGGGTTTAGG
TAAAATAGTG
TTCCCTGATC
GCACAAAAGC
TTATCAGATT
ATCGTTTAAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1083 INFORMATION FOR SEQ ID NO:43: SEQUENCE CHARACTERISTICS: LENGTH: 456 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: WO 97/37044 PTU9/52 PCTfUS97/05223 158 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION .456 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:43:
ATGAGAAAAA
GCGAGTAACG
ATGAAAGGCG
ATCCCCCCGT
CCAAAAGGCT
GTGGTATTGA
TTGGTAGCGC
CTGAAAGGTT
CGATTTCAGC
CTTTGATTTT
TCGCTTTCAG
ATAACATTTG
CGGTATTTGT
AAACTAAAAA
AAACTTTGGG
CGAAATTCCT
GTTGTTTTTA
ACAAACAGAT
CGTTGATTCT
GGAAGGCGCT
GAGCGTAGTG
CGGCCAGTAT
GATTGATAGC
GTCTTCCCGT
TCAGCGTGTA
TTTAGCCTAA
AATCTTAAAA
TACCGCTTGT
GATCCGGGCG
TTCGTCTCTC
CGAGCTGAAA
CCGGTG
TAGGGTTATC
AAGATGGGGC
TCTTTGATTT
ATCAGACCGC
TAGGCACTAA
CAGATAACGG
TTGATGAAAA
GTCTGTTCAT
CGTCTCGGCG
AACGCACGAA
TAGTTATTGG
TCGTAAATCG
CACGCTGACT
TTCCATCCGT
120 180 240 300 360 420 456 INFORMATION FOR SEQ ID NO:44: SEQUENCE CHARACTERISTICS: LENGTH: 729 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .729 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:44:
GACATCTTTA
GCCGCOCAAG
TTCATGAAAC
GATAAAATCA
AAGACCATGC
AAAGACTTTC
AGCTTAAAGG
GGCAAAATCG
TTTTCGCTCT
TTAAACGAAA
GGGTATATCA
AATTACAAAA
GCGCATTGG
ATAAGGAAGA
AAAGGAAAAA
TCTTTTTAGA
TCACCCAAAC
AAGAAGTCGC
AAGGCGCTTT
CTAACAACGC
CTGAAACCGA
TTTTTGATGA
ACAATGAATT
ATTTTGAATG
TCAAGGCTGA
GATCATGGCC
AGAACAGCCC
GCAATTGAAA
CGCGCAACTC
CAGTGCGATG
AAAAGACACC
TTTAAGGGAA
TGTGAGTGGG
AAAAATTGAC
GGTCAAAACG
GOACOOCACA
ATACAATTTA
ATTGATTTAG
ACCATCGCTA
AATCAAGACC
ACGCAAGTGG
AAATCCAATA
ATGGAAAACC
GTAAGCGCCC
GCGAATTTTG
GCTTCTAAAG
ATTCCTTTAA
AACGAAAAGG
GGCTCTCCAA
CAGAAGTTAC
ACGGGTTGGA
CCACCGCTCC
AAATGCAAGA
AAGAAACTAA
TGAACAAAGG
TTAATTCTGT
ACGGCA.ACAA
GAGTGCCAGC
AAGATTATAA
GCGAAAAAGT
AGCAAGCTGT
AGGAGCTAAA
TAAAAACGCT
TATGGAAACG
AGAAAACAAA
CGAATCTTTA
CATGGACGAT
GAGCATGATA
CAAGCTTTCT
GATTCAAATC
CGGGCAAAAG
TCCTAAAGGC
ATTTGCAAAC
120 180 240 300 360 420 480 540 600 660 720 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 159 LENGTH: 759 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...759 (xi) SEQUENCE DESCRIPTION: SEQ ID
AACGCTCATG
AGAAAAGACA
TATGAAAACA
GAAGTTTCAC
CCCAAAGAAT
ATCGGTGGGC
GGAGGGTTTT
ATGAGCGCGG
GATAAAGCGA
GTGAACACGC
GTTTTCAAAC
TCGCTCACTT
ACTAACAAGG
CCTTTACCCA
ACACCTTGCC
ACAAGCCTGT
AAAAGCATTT
ATTTGCATGA
CTCAAGGCGA
GCCCGCATGG
CTTATGCGGC
CCGTGCAGCT
ATAACACGAG
TCACGCCAAA
CAGCTTATGG
TTGAAGAGAT
TCCATTTAGC
TTTTTTAAGG
AAGCGTTGAT
AAAAGAAGCG
CAATATCAAG
TGCGGGTTTG
AGGGGGAGCG
CCGCTATGTG
TGCTTATGCG
CAAGCATTCA
AGGCATCATT
GCATTTTGGG
TAAAGCGTTC
GCACCAGCTC
CCTGATGGCA
ACGATTGTCA
GTGATTGAAG
TTTTTTATAA
ACAGGTAGAA
TTTACCGGGA
GCTAAAAATT
ATTGGGGTGA
AGCGCGGAGT
GAAAGCTTGG
CGCGAGTTAG
TTTAAGCGT
GCTTTCGCCT AGCTCAAAAA AGTCTCAAGT GAGCGTGCGT TTTCCACCCA ACATTCCCCA AGATCGTGTA TAAGGTTTTA ACCCTACAGG AAAATTCGTC AAATCATCTG GGATACTTAT AAGACCCTTA CAAGGTGGAT TGGTAGCGAG CGGGGTTTGC TAGAGCCTGT GTCTATTTAT TGGAAAAATG CGTGAAATCG ATTTGTTAAG ACCCATTTAT AAGAATTCAC TTGGGAAAAG INFORMATION FOR SEQ ID NO:46: SEQUENCE CHARACTERISTICS: LENGTH: 195 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...195 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:46: WO 97/37044 PCT/US97/05223
GTTATTAGTT
TTTTTGCGTT
CTCAAAAATC
CCGTATGGAT
ACCATTTTAT TATTCTTAAG GATGTGTTTA TAATGAGAAT TAAGGCTTAT TATCGCACTG GTTTTTATCG TTTGGTTGGG TTTTAGCGCT TGTAAAAACT TCAAGATTCT CAAAACAATA CCACCCAACA AGATAGCCCT AAAACCTACA
CTGAA
120 180 195 INFORMATION FOR SEQ ID NO:47: SEQUENCE CHARACTERISTICS: LENGTH: 366 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...366 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:47:
AAAGGAATAA
CTTCAAGCCC
TTTAAGGCTA
TCCCCACACA
CCCCTTGATG
AGAAACACCC
CCTTCT
GCATGAAAAA ATCCCTTTGT TTGTCTTTCT TTCTGACCTT TTGTCATTGA GCTTTTAGAA GAGATTAAAA CTTCGCCGCA AAGTCCTTGA TTCTAAAGAA CCAAGACAAG TTTTAGGCGT AAAAACTCAC GCTCACTATC ACTCACATAT CCACGGCAAT AAAAACTTTC TTTAGAAACG ACCTTAAGCC CTAACCGCCC AAATCGTTTT TTCTTCAAAA GAATTGAAAG AACCGCACTC
CTCTAACCCT
TAAAGGCACT
TTATAATATC
CGTCTATCAA
TACTATCCCT
AAACCCAATA
120 180 240 300 360 366 INFORMATION FOR SEQ ID NO:48: SEQUENCE CHARACTERISTICS: LENGTH: 408 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...408 PCT/US97/05223 WO 97/37044 161 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:48:
AAAAAATTTC
AGTGGCTCGT
ATTGATTTCA
TTCATCTTCT
GTTTTGAGTG
GCGTGCGCTC
CATCAATTAA
ACACTTTAAT
TGCAAAACAT
TCAATTTTAT
TCTTCATGCT
GTTTTAGTGT
AACACGCTTT
GTTTCTTTAA
ACTTCCTGTT
TGTTCGTTCT
TTTCGTTATT
CAATCTTTTT
TGTTTTGGTG
TAACAAGGTA
ATCAGTGTAT
GAGTGGGGAA
TTTTTGCGAA
GATCAGATCA
TATATCCTTA
CGATTGGGAC
GATATCGATA
AATTTGTTTA
CTGAGATTGT CTTGGTGGTT GGATCAGCGT TTTCTTCATT TTCACTTCTT TGGTGACTTT TTAGGTTGAT TGATTTGATT AATTTTTTAG GCTCATCTTT AAATGAGTGA TTTTAGATTT
TTTTATCA
INFORMATION FOR SEQ ID NO:49: SEQUENCE CHARACTERISTICS: LENGTH: 207 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...207 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: TACTTCCTGT TGAGTGGGGA ACTGAGATTG TCTTGGTGGT TAGTGGCTCG TTGCAAAACA TTGTTCGTTC TTTTTTGCGA AGGATCAGCG TTTTCTTCAT TATTGATTTC ATCAATTTTA TTTTCGTTAT TGATCAGATC ATTCACTTCT TTGGTGACTT TTTCATCTTC TTCTTCATGC TCAATCTTTT TTATATCCTT ATTAGGT INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 729 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 PTU9/52 PCTfUS97/05223 162 LOCATION 1 729 (xi) SEQUENCE DESCRIPTION: SEQ ID
ACTCTTTTTA
CTGATTTCAT
AAGATTTGGA
AGCGTTCAAT
AAAAAGAAGT
TTAGGCACAA
CTCATGCCAG
GAAA.ATGTCC
CACCCTTATT
ACATCAGCGT
TTTGTGGAAG
AGTCAAGCGA
TAAAAACATT
TAGTTTCCGT
TAGAAGAAAC
TTAAAAAGAG
GTATAGGGAT
AGAGCGTA.AC
GCATGGGGCC
TTGGGGCTAT
TTTTTTCTTT
TGCCTAGCTG
TAAAGTTTGC
CCTAAAAATA
GGTTATTTGG
GCTGAGACGA
TAACACGATC
TTTGGGCGTG
GAATTGGGAT
AAAACTAGAC
GTATTACATG
TATCACTTCT
GCAGGATTTG
AACCTAATGA
TTATTGGAAA
GGGTTGGGCT
AATCCTAAA-A
CCTTATGCCC
TCGTTGGTGA
AAAGAAAAGT
AATGATAGTT
CAACCGCGCA
ACACTTTTTT
GTGATCACGC
GAG TAATAAT
TATTCCTAAA
GTAGTTTTTT
ATCTTATTTG
CTAA.TAGCCG
TAGGGATTGT
TTGGCGTCA.A
TTATTTTTAA
TGGCTGGGTT
GGGAATATGG
CCTTACTAGG
CGAAGGCAAG
AAAGTTTATG
AACCTTCCAA
AAACGCTA.AC
GCAACACTTT
TTGGAAATAT
GGGGTTGTAT
AAGTTGGTTT
TGAAATCTTG
TAGCTGGATG
CTTGGA.ACCG
TTTCATTTTA
CTGTTTGGCT
120 180 240 300 360 420 480 540 600 660 720 729 GGGGAAGGGT TTTACCAGCT CACGCCTATA TCCAACGCAA
CTTTATTTT
INFORMATION FOR SEQ ID NO:51: SEQUENCE CHARACTERISTICS: LENGTH: 1458 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .1458 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:51: AAAA1AGGA
ATCTCCAACG
AAAGAAACTA
CAACAGATCC
GCGCTTTTTG
GGTAATAGCG
AACTACTTAT
CTGATTATCA
CTCAACAAAC
GCGACTTTTT
GCGTCAATTG
AACGATGTGA
GCTATGCCCA
TATAGACTAA
ACCGCTCCTG
AACCAATGTT
CTCTTTATGC
AAGAGACCA.A
CTGATAATAA
ATATATATGA
CTTTGAAAAA
CCATAGAAAA.
TCACAGGTTC
GCAACGCTAA
TGAGAGTGCC
ATGAAAGCAA
TCTGCAGACC
CTAACGATAG
AAGAGACAAA
AAAACAGCAA
TAGAAAACTA
TAAAGAAATA
GAAAGAGGCT
G.AAACCTCA-A
CACTTTGAAT
CAAAACTTAT
CCCTATTATC
GTTAGAACAA
GTTTGTGTTA
AAAAAGAAGC
GCTTTTTGAA.
CA.ACGATGAA
CGTTACTCA-A
TAACGCCAAT
AGAGAAGCTC
GCAACCTCTG
AGTGAAGCCG
AAACGCCTCA
TATGTCTCTG
GTGAATGACA
CTCTACGCTA
AAAACAAGAG
GTCAATGGGT
AAAATCAATG
GATCCCAACG
CAACAAAAGA.
GTGTGTTCGC
AAACCCAATA
GAAGCCCAAC
ATAGAAGAGC
CTCTTAGCAG,
TATCGCTCAT
ATAAGGTCAT
AAAAAGAAGC
TTGATGACAC
AA.AGCTTTGG
TGGATCTATT
CGATGGGAAC
ATTACAATAT
ACAAAATACC
CCCACACGCT
AAATGTATTT
CCCTAAGAGA
TCATTGCTCC
CATCACCTTA
TA-ATCGCCAA
AAAAAGAAAA
AAGCCTACTA
TAAGGCCACT
TAA.ACAGCGC
AAAAACTCAA
GGATTGGTTT
GGATTATAAC
TTATGCAGAT
TCTAAAAGCG
TTATGCCCAA
TGATAAGGGA
CAACTACGCC
TGAAATGGTA
TTATAGCCTG
TGCCACTCAA
CTCCCAACTC
ACAAGAGACT ATAGCCAATG AAGAAGAGAG GGAAAAGAALA WO 97/37044 PTU9/52 PCT[EJS97/05223 GAATTGGCTA AATACAAGCT CAAAGATTTA GAAAATCAAA GCAGAGTTGA AAAAGAAAAA CACTAAGAAA CCTAGAGTAG AAAACAAGTG ATTCTGATGA AACAATGAGA GTTATCAAAC TTATTAGTGG ATAAAGAGAC CACOATCAAA AGAAGCTATA AATTCTTACA GCAAMAAC ACCCATCAAC CTTGAGGACT ATTA-AGAGCT ACTATGTCAA, GTCTAATGGC TTGTGTTACG GTAAAAATCA AAAACGATCC CTACAAAGAG GGAATGCTGT AAACTGCTTT CACCACTAAG GGACAAGCTC AAATACGACA TTACTGAAAG ATTTAAAG AGAAACTAAA AGCTTTAGAA TGGAAGTGCC TATTCCTCCT AAAAAGAAAA CTATAATGGG AGGGGACTTT GATCAGTGAA TGAGAAGCTT AGAAGAAGAA CTAATGGCAT TAACCTCTAT GTGG'TTATGA GAGCGTTCA.A AGCAGAAGTT ACAAAAAGCG 1020 1080 1140 1200 1260 1320 1380 1440 1458 INFORM4ATION FOR SEQ ID NO:52: SEQUENCE CHARACTERISTICS: LENGTH: 303 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION .303 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:52:
GTAGTAGGCA
TTGGTTGTTA
AAAACTCAAA
AAAGAAATCA
AGCTATACTA
AGG
CAATGGAAGA CTTTTTGTAT AACACCTTAT TTTTTAGTTT CATAGGGTTA ATAGCGTTAT AAAAGGTTTT TAAAGATAAA GCTA.ACCAGC TTATAGATGG GCTGAAAGAA AGGGTTAAAA TTACTATCCT ATTCTTTTAT CACATCAGGG ATTTCATAGA GGATTATAAG TTTTCCTCTA CAAATTCATA CTCAAAAGAA AAAAAGCTTT CCTTTGGCTT TTGGTTGCCA TTATTTTTCT TGATTCTCTT INFORMATION FOR SEQ ID NO:53: SEQUENCE CHARACTERISTICS: LENGTH: 879 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 164 (ix) EATURE: NAME/KEY: misc feature LOCATION .879 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: AGAATGGCTC AAATTGTAAA AGGATCTGAC ATGTTTGAAG ATTTTTATCG CACTACCCTC TCTTTTTTAA GGTCTTTATT GCTTTTATTG GGTTTATTGT TGCCGTTTTC GCTTTGTATA 120 GCTGATGAAT ATATTAGCAT AAGTGATGAT TGGGATGAAA GGGCGCGAAA TCAGTGGGAT 180 GAAACTGCGC GAAATCATAA GACATATTAT TTTGAAAATG GTTTAGACCA TTTTAATCAA 240 GGCCAATACA AGCAAGCCTT TAA-AGATTTT AAATTGGCGC AAGAATACAG CATCGGGCTT 300 GGCAACGTTT ATTTAGCCAA AATGTATTTG GAGGGAAAGG GCGTGAAAGT GGATTACAAA 360 AAAGCGCAAT TCTATGCACA AAACGCTATC AAAGGGTATG GGAGCGGGTT GTTAGGGGGC 420 GCTCTTATTT TAGGACGCAT GCAAGCAGAA GGCTTAGGGA TGAAAAAGOA TTTGAAACAA 480 GCACTCA.AGA CTTACAGGCA TGTGGTTCGC ATGTTTTCTA ATAAAAGTGC AAATTTTGCT 540 AACAAATTTG GATCAAACCT TGCGGAATTT ACTAGTATGC TTATTGGATC GCGATTCATT 600 GATCTTTCAG GTTTGAGCGC GAATCCTATA AAATTTGGAA ATAALATTTGG AATACTTGTT 660 AAGAAAGCCC TTCAAATCAA AGATAATACA CTTTCTTGGG AAGACATTGC TGAAATTTCA 720 AGCAATATTA TTTTACTCAA ACAACAAATG GGGGAAATCC TTTATAGGAT TGGGATCGCT 780 TATAAAGAAG GGCTTGGCAC TAGAAAGAAA AAGGACAGGG CTAAAAAATT CTTGCAAAAA 840 TCCGCAGAAT TTGGCTATGA AAAAGCCATG GAAGCTCTG 879 INFORMATION FOR SEQ ID NO:54: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 243 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscteature LOCATION 1...243 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: GACTTTCACG ATTTCTGTGC GAATATCCAT CATCGTATCC GTTGGAGGCA CAGOCATATA GCCTCCTTGC TTTCCCGGTC TATGCCCAAA ATTCACGCCG TTTTCAAAGC TTTTATCCCT 120 ATTCCATTCG CCCTCTTCGC TATCCACTTC GTAGTATTGG GAATTGGAAG CGTCTTTAAT 180 CTTAATAGAA TCAAAGATAA AAAATTCATT CTCCGCGCCA AAATAAGCCA CATCGCCCAA 240 GCC 243 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 312 base pairs TYPE: nucleic acid STRANDEDNESS: double PCT/US97/05223 WO 97/37044 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .312 (xi) SEQUENCE DESCRIPTION: SEQ ID AAAGCCATGC TTTCTTCTAC GAATGACATA CGGTATCAAA AAAGGCGAAA TATTAGCGAA TCTGAAATTC ATAAAAAAAC GACCCTAGCG CGTATGTCCA ATTAGGCGTG AGCAAGAGCG AAGATAGAAG AAAGAAAACG TGCCAAAGAA CAAAAAGATT AGAGAAGAGC TTTTACAACA AAAAATCGCT TTGATGGACA GAGAAGCTTT TT
ACCCTAAAAA
TGCTTGGTAT
AAAAACAAGA
TTTTAAAAGC
CCCCACAAGG
CAAGGCTTTG
TGGGTGTAAA
AATTGAAAAC
CGATAGCATC
CACGATTTGG
120 180 240 300 312 INFORMATION FOR SEQ ID NO:56: SEQUENCE CHARACTERISTICS: LENGTH: 1587 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1587 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: ATGAGAAAGG TTATCATAAT GAATGGTTAT TTGAGGCTAA AAACCCCGTA
TCGGTCGTTT
TTTTTAAAAA
CAAAGCGAAG
AAAAAAACGG
AGAGACGCTA
GGTAACGGGC
CCGTATTCCA
GATGTGATTA
TGACTTTTTG
AAGTTACAAC
AGGTGCGTAA
GGAATTTGAA
CAGGCACGGG
ATAGCAACAC
ATATTGAACT
AGGGGGGAAC
GACTTTTAAT
CACCGAGCAA
TTCCACAAGC
TATTGAAAAC
CGTGCTGCCT
CAACATGATT
GGCGATTTTC
GAGCGTCCAA
CCCCAAAGAG
TCTTTTATGA
AAATTCAGTT
TCTCGCACGG
GCCTTGCAAA
AAAATTTCGG
TTAGTCAATG
CCTGTAACTT
TACGGCCCTA
TGGGAAAATC
GCGCGAAAGA
CTAGTGCCCC
TGATTTCCAA
ATGTACCAGG
TGCGCGGTTT
GTATCCCCAT
TCCAGTCAGT
ACACTTTTGG
AAGCGGCTGA
TTTTTTAGCG
TAAGCACCAT
AATTTCATGG
CAAAGAACTC
GATTCAAATC
TGGTGGGGGC
TTATGGCGCG
GGATAGGATT
AGGCGTGGTG
AAGGATCACT
120 180 240 300 360 420 480 540 600 AATGTCATCA CTAAAGAAAT WO 97137044 WO 9737044PCT1US97/05223
TTTTCGGGGC
CCCCAAACTT
GGTAAATATA
AATAGCCCTA
AATACTTTTA
AGCGCGCAAG
GGGCGAGCCA
GTGGGGGGCG
TCCAACCAAT
AAGGGAGAAA
AGCCCTTGCT
AAACTCAATC
TTTTTAACTG
ALATGGGAGTG
TATGCCAGCG
TACACTTTTT
AAALACCACGA
GCTCCTCTAA
TAGGAAACCA
TAGGCATTAG
CAAAGGTGCA
A.AGCTTATTA
ATTACGCCTA
AGCGCTTTGG
ATTTTAAATT
ACCAAAGCGT
TCAGCGCAAA
GGCAATTTTT
TTATCGTCAA
ALAGATTTATA
GGTTTGATGC
ATGAAATCAA
TAAATTACGA
AAGAGCGTTA
TGGGAATTTT
AATGCTGTTT
CGCTCAAGGC
AAACTACTTG
CCAGTATTAT
TAACCGCTTC
GATCGTGTAT
CACTTATTTC
GTATATGAGC
AAACCCTAAT
TGACAATATC
TACTGGTAAA
CCGCCGATCC
AGGAACTTCA
TTTCALATAAC
AAAAAAAGAC
TAACCAA
GTCGATCCCA
AACACTTATG
AATTGGATTA
TTGGATGCGA
CAATACAACT
ATCAATGAGC
CAAAACTACT
ACGCATGACA
GGTCAAAATA
TGCOGTCTGT
CGCCGATCCG
GTCAAACAAA
ACCACCAGGA
CTCAATAATT
GGCATGCTAA
GCTCCTCCCT
AAGAAAAAGG
GCCGAACGGC
ATGGGCAAGG
TTTATAAGAT
CTTACCATCC
GCCCTGACAA
TTGGCGATCC
TGAGTAGGGA
AGATTTTACC
ATTCTTATAG
TGGTGAATGC
CTTTTAACAT
AAAACCCTAG
TCAACAATTA
CGATCACGCC
TTAAGGTGGG
CAAGCCCTTA
TGGGATGTTG
TTTCAGGCA.A
CAATGCGACC
AGGCACTTTG
TCAAGATGGA
GGATAGGAAA
TTTTGGGTTT
CTTTAAAGGC
CGACACGAAT
CTTTGAGCCA
GGGAATGCGC
CATGCCTAAT
TACCGCTGTG
GGGCTTGAGA
TCAAACCCCA
660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1587 INFORMATION FOR SEQ ID NO:57: SEQUENCE CHARACTERISTICS: LENGTH: 582 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .582 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:57:
CGGGTTGGAT
CATCCAATGG
TTATTTTTTA
TTAGAAAAAA
AAAAAAACGA
GCCCTACAAA
AAAATGCCTG
GACGCTTTCC
AAAATCCCCA
CTTTATGAAG
TTCGCTCCTT
CTTTTTGCGC
TGGGTATGCT
AGCCCGCCGG
GTTTAGAAAA
AAGCCATGCA
AAGATATTTA
AAGCGAGTTG
TTCAAACCTT
AATTAGAAAT
TAGTGGTGGT
TCGCCAAAAG
TGGCGTTGGT
GATCGTTAGG
CGCTAAA.AAA
AGAAAAAGGC
TTGCAAGCAA
TATCGCTATC
TAAACCTTTA
CTTGCAAAGT
CATTGTGCTG
CCTCTAAAGG
TTTTCTCA.AA
OATTATTATT
GCCTATGAAT
GTAGAAAATT
ATTACCTTAG
GCTTTAAAAT
CAAGAAAAA
AAGAACGTGA
AAATTTTTAG
AAATCATGCG
CCGAGTTGGA
TATGGCGTTA
TGACTCAAAA
CAGATAAAAG
AAAGCATGTT
CAAAAATCAG
TCAAAGAGGC
GC
ATTTAACCCT
TTTTTTTATT
TTTAAAGGAT
TATTAGCGAT
TAAAAACAGC
CCCTGATGCC
AGAAACAACA
AGATTTTGAT
TTACCCCATT
120 180 240 300 360 420 480 540 582 INFORMATION FOR SEQ ID NO:58: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 618 base pairs TYPE: nucleic acid WO 97/37044 PCTIUS97/05223 167 STRANDEDNESS: double TOPOLOGy: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .618 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: CACGCCTTGT ATTCGCCCAT TCCGTTTTTA TCGCTTCATC AATCAAATTT CTAAGGGGAT GCTATCGTGG TCTATGGGGC AATGTCTATG AAATACGAAA AAACATTCCT TACACGGCAA TTTAATATTG ATCCGCGCTC CCGTCTTTTG CCAAAAGGGT TTAGTCTTGT ATCGGCATAG GAAAAAAACT CGCCTGATAC CTTCCTGAAA GAGAACTT
CAAAATCGCT
GTATTTTATT
TGAATTGAAT
GTGGGAAAGG
CGATTTTTTC
GACAATTGTT
TGCATATATT
GCGTTTGTCG
AGTGATTTGG
CTCTTTCTTT
TTTGAAAAAG
CCAGGTAAAA
ATTCTTACCA
TATCGCAATA
AACCGCCAGA
TTTGACGATG
AACACTGAAA
CTTAAAGATC
GAAGCTACAG
TTGCGTTTGA
CCCTTAAAA CGCTAAAGAA AGATAATGAA AATCTTTAAA ATTCCCTTTC ATCTACGGAC AATTAGTGCG AATGGGCGCG TTAAAGGGCG CTTTAGCACC CCCTAACGCT TCTAGGGAGT GTGCGGTCTT GTTTGACAAC ATGCCCAACA ATCATGGCAT AAGAAGGCAT CTTAATCCAT TTAAAGAATG GTCTAAAGTC 120 180 240 300 360 420 480 540 600 INFORMATION FOR SEQ ID NO:59: SEQUENCE CHARACTERISTICS: LENGTH: 1326 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1326 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:59:
GGAAAAATTA
AACACTCAAA
TTTTCTAAAG
AAAAAACCCC
AAGGTGTGAT GATGAAATTT TTTCTTTTAA AGAAATTCAG CGAATTTTTA CGCATTTTAA CCTCAAACGC TTGAACGCGT CTAGTTTTTT ATTAGAGACT AAAAACACGC CTTTGTTGTG GATTTGAGCG CGCCTTATAT TGGTTTGTCC CAGAGAGCGT TTTAAAAAAC ACTTTAGCGT TAGATTTTTG TTTGAATAAA 120 180 240 WO 97/37044 WO 9737044PCTIUS97/05223
TTCACCAAAA
ATCAAGGGCG
ATCCCTAAAA
CGTTTTAATG
GAGCATCAAG
TTATCCTATC
CCCCAAAAAG
CTGGAAGCGA
AA.CAGGCGTG
ATTGATAAGA
AAGAAACAAA
TTTAAGGAAA
TTTATGCCGG
TATAAGGATT
GACGCAAGAG
GTTTTTTGCC
ATTAAAATGC
GTCAAAATCA
GACACT
ACGCCAAAAT
CTAAAGATTT
AAGCCAATCT
ACAGGGTGGC
AAGAGGATTT
AGCACAAAGA
AACGCTTGA-A
AAGAATTGCA
AAAATCGCGT
GCATGCCCTT
AATCGCAATT
ATCAAATCAA
TAAAAAACTC
TTAAAATCGG
CGAATGATTT
AAAAGAATAC
AAAAAGATGC
TCAAAGGAGC
TTTACAAGCA
AGCTTATAAG
CATGATTTTA
TAAAAACGAT
GGATTTTAAG
ATTGGAACAC
AGAAAAATTA
AACTCAAGCC
GATTTTAAAG
AAACGCTTTT
CTTGTATTTA
CTATGTTAGA
TAAAATCAAA
TTTAGGGAAA
GTGGATGCAT
GCCTAAAGAT
GTTTAATGGT
TCATGTCATT
AACGTCATTG
AGTGAAACTT
GATCAAGAAA
ATTTTAGGGG
GGATTGTTGG
AAAAAAAATC
GAAAAACTAG
TCATTGTTGC
GATTTTGAAG
ATCAATAPAA
GAAGAAGAGA
GACGCTGCAG
CGCCCGATGA
AACCAAAAAG
GTGAGAGATA
GAGGTCATTA
TACGAGATTG
TACTCAAA.AT
ATAXACGATCG
TTATTTTGCG
AATGCGTGAT
CATTGCCTCC
ACATTTTAGA
AAATCATCAA
AAGATCCTAA
TCACTTACCA
ATAAAGAATG
AATTCACTCT
ATCTGAAAGA
AAGAAAGCGT
ACGGGTATGA
AGAATATCA.A
TTCCTGGATC
TGGAATTAGC
ACTACACGCA
ACCGAACTAT
GATTTTAGAA
TTTAGAAATG
AGAGGCTTTT
TAATATTTAC
AAAAGATTTT
GCGATTAAAC
AACTTTACAG
GCATTTAATC
CATGATTGAA
CAGCAAGAA.A
AAAAATCGCT
TTTAGAAATG
AGTGCTGTAT
GCTTTTACAA.
GCATTTGATC
CAAAATGTTG
ACGAAAATTT
TAGTCTAAAG
300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1326 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 879 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .879 SEQUENCE DESCRIPTION: SEQ ID
GGATTTATGA
TGGATGATGA
GGGAGCCTCT
GTAGAGATTT
GAAATGGGCT
ACGGATTTAA
TATGATACGA
TGCGGATCGC
GCCAAAAGCA
TTAAGCCTTG
ATGATCTTTG
GATTATTTGA
CTTTTCCCTA
GGCTTGGATG
TGATTTTCAT
GGCAAGCGGG
TGGAATTGTG
TAGGCGTGGA.
TGAATTTGGA
AAAGCGTGGA
TTTCTCAAAC
CTTGGACTTT
AGAAAATGCT
AATTGATAGA
ACTCATGGGC
AAAAAATCTC
AAGGGATTGG
GGGGCACGCC
TGACGCATGT
GCGTTACCTT
TAAAAATAGC
TGCGGCTATT
GTTTATCCCC
AAGCCTAAA
GCGCCAAAAG
AGCGACTTAC
TTATAGCGAG
GTATTTGAGC
TAGCGCTTTA
TAAAGAGCTT
CGCTTATTTG
TTTAACTGCG
TTTAGAAAGG
AGCGAATACC
GATTTAGCCA
TTGTTTAGCG
AAAAAGGGGC
GTAGGGGTTT
CTTTCTAAAG
ATGATAGAAG
CCTGAAGTTT
CTTCAAATCC
GAAA.AAGAAG
AAAAAACGCT
GATAGCATAG
GCAAAAAAAA
AAACGCCTTA
AAGAGAGCCG
CAGAAGTTAC
ATATTTTAGT
CGCATTTTTT
ATAAACAACT
AGAAAGCGCT
GCGAGGGGAG
TAAAAGCGCT
AAGCAGGGGT
CGTATTTGGA
ATGCGCACAT
ATGGGGAATT
CACACCCATT
TAAAAAAGCG
CCTACAGCCG
AGTGCCTTTG
AGAGACGATT
AAACTATGTC
AATCGGTTTT
CAAATCGTAT
TTTAGAAAAG
CAATGCAGTG
ATTCAGTTGG
CCCAGTTATC
TGATTTGTTT
120 180 240 300 360 420 480 540 600 660 720 780 840 AATTAGCGGT AATTATTTTT WO 97/37044 PCT/US97/05223 169 TTGCAAGGAA TTTACAACCC ACCCGCCTTT ATGATAAAA INFORMATION FOR SEQ ID NO:61: SEQUENCE CHARACTERISTICS: LENGTH: 576 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:61:
AGGGTGTTTT
GTGCTTTTAG
TATCTTAACG
CCTTTTAAAT
AATCCTCAAA
GATAAAAGCT
TCCATTCAGC
TTCAAACCCA
AACGATAATT
AGAGGGTTTA
TGACTAAAAA ATTCATGTCT TGGATGGTGG TTATCGGGGC GGGTGTTTAT CTTCTTCACT AGCATGTCGG TTAAAAAATC CTTATTTAGA GCAACGCCCA AATATCGAGG GCATGGGGAT GCGAAAGGTT TTTTAAAATC GCATGCGTTT CTAAAGAACT ACTCTCCTAT TATGGATTTT AAAAATTTAG AGATCAAGCT CTCTTACCCT TTCTATCCAT TCTCAAATCC AATCCCCTAT AAAAAATCAG CCAAATCCCC TTAAAAAACT TGAATGCCTT CGCGCTTGAA TTGCTCTTTA ACATTCAACG CTTTAGATGA TAAAATGCGA TTTGACTAAT GCAGAGAATC TCCTGCTACA ATGGAGGCGC AAGAAAATCT CTCCCT
TTTAATTTGC
TTTAACCGCT
TATAGGAGTT
TCGTTTTTTA
CCATTCTTTA
TTTAGAACAA
ATTGGAAAAA
AAAAACCTTA
CTTTTTTTCA
INFORMATION FOR SEQ ID NO:62: SEQUENCE CHARACTERISTICS: LENGTH: 1110 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...1110 WO 97/37044 PTU9/52 PCTIUS97/05223 170 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:62:
CAGGATCAAC
TATTGTATCG
CATGCCGTTG
GCTCAJ\AACA
ACCTTTAACT
CAAGCCGAAA
AGCGGGGGCG
ATTACTAACC
TCTCAAAACA
AACGCGCTTG
AGCGTAGGGT
TTCTATGACT
AAAATGAATA
GCGCAAAAAC
GTAGGGAGTG
GATTATCGGG
GTGAATGTGG
TCCTTTTATG
ATGTTTAACG
CCATGCAATT
CTGA.AGAAAA
AACACAATAA
AAATCTATAA
AC ATCAACAA
CCTACTACCT
TTGCGTCTAA
CTTTAGAATT
GCATGCTTTC
ATCCCAGCTC
ATAAGCATTT
ATGGTTACAC
ACCACCTCTA
ATTCGAGCGT
GTTTAGGCAT
CTAAAATGCA
ATAGGCACAA
AAACGCATGG
TGAGTTATGT
TCAAAAAACc
TGGGGCGTAT
CCCTTTTTTG
ACTCAATCAA
CCTTTAAAA
GCAATCTACC
CCCTAAATTA
GGTAGAAAAC
TTCTTTGTCT
TTATTCTAAA
CTTTACCAAG
TAATTTTGGT
TGGGCTTGGC
GGGGTTTTAT
GTGGGTGAGT
CACGAGTTTT
TGGTTTTGAA
CAAGGGGTTA
TTATAGTTTT
TTACTT TCTT
GCGAGCGTGG
AATCAAGAAC
GTCAA.AAATG
AACAATGCTA
CTTCAAAACA
GCCCA.AGCGT
TTAAAAAATT
TCTCAGATCG
AACGTTTCA.A
AAAAAAAATC
TTTGTGGGTA
ATAGATTATC
GTAGGCTTTG
CAAATGGATT
TTCCAAATCC
ATGGGCTTAA
A.ACGCTTCCC
TATCTTTATT
GTTTTGA-TA
GCATCCAAAC
AALATCACAAA
AATTA-ACCCC
TTGAAAAAT
TAGAAAAAAT
TAGAATTACA
CTCAAATTTC
GCATGTATGG
A.AGGGTTTCG
ATGGCTTTGA
TTTTCAATTT
CTTTAGCGGG
TCATCAACAA
CTTTGAATTT
AGATCCCTTT
TCTTTTTCAA
ATTTTTATCT
TTCCATCAGT
GATTTCTAAC
CATGCCCAAC
TACTGAAAG
AGTCATGCTT
GCAAGAACCC
ATTCAGCCAA
AAATTCTTTG
GGTAGGTTTG
TTATTACTTA
TGGTTTAGGC
CATTGATAAT
GAGTTCGTGG
CTATTTGACG
TGGGGTTCGT
AGCGGTCAAT
ACGCCTTGTC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1110 INFORMATION FOR SEQ ID NO:63: Wi SEQUENCE CHARACTERISTICS: LENGTH: 729 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .729 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:63:
GAACCCATGC
GGCATTGGCG
CTCTATTTTG
TTGGGCTTAG
CCCCACCTTT
GATGGTAGGG
ATGGGGGCTG
CTCTTTAGA.A
ATCGCTATGT
AATCCTAAAG
TGTGTGGCGT
AAACCTTATA
CTATGGGCAA
CTAAAAACAG
ACCCTAGGAT
GCGGGGTGGA
TAAGTTCCCC
TTACGCATTT
ATCCTAAAGT
AAGCTTATGG
TTGAAGCCAC
CGCTCAGTAA
GTCATAATGG
AATTCGCTAA
AGCGCTCAAA
CGAAATCATG
TTCCACTTCC
TCTAATCCCA
AACGGTGTAT
AAACGAACAA
CGTGGTAGAA
CTCTAAAGTC
GCTCATTACC
AGCCGAAAAA
CATCAATCTT
TGTGGGCGAT
GAATACCAGA
ACCTCCGCTC
TATCTTGTGT
AGCGCTATAG
AACTCTGTGT
GCGCAAGGGC
AAAATCAATT
AAAATTGATT
CCAAGCCGTT
GAAGGGCTGG
GGGGGGACGA
TTCAAGGGCG
TCAAAAAAAC
AAGTGGAATT
CTTCCAACAC
GCTCTCAA.TG
TTAACGATGT
CAATCCAATC
CCATACCAGG
TTAAATTGAT
ACGATGATTT
ATTTATTCAT
TGCAGCCTTT
ATAAAAACGG
CAGAGATGTG
AGGCAAAATG
ATGCCATAAT
GAAGAAAAAC
GCAGTTTTGG
TTCTTTTGAA.
CTATGTCAAG
CGCTGATAGT
TTTAAGGGGC
TTCTAAAGGC
TGGGGTGGTC
GCTTGTGAAA
120 180 240 300 360 420 480 540 600 660 720 WO 97/37044 PCT/US97/05223
GTGCCTACT
INFORMATION FOR SEQ ID NO:64: SEQUENCE CHARACTERISTICS: LENGTH: 702 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...702 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:64:
AACGATTTTT
TCATCAGCGC
CCAGAAATCA
CCTTTGACTG
ATCGTGTTTA
TTGCCTCGCA
GTGGCTTTGC
TCGTTTTCTC
GATATTAGCG
AGCTTGGAAA
GCGATACAAT
AAAAAAGCGG
GGAGAAAGGA
TCGCCCTATT
GAACTTATGA
AAGTGAGGCT
AAGCGCAAGA
ACATGTTAAA
CCCCTTATGG
TTTTAGAAAA
TGAAAGGCGA
ATAAAACGAC
CCCTCCAACA
TTGAAGCGCA
AATAACCATG AGAGCTACGG CGATAAAAAT GCTTCAGGGT TGCTTGAGCA TTAATTTAAA TTTAAATGCG AGTTCTTTTG AAATGACGCA CATTATCATT TTGAGCGCGG ATTTATTCAA CGGGCAGATT ACGCATGGGA AGCACCAAAA AACCATGTTC ATGCAAGAAG CGCAAAAAGC CGCGGGCGCG CCCACTTATG CGGTTCGTTT AGAAAATTCT ACTTACAGGG CGGAATTCGC TTCGCATTCT GGAGTGATCA TTAAGCATGA CAAAACGAGT AAAAATGGCA GTCAAGATTT TGTCAGCACG CAAGCGATGC AAGAAGCGAT AAGCGTAAGC CCGTTAAAAA AA
CTTTTCATTC
ACAAATGCTG
ATGCCCTAAA
CACTAAAGAG
ATGGATAGAC
ATGCTTAGGC
TACGATTTTA
GCTAGGCTAT
AAATATTTCT
CCAAGAAAGC
TTCTTTGATT
INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 849 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature WO 97/37044 WO 9737044PCTIUS97/05223 172 LOCATION 1 .849 (xi) SEQUENCE DESCRIPTION: SEQ ID
AGGCAATTTA
TTAATCGGCG
GGCCATTTGG
TTGGGAGGCA
ATCAAAGTGG
TCCAGCGTTA
TTAAAATTTT
GGGGAGCGGA
CAAGTGGTGC
AACGTGGAGA
TCAAGAAAGA
AATAATGTGG
ATGTTCACTC
GAAAAA.GGCT
GAAAGGAAA
TTCAATTCAAL~
GGCTCTTCTT
GATTAGATGA
TTGCGACCAA
GTTTTGCAAA
AGATCCGTAA
TAGCCTTAGA
TTTTA.ATCTT
AAGAAGTGAT
AATTCAAGCA
CTCAATTTGA
CTTTGGATGT
CTTTAATCAT
CTGCTTTGAT
CACGAGGAGA
TTTATGCTTA
TGGGAAGTAT
CTCGCCCATT
GGATAAAGTG
AGATTCCAAA
GCAALAGCCAC
TAAAGAAGGG
GAAAGCGATC
CATTCTCGCT
TTCCTTGATC
GGATAAACGC
GCAAGCGCAG
AGATAAATTT
GAAATTTTGG
GTGTGCATGG
TATGALATATG
AATTACAAAG
GGGGTGGTCC
GTGGCGGTTT
AATGALAGAAT
CTTATGGATC
AGGAATGTGA
TCAGTGGATG
AACAACGCTA
GTCAAACAGG
TTGAGCTTAA
GACGCTAACC
AAAGGCATGT GAATTACACT
TAGGCTTTAT
TGGTCTATAC
GGATTCAAGT
GTTTGGATTT
CTTCTAGAGG
TTTATGGTAG
GCTTGAGCGG
ATAGGATTTT
ATCTCATCGC
ACAACCTTGT
GGCAATACGA
GAAACATTGA
CCTATAAAAC
TTTATGGCTA
GGACAAAGAC
GGGTAATGTC
GATGATCAAA
GTTTATGGGG
CGGCGATAAA
TGATGCTAAT
AGACGATGAA
TAACTTGGAT
TTCTAATGTC
CTTTAAGGCG
TAATTTTGTG
GATTTTTGGA
120 180 240 300 360 420 480 540 600 660 720 780 840 849 INFORMATION FOR SEQ ID NO:66: SEQUENCE CHARACTERISTICS: LENGTH: 453 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .453 (xli) SEQUENCE DESCRIPTION: SEQ ID NO:66:
AGCAGCTCAC
GCTTTGGGGG
AAGCTTTTAG
TTTAGTTTCC
TTGTTATGCT
TTTTGGA.TTT
GTTTTAGGAG
GGGTTTTTGA
AATCTATAGC
TGTGGTTTTT
TGGCGCGCCC
CTAGCGGGCA
ATTCTAACGC
TTTTAATGGC
GGTTTTTATT
AACGCCCTTA
CTTACTCATT
CTTTAGCATC
ACGGCCTGTA
TGCTTTAGCT
AAACAATCGC
GTATGATAGG
AGGGATTGCT
TAAAGCCGCT
GGCTTGTGGT
TTATTAGGTG
ACCAATGGCG
TCAGCGCTTT
ATTAAAACTA
GTTTATTTAG
TGGTCGTGCT
TAA
TTGGGTTTCA
AATTCACCTT
AATTGGTTTT
TTTACGGCTC
TTGGTGCTAT
GGGTGCATTA
GCTCTTTAGC
AAAACGCATC
AAAATCCCTT
TGCGCATGGC
TTTGGCGTTG
TATTTTACTT
CCCTAGCGAT
GCTTTATTTG
INFORMATION FOR SEQ ID NO:67: SEQUENCE CHARACTERISTICS: LENGTH: 219 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 173 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...219 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: ATGATGGATT TAGAAAGTTT GAGAGGTTTT GCGTATGCGT TTTTTACCAT TCTTTTTACG CTCTTTTTGT ATGCTTATAT TTTTAGCATG TATAGAAAGC AAAAAAAGGG CATCGTGGAT 120 TATGAGCGAT ACGGGTATTT AGCGTTAAAT GATGCTTTAG AAGACGAGTT GATTGAACCA 180 CGCCATAAAG AAGTTCATGA TAAAGGCATA AAGGAAAGT 219 INFORMATION FOR SEQ ID NO:68: SEQUENCE CHARACTERISTICS: LENGTH: 300 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...300 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:68: GGAAATTTAA TGGGACAAAC TAAAGAAATT ATAACGACGC TTTTACCCCT TTTGGTGTTG TTTCTTATTT TTTATTTTTT GATCGTTCGC CCGCAACGCC AACAGCAAAA AAAGCATAAA 120 GAAATGATAG AGGGCTTGAC TAAGGGCGAT AAAATTGTCA CTCAAGGAGG GCTGATCGTT 180 GAAGTGCTTA AAGCGGAAGC GAATTTTTTT AGCGTGAAAC TCAATGATGA CACCACCGCT 240 AAACTTTCTA AAAACTATGT AGCGTTCAAA TTAGACGAAG AAACAACACC CAACAACAAC 300 INFORMATION FOR SEQ ID NO:69: SEQUENCE CHARACTERISTICS: LENGTH: 477 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 174 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...477 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:69:
ATGAAACTTT
TTTTCTGTGC
AGGGGGGGGT
TTAAGCCTAG
ATTAAATCCA
TTAGACGCGC
GAGTTTTATA
TTGCAAGTGA
TTAACCCTCG
CTTCTTTGCT
TGAACATGCT
CGTCTGCTTT
GTTTAGAGGG
TTTTATTGGA
GCGTGAAACT
TAGGGATCAT
TTTAATCGTT
AGAAACTAAA
TTTAGGGGTG
AGAATACAAC
GATCAGTTTT
ATTGCAAGGT
CACCCCTTTA
TCGTATCCGC
TTTATATTCG CGCTTCTTTT GGCCCTAAAA TCACTTTAGG CAAACCGATG AGGCTTTAAA GCTAAAAAGC AAAATATCTT GAGCTTTTAG ATGAAGATGA CATAGCCAGT TTGAAATCAA GAGCAAGAGG AATTGCGTAA TTGGATCATT TGGCTTGGCA
AGGGGTAGGG
TTTGGATTTA
AAACAAGTAT
GCTTAAAGAC
GGCGAAAAAA
AAAAGAAGCG
AAACACGATC
GAGCCTG
INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 264 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...264 (xi) SEQUENCE DESCRIPTION: SEQ ID GGGAATGTTA AAGTGAATAA AAAGCACCGC TTGGCTTTTT TAGGGCTAAT TGTTGGGGTT CTATTCTTCT TTAGCGCGTG CCAACACCGC CTGCACATGG GGTATTATTC AGAAGTTACA GGGGATTATT TGTTCAATTA TAATTCCACT ATCGTGGTGG CTTATGACAG AAGCGATGCG ATGACTTCTT ATTATATCAA TGTGATTGTT TATGAATTGC AAAAATTAGG CTTTTATAAT GTCTTCACGC AAGCGAATTC CCGC 120 180 240 264 INFORMATION FOR SEQ ID NO:71: WO 97/37044 PCT/US97/05223 175 SEQUENCE CHARACTERISTICS: LENGTH: 495 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...495 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:71:
AAAGCGTTGA
CCTTTGTTGG
TTTTACGTGA
CGCCAAAACA
CAAAACCCCA
AAACAAGAGA
AAACAAGAGA
AACAGTGTCT
AAAACCTCTA
TGGTAGGAAT
TGGTTTTAGC
AAAAAGACAG
GCACGTTTTC
CCAAAGACAC
TTAAACAAGA
TTAAACAAGA
CGCCCGTTCA
GAGTA
GAAAACTGGA
GTTCATGTTG
CGCTCCAATG
GCCTAAAGAA
AGTGCCACCT
GATTAAACAA
AACTAAACAA
AAACGATCAA
GCATGGACCG
TTGTATGCTT
AGTCCAAATG
GAAGCCAACG
TTAGACACAG
GAGATTAAAC
GAGCAAGAAA
AAAACCCCCA
GTTTAAAACT
TAGCGCATGC
TAGAAAAAAG
CAACCACAAC
CCACACAAAA
AAGAGATTAA
AAGAAAATAA
CAACCCCCTT
TTTTGCACAG
TGTGCTTGGT
CGAGACAGAG
CGCCACAGAA
ACAAGAGATT
ACAAGAGATT
GCCTAAACAA
AATGGGTAAA
INFORMATION FOR SEQ ID NO:72: SEQUENCE CHARACTERISTICS: LENGTH: 291 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...291 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: ATGCTTTATG CATCAAAAGC GCGTTTATTT TTACAAATCA AAGGAAAGTT TATGTTAAGA ATTTTAATCC CCTTACTCAT TATTGTGTGG GTTTTATGGC GTTTGTTTTT GAGGCAAAAA WO 97/37044 PCT/US97/05223 176 CCCCACAAAG ATGACCACAG AGACAACCAC TCTTACACGC AACAAACCCC CAAAGAATTA 180 GAAGATCACA TGATTGTATG CTCTAAATGC CAAACTTATG TCTCTAGTAA AGACGCCATT 240 TATAGTGGGG CGGTAGCCTA TTGCAGTGAA ACTTGTTTGA AGGATAAGGG G 291 INFORMATION FOR SEQ ID NO:73: SEQUENCE CHARACTERISTICS: LENGTH: 252 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...252 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: AAAACAATCA ATAAATTAAG GATTATAATG AAACCAACGA ACGAACCTAA AAAACCTTTT TTTCAAAGTC CCATTGTTCT TGCGGTTCTT GGAGGGATTT TACTCATTTT TTTCCTACGC 120 TCTTTCAATT CTGATGGCAG TTTTTCGGAC AATTTCTTAG CTTCTAGCAC TAAAAATGTG 180 AGCTACCATG AAATCAAACA ACTCATCAGC AATAATGAAG TGGAAAATGT AAGTATCGGT 240 CAAACTTTAA TC 252 INFORMATION FOR SEQ ID NO:74: SEQUENCE CHARACTERISTICS: LENGTH: 690 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...690 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: TTCAAAAATA ATAAGACACC CCTAACTTTT GAGCGTGATA AAGGGTCATT GGATGGTTTG WO 97/37044 WO 9737044PCT/US97/05223
AAAAAAGAAA
ATGAGTTTTT
ATAGGCACGG
CTTCAAGGCT
ATAGATTATG
TATGGGGTGG
TTTTTCTTTG
TTGGTGGTGA
GCTATCGGGA
GAAATTGGCA
TTGTTTTTTG
GACAAGGATT
TAAATATTTT
TGTTTATGCG
ATGAAGTGAA
GTAATGTGCT
GAGGAGATTT
GTTTGCAACT
ATACTTGGGA
AATTTGGGGT
TGAAAATCTT
TTTCGCACTC
TTACAAGCA-A GTGCATTATT AAATGCTGAA AATTTGAGTT CCCCTTAAAC ACCAACAAAC TCCTAAAAAC GACTGGGCTT TTTCAATAAC GATTCCAcTT TATGGTCGCT TACGCTAAAA AGCGGCTAAC ACATGGATAC TTCATTGAAA
GATTTCAATT
GCAGTTTCGC ACGATCGTTT TTTAACGCCT
GAAAGGCGCA
GTGGCATTTT
TAAGAATTTT
ACATGTCTTC
TTTTACAAGG
ATTCTAGGTA
TGCAAGCGIA
ACCCTATCAA
TCAATAATA
TTCATAACAC
TGTATCATAA
GCTTGTTTGA
AATACTGAGT
TTCTTATCAA
GGCTTCAATC
TTATTTCTTT
CATGTTCACT
CCGCTGGGCT
AGTCAAAGAT
TTATTTTAGG
GGTGGATGTG
AAGGAGTTTT
120 180 240 300 360 420 480 540 600 660 690 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 627 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .627 (xi) SEQUENCE DESCRIPTION: SEQ ID
GGAATTTGTT
TTGCCCCCTA
AGCGTACGCA
CTTAAAGAAA
AGCAATCGTT
AAGCTAGAGC
TTGCAA:AGCG
CCCTTTTTCC
TTGAGCTTGA
AAGGGGGCTA
TTTAAAGACG
TGAAGTTTCA
AGGGCCATCA
CTTATTGGCG
ACCCCAAAA.T
TCATGCTTTT
CTGGGTTTTA
CTCCTGGCTA
TAGCCTTTGA
TTAAAACCCC
ATACCCCATG
CATGGGAGTT
AATTGTGAGT
TTCTGGTTTG
CAAAGTGGAT
AAAGCTCAAA
ATGGAAAC
TTACTTGGAT
TTCATACACC
AGTCAAACCT
TAGAGGCTTT
GATTGAGGGG
GGAACGA
TTGTTATTGG
GTGAATATGT
AGAGGAGTTG
GACCCTAAGG
CGCTATACTT
TCTTTTAGCG
AAAAATGGCT
GACAGCAAGA
TTAGGGGTGT
AGCTTGAATT
CTTTTTTATT
ATATTGCGCA
TCGCCAAACA
GGCCTTTGTT
TAGCTAAAGT
TGGAAACTCA
ACGATTTTAA
TCGTTCTCCC
TTTTATTTGA
TAAAGCTTAA
AGCCTCTTGT
TCAAGGCCAG
CAATGAAACG
CATGCTAGGG
CCAATCGTTC
AAAAGGCATC
GAACAACCGC
TAGCGTGGAA
CAACAACGAA
AAACGCTTCT
120 180 240 300 360 420 480 540 600 627 INFORMATION FOR SEQ ID NO:76: SEQUENCE CHARACTERISTICS: LENGTH: 510 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 WO 9737044PCTIUS97/05223 178 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .510 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:76:
AAACCCTTTG
TATGTGGGGG
GGTATTGGGG
ATCAGTCTAG
GATGTTTCTG
TTTAATACCG
GCTTTCAAGC
GGGCATCAAA
CTTATGATCC
GCTATCTCGG
CTCCAGCGCG
CTATTGGGGC
GGAATTTTTT
ATGCGTATCT
ATTTTGTGGA
AAAATTCCAT
TCAATAAAGA
TGTGTCTAAA
GCTATTGTAT
TCTTGGGGTG
TTGGAATATT
TGGTAACCCT
TCAATACGCT
TTTTGATTGG
GCGTTATTTT
GCAAGGGAAT
CGCTTGTATG
AAGCAAGGGA
GATTTTTCTT
TATAACAAAC
AACAACGTTA
AACCAGCGTT
ATAGGGGGCA
GGGATTTTTA
CGGATCGCCA
CGCAAAAAA
ATAGCAATGG
AGCGTTTGGC
AGCCTTATTT
TTAAAATCGC
ATATTCAAGG
TGGATAGCAT
CTTTCCTAA.A
CCCTCACAGC
GTGGTCGTTT
TAACCTTTAC
GAGCGCCGGC
TTTAGGGCGT
GGTCTCTGTG
GCTTTATAAT
CGCTCTAGCT
120 180 240 300 360 420 480 510 INFORMATION FOR SEQ ID NO:77: SEQUENCE CHARACTERISTICS: LENGTH: 585 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .58S (xi) SEQUENCE DESCRIPTION: SEQ ID NO:77:
TTAGGATCAC
TCTTTATTCT
GAATATTCCA
CAAACGATTT
ACAAACATC
ACCCCCACTG
AAAATAGTCA
AAAATGCAAG
TTACAATTCA
ATTTCAAATT
CCATGCAATT
TATCTTTTAG
TTAGTCATGC
CTAACGCTCA
AAAACACCTT
AAATGCAAGC
TGCTTAGCGG
AACCCATTAC
GCCAATCTCA
CTTTGAACGC
TCAAAAAACC
TATCGCTGAA
CGTTGAGCAC
AAACCAAATC
TAACTACACC
CGAACAATAC
TOOCOTTOCO
TAACCCTTTA
AAACAGCATG
GCTTGATCCC
TTATCTTCCT
GAAAATGOGGG
AATAATCCCT
TATAAACTCA
AACAACGCTT
TACCTCCAAT
TCTAACCCTA
GAATTGGTAG
CTTTCTTCTT
AGCTCTTATT
TATCTTTATT
CGTATGCGAG
TTTTA.AATCA
ATCAA.ATTGA
TGAAAAACAA
CCACCCTTCA
AACTAGTCCA
AAAACTTAAA
TGTCTTCTCA
CTAAA
TTTATCTTTA
CGTGGGTTTT
AGAACGCATC
AAATGAAATC
TGCTAAATTA
AAACATTGAA
AGCGTTAGAA
AAATTTAGAA
GATCGCTCAA
120 180 240 300 360 420 480 540 585 WO 97/37044 PCT/US97/05223 179 INFORMATION FOR SEQ ID NO:78: SEQUENCE CHARACTERISTICS: LENGTH: 216 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...216 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:78: AAAGGCAAGG TCATGAATAG TTCTAACCTC AAAAATTGGC TATTCCCCAC TATTTGCTTT TTTTTATTTT GTTATATTTT AATTTTTTTG ATATTTTTTA TGTTTAAAAA CTTGCAATCG 120 CAATCTTTTG GCTCTGTGGC AGAAACCGGA AAAAAACCCA TCACCACCAC CAAGAAATTT 180 GGTAAGGAAT TGCAAAAACA GATTTCAAAA ATCCAT 216 INFORMATION FOR SEQ ID NO:79: SEQUENCE CHARACTERISTICS: LENGTH: 834 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...834 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: TTGATGCGTA AGGTTTTATA CGCTCTCATG GGCTTTTTGT TGGTTTTTAG TGCTTTAAAA GCCGATGATT TTTTAGAAGA AGCGAACGAA ACAGCCCCAG CGAATTTAAA CCACCCCATG 120 CAGGATTTAA ACGCCATTCA AGGGAGCTTT TTTGACAAAA ACCGCTCCAA AATGTCCAAC 180 ACTTTGAACA TTGATTACTT TCAAGGGCAA ACTTATAAAA TCCGCTTGCG TTATGCGATG 240 GCGACTTTAT TGTTTTTTTC AAAACCCATT AGCGATTTTG TTTTGGGGGA TAAGGTGGGT 300 TTTGATGCGA AAATTTTAGA AAGCAATGAT CGCATTTTAC TCATCAAACC CTTACAAATC 360 WO 97/37044 PCT/US97/05223 180 GGCGTGGATT CTAATATCAG CGTGATTGAT AGCGAGGGTA AGATTTTTTC TTTCTATGTG 420 TTTTCTACCA CTTTCACCAG CTCCAAACAC CCTAATTTAC AGGTTTTTAT AGAAGATAAA 480 AACTATTACA CTAACGCTTT TATCAAGCCC CAAAAAGAAA ATCAAGAAAA TATGTCTGAA 540 AATGCCCCTA AAGATGCCCA AAAAAATAAT AAGCCCCTAA AAGAAGAAAA AGAAGAAACT 600 AAAGAAAAAG AAGAAGAGAC TATAATTATT GGCGATAACA CCAACGCGAT GAAAATTATT 660 AAAAAAGACA TCCAAAAAGG CTATAAGGCT TTAAAAAGCT CTCAAAGGAA ATGGTATTGT 720 TTATGGGCTT GTTCTAAAAA ATCCAAACTC TCCTTAATGC CTAAAGAAAT CTTTAACGAC 780 AAGCAATTCA CTTATTTCAA ATTTGACAAA AGATTAGCAC TCTCTAAATC CCGG 834 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 180 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...180 (xi) SEQUENCE DESCRIPTION: SEQ ID TATCGTGAGA TGGATTATGC GGTGTTTAGC ATGACAATCC ACCCTGATGT GAGCGCCCGT CCGCAAGTGT TGCTCATGCA TGAAAAAATC ATTGAGCATA TCAACCAGCA CGAGGGCGTG 120 GCTTGGGTAA CGTTCAATGA AATCGCTGAT GATTTCTTAA AACGAAACCC TAGGAAAAAA 180 INFORMATION FOR SEQ ID NO:81: SEQUENCE CHARACTERISTICS: LENGTH: 237 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...237 WO 97/37044 PCT/US97/05223 181 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: CTTGTGTTAC AAGGGGCTGA GGTTTTAATC TATCCTAGTG CGTTTGGCAA AGCTAGGGCT TATAATTGGG ATTTGTTGAG CAAAGCTAGA GCGTTAGAAA ATGGCTGTTT TGTGTGCGCT 120 TGCAATCATA GTGGGGAAGA AACTAACGCT AAATTAAAAC AAACGCTAGA ATTTGCCGGT 180 GATTCAAGGA ATCATCGCAC CCAATGGGAA AATCATCGCC CAAGCCACCA AGCTTAA 237 INFORMATION FOR SEQ ID NO:82: SEQUENCE CHARACTERISTICS: LENGTH: 183 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...183 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:82: AATTTGCCGG TGATTCAAGG AATCATCGCA CCCAATGGGA AAATCATCGC CCAAGCCACC AAGCTTAATG AAGTCATTAT CGCCGAAATG GATTTAAACG AAGTGGCACT GCAACGCCAA 120 AAAATCCCTT ATTTACAAGA TTTTGACACC AAACTCACCA AAAAGGGGTT TGGAAAACTC 180 ACT 183 INFORMATION FOR SEQ ID NO:83: SEQUENCE CHARACTERISTICS: LENGTH: 486 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...486 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:83: WO 97/37044 WO 9737044PCTIUS97/05223
ATCATTATGT
TTTTTAGGAC
GTTAAAAAAA2
ATCAAAAA-AG
GGCGTTAAGA
ATTCAGGATT
TTAAATGAAG
CCTAAAGAAG
CATGTT
TTGGCATGGG
CAGAAATT
CGCTCAATGA
AAACCTTAGA
TTGAAGAATT
TGATGCAAGA
AAGTTTCCAA
TCCAATTAAC
CTTTTTTGAAk
CCCTCAAGCT
CGCTAAGGAC
GTATCAAAAA
AGAAGACGCT
TTATAAACGC
TGAAGAAGCT
AACCGATAAC
ATCCTTGTGG
GTCGTGGATA
ACTTTAGATA
CTCTTTGAAA
AAAGTA.ACTG
AGCTTAGAAA
TTAAATAAAG
AACGCCAAAG
TGTTGATTGT
TAGTGAAATT
AAGAAATCAA
ACAAAGTGGA
CAGAAAATGA
CCAACACGAT
AAGTTTCAAG
AACACGACAA
AGCGATTATT
TTTTCGCGCG
TATTGAAGA
GAGTCTTAAA
GATTAA.AGC
TCCTAACCAT
CGATGAATCT
AGAAAAAGAG
INFORMATION FOR SEQ ID NO:84: SEQUENCE CHARACTERISTICS: LENGTH: 555 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .555 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:84: CGCTATCGGT CCTTTAGCCG CAATCAGTCA AGTACTTTAA ACTCTGTGCC GTTGTGGATT GGCTTGAGCT TGTATGGGCC AAAGCTCATT GACAAAATGC AAGCTTTTTG, CATCGCGCTT CAATTAGGCT TACCGGTAAG CTCTACGCAT TTTTTAAGGG AGCGTTTGAG GGAGCAATCC ATTGTAGCGG CGCACTTTGG GGAAGATTTA GATAAAGCCA ATTTGAAAGA AAAATCGCTC ACCGCTATCG CTTTGGAATT GAAAAAGAAA GAAGAAGTGA TCAAA
A.ACTTTAGAA
ATGGTAGTAG
AA.AACGGTGG
TCAGCGGTCA
ATTGTGGTGG
AGAAGGCGTT
GAAGAAATTG
ATGCTAGAGA
GACAAAAAGT
GATGCGAGCA
GGGCAGCTGG
GGTCTGAAAT
TCACCGTGCT
GCGCGGTGTT
TTGCTAGAAT
AAGGCTTTTT
GCTTGAAAAA,
CGCTTAAAAA
GCCCTATGGG
GATCGCTTTA
CACAGAATTA
TTTAGCCTCT
TGGGGTGGGC
CAGAGACAAC
AGAGCGCTTT
AAGCAAGAAC
AGTGTATAAA.
120 180 240 300 360 420 480 540 555 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 315 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCT/US97/05223 183 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...315 (xi) SEQUENCE DESCRIPTION: SEQ ID
GCGCAAATCT
TATGAAATCA
GCGATAGGGG
CAAAAGGTGG
GAATGGCTGT
GGGAAAAATG
TTGTGCGTTT CAATTATGTT TTAGGCGCGA TCGGTTTTGT GGTGTTACTT TTTCGTTTAT CTACTATAAA AGATCGTTAG TGTATTTGAT CCTTGGCGTG CGTTGTGTTT GCTCTTTGTT TTTTATTACA CGCCTTATAT TTTAAACGCT GTGAAGTTGC GCTTCAAAGT GCTGAATTTG CCCGCTCGCA CGCTCAAAGC TTAAGGAATT GTTTGTGCTG GTGTGTGCTT TGTTTTTTTG GCGTTTGTTT
CGCTT
120 180 240 300 315 INFORMATION FOR SEQ ID NO:86: SEQUENCE CHARACTERISTICS: LENGTH: 348 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...348 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:86:
AAGGCTTGCA
GTTTTGTCTT
GTAGTTTCTA
AACGATAAAA
GAAATCTTTT
TCCCCTGAAG
AGATGAAGTG
TCATCACGAT
AAGAAAATAT
TCCTTGATCT
TTGATTCTAA
CCAAACGGCA
CTCAAGCTTT ACCTCTAATA GCGTCTTAAA CTTTTTTGTA AGGGCTAGTG TTTTTCTTTT TGCGTTCCCA ACCAACTAGC CCCTAAAATT GAATTAGAAA ATTTTAAAGC GTTTCAAATC GTCCATAGAG GGCAAAAAAG CCCTACAATA CGATGATCAT AATCAAGCGC TATGATGAAG ACACCATTGA AAGCGTTGAG GCAGGATTTG TATTTCTTCC CTAACGGG INFORMATION FOR SEQ ID NO:87: SEQUENCE CHARACTERISTICS: LENGTH: 315 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PTU9/52 PCTIUS97/05223 184 (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .315 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:87:
AATCAAACAA
CGTTGGTGGT
TTAGAGAATA
GTAGCGAACG
AAAGACCGGT
CGCTATGAAG
AGCCAGAAAG AAAAGAAAAA ATTCCCCACT GTTTTTTGGT GTGTTGTTTG AGCGTTTTA.A AAGGCTTGAA AAAAGAAAGA GAGCTTTTAG ACAAAACCAA AACCCCGTT ATTCAAGGCA TGTTTGCGGA TAAAGTGAGC GTGTTTTTAA
CAGCA
TTCAAAGGAG
GCGTGATGGA
AAATTACCGG
ATGTGCAGAT
ACGATAkACG
GGTTTTAATG
CGCTAAAjA
CAACCAATTT
CAAAAAAGGT
AAAGCCAGAA
120 180 240 300 315 INFORMATION FOR SEQ ID NO:88: SEQUENCE CHARACTERISTICS: LENGTH: 957 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .957 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:88:
AAAGAAAGTA
ATTTTAAGGC
A.&AGAAATTT
GACAAAGCCT
AATGCTAATT
AACCACTTTG
AAAAAGTTTG
TTAAAAAACT
AAAACGAGTT
GTTGAAGATA
GATAAAACTG
TCTCCTGATA
TTTGGAA.ACA
ATTCTATGGA
TATTGCCTTA
GGCTCTATAA
TTATTCTGGG
ACCCCACCCT
CTATTAGTGC
ATTTAAAAGC
TAATGATTGA
TTTTAAACGC
GCGAAGAATT
AAAGCTCTAA
GGCTTATGGT
CTGGGCATAA
CTTAGACAAA
TTTAGATAGC
GCTTAATGGA
GTTTGGGGAA
TAATACTTCC
GGATAATGAA
TTTCAAGCTT
TGGCAAAAAC
TCTAATTGAG
AGATTTAAGA
ATTTACATAT
AGGCGAGATT
AGGCATGGTT
CTCAAAGATT
GGTATTACAG
GTAAGAGAA
CAATTAGCGA
ATTCCTACTT
TTGAGTTTGA
TCTAGTATCT
TTACTCATTA
TTTATCCCCA
GCGTTTGAA
GAAAACGCCT
GACACACGAA
TCTACTTTAC
ATAGGGCTTT
AGCTAATTAT
AAGTCTTTGA
GTTTTAGAGA
CTAGATACAG
ATATTAGAGT
GTCAATACGA
GTGGAGGCAC
AACACACACG
ATCATAAATC
TGAATATGGC
ATTCCATGCT
ATGCAGATAG
AAGAAACGCT
GAATAAAGAA
TGAAAATTTA
TTTGTTTTTT
AGTGAGCATG
GCCAAGTGAT
TTATGAGTAT
AGGCAGTGGA
AATAGTAAGC
CTTGTTAGTG
AATGAGAATG
CTTTTTAAGA
TGTGCATGGG
120 180 240 300 360 420 480 540 600 660 720 780 WO 97/37044 PCT/US97/05223 185 GTTATAGAGG CGATTGCTTT AAATTTACAA ATGAATAAAA GTGGTTTAGA TGTCAATGTA 840 GCAAAAAAAT TCTTTAAAAG CAGTGTGGAT GTTGTCGTTC AAATTGTATT AGATAAAGCC 900 ACTAACACCA GATACATTCA AGAAATCTTA CCAGCAAAAG ATTTAAGAGA TAGTCTA 957 INFORMATION FOR SEQ ID NO:89: SEQUENCE CHARACTERISTICS: LENGTH: 261 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...261 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:89: GGAGTTAAAG CAATGAATTA CGATGTTTTA ATGGGATTTT TAGCACTAAT CTTGCTAATT CTTTGGTATG CCTATGGATT AAGGCAATAT CTTAAATTAA AAGATAAGAA TAAGAGATTA 120 AAAGAGAAAT TACAACGCTG TAATTGTAAT ATTAAAATTC CTAGTATTCT TGAAATGGCG 180 CATAAACCTA TCATTATGGA TATTAAGGGG GAATTGCTAC CACATCTTAC AGAGAGTTAT 240 AGAAAATCAA AATTTAAGGA G 261 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 285 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...285 (xi) SEQUENCE DESCRIPTION: SEQ ID CTCTCAAGGA ATGGCGTGGT CTCTAGCGTG AGTTCTGATG GCTCTAAAAT TTTAATGTCT WO 97/37044 WO 9737044PCT/US97/05223 186 TTAGCCCCTG ATGGCCAACC GGATGTTTAT TTGTATGACA CGCATAAA AACTAAAACC AAAATAkACGC GCTATCCGGG GATAGATGTT TCAGGCGTGT TTTTAGAAGA TGACAAGTCT ATGGCTTTTG TTTCAGATAG ATCCGGTTAT CCTAACATCT ATATGAAGAA
ATTGGGGTTA
AAAGAGAGGC GGAGCAACTC CTTTATGAAG GAAGAAGCAA TGAAT INFORMATION FOR SEQ ID NO:91: SEQUENCE CHARACTERISTICS: LENGTH: 438 base pairs TYPE- nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoric) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .438 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:91: 120 180 240 285
GGGGATAAAA
GAGGGCGAAC
GTTTTGACTG
CAAATACGCC
GATTATTTTG
CAATCCTTAG
TTTGAAATAT
TTTATCAATG
TGCTAAAA
ATTATGAAAT
CGATCAATAA
TTCAAAACGA
AAAAGGATTT
AAAAAACTTC
TAGAGGGAGC
GCGCGAAA
GTTATTGCTC
CATCGCTGAA
AACCTATA-AA
TTTTTTAGAG
TAAAAGCGTA
CAACAAGCTC
GATAACGCAA
ATTTCACTAT
CTTTCCAAGG
ACCTGTATTG
AATTTGTCTC
GGGGTTTTA.A
GTATGCGTTG
ATCATAGGAT
TTTTAGGGTT
CTTTTTTGAA
AAACCGGGCA
AAACAGAGCA
AAACCTTGCT
CCC CAAAAAA
TAGAAGAGCA
TTTAAGAGCA
AGCCAAAGAG
TGATCGCACT
ACAATTTGAT
TAAAGACATC
CGCTAAAAAT
GATGAATCAA
120 180 240 300 360 420 438 INFORMATION FOR SEQ ID NO:92: SEQUENCE CHARACTERISTICS: LENGTH: 345 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .345 WO 97/37044 PCT1US97/05223 187 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:92:
ATCCAAAGGC
GTCGCTAC-CA
GGTGAAAGGA
CGCCAAAGTA
GGGGGCTTAA
ACGATAGATG
TAAAAGAAAT
GTTTGCAAGC
CTTTTGAATA
ACCAAACCCG
TGGCGCAAAA
GCGTCGCTCA
GCTTAGA-AAT
TCAAGAAT
CAACCATCAjA
TGATATTTTT
AATCTATGCC
AAATGGTAAC
CAATTTCGTA
ACCCCCACTT
ATGTATACTG
AGGACTAGGG
AGGGGGATTG
ATTTTCCACC
TCGTGTTTGT
TGGGTAAGGT
ACAGGAAAGA
CGGATGTGAA
AGAGCCGTCT
ATGAC
CACCTGCATT
AACCACTAAG
GCTCCAACAA
TGCGGCCAGT
CTTAAGGGTA
INFORMATION FOR SEQ ID NO:93: SEQUENCE CHARACTERISTICS: LENGTH: 792 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .792 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:93:
GCGGTAAGTT
TTATTGTCAT
GATCCAACGC
GGGACTTATA
GAAACGCTCT
AGTGCTAAAG
GGGCTTTCGC
AGCTATTATC
ACCAATCAAA
TTTTCCATAC
TCGCTTGATG
GAGGTAGGGG
ATTATCCCTA
CCAAACATTC
GGATAATGGC GTGTTGGCAT AAAAGATTGG
CAGTTGGTTG
GCGTAATGAG
TCCCTGCATG
AAGAAGGCTA
ATTCTAATGG
AGCTGATTTG
TCTTAAAA.AC
ATGCAAGGAT
AAACACCCAC
AGACAACGGA
CTTCTTTTAA
AGTTTATGGG
AAGAAAAGAT
TGCTAATAT
GATTTATTCT
TGCGACTTTG
CTTGTATCCT
CATAGCCAGC
CGCCGATTTT
TTATGCCAAA
TTCCGATCTC
CATTTCTGTG
AAAGGGGAGC
AATGACTAGC
TGCGCGATTT
GTTAGTATCG
GTTGCGTTAT
CTAGAAAACG
AAAATGATTT
TTGCACCTTG
AGAGACGATT
TACGCTCAAA
TACTACCCGG
GCTGAACTTT
GTGTTGTGGG
AGCACTTTAG
TTAAGCGCTC
TTAGAGACGA
TGAAAGTGTA
GGCGTTATAT
TAGCCAAAAT
AAGCGATGGA
ATTGTCATAA
CTTTCCATTC
CGTTAAATGA
TGAAATCTAA
GAGGGAGACC
AAAACCAAGA.
TAAAAAATCA
TTGTATCGTT
TCCGCCCCTT
TTTTAGCGAT
CGCTTCTTCT
GCAAGACAGC
TAGGGATCA
AAGAGAAGAG
AAATCCTTAT
GGGGAATTCT
AAAGTTTCTT
TTATTTTAGC
AAGTTTGGTG
AAATATTTTT
120 180 240 300 360 420 480 540 600 660 720 780 792 INFORMATION FOR SEQ ID NO:94: SEQUENCE CHARACTERISTICS: LENGTH: 441 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) WO 97/37044 WO 9737044PCTfUS97/05223 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .441 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:94:
CTTGGAGGAC
AACACGCAAA
AAAACCCATG
CGCTCTAAA~G
CTTGAAAAAA
TCTTTAGATC
TTAGAGGATT
GTGATTTTGC
TAGACATGTA
AAGTGAGCGA
AAGAAAAAAA
AGCTTCAAAA
ATCGTAACGA
ATTTGCAAAA
TTCTTTTCTC
TAAGTGGCGT
TAAATTAGGA
TATTGCTAAA
CCAACTGAAC
GGTTGAGATT
AAGTTTGGTG,
ACAACGATCO
ACAAGCCCTA
ATATTTTTAT
GACATCCA.AC
AGTCGTTTGA
GAACGCCAAA
CAAGAAA.AAG
TTTTTACAAA
AAGGGGCAGA
TAGCCACCTT ACTATCAGCT ATAA.AGAAAC CCTTTTGAA.A GTTCTTTA.GG CGAAGCGATC TGGTCGCCTT AAAAAAGAGT TCCTAACCAA CTACCGCAAG AGAGAGTGTT TGATACGCTT ATTTAGCCTC TTCTAATGAT 120 180 240 300 360 420 441 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 1569 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1569 (xi) SEQUENCE DESCRIPTION: SEQ ID
TATATTCCAT
CTCGCGCTTC
AATAAGATCA
GTTAAAACCT
TCCACGCTCA
CACATAGACA
GCTATTGTTA
TCCAATGTCG
CACTTAAGTT
CCCGCTTATG
GAAGGGCATT
TCAATTTATT
TTTTAATCGG
TCGCTTCGTA
TTAAATTGAG
TTCTTAAGGG
TTAAAAATTT
CTTCTGGGAA
CTCAATCCCA
TGAACGCAAA
CGAACGCAAA
TGATCCTAAC
TCAAGGACAA
CCTTTTAACG
TATAGAGAAA
ATTCAACTCT
GGATTTCTCG
ACGCTCTTTT
TATCAAAGGG
CACTGCCTAC
AGACGCCAAT
AGTGTCCTTA
AGCTAATA-AC
ACAAACATGA
GCCTATCTTA
AAAATCAATC
TTGGATTTCA
CTTTTAACGC
AAGGATTTGA
CATAGAAAAG
AACGCCCTTT
TTAGAAGATT
CAAGCGGATT
GCTTTAATCA
AAAAACTTCT
TCCTTTTTAC
CGAACGAGCG
AAGCTCAAGC
AAAGCGTAAA
TACCCTACCC
CCCTTGTAGT
TAGATGATTT
TGCTTTATTT
TTAACTCTCT
ATAACGCCCT
TTATACCATG
AGAATGGGGG
TTATTTGAGC
CAACGATGAT
TTTGGATTAC
CTTAAGAGGG
TCAAGGCGTC
CAAGCTTTCT
AATCAACCGC
AAAGCCCTTA
AATCAATCAA
120 180 240 300 360 420 480 540 600 660 WO 97/37044 PCT/US97/05223
ATCTTTCATT
AAAGGAAACA
CTA,'AAGCG
CCCCATCTAG
AAAGGCGATA
GACGGCGCGC
TCCACTTCAA
GCTAACTTGG
GCAAGATTCC
ACTAA.GAA
TCTGATTTGA
TTAAACACCA
ATGAAACTTC
ATTCAGCAAA
GGTTTAGATC
GGGCTTTTT
TAAACCTTAA AGACACGCTC GTTTTCAACC
AAGCCATCAG
AATATTCTTT
CCAAACTCCA
TAGAGCAAAG
TAGACTTCAC
AAGCCTTAGA
ATTATGACCT
TCAAAAATGC
TTTATCACGA
GTTTAAAAAG
AGCAAATGGA
AAGGCAGCAT
ACCTGCAACA
ATTTGCTTAA
CGATACCACC
CCCTGCTTTA
AAACATTACC
CCC TAAACTC
GCTTTTAAAT
TTTATTCCAT
TATCTCTAAG
ATTCAGCGAT
TGCCAATCTA
CC CCAAAAC C
CATACTCATG
GCACCAGCCA
AGGCTTGAAA.
AGATGATAAA
CTAACTAGCC
AAACTCAACG
AACCACCCCT
TTAAAAGTCA
AAAGATTTGA
TACCCTAAAT
CAAGGCGTGT
TTCCTCTATT
GTAAGCCAAA
CAATTGAAAA.
GATGCGGAAA
AAATTTTCCC
GAAATCCTTA
CTCAAAGAAA
TCTCGCACTC
CTTTAGTTAA
CCCCCTACAC
TAA_2AGGGAG
GTGGCCATTC
AAGCCCGTTT
TTTTCCAA.TC
TGAAAGCCJA
CCATTTCTCG
TCAACCAGCA
TCCATAACGG
TCTTAAAATT
TCATTTTAAA
AAAACGACAC
AGCTTGAAAA
AAGCGATTTT
TTTCACAGCC
TTTAGAA-ATA
CTTGACTTTA
AAATTTACTA
TTCCAATATT
CATTGCAGAC
TCTCAAGAAC
TTTTGATATT
ACGCCTGCTC
CTTGTTGGAT
CATCTTTAAA
CGAAAAAGCC
CCTTAkAAAAA
AGGGCTTAAG
720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1569 INFORMAJTION FOR SEQ ID NO:96: SEQUENCE CHARACTERISTICS: LENGTH: 666 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (geriomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION .666 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:96:
GCATTTCTTT
CTAACAATTC
AATAAGCTAA
ATACAAAAGG
TTATGGCTAG
CCTACTAAAA
ACTAATGTCA
GTTTTAAAAG
ATTGCGTGGA
GGCATTTACC
TTTTTGCGTA
TGGCCC
ATATTCATTC
ATCTGAA.TGC
ATCATGCTAG
TGAGCGTTTT
GCACAGCCTT
CAGAGCCTAA
TGATGACCAA
CCGCTTATCA
AAGAATCATG
ATGCGTATAT
ACGTGATGGG
AATAGCGCTT
AAAATCTCAA
AATTATACTT
GAAACATTTA
AAGCATGTTC
GCCCGCTAAA
TTGCGATAAC
ATTCGGATCT
CGCAGGGACT
CCCTAGCGTT
GAGAATTGCT
GCGCGGGTAA
TCCTCCCTAA
GAATTTATTC
GCCCCACTCA
TTAAGTTTGA
GGGGTTAAAAi
CTTAA.AGACT
ACAGAAAATT
TATAAAATCA
TTAAAAAGCT
ACATTAAAGA
TTCTTCAACT
AAAACAAA.AT
CTAGTTTAAT
TTCATATCCC
ATTTGAACGC
ACAAGCCA
TTAACGCTAA
TAGGCTATGA
ATTTTTCG.AA
ATGGGCATAA
CGATGCGTTT
AAAACCTCTA
CACTCTTAAG
TTATTTCTTA
TTTTAAAGCC
TGAAGAAAAC
ATCGCCTGTT
GCAAAAAGAA
AATGGCGGGC
TCCGAGTGCG
TAATAGCCCC
GCTTCTGAAG
120 180 240 300 360 420 480 540 600 660 666 INFORMATION FOR SEQ ID NO:97: SEQUENCE CHARACTERISTICS: LENGTH: 897 base pairs WO 97/37044 PTU9152 PCT/US97/05223 190 TYPE: nucleic acid STPANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION l...897 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:97:
TTAGTTATAA
AAAAAATTTA
GAAAATGACG
CAAAAAAATG
GTCAAGTCCA
GCAAAGATTC
TTGTTCAACC
TTTCATTTTG
AAGGAAAAAG
CAAAAATTAG
GGGAGCGTGG
GATGTGAITA
TTAAAACGCC
ATGTGGGGCT
ATCCAAGATG
TATCGCTACT
TTCTATCTTC
GCTCTAAACC
AAGCTCAAAA
TTTCTTATGT
GCGTGGGCGA
AAGGGTATTT
ATGAAAAAGC
ACGGCTTAAA
AGCATGCTAA
TGGAGGTGCG
GGOGGACAG
GTGTGATTGA
TGAATGACGG
TGTATATGCG
AACAACACTC
TCTTGTTTTC
AAACGATTTG
CGAAACCTCT
CGGGCTTTCT
TATGGTGGAT
TAAAGACGTT
CAGGATTGCC
ATCCCAAATG
AACGGCTTTA
CACAGAAAAG
TATTTATATC
ATCTTTGAGC
GAAATTGCGC
TAGGGGTTAC
AAGCTAAAAT
GCATGTATCA
GCTTCTCCAA
CAATCCAATC
TACATGTCTG
TCTAAAAAAA
TATGCCACTT
GGGGTAGAA.A
GGGATCAAAA
AAAACGGCTT
GTCAGTGAGG
AAACAATCCA
GCGAACA-AGC
TTAGATCAAT
TTAGACGCtC
CCATTAAGGA
ATACCOGGCGT
AAGAAACCCC
AAACGCCTAA
ACATGCTCGC
TAGACACCGC
TTGAAAACGG
TCAAGGGTTA
AGGGCGACAC
TAGAGGGGCA
GAGCGTTATT
TTTATGAGGG
AGCGCGATTT
TAGAATACGA
AATCAGTATT
TGAAGCTTTA
AAAAGAGGCT
AGAA.ATGAAA
TAATGAAATT
TGTTTTAGCT
CATTTTAGAG
TGGGACTGAA
CTTTGATGAG
GGGCTATTAT
GATCGTGTTT
AAGCGATAAA
CATGGGCTGG
TTCTTTGCGT
120 180 240 300 360 420 480 540 600 660 720 780 840 ATATTTCTTC GCCTTTT INFORMATION FOR SEQ ID NO:98: SEQUENCE CHARACTERISTICS: LENGTH: 864 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .864 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:98: WO 97/37044 WO 9737044PCTfUS97/05223
GAGTTTGGTG
GAAACGGCTT
TCCACAGGGA
AAAGAGCGCG
TCCGCTTTTG
TACTATGAGC
AAGCATGTGC
AAAGGCTCAG
AAATACTTGA
ATCGTTCTTA
AGGAGCAGTG
ACACAGAATT
ACAGCTTCAA
CGCCTTTCAA
AGCGACTCTA
TGAAGCGAAT
CTGCTACTAT
ATGTGAAAAA
TGA.AGAATTT
GAGCTTTCTT
ATAGGAATOG
TTAAAGACTT
ATTTTGGGGC
TCAGCATGGT
AAAAAATGGA
GGGAGCTTGA
CTTCTAACCC
AAGAGCCAGA
AGAAAGAAAA
GTAAGTCTGA
TTTATTTTTT
TAACACTACA
AGACCGCAAG
TALACCAGTAT
TAAGGGGAGT
CAAGGTTTCT
AGGGACAGAG
TTTGCATGAG
TAGGGTGAGC
AGALACAAGCT
AAGCCATACT
AGATCTAGAC
GACTTCTTCA
GCAACAACAG
AAAA
TTAGCGGCTA
GTTGATCCCA
AGGGTTTTAA
TCTGAAACCA
TTGGAAGATT
TTTGTGGTGA
CTTTCACTCC
CAGTTTGGGG
CAAAAAGAAA
GAGAAAGACA
GA.TAGCCCTG
CCTATGACTA
AAAAAGGAAA
GCCTTACAAC
CGACTTTTTT
ATGTTATGTT
AGAGCATGGT
AGATGAGTAA
GCGTGGAGCA
ATGACAGAGA
CCTTGTTCAA
ACATGTATGA
AGGCTAGAAA
CTAAGGCAGC
AATTTATA-AG
ACGCTAACAC
AAAAGCCCAA
AAGAGTTTGA
GTTGAGAGCA
TTCTGAAAGC
TGATTTAGAA
GGGCGATTTA
AAAGATTTGT
AAAGTTTTAT
CTGGCTTTAC
TGGGTATATC
AGTGGATGCA
ATTCCAAAAG
CTCTTCTAAG
GCTCAAAGAA
GAAAAAACGA
AAAGCAAATT
INFORMATION FOR SEQ ID NO:99: SEQUENCE CHARACTERISTICS: LENGTH: 1221 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1221 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:99:
AGTCGCACCC
CAAACA'CACC
AGCGCCATAC
GTTGGGGGTA
CTCAAACAAG
CA.AGCAGGAA
TTATCCAGTA
AAAGGCGGGC
CTATCAGGGC
TTGCGTTTGG
ACCACCAGAG
A.ATCGTGTGG
TCAGAAGGGA
AATTTGGCTT
GTGGGAGCGT
GTGGATTTTA
AGCAAT.AAGA
TTTGTGCAA.A
GCAAAATCAA
CGCAAGAGAG
TCGCCACAGG
CCGAAGAAGC
AAGGCTTTAA
AGATTGATGG
A.ATGGAACAA
TTAGAAACTT
GCCAATTCAA
TGAATTTCAA
GTTCTGGAGC
TCACTAGCAG
CAAACAGCGT
ATTTAGCCCC
ACCATCTCAC
CTCATATTGG
AATCGTTTTA.
TCGCCCTTTA
TC.ATGCCGCC
CACTGCTGTA
GAATAAAACC
TGAATTTCCT
AGGTTGGGAC
GCTTGA.AGTG
TACTGGTGGG
TGGCAATTCT
CGCTAAAAAT
CGGGAGAAAA
TAAAAATGCG
TAAATTAAAT
TTCATACAGC
TGTGGGGGAT
CACACTGGAT
CAAAAAGAAA
GTTTCTCTCG
TTTTTCACGA
GGAACGGTCT
CCAGATAAAC
AACAAGGAAT
TGGGGGAACG
GATATGAAAG
GATTTAGACG
TTCACA.AGCT
ATTTCAATTG
GCCAGCTCTA
GAAA.TTTCTC
GGTAATGTGT
ACGATCAACA
CAAAACGCCG
TTGTGGCA.AA
GGAAGAAAAT
TTTTAGCAGG
CCGTGATCAT
CAGGGCTTCT
CCGATAA.AGT
ACGACTTATA
CCGCTAGGCA
ACGCTGTAGG
TGAATATGCA
ATAAGGATAG
ATAATTTTGT
CGGTTTTGAC
TTTATGATGG
GGATGGGCCG
CTTCAAAAGT
CTCAAGCGGG
GCGCCGGGTT
GGAAATACAA
AGCGTTGATT
TCCAGCCATT
TAGTTGGGGA
TTGGCGCATT
CAAATCCCTT
TTATTGGGTC
GACTTATAAA
AAAAGCCACT
CGCTGATCGC
AGAAATCAAT
TTTGCAA.GCT
CGCCACGCTC
TTTGCAATAC
TCAAGGGGAA
CATTATCGCT
AA.ATATCATT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 WO 97/37044 PCTfUS97,05223 192 GCCCCTCCAG AAGGTGGCTA CAAGGATAAA CCTA-ATAGTA CCACTTCTCA
AAGTGGCACT
AAAAACGACA AGAAAGAGAT CAGTCAAA.AT AACAATAGCA ACACAGAGGT
CATTAACCCA
CCCAATAACA CGCAAAAAAjC AGAAACTGAA CCCACCAAGT CATTGATGGG
CCTTTTGCTG
AAGGCAAkAGA CTCGGTTGTC
A
INFORMATION FOR SEQ ID NO:l00: SEQUENCE CHARACTERISTICS: LENGTH: 489 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .489 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l00: 1080 1140 12 00 1221
CAAATCTTTT
AAATCGCCCT
TCTAAATCAA
CTTTCAGAAA
GCTCTCAACA
TATATCCATA
TCTTCTA.TTT
CTTTCAAAGG
CAAAAACTT
GCTCCACGCA
TACTCATCTT
CCATGCTCTT
ACATAACATT
AAAAAGTCGT
ACCGCAACAA
GTAAAAACCC
GTTTTAATCC
ATCTTCCAAA
GGTTTCAGA.A
TAAAACCCTC
GGGATCAACT
AGCCGCTAA.A
AAACAGACGC
ACCTAGAATT
TGTGGTGGTG
CTCCCCTTAA
TACTGGTTAA
TTGCGGTCTT
GTAGTGTTAA
AAAAATAAAA
TATTCTTCAT
CAAGCTATTT
ATACTTAGGA
AGAAAGCTCC
AATTCTTCAC
TTTTCACATT
TAGTAGCAGA
TTCGC!TTCAC
TAAGCCCTCT
TGATTATCTT
TCCAAAAAAT
AAAAGCGGAT
GCGCTCTTTT
CCCTGTGGAG
AGCCGTTTCT
ACCAAACTCC
TTTAAAATCT
AAAATACAGA
GCGGACTCTC
120 180 240 300 360 420 480 489 INFORMATION FOR SEQ ID NO:l0l: SEQUENCE CHARACTERISTICS: LENGTH: 603 base pairs TYPE: nucleic acid STRA JDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature WO 97/37044 WO 9737044PCT1US97105223 193 LOCATION .603 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:101:
GGTTTAATCA
CTGTCTTTCT
ACCGAAACCA
AATACCATCA
ACGATTTCTT
TATCTCAAAG
CATCTTTTTA
GCCGATAAAC
TTCAACACCC
ACCCAAGATT
TAT
CAATGGATAA
TGTTA.ATCGC
CAAAGCAAGA
CCCAAGATTT
TTGAACATGC
ATAAAAAGTA
GCTCCAAAGA
TCAAGCCTTT
CTTATAGCGC
TAGGCACTCT
AAACAACAAT
TCTTAATAGC
AACAACCAAC
TAGCGTTACC
CAAAATTGAA
TCTAACCCCT
AAACTCGCAA
AGA.AGTGCGT
CTCAAAAACC
TAGCATCATT
AATA.ATCTCC
TATTTTTTCC
AACCACACAG
CAAACCATCC
ATTGATTCTT
AAAGAAAAGG
CCCTCCCTAA
TTTTTAGACC
ACCCTTGGGC
AAAACCCTGA
GCTTGATTTT
AAGAACCAA,
CAACAAGCCC
CTCAAGAGAG
TAGGGCGCAT
GCTTTTTAGA
AAGAGCTCCC
CCACGCTCAA
CTAACGAACA
CTTTTTATGA
AGCGATCGCT
CAAAACAACA
CACCGCGTCC
TTTGTTAAGC
CAAACAGGTT
GCATGTGAGC
CCTTTTAGCA
TAATAAAGCG
GCTCGTTTTA
TGATTTGCAT
120 180 240 300 360 420 480 540 600 INFORMATION FOR SEQ ID NO:102: SEQUENCE
CHARACTERISTICS:
LENGTH: 204 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .204 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:102: GGAGTTTCTA TCATTATGGT ATTGAAAACA AAATTAAAk.A TTATAAGCTC
GGTGATTTTG
AGTACTTTAT TGTGGGTGGG TTGCTCAAGC GAAATGGCGA CTTATCAAAA
TGTGAATGAC
GCCACTiAAAA ATACGACTGC AAGTATTAAT AGCACGGATT TATTGCTAAC
CGCTAACGCG
ATGTTAGATT CCATGTTAGG
GACC
INFORMATION FOR SEQ ID NO:103: SEQUENCE CHARACTERISTICS: LENGTH: 528 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
120 180 204 (iv) ANTI-SENSE:
NO
WO 97/37044 PCT/US97/05223 194 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...528 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:103:
GCGATGAAAA
GTAGGTTGTA
GCGACTAAAG
AAATATTCAG
TATTCTACGA
ACTTTACAAA
TCCATAAGCG
TCTAAAATGC
AAGCAAATTG
ATCAAGTTAA AAAAATTTTA GGGATGAGTG GCCATGCCCC AAAATCAGGT ATCAGTAAAA GCGCTCCTGA TTGGGTTGTA GGGGATTTGG GGGTCTTTTT AGGAAGGGCT GAAGATTTGA ATCAGGCTAC AGCCAAAGCT AGGGCTAATT AAGATTTGGA AAATGAAAAA ACCAGAACGG GCACTGATAC TGAAAAAATT TCTCAATTAG TTGCCCGCTA TGTCGGTAAA GATAGGGTTT TGGATAAAGT GCGCGAAGAG TTAGGCATGG
TGGTAGCAGC
GCAATAAGGC
AAAAAGTGGC
TCACTAATAA
TAGCGGCGAA
TAGATGCTTC
TGGATAAGGA
TTGTTTTAGT
TTAAAAAG
GATGGTGATC
ATACAAAGAA
GAAGTATGAA
TGATGTGGAT
TCTAAAGTCC
TGGTAAAAGG
ATTGATCACT
GGGCTTGGAT
120 180 240 300 360 420 480 528 INFORMATION FOR SEQ ID NO:104: SEQUENCE CHARACTERISTICS: LENGTH: 300 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...300 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:104:
CCCAAACCTT
GGTAGGGTTT
CAAGCTTTTG
AATCTACCAT
AACTCTTCAT
ACGCCGTCAG TATTCTATGT ATTTGCGTTA TGTTTATACT TTTTAAGTTT TGGGTAAGGC TTATAGCTTA TACTTATATA TATATGAAAG CTTGATTTGT GGTTGTCATT GAGTTGCAAT AACTCTATGC TGTTTTCTAC TTTTTTGATA TACCACACAA TGAGTCCTTA TGCTGTTGTA GGGATATTTT AGCGTATTCT CGCTAAAGAC ATATTCATTG GAGTCAAATT TTTCTTTCAA TTCTTTATTT INFORMATION FOR SEQ ID NO:105: SEQUENCE CHARACTERISTICS: LENGTH: 300 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCTIUS97/05223 195 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .300 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:105: ACAGCAAGAA TGAATCGCAT GAAACCAATT TTTTTGTTAA CTTATTGCTA GAGAAAAGGA CGCTTCTTCA AACCTTTTTG AACAGAGAAC AAGAATTGAA AGAGCAGGAG CAAAAAACGC CCTTTAGTGG CGTTAGAAAT TGTCCCCCAA GAAACGCCCT AGGGAGTCGT ATTATTTGAA AGTAGAGCGC TGTAGTGGAG TCTTTTTGTT GTTAGCGAGC ATTTGATCGA TAAGGGGATT GCTTGAAACT GGCTCAAAGC ATTTAGAATG GCAAGGGGCT AGCGTGGTTA TTTTAAAGAT INFORMATION FOR SEQ ID NO:106: SEQUENCE CHARACTERISTICS: LENGTH: 693 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 693 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:106: GTTGCTCGCG CTTTTATCGC GCCTTAAAAA CCGAATTTAT CCGGTTGTAG TGATCCCCCC TCCAAGCCGG CTTCGCAAAA.
AGTGTTTTAG AGCAAATTGA GAAAACGCCA CTTCAGACGC AAAATCATTC AAAAACTCCC ACGCCTTTAA ATAAAACCCG AGGGTGATGA AAGTCCTTAT TACGGCTCTA CCAACCCTAT CGTGTGGAAA TCTTTTTTTC GATGAAGAAT TCAATCCCCA
TCTTTATGCC
TAAGATTTTT
TGATTCAGGG
TACCGAAACA
TCAAGGCTCT
TATCAATCAA
TAAAAGGGTG
TTTTAAAAGC
ACAATACGGC
TGCGCCCAAT
AACCGATGCG
TAAACAGCAA
ATTTCAGCGG
AATTACGCCC
AAAGAAGAAG
AAAGCCACTA
GTTTTAAAGC
GACATGATGC
CATATTAATG
CACTATGAAT
GTGGATCCTA
GATTCCCTAG
AACGATTTGA
GAA
TCAATAAATC CAAAGTGGAA CAAAGCCAGA GGCGATGCAG AGCAAATGGC GAGCGAAAGC TCGCTCGCAA GGGCGAAGGC TCCCCTCTAG TTTGCTGTTT TTTATATTGA ACGGATCGCT TGAGAGGTTT TACGGATAAC TAGCCGCCAA TCGCGCTTAT ACCAATTGTC TTTTTCTTCT AAAACAGAAT GAAAAACAAT GCAAGATCCA TTCTATTTTA 120 180 240 300 360 420 480 540 600 660 693 WO 97/37044 PCT/US97/05223 196 INFORMATION FOR SEQ ID NO:107: SEQUENCE CHARACTERISTICS: LENGTH: 294 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...294 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:107:
CAGCGTCTAT
GTGAATGTGG
ATCCCCACGC
GGCCCACTAG
ACCCAAACCT
ACCTAGGAAA ATTAAACGCT AAAGTGAATC GCATTAGAAC CAATATTTTT
GAACACCATG
TCCCTAATTA CTTTTTCAAA
GGTTCTACTA
AGAATGGGAA CCCGACTACT ATCACCGGAG TACGCCGTCA GTATTCTATG
TATTTGCGTT
ACACGATTTT
CCAATTTTTA
GCATTGAGTT TGGTATCAAA CTATAAGAGC GAAAAAACAA CAGAAACCAA
TTTCAGCTTA
ATGTTTATAC TTTT 120 180 240 294 INFORMATION FOR SEQ ID NO:108: SEQUENCE CHARACTERISTICS: LENGTH: 1071 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iV) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...1071 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:108:
TTTCATTTTT
GAAAGGATTG
GTTCTTCTTG
AAGGATTATT
AGCTTTGCAG
ACTGCTTGTA TTTCCTTAAT GGTGGTTATA ATCGGCTTCA TAAATCATAC TCATGCTAAT CGCTCGCTTT AAAAAAGCTT TAATTTCCTA TTCTTTAGGG TTTCATCGTT ATTAGGCGTG GCTAACGCTT CTAATCAAGA GATCCAAGTC TTGGGGAGCA AACCATAAAG CTTCCTGTTT CTAAAATAAT CTACTTGGGC AAGTGCCTGC CATGTTCAAT ACTTGGGATA GGGTCGTGGG CATTTCGGAT 120 180 240 300 WO 97/37044 WO 9737044PCTIUS97/05223
TACGCTTTTA
ATGAGCAGTG
CTTGTGGTAA
TCATTCCTTT
GCTAAGGCCT
TTTATCA:A AG GCCAAT, AAA
GATAATTTTG
GTTAAAGAAA
GTGTTGAACA
CTCCCCACAA
AAAGCCCACC
AAAGTGGTCT
AATCTGATAT
ATCATGTGGC
CCTTTGTGGG
CTTTCCAAGA
TAGAAGTTGA
AGCGTTTGAA
TCAGCGGCCA
GCTTGAAATA
ACCCTGAAAT
ACCCTAAATT
TGGATATTGG
CTGAAGCGTT
TTGATTTGAA
TGTTAAAGCC
GGCGTTGAAT
TAACCCTAAA
AAAAACCATT
CGCTTCTAAA~
AALATGTCAAA
TCAAGCCCTT
CGTTAAGTTT
CATTTTCATT
TTCCACTATC
CGGACCTAGA
TAAGGGCGTG
TGACGCGGAA
ACTCTCAAAG
GTGGAATTAT
GCGGTAGAGC
GTAGAGGTCA
AAATTGGCTA
AAGAAAA.AGG
GATTCAGACA
GGACGCGCTG
TGGTGGGTAA
AAAGCCATTA
GCCCCACTCA
GATATTAATG
GTTGAACCCT
ATCTTGAACG
TAAAAAAGCT
ATGCGAAAAA.
TGGAAGATAT
AAATGCAAGA
GGGTGGAGCT
TTTTAGAAJ\
ACATTAGTGT
GCCCACTCAC
AAAACAAGCA
TTAGTCTTTT
CGATAATCAA
TTTTGTGGCA
CATTAAACCC
TAGCCCTAAT
ATTTGGGATT
TGACGCTCAA
AACTTTGGAT
TTTCCATAAA
AGGGGGTATA
GGAAAA.AATC
TCCTGAAGAC
AGTCTATAAG
TATCGCTTTA
AGATTACTAT
C
360 420 480 540 600 660 720 780 840 900 960 1020 1071 INFORM4ATION FOR SEQ ID NO:109: SEQUENCE CHARACTERISTICS: LENGTH: 183 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 183 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:109: GATCATCACA AGGAGTTTCG TATGAAAAAG CAAATCTTGA CAGGTGTTTT GTTATCAGTT TTGGCAGTGA GTTCTGCATA CGCTCACAA.A GATAAAAA.AG ACGCTAAAAA ACCTGAGTTA AGCTCTCAAT TAGTGGCTCA CAAAGATAAA AAAGACGCCA AAAAACCTAA AAACTCAGTG 0CC INFORM4ATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 600 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: WO 97/37044 PTU9/52 PCTfUS97/05223 198 ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .600 (xi) SEQUENCE DESCRIPTION: SEQ ID
AGAATAATAA
GTAGCGTGTT
AGAGGGCAAG
AAATCTTTCG
CAATATGAAA
AAGCATTTAG
GAAAAAAAAG
AAACAGCGCG
ATGGCTTATA
AAGGAAACAT
TGATCACCGC
CCATTTTAAA
TGGAAGCTGA
ATAAGAATTT
AAGCGCAGCA
AGCTTGAAAA
CCATTTGTAA
CCAAAGATGA
TATGAGCAGC
TCTAATCATG
AGGCGCTTCA
AGAAATGCGC
GCAACTCCAA
CAAAGAATTT
AGAACGCCAA
AGAAGCTCAA
AATTAAAAGC
GATCAGGCGT
GGGTTAATTT
TATTATGTGA
GCCAAAGCTA
ATGAAAAGCC
ACCCATTTTG
GTAAGAGATG
ATTTTAGAAC
GCCAAAGCGC
ATGATCTTAG
TATGAAGAAG
ACATTTCATT
TGAAAAAGAT
AATTAATGGA
AAGAATGCAA
ATAAAAAAGA
AAAAACGCTA
AAGAGAGGGA
TAGATGCGAT
AGCAATTAGA
AAGCCTTTAT
AGAAGTCTTG
CTATTACGCT
ATTTCAAGCG
GTTGCAACAG
AGCGCATTTG
TTTGGAA.AAG
AAATTTTAAA
GCTCAATTAC
ACAGGZAfCTA
TATGTGTTTA
120 180 240 300 360 420 480 540 600 GAAGCGCAA.A AGAGCGCCTT INFORMATION FOR SEQ ID NO:111: SEQUENCE CHARACTERISTICS: LENGTH: 501 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoinic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .501 (xi) SI
GGCCGAGCTT
AAAAGGCTTG
TTAAAAATAT
TTTTCTTCAC
AACGCTACGA
TATATAGAAA
TTAGATATTG
CCTCACCCTA
CTTTTTAAAA
EQUENCE DESCRIPTION: SEQ ID NO:111:
TTATGATGCG
ATTTTTCTAA.
TAAAAAGTTG
CCATTTATAT
TTATCCTTAA
GGCGTTTTGG
ATATTATCGC
AATGGAGTGA
GAGAAGAGTG
AGAGATCCTT
CAGAGTGGTT
TTTTTTATAT
CAATCCGCCT
AACATCTTTA
GCGTGCAAGG
TTTCAATCAA
AAGAGACTCG
ACTAACCGCT
TTAGGGCTGG
TTTAAAAATC
TTTGGTTACA
GGTTTGCGCC
AAGCGCGATT
GTCATTTTGA
GTGTTAGTGC
TTTTCCCAAG
GATCTA-ATCT
ATAGTAAAAT
CTAATCAACC
ATTTTTTTGC
TTAkAGATGC
GGCAGAATGA
CTTTAACTTT
CCTTTTTAAA
TAAAAACCCT
CGGGAAAATT
TAACTTTTAT
TCTAGTGTTT
TCCAAGAACT
TTTGACTTTA
ACAACAAATC
120 180 240 300 360 420 480 501 INFORMATION FOR SEQ ID NO:112: SEQUENCE CHARACTERISTICS: LENGTH: 513 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 199 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...513 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:112: ATAATAGAAA TTTTAGTGAT TCAAGGGCCT AATTTAAACA AGACTTTATG GCATGGTAAC CTTAGACCAA ATCCATGAAA CAAGGCAATT TAGATGTGGA ATTAGAGTTT TTTCAAACCA GATAAAATCC AAGAGAGCGT GGGCAGCGAT TATGAGGGGA TTTTCGCACA CTTCTATTGC GATTGCGGAT GCGATCATGC GAAGTGCATC TCACTAACAT TCAAGCCAGA GAAGAATTCA GCGGCTTGTG GAGGCGTGAT CATGGGATTT GGCCCGCTTG GCGATGGTCA ATATTTTAGC CGAAATGAAA GCGTTCCAAG AATAACCCCA ATAACCCGAT CAACAATCAA AAA
TGTTAGGACA
TCATGCAAAC
ATTTTGAGGG
TTATCATTAA
TAGCGGGCAA
GGAAAAATTC
GCTACAACAT
AAGCCCAAAA
CAGAGACCCA
TTTCGTGAAG
CGAAATCATT
CCCTGGAGCG
ACCTGTCATT
TTACACCGGA
GGCTTTAATG
AAACAACCCT
120 180 240 300 360 420 480 513 INFORMATION FOR SEQ ID NO:113: SEQUENCE CHARACTERISTICS: LENGTH: 198 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...198 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:113:
CCGCGAATCA
GGACTCGCAC
AAGGATAAAG
CAGGCATTAG
TAGCATTCTT TTGGATATGG GGATCAAATA CGAACGCTAT TGCTCTGATA GGCTTTTTTT GAACCCTAAA GACTTTGTGT TCAAAAGGGA GCAGAGTTTC AGCGTCAAAA GATTTATGAC ATTGTGAAAG AAGCGCAAGA AAAGGCTATT
AGCGGGGA
INFORMATION FOR SEQ ID NO:114: WO 97/37044 PCT/US97/05223 200 SEQUENCE CHARACTERISTICS: LENGTH: 210 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...210 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:114: CGATTATGGC TATGGGCAGT ATTCACTCAC AGCACCGGGC ATGGCATTGA CTTAGACATT CATGAGCTTC CCTATATTTC ATCGCGCAGT GAAACCATTT TAGAAGAGGG CATGGTGTTT 120 TCTGTAGAGC CTGGGATTTA TATCCCTGGG TTTTTTGGGG TGCGCATTGA GGATTTAGTG 180 GTGATTAAAA ATTCTAGGGC CGAGCTTTTA 210 INFORMATION FOR SEQ ID NO:115: SEQUENCE CHARACTERISTICS: LENGTH: 297 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (Vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...297 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:115: TACCAATACA TCCAACAAGG CGGGGGCTTT GGGGTGAATG TCGGGCGCAC GCTGGGTAAT AGAACCCATG TGAGCTTAGG GTATAACCTG AATGTTACCA AACTCCTTGG TTTCAGCAGC 120 CCCTTATACA ACCGCTACTA TTCCTCTGTT AATGAAGTGG CCTCTCCAAG GCAATGTTCC 180 ACACCCGCAT CGGTGATTAT CAACCGCTTA TCAGGCGGTA GAACTCCATT GGTTCCTGAA 240 AGCTGTTCTA GTCCTGGAGC GATCACCATC TTCACCAGAA ATAAAAGGTA TTTGGGA 297 INFORMATION FOR SEQ ID NO:116: WO 97/37044 PCT/US97/05223 SEQUENCE CHARACTERISTICS: LENGTH: 276 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...276 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:116:
ATGGCGTGTG
TTGAGCTTAG
CTTAAAAAAC
GCGAGCTTTC
GCCAAAGACC
AATTTTTGAA AAAGCCAAAG TATTACAAGT GGCTTTCTAT GGTGGTGGCG ATCCTTATGG TCACTCACAT TTCGTGGCTT TTTTGGCTTG TCAATGTCTA TAAGGCTTAT AAAAACATGC CTAAATACAC GCAAAATAAA ACAAAA TTATAGAGGG GGCGAATTAT GCGTGGCTAT AGGCTATGGG GGGTTATTTG GGGCGTGTTA AAAAAGACTA TGAAGAACTA 120 180 240 276 INFORMATION FOR SEQ ID NO:117: SEQUENCE CHARACTERISTICS: LENGTH: 267 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...267 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:117:
AGGAACTTGA
AAACTAGGAC
TTTTCTGATG
GGATCTTATA
TTTATAGCTT
TGAAAACGAC AGAAAACACA GATGAAACTC ACTTAAGGGA AACTAAAAAT GCAAACCAAA AGCAGACGCT AATAAAAAAA CTCGCGCGGT AAGCTTGTAT AGCAATACCA AAAACTAGAG AAAATGGCTA ACGAAGAAGA AGAAAGCGTG TCAAACGCTA TATTTTGAAG GCTTTAAGAA AAATAGAACA AGGTGGTGGC TTAACTTGTT TTTAATT WO 97/37044 PCT/US97/05223 202 INFORMATION FOR SEQ ID NO:118: SEQUENCE CHARACTERISTICS: LENGTH: 309 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...309 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:118: ATTTTGAAGG GGGGGTTTTT AGGGTTTTTT ATCGTGGCGT TGTCTTCGTA TTACGGCGTT AAAAAGCGTT TGGATTTGAG GAAACAAGAT TCAAAAGAAA AAGAAGAAAA GCAAAAATTC 120 CAAAAATTCG CCCTAGGTTT GGAAATGTCT TTCAATGTGT GGCGTTTAGG GGGGTATGGG 180 GTTTTATTAG GTATTTTAGG AGTGCTTTTA TTCTTGCATC TTTTTAACGG ATTGCCGTTT 240 CTTATCGGCG TGTTTGTGAG CTCGCTCTCT AGCGCGTTAT TACGATTCTT AAACAATAAT 300 GGTAAGTTT 309 INFORMATION FOR SEQ ID NO:119: SEQUENCE CHARACTERISTICS: LENGTH: 771 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...771 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:119: GCCCTTACAC CAAATCCCAT GAAACTCCCG GTCGTTGAGA GCTTTTTTTC CTTACAGGGT GAAGGAAAAA GGATAGGCAA GCCCAGTCTT TTTTTGCGCT TAGGGGGGTG TAACCTTTCA 120 TGCAAGGGCT TTAATTGTAA AACCTTATTC AATGATGAAA TCCTAACAGG TTGCGATAGT 180 TTGTATGCGG TGCATCCTAA ATTCAAAACA TCTTGGGATT ATTACAATGA GCCTAAGCCC 240 WO 97/37044 WO 9737O~PCTIUS97/05223
TTGATTGAAC
GGCGGGGAGC
TACCATAAkA
CCTATTTTA-A
GAAAGCAAGC
CATTTTAAAT
CTTTTGAAAC
AACAACGAGC
AGGCTGAGCG
GATTAGTTAA
CAAGCTTGTA
AAATCCCTTT
AAGAATTGCA
GGATCAACCT
TCGTATTGGA
AGCTCTCCTT
TAGACAAAAA
ATAGGCTTCA
TTTAGCCCCT
TTTTAATAAC
ATTTGTAGAG
TTTCACCCTA
TAAAGCCTTA
GAGTCAAAAC
AAAAAATAAT
TCTAAAAACC
TATCCGCTTG
AATTATAAGG
CCTATTTTAT
AGTAATGGCT
AGCOTCAAAC
CAAAATATCT
GCCGCTCATT
GAAATCTTTT
CTAGCCCCCC
TGGGACAATA
ATTTTGATTT
TGAGCGTTTT
CTATTTTTTT
TCTCTTTTTC
TAAATAACGC
CTATCGCAGA
TAATGCCCCT
TAGOCATAGA
AGAAAGGGTT
CATTCTTACA
AGAGCATTTT
TGAATTTAGC
TTTGGAGCA.A
TAAAAGCGTG
AATTCAAAGC
AGGCACAACT
GCATGGTTTT
T
300 360 420 480 540 600 660 720 771 INFORMATION FOR SEQ ID NO:120: SEQUENCE CHARACTERISTICS: LENGTH: 1158 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1158 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:120:
TTCAAGGATC
AGCATTGATT
TATGAATTTA
ACTTTAGAAA
TATGCGGGCT
TACGCCCCTT
TCTTTGAATT
ATGCGTTTCA
ATCGCdCAGT
AACCATAAAG
AACGCTTTTT
TTAGAAATCA
CAAAAAAATT
TTTAAAATCT
TTGGAATTAG
CTGGACGGTA
CAGCATAACC
TTTGTGGAAA
TACTTGCCTA
GAAAATAATG
AAGGCATGAA
TTCATTTCAA
GCAACACGGG
TGCTTCAAAT
TGAAAGATAA
TATTAGAAAA.
ACCATCACAA
AAAAAATGAC
TTGGCATGCC
AGGGGTTAAA
TAATTTCAAG
GTAAAATCAT
TAAGCGTCCA
TAGAAGGCGA
AAAAAGAGAG
AAAAAGCTCT
TTTTAAGTGG
ATATAACTTC
AAGGGAGTTA
ACGAATTT
TTTAAACTTT
TTCTAGCGCT
CGAACATGCC
TTTTTCTCAA
AAACGCGCTG
AAATACGAGC
TAAAATCAAA
CCCCCTAAAC
TAATTATTTT
AATCTTGCAA
CTATCAAAGT
TAGCGCTTTT
TTCAAACGCC
TGTGATGCGC
CGAAAGGTTT
TTATGCAAAA
CCATGCTAAA
TCAATACATC
TGCGAGCGCG
ATGCCCCTAT
AGGGATTTTT
GTTATTCAAG
ATTTTAGGGG
ACGACTCAAT
AACTTTCAAG
TTAGGGCATT
GCTCAAAAAA
GGCTCGCA.AC
AATGAAACGA
TATTTGTTTA
AGCGTCAAAG
CTAAAAGCCC
CATTACCCTT
TTGAACAAAG
AATTTGAGTT
ACGCTAGGAT
AAAGAAAAAG
CTGCTCAAAG
TGCATGCTTA
GCGTGCATGA
TGAGAAAAAG
TAAAAATCGC
TCATCTCACT
AAAGAAACCT
TA.AAGGGGAA
CGGAGCAAGT
GCTTCGGGAA
AATTCGCCCA
ATTCGCTTTT
AAAGTTTAGA
TTAAAAACCA
ATGGGAAGTT
AAGCTGTGCC
TAGAAATTGA
CCAGGCGGTT
CGCAATTTGA
AAATCAAGCA
TAACCATGTA
AGTGCCTTTG
CGGTTTAAGC
TGAATTGGGT
CCCTAAAAAA
TAAAATCCTG
TCGCTTTTTT
TTTAGAACAA
GTTCAATGAC
CCAAAAATTA
AAGCAAGCGA
ATTTTTTAAA
AGCCCACCCC
TTTTGACGCT
TACGGGGTTA
AAAAGGATTC
TTTTTGGGTG
ATTGGAATTT
TGAGAAAGGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1158 INFORMATION FOR SEQ ID NO:121: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 204 LENGTH: 189 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...189 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:121: AAGATTAAAG GAAAAGAGAT GAAATTTTTA AACGGATTAG CAGGGAATTT ACTGATTGTG GTTATCTTAT TGTGTGTGGT CGTTTTTTTC GCGCTCAAAG CGATCCATAT CCAAAAAGAG 120 CAAGCCACCA ATTATTACCG CTATAAGGAT ATTAACGCTT TAGAGGCAAA AAACACCCAA 180 AACCACGCT 189 INFORMATION FOR SEQ ID NO:122: SEQUENCE CHARACTERISTICS: LENGTH: 300 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...300 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:122: ATGCTAGAAA TTGAATTAAA AAAGAAATTC ACTAAGGATT TAAAAAAGCA TATTTTAAAT CAAAAAATTG AGTTAGAAGT TTTTGACTTA GTGGTTGAAA ATTTAAGAAA TCAAATTCCA 120 TTGGACAAAA GATTTAAAGA CCATGCTTTA AGTGGAACAT ACAAAGGCTG TAGAGAGCGC 180 CACATTAAGC CTGATGTTTT GCTTGTGTAT AGAGTGAAAG GCAATGTTTT AACTTTGGTT 240 AGGCTTGGCA GTCATAGCGA GCTGTTTTGT AAACCGCCCA CACCACTCAT AACGCTTAAA 300 INFORMATION FOR SEQ ID NO:123: SEQUENCE CHARACTERISTICS: WO 97/37044 WO 9737044PCTIUS97/05223 205 LENGTH: 1938 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1938 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:123:
TCGAATTACA
GACGCAAGGG
CCCGCCTATC
AGCTATGCCT
ACCTTTAATA
CCAGGACATG
ATCATTCAAA
ACAAAACTTG
TTAGTATACC
GTACCAACAA
AGCATCATTA
TATTGGGCAG
GCTATCCAAG
GAAAACACGC
GCTAGTTTTG
GCCGAACAAG
GGGGTGTGTT
TCTAACACTT
AGCATCGCCC
ACTCTGGTGA
ACTGCGCTCT
AACCCC-TATA
CAAATCCAA.A
AGTTCTCAAA
TTCTTTGGCC
GCGTTCATTA
GCGGACGCTC
AAGCTTTCTG
GAGTATGTGA
TTCCAATTCT
AGCGATCATG
AACTACTATT
AATTACGTGT
ACAACCTAAA
ATAATCTAGG
AAGCGGTGCT
TTACCGCTTG
ATGTGCCAGG
GCGGGCCAAT
AGGCTTTGAC
ATTTCACTAT
CATGGAGTCA
CAGAAAATAT
TCACTACCCT
GGATAAGTGG
GCATGATCGC
AAAATCAAAA
CTGAAAGCAT
TGGTGAAAAA
ATGAAGTGCA
GGGGGGCAGG
ATTTTGGCAC
ATTTCAAATC
CTAATATCCC
GCCCGCAAGG
CCATCAACCA
CCAATAATGG
AAAAAAGAAA
AATCCAGCTT
TTTATAATTT
TGGGGCTTTT
ATTTAGCCAC
TATTCAACAT
CGGCTCAGCA
CCTTTATGGG
TCGCTTAC
CACTCTTGTA
CTCAAGCGCT
TTTAGCATTG
TGGTCCTGGT
ACAAAACACG
ATCCACTGAA
AGCCAATGGA
CAATGGAGAC
TGGGAAAGCT
CAATACAACC
GAATAGTGC!A
CAATGGGACA
TAACGCGCAA
CAGCCTAGAC
GCTCAAAAAC
CTTTGAAAAA
AGGAGGTGAG
CTGTGCGTAT
TCAAGAGCAG
TAGATACAGC
TAACGCGCAA
CATAGACACC
AGAACTCGGG
CGCGATGAAT
ATGGGGCGCT
CTTCAACTCG
CATCAACGAT
TGGGGGTATT
CATGAATAAC
GGGAGTGAGG
TGGGATTGAG
GGCTGAACTC
TCGCTGTCCT
AGGAATTTGC
AACGCTGCAG
AGCAATGAGA
ACGACCATCA
AATTATGCGA
GAAGGGATCC
AAAAGAACGG
ATTTCAACCT
AATAGCGCTC
TGCCCAAACT
ATGTGTGGGA
GA.AGCTGTCG
GCTGGAAAAC
GCGCAAGCCC
ATCCCTACAG
CGTCGTGGCA
GTAGGACAAA
CAGATACAGC
GAATTGGGCA
AGCTTGCAAA
AATTACTACC
CGTA.ACCCCT
GGGATCGGTA
AGGTATTACO
GCTTCTGACG
AAAGCCACCA
GCATTAGCCG
GTCTATAACG
ATGAATTTAG
TTAGGGCTTA
AAATACCGAA
CCGATCCGAG
TAGATGTCAA
TGGGCTTGTG
ACGCGAATGG
CTTGTAATTC
TCATCAACAA
CAGTTTTAAG
GTGGCGAACC
CGTGGAATGC
AAGAGCTTTT
TCCAAAATGG
TGTTTAAGAA
CGCAAGCCA-A
CATTCAACCC
AAGCGGAGAT
CCTTTGTAAA
CCAATCCGGG
CGATAACAAA
AAGCCGAAAA
ACACTTATAA
ATGCGGTGAG
TCAATCAAAA
TTAGGAAAGT
TTCAGGTGGG
GCTTTTTTGA
TGTGGACTTA
ATTTCTTAGG
GGACTTCATG
CTAAAATGAA
CCAGGCCTA.A
AAATCCCCAC
GGCTCTATAG
TGCTGTCAAC
AGCCAATTCC
GCAAGTTACA
AGGTATCCAA
GTATTATGAG
GGCTTATCAA
CAACACCACT
AAATAAAAAA
AACCATAACA
AAAACAAGCG
TGGTAGCGGT
TGAAATCAGC
AATCGTTAGT
CTACACAGAC
TTTAAACCAA
TGACTCTTTA
TCAGACGACT
TCTTAAAA.AC
CATCGCTGAC
CAGCATCACC
TAAAAAGAAT
CTCTTACAAC
GGGGATTGTT
TTACAAACAA
TTACAACCAT
TGGCTTTGGA
CAAAAACAAC
GCTTAATTCT
CGTGGCGAAT
GAAAAAAGAC
CATCAACACG
CGTGTATTTG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1938 INFORM~ATION FOR SEQ ID NO:124: Ci) SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 206 LENGTH: 279 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...279 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:124: GATTTAAAAA TGCTGACGAT TGAAACCAGT AAAAAATTTG ATAAGGATCT TAAAATTCTT GTTAAAAATG GGTTTGATTT AAAGCTTTTG TATAAAGTGG TTGGAAATTT AGCCACAGAG 120 CAACCCCTAG AACCCAAATA CAAAGACCAC CCACTCAAAG GCGCTTTAAA AGACTTTAGG 180 GAATGCCACC TAAAACCGGA TTTATTGCTT GTCTATCAAA TTAAAAAACA AGAAAACACT 240 CTTTTTTTAG TAAGGCTAGG CAGTCATAGC GAGCTGTTT 279 INFORMATION FOR SEQ ID NO:125: SEQUENCE CHARACTERISTICS: LENGTH: 243 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...243 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:125: CGCGCATGGA GAATGAAAAG CATGCGTTTT AGTTACATTG AGCCAAGAGC GAAATACTTA ATCAGCAAGC TTTCTAAAAT TTGGGTTTTT TACATTTTTT TATCTTTTGT GCTAATAGGG 120 GGGTTAGTGT GGTTTATGCA CAACGCCATT AAACGCGCTC AAGACAACGC GTCTAGTCTG 180 ACGATCCAAG AAGAGCTTTA CCGCTGTTAC ATCACCCGCT TGTCTGTTAA GATGATTATA 240 CTC 243 INFORMATION FOR SEQ ID NO:126: WO 97/37044 PCT/US97/05223 207 SEQUENCE CHARACTERISTICS: LENGTH: 447 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...447 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:126:
GTTAAATTTC
GTTAGGACAC
TTGTATTCAA
AAACGGACAT
TTGCTTTTGT
TTAGAAAACC
CAAAAATATG
TACATGGCGA
CTTATCTGTT
TAATAAGTTT
AACCTTTGGT
TCTATAAACG
TTGTTAAATT
GCCACCACCA
TCCATATCAA
CTATGGTGAT
AAACATACGG
AGGGATTTTG
CTATTCGGCT
AGCGTTCGCT
GAAGCATTCT
TTCTTTCGCT
ATTGCCTGAA
GCGTTTT
ATAATGTTAT
TTAAGCGTTT
GGAAGTGGGA
TTCACGATGA
GCGTTGATGA
AAAAATTATG
GGGCCTCCTA
ATCTAAGGAA AGAAAATGGG TGAATGGCGA TGATCTGAGA TTATTGGGAT TGATATTGAC AATCGTTGTT TGGTGAAAAC GCAAACACAT GAAAGGGCCT AAAAAGCGGT TAATGGTTGT GCAACTTCCA ATCAGGCTCA 120 180 240 300 360 420 447 INFORMATION FOR SEQ ID NO:127: SEQUENCE CHARACTERISTICS: LENGTH: 477 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...477 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:127:
GTCATCACCA
ACTTCCAAAG
ACAGGGAGTT
AACAATATTG
ACCAACAAAA CGAAGTCAAA ACTTTCACGC CCATTGAAAC CAAAAAGATC AACAAGCTTT TTTAACCCTT TCAGCGCTGA TGGATGCGGT AGAAAACGGC TGGCTCGCAT TAAAGGTTTA GAAATTGCCG GTAAAACCGG AACTTCTAAC ACGCTTGGTT CATTGGCTTT ACCCCCACCT TACAAAGCGT GATCTGGTTT WO 97/37044 PCT/US97/05223 208 GGGAGAGACG ATAACACGCC TATTGGCAAA GGAGCGACAG GAGGCGTTGT GAGCGCGCCT 300 GTGTATTCGT ATTTCATGCG CAATATCTTA GCGATTGAAC CTTCTTTAAA AAGAAAGTTT 360 GATGTCCCCA AAGGCTTGCG TAAAGAAATC GTGGATAAAA TCCCCTACTA CTCAAGCCCT 420 AATTCCATCA CCCCCACCCC CAAAAAAACA GACGATAGCG AAGAACGCTT GTTGTTC 477 INFORMATION FOR SEQ ID NO:128: SEQUENCE CHARACTERISTICS: LENGTH: 240 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...240 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:128: TTATTGTTAA ATCTTAAGAT TTATGTTACA CTCTGTAAAA TCCAAGGGGA TAGCGTGTTA GAAAAATCTT TTTTAAAAAG CAAGCAATTG GTTTTATGCG GGTTGGGTGT TTTTATGTTG 120 CAGGCTTTGC ACTTCGCCCA AACACTTCAC AAAGAAATTC TTTTTTACAA GATGTGCCTT 180 ATTGGATGTT GCAAAATCGC AGCCAGTATC TCACGCAAGG GGTGGATAGC TCGCACATTG 240 INFORMATION FOR SEQ ID NO:129: SEQUENCE CHARACTERISTICS: LENGTH: 1032 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...1032 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:129: GTCAAGTGTG GCAAAAAGGG GATTTTAGTT TCAACGCGCA AGGCAATGTT TTTGTGCAAA WO 97/37044 WO 9737044PCTIUS97/05223 ATTCCACTTT CTCTAACGCT
AATGGAGGCA
TTTTTGCCGG
CTAATCAAGT
GCCACTAATA
CAACAAAGCG
GCTTCTTCTA
TTGGTGGTTA
GATTTTTCAA
CAATTCCA3AG
TTTGTAACGA
GAGGTGCTTC
TTAGAAGTAG
CAAACGCCTG
ACGCCTTTTA
CGTTATATTG
TTAATCAATA
CAACGGGTTT
AAACAACCAC
TTCTAACATT
ACAATGTTTC
ATCCA-ACGAC
GTAATGCGTC
CGGCGAATGG
A-AATTAAAGG
CCAACAACCT
ATAACTTAAA
AAAATTTAGT
GGGGGGCATT
TAAATCCGCT
TGAATGTCAG
ATTACAATAT
TCAACGGAAA
TA
ATCGCTTTCA
AACGTCACCA
CGTGTCTCA
AGCTAGCGCC
AAACAACGCG
TTTCAATTTT
CTCTGCAAAC
CACGATTTCC
CATTCAAGGA
GATCGCTTCA
GAATAATTTG
CATTCAAGTA
CGTGGCTAAT
CAACCCTAAC
CCACATAGAG
CGCTCAGTTT
CTAACCATTC
TGCTTAACGC
GGCAATCTGT
ACAAACCCTT
CCAATCGCCT
TCAGGCAATA
GTTAAAA.ACC
AACCAAGCGG
GCGTTTAACA
AACGCTTCTT
GGAGCGATCC
GGGGGGATCA
GGCGGAACTT
AGCTTGCAAT
GAAAAAAACG
TAACGCAGGA AATTCGCTCA
TOGALACGCTC
AGCAACGGCC
TTATCAACGC
OCACCACOGO
TAAATAATA-A
TTTACGCTALA
TGTATCTTTA
TATTAGAGAA
ACAACGCCAC
TAAGCACCGG
ATTTTAATTT
TTAATCTCAA
ACACTTTATT
CGTATTTGAA
GCGTATTGAC
AATTTGTTGT
TA.AGATTAAC
TAGCTGCGTG
TCAAA.ATAAC
CGATGAAAGC
CGGGGTGGTT
CAATAACGCT
AAACGCTAGC
GCAAA3AAATA
GATTTATGGG
AGAAAATTCT
CACCACCCAA
AAAAAGCAGC
GCTCTATACC
TTATTTGGGC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1032 INFORMATION FOR SEQ ID NO:130: SEQUENCE CHARACTERISTICS: LENGTH: 798 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAMdE/KEY: misc feature LOCATION 1. .798 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:130:
TTGTTGGCGC
AACAGCATCA
AGCCAATCCA
AGCGCTTATT
GTTGGATTAA
GGCTACAAAC
GACTATAACC
TATGGGGTAG
AAGATTTCTT
CAGTATGTGA
TTCCAATTCT
AGCGATCATG
AATTATTATT
AATTATGTGT
OCACCATACT
CCACGACCGC
CTAACCCTAA
CGCAATTATT
TCAGCTCTCA
AATTTTTTGG
ATGCTTATAT
GGACAGATGT
TTGGGGTGTT
ATTTAGCGAC
TGTTCAATTT
CGGCTCAGCA
CTTTTCTAGG
TTGCGTAT
TGATTTTAGA
TTCAAACACG
TAACCCCGGG
AAGCGCCACG
AACCAACAAT
TGAAAAGAGA
CAAATCTAGC
CCTCTATAAC
TGGGGGGATT
CTTCAATAAT
AGGCTTGAGA
TGGCGTGGAA
CACTAAGCTA
GGCAGCCTTA
CC TAATTCC C
GGCTTACAGG
CAAGAATTAG
GGTGCCATGA
AGGTGGGGGT
TTTTTCAATT
TTTATCAATG
GCGTTAGCTG
TTCTATAGCG
ATGAACCTCG
TTGGGCGTGA
GAATACAGAA
GTAATTTAAA
CATTCCTTAA
CCGTTTATCA
GGCATAACCC
ATGGGATCGG
TAAGGTATTA
CGGCTTCTGA
ATAAAA~CCAC
GCACTTCATG
CTAAAATGAA
CTAAGAATA.
AAATCCCTAC
GACTCTATAG
CAACACTTAT
AA.ATTTGATA
AGTCAACCAA
TTTCAGACGC
TGTGCAAGTG
CGGCTTTTTT
TGTGTTCACT
CAAAAACAGC
GCTGAATTCC
TGTGGCGAAT
GAAAAAAGAC
CATTAACACC
CGTGTATCTC
120 180 240 300 360 420 480 540 600 660 720 780 798 INFORMAtJTION FOR SEQ ID NO:131: WO 97/37044 PCT/US97/05223 210 SEQUENCE CHARACTERISTICS: LENGTH: 285 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...285 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:131:
CGCAACGTGG
GTTTTACAAG
ATCAATGCCG
CAATTGGCTA
AACGCAAGCC
AAGCGCGTTA TTATTATGGG GACACTTCAT ACTTTTATTT GCATGCGGGA AGTTCGCTCA CTTTGGATCG AATGATGTGG CGTCTTTAAA CACCTTTAAA CTCGCAGTCC TTTAAGCACC TATGCAAGAG CGATGATGGG TGGGGAATTG AAGAAGTGTT TTTGAATTTG GGCGTGGTTT ATTTGCACAA TTTGATTTCC ATTTCGCTTC CAATTTAGGA ATGAGGTATA GTTTC 120 180 240 285 INFORMATION FOR SEQ ID NO:132: SEQUENCE CHARACTERISTICS: LENGTH: 465 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...465 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:132:
AGACTTCCTA
ATCAAAGAAA
ACGCATAAAG
GACGCGTCTT
AACGTTAAGG
ATAGGGAGCG
AAAAAGAAGA TTACGCTAAG GCGATGGTGT GCCAAATCGT TTTATTGATC ACTTCAGCGT AAGAAAAAGA AGAAACGACA AAAAGCGCTA TAGAAAATAT AGAAATCCGC AATATCAGCA TGCGTATCGG TCAAAAAAAG ATGATTTTAA TGGTGGAGTT GGATCAATTG GTGAATGACC
TTTCTTTTAA
TTGAGGGCCA
CTGAAGAGAC
TGCTTTTAGA
AAGATGTGGT
CTTTAGAAAT
AATGGAAGCC
ATTTGAAAAA
TAAAACCCAC
CGTGAAATTG
CTCTATGGAT
CCTCGTAGAT
120 180 240 300 360 WO 97/37044 PCT/US97/05223 211 GACAAGGTGA TCGCTAAGGG CGAAGTGGTG ATCGTGGATG GGAATTTTGG CATTCAAATC ACGGATATTG GCACTAAAAA GGAACGATTA GAGCAACTGA AAAAT INFORMATION FOR SEQ ID NO:133: SEQUENCE CHARACTERISTICS: LENGTH: 633 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .633 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:133: 420 465
AAACCTACCA
GCTTCATTGT
ATTGTGGGCG
AACTCTGGGG
GAATTTGACT
AGCACTCTAT
AGAGCGAGTT
GTGGGCGTGA
GTGGCTTTAA
CGCGTTATTA
TCGCTCACTT
ATGTTTGGGC
ATGGCACAAG
GTTTTGGAAG
CCAATAACGC
TTGAAGCTCA
TACAAGATTT
ATGGTTATGA
GCTATAACCA
AAAATGGCGC
TTATCGGGAC
TGGATCGAAT
TAACGCTATT
CGCCGGCGTA
CTATGGTTAT
TAATTTTGGC
AGGGGCGCTA
GAATCAAAGC
CTTCGCGTTT
TTTAGGTTCA
GAGCAGTCAG
ACTTCATACT
GATGTGGCGT
GGGGGAGCGA
GACGCTTTCC
AGCTCCTTTA
GTGTATAGCC
GGGAGCGATC
TATAATTACT
TTTAGGAACG
ACCAACTTTA
CATTTATTCA
TTTATTTGCA
CTT
GCTTGAATAG
TTAACGGGAA
GCAATCAAGC
GTTTTTTTGC
AATCAAGCTT
TAGCCTATAG
CTTTAGTGTT
AAAGCAATAG
ACGCTAACGC
TGCGGGAGTT
CGGCTCTAAC
TGTGGAAGCC
GAACTCTCTT
CAACCACCCT
GAATTTCAAA
CGCCACAGCA
AAAACCAAGC
CCAATCACAA
AACGTGGAAG
TTACAAGAGT
INFORMATION FOR SEQ ID NO:134: SEQUENCE CHARACTERISTICS: LENGTH: 1029 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature WO 97/37044 WO 9737044PCTIUS97/05223 212 LOCATION 1. .1029 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:134:
CATGCTAGGG
CTATGGTTTT
AAAAAAGCGA
TTGTATTATT
GGGCTGTATT
AAAGACGCCC
TTATTGGTAG
AATGGCGTTA
TTTATAGAAA
ATTGGGGTGC
TTCCCTAAA.A
CACTCGTTAG
GTGAAAATCA
GGGGGGTTTT
TTTATCATCA
GATAGGATTT
GCGTTAAGCA
AAAGTCCGT
AATTTTGCCA
TTTTGAATGG
GCCTTCAAAA
CTAAAACCTA
TGTTGCAAAG
TTATAAGACC
GGCAAAAAGG
TCAGCAATAT
CGAAATTCAT
GTTTAGAAGA
ACCCTTTTTT
CGGAGTTTGA
AACGAAACCA
TGCTCAAAGA
CTAAAATAGG
TATGGGTGAA
CGCCTAAAAT
AGTTGTGATG
CTTAGGGGCT
AGGGGGCGTG
CCCCAAACAC
CGCGCCAAGC
CATGGCCAGT
CTATGACCGC
TTGTTATCAA
CAAGCGCTTT
ACGTCATAAG
AAAAAACGAT
ATGGGTGGTG
TCAAATCAAA
CACTTTTTTA
CGCTCATTTG
TCATAGAAGC
TGAATTATTA
TTTCACAAAG
TATGATTTCA
GCTTTAAAGG
GCTAAAGTCA
GAGTATGTTT
ATAGGGGCTA
TACGCTCAAA
ATGCTAGGGC
TTAAACCAAC
CGTTTGGTGG
GAAATCCTAG
AGCAATCTTT
GAAGTAACGC
GAGCGCTATG
CCCAAAGGCT
GTTTCGTTCA
GTCTGGCGTC
CCCTTATTAC
AGCATTGTCA
A.ATTGCCTAA
TCAAATCCGA
ATACATTAAG
ATCAAGCCAC
TTTCACAAAA
TAGGGGTAGG
AAGAGCCTTA
TAGCGCAATT
CGATCAATGA
CATACCAAAG
TCAAAGTCAA
GCATCGCTTT
TGGATTTTTT
ACCCGAAAGC
AAGGCTTTGA
CTTTATCGTT
GGCTTTTTTT
GGGCGTGTAT
TCCTTTTATA
GGATTTAGAC
AGA-AGCGCGA
AACCCAAAAA
GGGGAACGGC
TTATGGGGAT
TGATCCCTTT
TTACAAGATC
CCTCGCCAAA
TAAACGCTAT
AGACGAGCGT
AAAGCTTGGG
TTTAAGAGAA
ATTTTACATT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1029 INFORMATION FOR SEQ ID NO:135: SEQUENCE CHARACTERISTICS: LENGTH: 495 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (geriomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 495 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:135: ATTAAAATTA ACATGGTAGA ATGCCAAAAT TTGCTGTCGT AACAAGGAAT TGGGAATGAG AAGGAGTTTG GCTTTTTGCC CAGGTTTTAG GTGCTAGAGA CTTTTCGCAA CTCAAAAACG GGAACTCTGC CTTCTAATGA AGCGATTGAT TATCGCATGG GCCTTAAGTG CTGAAGACGC TAAGAATTTC CGCGCGAATT AATCTTTCTA AAATGAGCGA AGAGGATTTC AAAAAAATGC TTAGAAGAAA AAACCAAAGG TCTAAGCGCT GAAGAAATCA AGCGTTTGCA. GCGGCGATAC GAGAAAAGTT TGGTGTAGGG CATTGCTCTC CTAAG
CATGCGGCAA
TTTTAGCTTT
AAGAACTTTT
AAGTGTCTAA
TCAGCCGGAT
GTGAAGAAGT
AGGCAAAAGG
CTGTTAAGA.A
AAATTTAGAC
GCTTGGATTA
AAAATTAGCA
ACGCCTTATA
CGCTAGGAAG
GCGTAAAGAA
ACTTAATGTG
AAAAGACGAA
120 180 240 300 360 420 480 495 INFORMATION FOR SEQ ID NO:136: WO 97/37044 PCT/US97/05223 213 SEQUENCE CHARACTERISTICS: LENGTH: 471 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...471 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:136:
TTTAAAAGGA
GTGGCTTTAA
GGCGTTGTCG
AAGATGCCTG
AACACAGACA
CTTAAAAAAG
GGGAAAAAGC
CATGACGATA
AAAACATGAA AAAAGCGTTG ACGCCAAAGA TTTCAGCAAA CTCCGCAGGA TATTGTGGAT AAGACAAGAG AAAGGCGTTC AAATGACCGT GGCGGATTTT GCAATATGGA AGACATGGAT ACAAACACGA TAAGCATGGC AAGACCATGA CCACCATGAT
AAAATACTTT
ACAAGCGATG
TACACAAAAG
CATAAACAAT
GAAGCCCGCC
GATGATTTTG
AAGAAGCATG
GAAGATCACA
CTGTTGGCGC
AAGATTTGGC
AGTTGAAAAA
TGCATGAATA
AAAAAGCCAT
GGTTGAGATC
GCAAAAAACA
GCGATAAGCA
GTTGCTATTT
TAAAATGGCT
GCGCATGGAA
CGCGACTAAA
TAAAGAAGCG
ATGCAAGCAT
TGACAAAGAT
C
120 180 240 300 360 420 471 INFORMATION FOR SEQ ID NO:137: SEQUENCE CHARACTERISTICS: LENGTH: 2031 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...2031 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:137: ATCCAAAGAT ATTGGAATGG AGCGTTAATG AAAAATCAAC ACAAAAATCC CCTAACAAAA GCTTTAATGA AAACTTATCC ATATAACCAT TTTTTATTTT TCTGCTTTAT TCTAGGAGCG TTTTTATTAG GTTTGCTCAG TCCAGCTTAT GCTTTAAGTA TTATCACCAC TAAAGAAATT 120 180 WO 97/37044 WO 9737044PCT/US97/05223
GACGCTA.ATT
AAAGTAGAAG
ATCACCAGTC
TTAAAAATAC
AACGCTGAAkG
AACATCGCTC
GGTTCTGATA
GGCAAAGATG
GGGATCAGTG
GGTATTGATA
GGCGGCTCTG
ACAGGAGGGT
GGGCTTTTCC
ACCACTAGCC
AACAATTACT
CTAGATGTTA
AAATGCGCTT
TATGTAGATA
CAATACGCCA
AAAGGTTATG
GACAATTTAA
GTTAGGTATG
AATGATCCTA
AGCTCGCA.AT
GCCAAA3AGAC
ACGCCCTGTA
GGCGTGCCTA.
AGGAAAAATT
CCGCAATATT
CCACCCAAAC
CCTATAATCT
TGCTTAATGG
CTCATGGGTT
TTTTAAGACA
CTCCTAACGC
AAGTGGCTAA
AAACCAAAGC
GTTCCTTTTA
GCGCAAATGG
GGAGTAATGG
CAGATGGCGT
TAGGGGGTTA
ATGACAATTT
CTATTCCTTT
CAACTAATGG
CCAGCCAGTA
TCGCTAAAGA.
ATAGATACGA
GGGAAAATAA
TGCAACTTTA
GTATGGGGAA
TCCATGTCGC
AAAAAAATGA
ACA.ACCCTAA
ATCALAGAATT
CTACALATGCT
ATTTTGAAGC
GTTCTAAAGT
ATGGTGAGAG
CCATACTGAT
CACTCAACCT
TATCACCACT
AGCGATAGAA
TTATTTTAGA
CAATCAATCG
CAAGATTAAA
GATTTTAGGC
GGCTAATGAC
CGATAACAAT
GAGTAACGGC
TGCAAATGGG
GTTAGGGGTG
TGAGAATAAT
TAATA.ATGGC
TGGTAATGGA
CAGTAGTTCT
TTGTAAAGTG
TGGCTCTTCT
TTTTGAAGCC
AACGCAAAAT
CAAAGATGAC
AACGCAAACC
TGTGCCTTGC
TAAAACCCAA
CAAGCAAGAG
TGCATGCGGT
AAAAAGCTAT
AGGGATTAAA
CTTAAGCGAT
AGAACAATG C
CCTAGTATCA
TATTTACGCC
CAAACCACCA
AGCAGGGTGG
AACAATGCGA
TTTCCTTTGA
AAATCCACTA
GTTAGCAAAG
CCTATGTATG
CCTALATAGCC
TATGGGGCAA
AGTCATTCAA
GATGGGGTGA
TTCACTAATC
AGCTCAAGTG
GACACAAACA
ALATAACGCCA
CCAGAGTTAA
ATTTCTATGA
GGTAAAGCCA
ATCGGTGGTT
AGCAAATGCG
TTTCAAACTG
AGCGATTATG
ACCTTAACGC
ATTTTAAATC
CAATGGGAAT
AACAAGCTTA
AGCGGTGCGG
ATTACTACAA
CAAAAACTTT
CCGATTGGAG
CAGCCCAAGA
TCAACAGGAC
TGTTAGGCp.A
CTAACAGCAT
CTAGCAGTGC
TCCTTGTTTT
AAGAATACCA
CTAACACGCC
CTAGCAATAA
ATGGCAATGA
ATAATAATGC
ATGGCTCTAG
ATGGCTCTAC
GTGGGAGTTT
ATTCCAATAA
CTAATCCTAG
GCCCCAACAA
ACGCTTTAAG
TCAAGCAAAC
GTGTGGATTT
CCTTACAAAC
AAATCGTGTT
CAAGGGTGCA
CTATAGTGGA
GTGGGATTGC
ACAATGACGC
ATGGAGAATG
TTGTTAGCCC
GCCATTATTT
ATGGAGTCAA
CGCCACTTAC
TAATGAAAAA
TCAAAGCGTT
GAGGGTGTTT
AGATATAGAA
TAAA.ACCAGT
GAAAGGCGAG
AAAGCTAGAA
TTTTAGTAAT
CGCTATCAAT
TGGGGTAAAT
AATAGGCAGT
TTCTTCAAGT
TAACAATAAC
AGGGAATGGG
TTCCACTAAC
TTCGCAAGAA
CACGATGAAA
AGATGACACT
GCAATACTAC
ACAAGGCGCT
CACGAGCGAT
TCGTGGGATG
AGACAGGATT
TCAGTATTAT
CACCCAATTA
TAAATTAGAA
GGTAGAAGTT
TTATGTGATG
TAGGATAGAA
TCGTTGCCAA
AAAACCACTA
CACCCCACAA
T
240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2031 INFORMATION FOR SEQ ID NO:138: SEQUENCE CHARACTERISTICS: LENGTH: 339 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .339 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:138: ATGGCTAAAA TGAACGCTCC AGATGGGGTT GCCGTTTGGG TGAATGAAGA CAGGTGTAAG GGTTGTGATA. TTTGCGTGTC GGTATGCCCG GCTGGAGTTC TTGGCATGGG GATTGAAAM..
120 WO 97/37044 WO 9737044PCTfUS97/05223 215 GAAAGGGTGC TTGGAJA.AGT GGCCAAAGTA GCCTATCCAG AGAGCTGTAT TOGTTGCGTG CAATGCGAGT1 TGCACTGCCC GGATTTTGCG ATTTACGTGG CTGACAGGAA GGATTTCA-AA TTCGCTAG TTTCTAGA AGCCCAAOPJ\ AGAAGCGAAA AOGTTAAGGC
CAATAAATAC
ATGCTCTTAG A-AGAGACTAT TTTAGAAGGG AGAGGCAAAk INFORMATION FOR SEQ ID NO:139: SEQUENCE CHARACTERISTICS: LENGTH: 987 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 987 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l39:
ATGACGACTA
GATACATGTT
AACAAAGTGG
CAAGGCGTGG,
AAAGGCTATA
ATCAAAGCGA
TTCACGCAAA
GAAAGCATCG
ACTTATCATT
TCCATGAAAA
TGGTTTGAAA,
ATGCCCCTGA
GATGGGGCTT
ACCOA PAACA
AGCGTGAGCT
TTTGTGAAAA
AACATTAATA
AAAGAGTGAA
TTCTTTTCTT
TGGTTGTCCC
ATATTAACGC
TTGACATGGG
AAACGGCGCA
TTTTGAGCGA
CTCCACGATT
TACCTTTAGG
AACACGAAGC
AAATCATTCT
TTGCGAGCGT
TGAATTATCA
CCCCCTACAA
TAGAAGCGGT
TGCCGGATAA
TTTCTAATAA.
TACTGCCACA
CATCAGTATT
GCAAGGTTCG
TTTAGATTTG
CGATGGGGCT
AAAAAGCGTT
GACTTACCAA
AAACGGCGCT
GGAGGACGCT
CTTAAGCA.AA
CGCTTCCATT
GATTTTTAAC
GGAATTTTCA
TACCTATAAA
TAGAGCCGTG
AAAACATGCT
TCATTTT
AACAAGATAA
CTTTTTTACC
CTCAAM.AAG
CTTTTGTTGC
TTAAGGAAGG
ACTTTAATCC
CTAGAAACAA
GTGATAGAAG
TTTAAAATCA
CAATGGCTTG
GTGCAAAAAG
CGCTTGAAAA
CACGCTAAAG
TTTAAGGGCT
GTTTTCCCTA
TTCAGCGCAA
TGACATTAAA
TA.AGTATACC
TGTTTTTTTC
GCCTAATGGG
GGGATTTTTT
CTGGAGAAAC
GCGATCTCAA
ATGGGGTGAT
TGCAAACTTT
GATACTACCA
AAGCCGCTAA
AAGGCATGCC
TAACCAAAGA
TGCCTAAAA
AAAAAACGGA
CTTATAAGGA
CACTTTCTTG
AATTTATCCT
TTTGAAAGAG
CATGCCTAAA
AGTCCGTTTG
CCGCTATTTT
TGAAGCTTAT
ATGGCCAGAC
GATTGGTCAA
TAAAGAAGAG
TGTTGAAGAA
TTTACAAATG
GCGCATTAAA
TCCTGTAGGG
TTTCTrTGTAT
GCATTTAAAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 987 INFORMATION FOR SEQ ID NO:140: SEQUENCE CHARACTERISTICS: LENGTH: 444 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCT/US97/05223 216 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .444 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:140: AAATTCATGA GATTGTTGTT AAAATACCTT TAAGCGATGA GAAGTCCAAC CCGATTCAAA CTCACTTTTG AAACGCCTTT GTCATCGTGT TAAAAAACGC TCTTTGAATG CCTTAGAAAT GCTAAAGCGG GATTAGAAAT TTTTTTTACC CAAAGGTTTT
CTTGTTACTG
CGCCCCCATT
CGCTCCGGCA
TAATAAAACG
TCAATTGGAT
GTTTTCCTAC
CCAAGCTTCA
CATT
AGCGCTACTT
AAATTGGTTC
ACGCCACCTA
CCTAAAATCA
TCTAAAAAAA
CAAAATGACA
AGCAGCAAGG
TGATGTTACT
ATTGGCAAAA
TAAAGGCCGT
TGGAGGTTGA
CGATGGATTT
TCTACCTCTT
ATAAAAAGCA
GGCTGAAGAA
TGCGCTAAAA
GCAAACCACG
AGGGCAAAAA
TAAAGAAGCC
GTCTAAAAAA
ACTCGCTTTC
120 180 240 300 360 420 444 INFORMATION FOR SEQ ID NO:141: SEQUENCE CHARACTERISTICS: LENGTH: 783 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 783 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:141: GAATTGATTT TAAAGAAAAA GAAAGAAAGG AATTTAATGA GTTTTAGGAT CGTTACTAGC AAGTGGGACG TTTTATACGG ATGAAGCAAC AGCACAATAA CATGGGTGAG TCTGTGGAGT AAAGGCAAAC AAGAGCCTAA AAATAACCAT TTAGTTGTCT GCTAATAAAG TTATCCCTGA AAATTATCAA AAAGAATTTG CTGAGTAATT TTTTAGAGAG AAAGGGCTAT AGCGTTTCGC ATCCCTCAAG ACATCAAAGA AAAAGCGTTG CTCGTTTTAC ATCTTGGAAG ATATTGTAGA AGAGAGCGAT GCGCTCAGTG TCTTCAGGGT ATTTGAATCT GAATTTTGTT GAGCCAAAAA TTTGGTATTG ATGTTTCAAA GATTAAGGCT GTGATTGAAA AATTCTGGAG GTTTTGTCCC CAAAACTTTT GTGCATAGGA AGAGCCATTA AAAAGATCAT GAATCAAGCC TATCACAAAG GAATTAAGCA AAAAACACAT GGAACGTTAT GAAAAAGTTT
AAAAAGGTAG
CTCTAGCTGA
TGCATTTCCA
TAATCGATCC
AGAAGTCTTT
AATTTAAAGA
GCATGGATGG
AAGAAAAAGT
GTGAAGACAT
GGGTGGAATT
TCAAGGAAAC
TGATGGCGCA
CTAGTGAAAT
TTTAGCAATC
TGGAATGCCT
CTATCCTATT
TAAGATAGAG
GTTCCTCCAA
TGTTAGCGAA
GAATGTGGCT
GATAGACATG
TATCCATAGT
GCGGCGTACC
CGACCATGAC
TATCACCAAA
GAAAAAGCGA
120 180 240 300 360 420 480 540 600 660 720 780 WO 97/37044 PTU9152 PCTIUS97/05223
AAG
INFORMATION FOR SEQ ID NO:142: SEQUENCE CHARACTERISTICS: LENGTH: 804 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 804 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:142:
GACCAAAAAT
GAAAACACTT
TCACACATCC
ATTGGACTAA
AACGGGAAGA
ACTAAAGATG
GACGCTGATG
GCCCCCTTTT
AAAAGGGAAT
GTAGGCACGG
ACCGAAGAAG
TTTTATGAAA
CACGCTTTTT
AAAGAATACA
TCAAGGGGAT
TGCGCTACTT
ATTTAGAAGT
AAAGCTATAT
AGTTTTTAGA
ACAGCTTGGT
AGTTCCCTGA
TGGTGGATGC
TAGCCGGTGT
ATACCAAGCG
ACATCTCTTG
ATTTCAGTTT
TCACCAAGCT
CTTCTCTCTC
TATTTTAGCT
GCAAGCTTTT
CATTAAAGAA
TTCTACGCAA
TATTATCTCA
GATCAAACAA
ATTCCCTGTT
GTTTAAAG
TTTAATGCAA
GCTCTCTTAC
CATTTTGCCT
TAAAAGCGAC
CATTGATGGG
ACTT
ATGAAAATCA
TTGGATAPAA
AAGCTCTTTT
TCTACCGATA
TGTTTGAAAG
AATAAAAGCT
ATTGATCCAA
ATCGCTCCTG
TTCAATCAAA
ACGCAGTTAG
AAAAGGGCTT
GGCATGTTAG
AATTACCCTG
GTGTTAGTAA.
AGGACGCTTC
TAAAAGCCAG
AAGAGGGCGT
ACTCTAATAT
CTTTCAAACT
AAGTGAGTTT
TGATTGAGCA
AACACCAAAC
AAAAAATCTC
TATTGGAAAT
CGGTGGTTGA
ATTATCAAAA
AAACGATTTA
CTCTATCGCT
CGATTCAGAT
AGGCACGATC
TGTTTTAGAG
CCCCATGTTT
AGAAATCAAT
AACTAGCCAT
CCTTTCAGTG
TATCCATTCC
CCTTAAACTT
AAACGAAACG
AATCCTCCCT
120 180 240 300 360 420 480 540 600 660 720 780 804 INFORMATION FOR SEQ ID NO:143: SEQUENCE CHARACTERISTICS: LENGTH: 1248 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737044PCTIUS97/05223 218 (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1248 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:143:
AACCCATTGA
TTTCTTGGOA
AOAATAGAAA
TCTA.ACTTTO
ATTTTAGCAA
AAAATGGCTG
ACTTTTATCA
TATGTTTTAA
CTTTTGCCCA
AAACAAAAAG
CAAAAAGAAC
TATTTCGCTA
AATCTTTTTT
TTATATTATC
AAAGAATCGC
GATTTAGAAA
AAATATTTAT
AGCGGGACAA
TTTTATTTGT
AAAAACAAGG
ATCAAA.AGAA
TTCAAAAGAA
AAATTATCTT
AAGAA.AGCAT
AATATGAAGA
AAGCTGTGTT
AAGTTAAAAT
CCAAACAAGC
GCTACGCTAA
TTTATGCTAA
GACAAGCTCA
ATTTTAATTT
AAGATTTATC
TAGAACCCTT
AAAATAGACT
AAGATAATTG
A.ATTAGGCTT
TATTATGCTC
CAGCGCAAGC
GCCAAAAAGA
GGTATAAAAA
GTGAATACGA
CAAGAGCTTA
GCATAAAGTT
CCAAACAATC
CGCTCATGCT
AAAACAAAGT
CATTGCCAAT
CACAAGATCT
AAATAAAGCG
AGCATTAAA.A
AGCCCAACTT
TTTAAAAAAT
CACGCCTTCA
AAAAAGCAGA
TATTTTTAAG
TTTGAGCGTG
AAAGGGGCTT
CACCCCTAAA
GGTTATAGAA
AGAAAAAATT
CACGATTTCA
AATTTTAAAA
TCAATCTTTT
TTTATCATGG
TATATAGACC
GACTATGAAA
GGTTGTATTT
GAAATTTTTG
AACGCTAAAC
TTCGCTCCTG
GATTTAATGC
ATTTTAAAAG
TATAATTTAG
AACCCGCGCA
GGGTGGAGCA
AATAATCGCC
TTAGATTTTT
TTTAAA.ACGC
GATTCTATTA
GTTAATAAGG
AAAAATAACC
GACATCATGC
ACAAAATCTA
TA.ATCTCAAA
A.AGCTTTGGA
CCCCTTATAA
AATGGATTGA
TTATTTCTAT
GAACGCGCAA
ACATCAATAT
GTTTTAAAAT
GAACGATTAA
AACAAATCAA
TGGATGAkAA
GTGTAGCGAT
GCGATGAAAA
CTTATGAAAA
ATAGCCGACA
CAAAACCTGT
TTTTAGATTT
ATTATTATTT
CGCAAGCTGT
TGTTGCGTTT
TTTTATTT
TTCGGTCGTT
GTCTTTGAAA
CACCAAGAGT
AGAACATTTG
GGACGATAAT
TTTTTTAGGC
TACTCATGAA
CTTACGAACG
GAATGTTTTT
AGAGTTATCC
AGGTGAGATC
ACAAGAAATC
GCTTA.AGGAG
ATATTACCTA
AGGCACAAAA
AGCATTGATT
TTTTGCAGGC
GAATTGGTCT
TAGCATTTTA
AGAAAAGATC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1248 INFORMATION FOR SEQ ID NO:144: Wi SEQUENCE CHARACTERISTICS: LENGTH: 228 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: nmisc-feature LOCATION .228 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:144:
AATTTTAGCG
CCCAGGCGTT
GGCTCTGGCA
ATTGAAAAAG
GTGAATCCAIA AAAGCGCCTA AAGCACCCAG CCCCATTCCC AAGGGAATTA GCATCCAATT GTTTTCTTTT TTGGAAGACA CGATTTTTGA TCCCTTTAGC CGACTATTTT AGAGGCCAAC GCTTTAGGGC GTTTTAGCGT GGGTTTAGAG AATATTGCGA GTTGTTCAAA AAGCGTATTT TAGAGAGT 120 180 228 INFORMATION FOR SEQ ID NO:145: WO 97/37044 PCT/US97/05223 219 SEQUENCE CHARACTERISTICS: LENGTH: 651 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...651 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:145: AGGAGATGTT GTAGGAGGGA AGACACGAAG AGATATGTGA TAGAAGTGGT AGCACAATGA CAAGTTTATT AGCCCTACCA AAAAAGGTGA ATGCTCAGCA CTACCGCTCA GCTTACGGCT CTAATACGCC CATGGCGCTC TAGGGACTCG GGTTTTGGGT TTGTTGTAGG CGTTATTACG GATTCTTTGA TATGGCATGA GAGACGCTCG ACAGATGTGT TGTTTAACCC CTTGGGCGTT GCGATTGGTG
GTCAGCAGAG
AAGAAATGCT
AAATTGTTCC
TAACATGCCT
TGGGTATAAA
GTATAAGCAT
TTTTGCAAGC
CAAGGGTTCT
AGCTATTTTC
GCACCTCTTG
ACCCCTAAGC
GCTTTTATAG
CATGGGAATT
ACAGCGTCAA
GGCTTAAGCA
TTTTTTAAGA
TCTTATTATA
CAAAGTTTCA
AATCGTGAGA
GGGTCCAACA
AAGAAAAGGC
GGATTGATTA
GCAATGGTAA
ACCCAACAGG
ACCAACAATA
AAGCCCCACA
AGTACTACAC
TGTTTGGCTA
AACTTGCATT
AACTATTATT
TATATCTTTA
TATTAAGACT
CCAGTTGGGT
TCAAAGTGGG
AGGCCTTACT
CGCTATCAAT
ATTTGGAATG
TTATAATGAT
TGGGGCTGGC
TGGGGTTTTT
T
120 180 240 300 360 420 480 540 600 651 INFORMATION FOR SEQ ID NO:146: SEQUENCE CHARACTERISTICS: LENGTH: 321 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...321 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:146: WO 97/37044 WO 9737044PCTIUS97/05223
GGGGGCGTGT
CCCCTAGACT
ATGGTCAGCT
GAAAATTGCT
CCGATCGGCA
GCGCGTTTGA
GCGGCGTATT ATCGTTACAA GCGCATTTGA GCTCACTATT CGCTACAACT TGCACCCCGC CTCAAGGGGG CGTGTGCGGC GTATTATCGT CTAAGCCCCT AGACTGCTCA CTATTCGCCA GTTACATGGT CAGCTCTCAA GGGGGCGTGC GTTTTA.AAAA T
GCTTTGAAAA
AAAACcCGAT
TACAAGCGCA
CAACTTGCAC
GCGGCGTATT
TTGCTCTA.AG
CGGCAGTTGC
TTTGAGCTTT
CCCGCAAAAC
ATCGTTACAA
INFORMATION FOR SEQ ID NO:147: SEQUENCE CHARACTERISTICS: LENGTH: 1152 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1152 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:147:
AACTATAGTA
TTTTCAA.ACG
TCCAACGAAC
AACGTGCTTT
ACTAGCCAAA
TTGAAACAAA
AGCATGGAGC
CCTTATTTGA
AAAAATAACG
GATTTGATGA
CAACAATCCC
TATAAA:GCCA
GAAGACGCCC
GATTGTTGCG
AACGCCCTAA
GCTCTAATCT
GAATTGAGTG
GCCCTTTTAG
GGGAAAGCCC
TTTAAGCAA.G
ATAAAGATAA
CGCAGATAAT
AGCTTTCTGA
CTAACAACCA
GGCGTTTTTT
GCGCTTTGGA
ATGAACGCTT
GCGGCGTTAA
CCCTTTTCTT
GCGCTTTAAA
ATAACAAAAC
TGCGTTTGA.A
TAAAAACCTT
ACAAAGAAAA
TAGAGCGAGA
TAGCCAATTA
AGCA.AATGGC
CCCAACAACA
TTATTAAAAA
AG
CCCCATGAGA
GATGACTTTT
CATGCTTTAC
GGATCAACTT
TAACGCTAGC
ATTAGAAAAA
AATCAAAGAA
GAATTTAGAA
ACTCAAAGAG
CGCCTTATGC
CCTAGAATAC
AAAAATTAA-A
TTTACCCCTA
TCTCAAATCA
TAAGGAATTA
CGAACACACT
GTTTTTGA.AT
GACTAAAAAG
TATCCGCTTA
TTTTTTTGCT
GATTCCCAAA
AAACTCAATG
AAAGAAATCA
CAGATCCGCC
TTACAAGCTT
TCCCAAGCGC
GAGGCATCAA
CCTAAACTCT
GATCAGGTTT
A.ACGCCCTTA.
AACAAGCTTC
GAAAAACGCC
TGCGCTAACC
AAAAACGCTA
TTAAAAACCT
GAGACCATGG
CCCTTCAATG
GATCCGCATG
TCTTTTTATT
CTAACGCCAA
AAAGTTTAAG
AAAAGGCTAA
TTATGGACAC
TAGAAAAACA
TTTTTTTGCA
ACGCTTTAGA
TTCATTTGCT
TAGAAAATGA
TGA.ACCATGA
AAAGCCAAAT
CAGAGACTCT
AATTACAGCA
AAA.ACAACAA
TGAATATAGA
CGTTAAACGC
TGAGCGATGG
GATTCCCTAG
TTTTCTAACC
ACTTTCGCGC
AATCTATCAA
CAGCACCCTA
TGATGCACTA
CTTAAAAGAG
AGAGCATTGC
AGTCCAAGAA
CTCACGATTG
AATACAGGAT
TTTTCAAGCC
CCAAGCTAAA
AGGCTCG.AGC
ACGCTACCAA
AGAAAAGCAT
ATTTTTAAGC
CCAAGTTTTA
TTTGAGCGGT
CTTTAAAAAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1152 INFORMATION FOR SEQ ID NO:148: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 894 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 PCT/US97/05223 221 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 894 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:148:
TTAGTATTCA
TTCGTGAAAA
GGGTTTGTAA
CTTTTTTTGG
TTGCGTAAGA
CCATGCATAG
TTGCAATATT
GATCCTTCTA
AAATACGGAA
CGATCGCAAA
CTGGATAATA
GGGCATAAAT
GTCATGCTTG
GTGGTGCAAC
TTAAATCGCG
AAAAGCCCTT
TAACCATAAT
AAATTCCCAC
CGGCGATGGC
TTGTTTTTGA
AAGTCAAACC
TGTTGATGCC
CGCCTAACTT
AACCCATTCC
ACCATTTTCA
ATCTAAAAAA
ATTTGGCGCG
CTAAAGAAGT
AGAGCGATAA
CTTCAGCCGA
TTTTAAAAAT
GATTAAAGAT
CAAAAAACAA
TATCATTGTT
GAAATGTTTG
CGACGCCGGC
AACGACTCAC
TTTTTACTTG
TGATTATGCG
CATCCATATT
TATCAACAGC
TCGGGTAACA
GCCTAACGCG
CTCCTTTGTC
AGAGATTCAA
AGGTTACTAA
TTTAACCACT
GGAGCAAAAA
GTGAGTTTAA
CCTAATTATG
TATGTGGTTT
ATTAGTGGCA
TCATGGCAAG
ATCTCTTTGA
TCTTGCATTA
CGTTGGTCGC
GAGAGCGAAT
CACAAACGCA
TTGTTAGCGA
GATCATGAAT
ACGTTACTAA TAAGTTAAAA ATTGTAGAA-A AATAACGAGA AGATGAAAAA AGCGGGTTTT ACGCCAAAGA TCCGAATGTG AGAAAAATCA AAATCCTTCA TAAAAGATAT TAACGGTCCG TTGAAAACCC TTTGTTGCTT CGCGCGATTT TATGAGTAAA CGATCAATTC TAAAAAAGGG GCCTTGATGT GCGCAAACAG CATTATCAGG TGGCTTGAAC TAGCGCAAAA AAGCCCGTTT TGGGAGACTA TGGCTTGGCG CACAATTTAA CCCATTGACT GCGCGATTTT GCGT 120 180 240 300 360 420 480 540 600 660 720 780 840 894 INFORMATION FOR SEQ ID NO:149: SEQUENCE CHARACTERISTICS: LENGTH: 537 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .537 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:149: CTAAGGCGTC TAGCTTTGAA TACAATGAAT AGCGTGTTAG AATGTAAAGA ATTAGCGCTC WO 97/37044 WO 9737044PCT/US97/05223
TATGGAGGGA
GAATTATTGC
CCATGTTTTT
CCTAGGGTGC
ACCGTTCTTC
GATTGTTTGA
TTACTGGTTT
TTAAAAGGCA
GTTTTGATCC
CATTCGTTCA
TGGACGCAAA
TGTTGAGCGA
ATTTCCAAAA
GGCATCTTTC
TTGAAAGGAT
TTGACGCGCC
TTTGCACALAG
GCTGATTGTC
AACCCGTTTT
TTTTGAAATC
ACTCTACCGC
TTCTTGGACT
TGGCTATGAA
GATTTCTTCT
GCTCATTTAG
TTGCCCGCTT
AAGGAATTAG
AAGCA.AGAAA
CCTAAAACGC
AACGCTAAAG
AAAATTCAAT
AGCGCGATTA
CCATTATTGA
ATCAAAACCC
AA.AGAGCTTT
GGGCTGTGCC
TTTATTTAGT
AGCTTTTAAA
TTAAGGGGCG
GGGCTAGTTT
GCAAACTTTA
TTTCAAAAAG
AA.AGGGAATG
TACGATAGAA
CATAGGAC
ALAGGGTGGAA
TTATCACCCT
AGGGGTT
INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 807 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .807 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:150:
TCGTCAATAT
CTAGGAAGCG
GGGTTAATGG
AAACAAGATT
TTTGGGTGTG
AAAAAAGCCA
CTACTAGGAA
CAATATTACT
CTACACCATT
AAAGCTTGCG
GCAAAGAATT
AGGGGGTGCT
AAGCAAGCGG
GCTCTCAAGG
TTGATTATAA
TTAAAAAAAC
CAGAGCCAGA
TCGCTCAAGC
TTTTTTTAGG
TTCAATTTTA
ATTTATACTA
CTAA.ATCTTG
ATGGCGTAGG
ATTTAAAAGA
TTAAGGAGGC
ATAATTTAGG
TAGAAAACTT
AATTAAAAAT
TTATAAGGGA
CCTTTTTGGG
CGCTAAAGAA
TAAAGCGCAT
GGCGTTTTAT
CACTAAAGGT
TAACGGACAA
TGAATTGAAC
CACGCCTAAG
CAGCCCTGGG
TATCGTTCGT
GGTTATGCAA
TAAAAAAGGC
AGAGCTT
TTACATTTCT
GTCTTGTGTT
CTTGTTAGTT
TTTGAAAAAG
GAAGAAGGGA
TGTGAATTAA
GGCGTTTCTA
CATGCTGAAG
GATTTAAGAA
TGTATTAATG
TATTCTAAAG
TACAACGCCC
TGCAAATCAA
CCAATAAGAG
TGGGCGCGTT
TAGGTATAGA
CGTGTGAGTT
AAGGAGTGGG
ATGATGGTTA
AAGACGCCAA
GGTGTACGGT
AGGCTCTTGA
CAGGATACAT
CATGCGAATT
AAGGCACAGC
GCGTTAA.AGA
ATATATCATG
GTGTTTGAGA
GAGCGTGAAG
AAAAGAAGGT
AAAAGACTTG
TGGGTGTCGC
AAAAGCCTCA
CTTAGGAAGC
TTTGTATGAA
GTATGGTGTG
AAAAGATGGT
AAAGGACGAA
AGCATGCGAC
120 180 240 300 360 420 480 540 600 660 720 780 807 INFORMATION FOR SEQ ID NO:151: SEQUENCE CHARACTERISTICS: LENGTH: 258 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/05223 223 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...258 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:151:
CATTTTAACA
TATAACGCGC
AAATCTAATG
GTGTTGGGGA
AAAATCGTAA
TCTTTACAGA AGATAATCGT GAAATCAGCG GGAGCACTGA CAAGCTCAGT TGAATGGGGA ATACAAATTA TTGCAAAATG CGGTGGTTAG AGAAGTGGGA TCATCACCGG CGATGAAATC ATTTTAAACA AAACCAAGGG TTATGCTGAT GCGCGAAGCG CCCGCTAAAT TTGTGTTTGA TATGGAAGAC ATTAATGAAG
GGCTAAAT
120 180 240 258 INFORMATION FOR SEQ ID NO:152: SEQUENCE CHARACTERISTICS: LENGTH: 276 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...276 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:152:
CCCAAATCCC
GAAAACACCA
ATCATGCTGA
TTGAGTAAAG
AGGATAAAGA
CTAAAAAGAT CGTAAAAAGA TACGAAAAAA GCGCTTGGAA AATGCCAAAA CCCACGAAGA CGCTTTAGCG AAATGCTTTG CGCCTGCAAT TTTTAAAGCC ATAAATTGAT ATTGGCGGTT GTGGCTGCTC CCATTATAAA GATGCTTAAA AGATTTGGTG GAAAAAAATG GGCAAAAAAC ATGCGATAGA TTTTAAAATG TATTGGCTTC TTTAGCCCCT AGCTTT INFORMATION FOR SEQ ID NO:153: SEQUENCE CHARACTERISTICS: LENGTH: 555 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 WO 9737044PCTIUS97/05223 224 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .555 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:153:
AATAAGGATC
GGCTGTTTAG
GACGCTCTCA
ATTTATAGTT
ATTAAGGTGC
GGCTATTTAG
AACGGGAATT
TTTTTAGGCT
AAACGGGATG
ATTTGATGTT
GGGTTTTGCA
ATTCTTTAAT
TCACGCACAG
AAATCATTTA
ACAAATACCC
ATTACGGCAT
CAGCGAATTG
ACACAGAGAC
AAACAAGTTT
AGCTAAAAAC
TTCTGGGATT
AGATATTGCA
TGATTATGAA
TAACACGAAA
CATGCACCAA
GAGCAAAAAC
GATCTTCAAA
AAAAAAATCG
AGCTTATTTG
AGCAGCGCTA
AGAGCGATTA
AGCAATCACA
GTGTGCTTAT
AAAGTAGCGA
GCTTTTGAAA
GCCA.AGAGTT
TTGGCGTGGG
TCTTACCTTA
GAGAGAACGT
AAAGCGTAGC
ATAACAAGCA
TAAAGGGGCT
TCATTGATGA
ACAATTATGA
ATTATCAAAA
TGTGTTAGTG
TGAGCAAAGA
GAAAATCGCT
GAGTAGGGGG
ATCCACTATT
TAAGGCTAAA
TAAAATCGTG
AGTGTTTTTA
GATGCTAGAA
120 180 240 300 360 420 480 540 555 GGTTGCGTTG GGTTT INFORMATION FOR SEQ ID NO:154: SEQUENCE CHARACTERISTICS: LENGTH: 645 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .645 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:154:
GGAAGAACAA
GATAAAACGC
TCCATAGATA
AAGACTAGCC
GAACTCAAGG
AATAAAATTT
GACTACCCCA
GTGAATGATT
TGAAATATTT
TAGATATTAT
ACGATGCCAA
AGCATTTTGA
ATAAAAAAGT
CACGATTAA.A
TTGTAAGTTT
ATTTAAAAGC
ATGGCTTTTT
TAAAACCATT
TTACGCTTTA
TGTTTCTCAA
CCATCTTGTA
ACTTTATGAT
AGATCTATAC
CCCTTCTATC
TTAATATACG
CAAAAACTTC
AAATTGCATG
AACAAAGAGC
GCGCTTGTGA.
GTGGATACAG
CCTTTTGCAG
GCTTGGATGA
GCACTAGCGG
CTATAGGGCT
CTAAGATTGA
AAGTCTTAGC
AAGGTGCTAT
GCGTGGCGGT
GAACGCTCAA
CGCACAACAT
AGCGCCTGAT
ATTATACGAT
TTTTGCAACA
AGTGCGCTAC
GAACGATTTA
CAATTACGCA
AGAAAACGGC
AAAGACTTTT
GGCCATTGTG
TGTTTTTTCT
GCGTTATCAA
120 180 240 300 360 420 480 540 AAATACATTG GACCAGGAAT CACAAACATC WO 97/37044 PCT/US97/05223 225 AAAGAAATCA TCAAAAACAA CCGACTCAAT ATTTTCCCCA AATGGGCGAA CGCTGAGCAA ACGGAGTTTT ATTACACGCA GATGGCGGAA AAACGCCCAT GGTTT INFORMATION FOR SEQ ID NO:155: SEQUENCE CHARACTERISTICS: LENGTH: 528 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...528 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:155:
GCAGCTATGA
AAGATTAACA
ATCATTGGTT
GAGCAATTCA
GTGCGAAATA
AACAACCCTG
AAAACGGCTA
AATGGTGGCA
TACGCTCTCG
TGTTTAATAA
GCGCTCAAGA
ATGGTAATGT
AAGAGCGCCT
CTGATGACAT
ACAATTACAA
ACGGCTCTAA
ATACCACCAA
TGAAGAACGC
TGATATAGAC
TCTCATTAAA
TTCTACAGGT
AGCCCTTTAT
TAAAGCATGC
GTATCTTATC
AATTTCGGTG
CTTACCCACA
TCCTTTCGCT
AGCGCGACCG
AATACAGAGC
ACCAATGGCA
AACAATAATA
GGTATGGCTA
GGTAAGGCAT
TATTATTTAG
AACACCACTA
CACAACACCA
GATTTTACAA
ATGTTTTATT
TTAGTAATGT
ACCGCATGGA
TCGGCAATCA
GGAGAAATAT
GCAATTCTAC
ATAATGCGCA
CTCCTAAT
ACCGCTCATC
GAAAGCGAAA
TAATCTAGAA
TACTTGTGTG
AAGCATGGTG
AGGCATCAGT
GCCTACTGAG
TTCTGCTAAC
120 180 240 300 360 420 480 528 INFORMATION FOR SEQ ID NO:156: SEQUENCE CHARACTERISTICS: LENGTH: 564 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...564 WO 97/37044 PCT/US97/05223 226 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:156: ATTTTTAGAG CTTTTAGAAG AGGATTTTTT AGAAAAACAT TTGCATATCG CTTTACAGCA CACCATGATT TCATGCTAGA GAGAATGAAT CGAAGAAACC GCACTAAAAG CGATAGGGAA 120 TTATTAGAGA TAATCGCTTC TAAGAATTTT GCTATCGGCA CGGATTTTAT TGTGGGGCAT 180 CCGGGCGAGA GTGAAAGCGT TTTTGAAAAG GCGTTTAAAA ATTTAGAAAG CTTGCCTTTA 240 ACGCATATCC ACCCTTTTAT TTACAGCAAG CGAAAAGACA CCCCCTCTAG TTTGATGCGT 300 GATAGCGTGA GTTTGGAAGA TTCTAAAAAG CGTTTGAATG CGATTAAAGA TTTGATTTTT 360 CATAAAAATA AGGCGTTCAG GCAATTGCAA CTCAAACTCA ACACGCCCCT AAAAGCCCTA 420 GTGGAAGCGC AAAAAGACGG CGAATTTAAA GCCTTAGATC AATTTTTCAA CCCCATTAAA 480 ATCAAAAGCG ATAAGCCTTT AAGGGCTAGT TTTTTAGAAA TCAAAGAGTA TGAAATTAAG 540 GAGAGGGAAA ATCATGCCGT TTTC 564 INFORMATION FOR SEQ ID NO:157: SEQUENCE CHARACTERISTICS: LENGTH: 237 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...237 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:157: AAAGCACTGC CACAAGGTCG AAACAGAATG CGGTCGGATG TTGAAGTTTT ATCGCCTTTG CATAAAATAG ATGAAAAATA CCTTTTCCAT TTAAAGATTG CGGGGGAATT GGCGAGCATG 120 GGTAAGATTT TAAGTGTATA TTTAGCCCAC AAGCACAGCG CGTATTTCAT TTTAAACGCT 180 TTGAGTTACG GCTTTAGCCA CCAGGATAGG GCGATCATTT GCTTATTGGG CGCAATT 237 INFORMATION FOR SEQ ID NO:158: SEQUENCE CHARACTERISTICS: LENGTH: 705 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737044PCTJUS97/05223 227 (ix) FEATURE: NAME/KEY: misc-feature LOCATION .705 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:158:
AAGGGAACAA
GACACGCAAG
GCCGATATTA
AGTTACGCTA
AAAACCCAAG
AATAAAGAAA
TTAGAAAACA
TTTATCAAGA
AGAGGGAGGG
TCTCAAGGGG
AGAGGAAGCA
ATTAGGATTG
AAATGAAAAA
TGATGGGCGA
TTATTATCAA
GAAAAATGGC
GCAAAGAGCT
AGATTAACGC
AACATTTAOA
TCCAAGAGGG
CTAGGAGTTT
TTCA.AGAAGT
ATATCGCGCG
GGAGTTTAGA
AGTCTATTTT
AAATTTAAAG
TTCTTGCACC
ACGATTGGAT
TTTTGAAAAA
GCTTTTACAA
CACCACGATG
CTGTGATTTT
TGAAAAGAGG
GGTTTTAACC
ATTGATTAAA
CCTAATCAAA
AAAACTTTTO
GATTTTAGCG
GTAACCAATG
AAAGAAGTGC
GGGCTTTTAA
GAAAAAAAGC
GTGAGCGAGT
GATTGCAATT
AAAATTTTAO
GGCACTAACG
AAATTAAGCC
TCAACGATGA
GOTGTAGGAC
CGACTTTAGA
GGACCGATAG
TATTTACTGG
AGGGCGTTTT
GTTTTTTCAT
TTGTGGGAAA
ATTGCATTAT
AGCAGGTGGG
TGGGGAGCTA
AGATTACTGG
ATTTT
GAATCTTTTT
AGAACAAGAA
CGCGGTAAGG
TTGCGGGGTG
TGGGCATGAC
AGATGACAAT
AACCAGGGCG
CCCAAGCGTG
CCTTTTATGC
TGGGAAAGAT
ATTGAAACGC
120 180 240 300 360 420 480 540 600 660 705 INFORMATION FOR SEQ ID NO:159: SEQUENCE CHARACTERISTICS: LENGTH: 303 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .303 (x i) SEQUENCE DESCRIPTION: SEQ ID NO:159:
GTTACGGCTT
AAAAAAATCC
ACCTTGCAGT
CATCATTTAA
TACTTGGCTA
OCT
TAGCCACCAG
TTAAAGACTA
GGCTGAGTTT
AATACACGCT
AAGAAATGCT
GATAGGGCGA
CGTTATCGCC
TATCCTTTCT
AGAAAAAAAC
CCCCAAACTC
TCATTTGCTT
CACATGAGTG
TTAGCCGAAA
AAGCTTGTGA
ATTAAGCCCA
ATTGGGCGCA
CGATGATGCC
ATTTGTGCCT
TCCATTCTAA
TTCCTTGGAC
ATTTAGCCAT
AAGCCTTTTG
GACAGACAGC
TGATGCGCTT
GATAGAGTTT
INFORMATION FOR SEQ ID NO:160: SEQUENCE CHARACTERISTICS: LENGTH: 1227 base pairs TYPE: nucleic acid STRLANDEDNESS: double TOPOLOGY: circular WO 97/37044 WO 9737044PCTIUS97/05223 228 (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1227 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:160:
AGCGA.TGCGT
CAAACCCATT
TTAGAAAACA
TTAGCCAATA
TTAGGCAAGG
CATATTGGCT
TTGTTGCTCT
AATCCGGAAA
GCCCCCTTTG
ATGATGCGCG
TACATTCAGG,
ACCGAAAGCA
GAAGACCACC
AAACACCGAA
CTGTCTAAAA
ATGTCTTCAG
TTTGACCCTA
ACCTTTAACC
AATTCTTTAA.
CGCTATTTGT
CCTCAAATTT
TTTTAATCAT
TCAAACGCAT
ATGAAGAAAA
AATTCATTTC
GCGAGATCAT
CTATCAGGCA
CCACTAAATC
TTTTAAACGC
GTAAGAGCAT
ATGTGTATCA
TTTTACACCC
TTGAAGTGAA
AAAAAAAAAT
GAGCCTTGCA
CCACTCAAAG
TGATTTTTGA
ACAAGCACTA
GCAAGATTGA
GAAATCCCAT
GGTTTTTATC
TTATCCCTGT
ACAAGATTTT
GCGTGTGGTT
TAAAGATGAA
TCGTTTGCCT
GGAGATTGAT
AAAAGAATAC
TTTAGTCATC
GGTGTACCAT
TTATTTATAC
AGCCTTGTTT
CACTAGCCCT
TTTTGATTTT
GGGTTTGATT
CAAAACAGCC
CATTGTGGTT
TGTGTCTATG,
TAAAAACGAG
GATTTTTCAA
TTTGCATTTC
CACTAAAGTG
AGCGGAG
AAAGAGCAGC
TTGAGCGTGA
AAGCTCATTT
A.ATATCCCTA
GTGCCTTTTG
AGGATTGTAG
CAACCACGAG
CAAGTCAAA.A
ATTGATATGC
TTGCACAAGC
AAGTTTTATC
TATGGGAAGA
GTGGTAGGCA
ACACCGGTTT
TTGAATGAAA
CAAATGGATT
ATTGTCAATC
ACCGATATTA
ATGCCTTTTG
GAAAAATTAG
GCATCATCCA
AAAGAGATAG
TGATTGATGA.
GCACCCCTAG
GGAGCATTTT
GGCTTTATCG
ACATTCTTTT
GCAATGTGGG
GATTGCAGAG
GTTTAAAGAG
ATAAATTTTT
GTTTTATCCA
GAGAGCTTTT
ATAAAACCAA
GTTTAAGCAT
TGGGCTTGTT
ATTACGAAAA
AAAACCCTAT
AAGAGTGCAT
CGTTTTTGA.A
TAAAATCATT
CGAAAAAACT
ATTTGAAGTT
AGAGTTTGGG
TGCTTATAGG
CAACGATGTT
AGTGGCAGGC
GCAGTTCCCA
CCGA.AAAGCG
CTACAAGCTC
ATCGCTAGAA
AAAACTCCAT
TTTCTTGAAA
CACCTCTGGC
TAATGAAGAC
GCTTTATGAT
TTTAGCCAAC
CATGTATCTC
CACGCA.AACG
CGATGATCAC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1227 INFORMATION FOR SEQ ID NO:161: SEQUENCE CHARACTERISTICS: LENGTH: 192 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature WO 97/37044 PCT/US97/05223 229 LOCATION 1...192 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:161: CCAAATGAAG CGAGCGCGGC TGTTAATGTG GGCTATAAGA TCAGTAAGAG TTTGACAGCG AGCGTGAAAT TAGAATATTT GGGCGTGATG ACGCATTCAG GCTTTACGGT AGGGAGTTAC 120 AGACCCACGC CCGGCTCTAA AGCACTTTAT TCAGACAGGA GCCATCTAAT GACAACCCTT 180 AGCGCCAAAG TC 192 INFORMATION FOR SEQ ID NO:162: SEQUENCE CHARACTERISTICS: LENGTH: 255 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...255 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:162: TCCATAGTAT ACGGTAAAAA TAAATTATTC TCAGATGAAT TTTATGAAAA AATCGAGGAT ATTTTAACAA ATAACAACCC AAGATACAAA CAAGTGTGCA TTATTTTTGA TGCAGACATA 120 AAAAAAGAGA ACCAAGAAAG CGATGCAGGT TTTGATAACA AGCTTAAGCA TATTCGTGAA 180 AAATTCAAAG AAAAAGGGAC TGATTTTCCC AAAGAACAAA TCTTTTATTC CCTAACAATC 240 AAGATGATGG CGATT 255 INFORMATION FOR SEQ ID NO:163: SEQUENCE CHARACTERISTICS: LENGTH: 438 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 WO 9737044PCTIUS97/05223 230 LOCATION .438 (xi) SEQUENCE DESCRIPTION: SEQ ID NO;l63:
GTGCAAGCGT
AATAAAAAAA
GTGGGTCAGG
GTTTTAGAGG
AATCAAGGCG
AAAAGGGCCT
GTGATTGATT
ATGAATACCC
TTGATTATAA
AGATTGACAT
GCAATATTTA
GAAAAGTCGG
GATCGGTTAT
TGCTTGATGG
CTATAGCGTG
CCTTCTCA
AATTGAAGTT
AGCTAGGGGG
TGCGGATTTT
CGGCACAATC
TTATAATTAT
CACGAGTATC
CGGGAACGCT
TTGGCAGAGT
ATTTATCCTA
TTATCCAAAA
CGTGGGATTG
ATCGGTTATT
CATGAGTGCG
AGAGCCAATA
CCTTTTCTAA
CAGAGACTTT
GCCTTAAAGA
CTTATGACAG
GGGATGGCTA
CGCTTGGCTC
AAATCCGCCG
AGTTGGCTTT
TGTAACCGCT
TCAAGGGCAT
CACGAAATTC
TTTAGGGGGT
TGATGGCAAG
TAATTACTTG
INFORMATION FOR, SEQ ID NO:164: Wi SEQUENCE CHARACTERISTICS: LENGTH: 741 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION I1.. .741 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:164: TTCCGCTATA AGGATATATT TGCGGCTAAA GGGGGGCGTT ATCAATCCAA TGCTCCTTAT
ATGAGCTCTT
AGCCATAAAT
TATGATTTTT
CATTTAGTGG
TCGCCCGGGA
AATGGCGTGG
CTTAAAAGGG
ATTAGGCAAC
AAAAACGCAA
AATAGCGTTT
ATACGCAAGG
TATGGTGGTT
ACTCTCCAAG
ATTACACTTA
CTTATTATAG
GCTTTAGATC
ATACTTATCG
GATTTGATTA
ACGCTTACAT
ATGATATAGG
CTTTGAAATC
TAGCTCATGG
AACCGTGATT
CGAAAGGAAA
CCCTGGGGTG
CGAAACGAAG
TTACGCTGTG
CAATGAATTT
AGGCACGACA
GCAAGCCTTA
CGTGCATAAA
A
AGCGCTAAA.A
GGTAGGGCGT
AAAAACGGGC
GGGGTTAGCG
GCTGTAGGCT
GCTTATATTT
AAAGCTGGCA
AACTTTGGGG
GGAAACCCTC
AGCCATGTGG
AAATGGCTGT
TCAAGGATAA
TCGCTTATGG
GCACTCTAAA
TCAGCCCCTT
ATGATAGTAA
TGCTCCCTGT
CCGCTGGGCA
GAGCGTTTTA
TAGGCATTGA
TAACCGCTGA
GGGGGACTTT
AAATGAGGGG
GGAGTGGATT
TTATGGTATC
TTTCCAGTTT
CCCTAATTTT
CCATGCCCCC
AAGCCTGCTC
TAAAGTATGG
TTTTTGGACC
TGCCGTCTCT
GTGGCGTTGG
120 180 240 300 360 420 480 540 600 660 720 741 GGTTGGGTTT TTGGTGGAGG ACTAGCGGCG CTTTAGCCAA INFORMATION FOR SEQ ID NO:165: Wi SEQUENCE CHARACTERISTICS: LENGTH: 390 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/US97/05223 231 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...390 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:165: TTTTCCCAAA GAACAAATCT TTTATTCCCT AACAATCAAG CTATTATTAG AAATTGCTAA ACACGATGAA TTTCTTAAAT TGTATTAAAA GTAAAGAACA TTATAAACCG ATTAAAAACA GCCTATTTAG AAGCGCTTGG ATTAGAGAAT TTGACCAAGA AGTAAAGGTA AAATAAAGAG TAGATATGAA GAAAACTACA ATTGATTTCA GCTCGAATTC TCTCATTCCC CTTAAAAATT AATAAACAAA AAACAAATCC TAAAATTTTC ATGATGGCGA TTTAGAAACC GCTTTGAAGG ATACTTAGAA TAAGAAAAAA TATGCTGTAT CCAATATAGA TGTTTTTGAT AAAAATTAAC AGAAGAAGTT TTCTAGGGCA ATTCGCAGAA 120 180 240 300 360 390 INFORMATION FOR SEQ ID NO:166: SEQUENCE CHARACTERISTICS: LENGTH: 213 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...213 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:166:
ATTTACTTTA
TCTAGCTTGA
GCTAAAGAGC
CCTTTGAATT
GCGCCAAGCG ATTCAACACC AAAAACACGC ATGGCACGGG TTGTACTTTG TTGTGGGCTT ACTCGCTCAA GGGCTGGATT TAAAAAACGC TATCACAAAG TTTTAACTAT CATCATTCAA AACCCCTTAA ACATTGGGCA TGGGCATGGG TGTGGAGCAT CAAAAAGCAT GTT INFORMATION FOR SEQ ID NO:167: SEQUENCE CHARACTERISTICS: LENGTH: 699 base pairs TYPE: nucleic acid WO 97/37044 WO 9737044PCT/US97/05223 232 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .699 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:167: CGCCCTTCAT TCCAAAAACG CTGCGCCCAT GTGCTAGAAA AACAAGATAA AGACCTTTTT ACCAAAAGCG AACAGGAATT AAAALATCAAT CAGCTGGGCG ACATTGTGCT TAAAAATTTA GACGATTTGC CCGTTATTGA AATGAAGAAA ACCCCCCTAA AAGAATACAA AGAACTTTTG CAAGAGTTTT TGAATACAGC CA.ACATGCTT GTGGTTGAAA ACGAACGCAA AAAAGCTTTT GATTTTTACG CTCTGTACCC TAACTTGGGC AAAGAGAAAC AGCTTTTTTT AAAAACCACC GCTCAAAACA CCCTTTTAGG CTTTTCTTTT CTTTTTGCCT TTGAAGATCG CTTATTGGAC
TTTAACGCTA
CTTTTGCAAG
CAGCCTTATT
GTGCCTGCCC
TTAAAAGAAA
AGCGAAAAAT
TTAAGCTTGC
TTACAGATGA
GAAATTGGCG
CTCCAAAGGA
GTGGAAATTG
ACGCTAGGG
CTTTACCCAT
TGGGTAACAG
TTGCGACTAT
CAAAA.ATCTT
TATTGAGCGC
TAATCCATGC
AATCGCAGGT
AAGCCAAAAA
GCGTTATTTA
CCAAAGAGGT
TGTGCGAAAA
TCTTTTAAAA
GATTATCCCC
GCAAAGAAAT
AGACGCATTG
CAAAGACAAC
TAAAAGCTCG
TTTAAGCTTT
ACAAAGCGTT
TTTAAAAGAA
TTTAAAAGAG
AACCCCCATG
120 180 240 300 360 420 480 540 600 660 699 INFORMATION FOR SEQ ID NO:168: SEQUENCE CHARACTERISTICS: LENGTH: 462 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .462 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:168: CTATTAGAGA TGATTTTTCG ATCAAGGCGT TCAAAATGGG GGCCGTTATG CAACGCTCAA ATCATTGAAT GCGTAGCGAA CGCTTTAGAA ACTTGTGATT TTGGGTTGTG CGTTTTAGAT CCGGTGATGG TGGCAAAGAA TGGGGCTTTG CTTTTAGAAG AGGAGGCCAT TTTAAGCTTA 120 180 WO 97/37044 PCTIUJS97/05223 AAAAAACGCC TTTTACCTA
AACCAATTTA
CTCACAGGCG TTCAAGCGCG
AGACGATAAA
GATTTAGGCG TTAAAAACGC
TGTGATTAA
TTTAGCAACG ATTGGGTGTT
TTTAGAAGAC
ACACCAAA CACGCATGGC
ACGGGTTGTA
CTAACCCCTA
AGCGCTTCAA
GGGGGGCATA
GCTGAATTTA
CTTTGTCTAG
ACCTCCCTGA
AGTCTATGCG
AAGCGATGGG
TGTTTTAAGG
CAGAACATTT
TCAAGGGGAJ\
CTTTAGCGCC
AAGCGATTCA
CT
240 300 360 420 462 INFORMATION FOR SEQ ID NO:169: SEQUENCE
CHARACTERISTICS:
LENGTH: 969 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 969 (xi) SEQUENCE DESCRIPTION: SEQ, ID NO:169:
TGGCTTCAAT
GGTAAAAATT
CATACTTTAG
GAAGGAA.ACG
GAAGAGCTAG
GTTTGCTCTC
ATCTATACCA
CAACAGCTCA
AGCCATTTAA
GCGCAAAATT
AGCCCTATTT
GGGTTAGCCT
CAGCAATTAA
AATAAAAAGC
GGGTTAAGGG
TTAGATATAG
GCGTTCCTT
CAAGGTGTTT
TGAATAAAGT
GCTATACCCA
GGCCAGAGGC
AATTCAGTAA
GTTTTGGCGG
ATTGCGCTCA
TCAATCAAAA~
ACGCCGGGTT
TTTCCACCAC
TAGGGGTTA
ATTACGGCAT
GCTATGGTGG
AAGACCATCC
GCTTATACAA
TAACCGGGTT
TAGGGATAGC
GGTCATTTTG
TAACGGGAC
ACGCTGTGAG
TGGGCATGAA
CAAAAATCAG
AGTCCCCGCT
CGCCCTGCCC
GAATGCGCAA
TTCCACTACC
TGTTAAAATA
TATCCAATAC
GGGAATGGAT
AACTAAAAAG
TAGCTACTAT
TAATTACCGC
GCCTTTTGCC
GCATTAAGAG
ATGACTTATC
GGCGGGCATG
GTGCGAGACC
CCCGCTTTCC
GGGCTTATAG
ATTAATTTCG
AATTTTGCAA
ACTTACCGCT
GGCTACCAAC
AACTACGCTC
GTGCTGTTTG
GTTTTTGCTT
GTCTTCAATC
TACAAGCATT
TTAGTTCCGA
ACATTATCCA
AAAGGGTGCG
AAGTGGAGAA
ATGATGGTTA
CTAGCAATTA
GOTTACTAC
CTAACCTAAA
CCTCTATGGT
CTTCAAGTAA
ATTATTTCAA
AAGCTAACGA
ATTTCATCAC
CCTCTTTTGG
AAGTCAAAGG
CTAAATATTC
AAAATTGTTC
TGAATATGGG
TTTATGCCAA
AAACGGCAAA
CACCTATGAT
CCCCAATTCC
CGCTGTTTGG
TAGCCAAACC
CAGCGCGATC
GAATTTTAGA
TGACTACATC
TGAAAAAATC
CACTTACACC
GGTGTTTGGG
AAGCGGTAAT
CATAGCGTTA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 INFORMATION FOR SEQ ID NO:170: SEQUENCE
CHARACTERISTICS:
LENGTH: 426 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 WO 9737044PCT/tUS97/05223 234 (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION I1.. .426 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:170:
TATGATTTCC
ATTTCGTACC
AAACCTATCG
ATCACTCGCA
CTTTTAGAAA
CAAGTCGCTA
CTTAACCTCA
TATTTT
TAGGGGTTTC
GCTTGAAAAT
TTAGTCGCAT
ACT TGCC CGA
TAGCGTTAAA
TCACGCATTA
AAGCCTTAAG
CTTGCATGCC
CGTTGATTCT
TAAA.ACAGCC
TTTGAAAAAC
AAAAGAAAA.A
TGATCACAGC
CGTTAAAGCG
CTTAGCCCCC
AGAGTGATGG
CCCTATGTTT
CCCTTGCTCC
GAGCGAGTCA
TATAAAAACG
AGTTTAGATG
TAGAAGA.ACA
GTGAAGAGTA
TAGACTATCA.
AAATAAAGCT
TTGACTGCCT
GCACCACTAC
GGAGATATGC
AGAATTTTTA
TTCTGTTTCT
TTGCTCTATC
AGAACGCTTC
TTTAAAxJAcC
CACAAGCATT
TGTCTTTAGA.
120 180 240 300 360 420 426 INFORMATION FOR SEQ ID NO:17l: SEQUENCE CHARACTERISTICS: LENGTH: 1032 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: No (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .1032 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:171:
GAGTGGTTAA
GGGTTGTTGG
ATTAAAGATT
GCTAATATTT
TTTGTTGAAA
TTAGACGCTG
GGTAGCACTC
ACCAGAAAAC
GAAATTTTAG
ACCGCAAGTT
ATGTTAGTCG
TCACTCTCTA
TGCTAAAAJA.
CCGTTCTTGT
ATCGCCCCAG
ATGATAAGGA
GCCTTCTAGC
TCATGCGCGC
TAACCCAACA
TCAAAGAAGC
AGCGTTATTT
TAGGGTATTT
CCTTACCTAG
GGGCTAATGA
GATTTTTTAT
CGCTCAAGTT
TGTCGCTTCA
ATTTCGTTTT
GGTAGAAGAC
TATGATTAAA
ACTCGTTAAA
TATCATCTCC
GAACCAAACT
TAAAAAACCC
GGCTCCAAGT
TATTTTAAGG
GGTTTTATCG
TGGGTAACTA
CAGATTTTAG
TATGCGCGTT
ACCCTCTTTT
AACGCTAAAA
AACATGGTGC
ATACGCATTG
TTTTTTGGGC
CTTGACAAAC
TTTTATGACC
CGGTTGTATT
TTTTATTTTT
GATTATCGTA
CGGATAAGGA
TATTGCTAAA
ACAGAAAAGG
GCGTTTGATC
TTGAAGAAAT
CCCCCCACGA
TTGAGCATGG
GGGGATCAAT
GTGGTCGTTA
CACTGAAGGG
TCACACGGGA
AAAAACCCTA
AAAAAGTCTT
AAGCAAAGAA
ATGGGTATTA
TGGCGTGAA-A
TCACGCTTAA
AGAAATCACC
CTACCAAAA
TTTAGAI\TTT
CTTTAGGCTG GATTTCTTCT 120 180 240 300 360 420 480 540 600 660 720 WO 97/37044 PCT/US97/05223 235 AACGAGCTCA AATCCGCTCT CAATGAAGTG CCAATCGTCT ATAACCAAAC TTCCACGCAA 780 AATATCGCTC CCTATGTCGT GGATGAAGTG TTGAAGCAAT TGGATCAATT AGACGGGTTA 840 AAAACTCAAG GCTATACCAT AAAACTCACG ATAGATTTGG ATTACCAACG CTTAGCGTTG 900 GAGTCTTTGC GTTTTGGGCA TCAAAAAATC TTAGAAAAAA TCGCTAAAGA GAAGCCAAAA 960 ACTAACGCTT CTAATGATAA AGATGAAGAC AACTTAAACG CCCAGCATGA TAGTTACAGA 1020 AACGAGCACC GG 1032 INFORMATION FOR SEQ ID NO:172: SEQUENCE CHARACTERISTICS: LENGTH: 351 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...351 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:172: ACGCCCAGCA TGATAGTTAC AGAAACGAGC ACCGGTAAGA TTTTAGCCTT AGTGGGGGGG ATTGATTATA AAAAAAGCGC TTTCAATCGC GCCACGCAAG CCAAACGGCA GTTTGGGAGC 120 GCGATCAAGC CTTTTGTGTA TCAAATCGCT TTTGATAATG GCTATTCCAC CACTTCCAAA 180 ATCCCTGATA CCGCGCGAAA TTTTGAAAAT GGCAATTATA GTAAAAACAG CGTGCAAAAC 240 CACGCATGGC ACCCTAGCAA TTATACTCGC AAATTTTTAG GGCTTGTAAC CTTGCAAGAA 300 GCCTTGAGCC ATTCGTTAAA TCTGGCTACG ATTAATTTAG CGATCGCTTG G 351 INFORMATION FOR SEQ ID NO:173: SEQUENCE CHARACTERISTICS: LENGTH: 624 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...624 WO 97/37044 WO 9737044PCTIUS97/05223 236 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:173:
CCCAGAGACC
AATGGCGTTA
GGCTTTTATA
GGGAATAACA
ATCGGCTATG
GCGAACACTT
GTTTTTGGGC
GAATATTCCT
AGGATCACTT
CCCAAATTCA
ATGACCAACC
ACACTTATAG
CTTTAAATAT
ACACCTTTGG
CTTCCTATAT
ATTTTTGGGA
TCACTTTTTA
GCGTCTCTCA
TGCAATTCAA
ATTATGGGGC
ATAACCCGGA
TCACGCTGAA
CTTAGAAGAC
CCGCCAGGTT
CAATTCAGAC
CAGTAATGAA
TAATACGGCT
CACTTCGGTT
TGCGAATAAA
TGCGAGCTAT
TAGGATCAAT
TGGCGATTTT
GTTT
AGCACGCCTC
TTTTGGTGGG
GCTTTTTTAG
ATCTCAGTAA
TATGATGGGC
GGAGGGATCC
AATGCGTTAG
GCGTCCACTG
AAAGGGTATC
AGCGCTAATT
ATGGCTCTTT
ATAATTTCAA
GCTCTCACAC
CGACTAGGCA
TGGCTGATGC
ATAAGCGTTT
GGCAAGTGGG
AATCGGTTCT
AAGCAGGCTA
GTTAGGGAGG
CTGGTCCATT
CATGCCAAGG
TGCCGGAATG
GATCACTAAC
TGCATGGCAT
GAGGGCTAAT
CCTTAACTTT
TTTTGGAGCG
120 180 240 300 360 420 480 540 600 624 ACCAAGACAG AAGTTACATG INFORMATION FOR SEQ ID NO:174: SEQUENCE CHARACTERISTICS: LENGTH: 396 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .396 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:174:
CCAAGAATAC
CCTACCCTAT
AACGTC TTAA
TATTCAAGCG
AACCCTAAAG
AACATGCTCT
GAAATGCTCC
ACGATATGGG
TGATTTTAAG
AAAAAACTTT
ATGCAATCAA
ATACCGCTCT
TATATGACAA
AATTTGATAT
CGATTTAGAT
CGCTTTAGAT
TAAAGATCGT
AGGCTTTATT
TTTTGATCAT
ACACCAATTG
TTCCAATTTA.
TCTCTAAACA
AATTCTTTAA
TTGAGGGTGC
GCGCCTTCTC
TTAAACCATG
ATTAAAATGT
AAGGAC
TCAGCCCGGA
AAGATTACGC
TTATTTTACT
AAACTGATTT
ACGCTTTAAA
ATCAAGGGAT
TCCTAACACC
CCCCACCTTT
CAATCAACCC
GATGATTTTA
CCATTCTTTT
CGTGCCAGCA.
120 180 240 300 360 396 INFORMATION FOR SEQ ID NO:175: SEQUENCE CHARACTERISTICS: LENGTH: 366 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO WO 97/37044 PCTIUS97/05223 237 (iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .366 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:17S:
AGAAGAAGCC
CGTTTTGCGG
GATTATGTGG
AGTGGGGTGG
ATTTATCGGC
CAGCCTAACA
GCTTTC
TTTATTATGT
GCGATTATGC
GTCGTGTGAT
ATATAGAATT
GTGAAGTGGC
GGATTGAAGA.
GTTTATAAAA
GACAGAGAAT
AGGCAAAGAC
TAGCGAAGAC
GAGCGAGACG
GGTTTATCCT
TCGTATGCCA
TTAACAAGCC
GGGAAAAACA
AGTAGCGAAT
ATTAAAATTT
AAAATTCACC
TTTTAGCGGA
GTATCGCTTT
TTGAAGCGTT
TGTGCTTGTC
TAATAGAAGA
CGCAACATGG
AGCGACAGCC
GCCTTGCTCA
TAAAAAGATT
CAGTTTCAAT
CGGCCGTATC
AAAAAGAATT
120 180 240 300 360 366 INFORMATION FOR SEQ ID NO:176: SEQUENCE CHARACTERISTICS: LENGTH: 750 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION I1.. .750 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:176:
AAGACGGCCG
ATGGAAAAAG
ATGGAAGATG
AACGCTTTAC
GGGGGGGATA
ACTCAAGAGC
GAAGACCCGG
GTGGAGTGCG
AGAAAGAGCG
TTTGATGGGG
AAATCCAACC
ATAGAAGAGA.
TTCAAAACGA
TATCCAGCCT
AATTGCTTTC
AGCTTAAAAT
AGCATTCTAA
AAAAACTCGC
TTGGGAGAGA
TTGTGATCA
CGAGCGTGTG
ATGAAGAATA
TGGAAAAGGC
AAGTCAGGGA
GCACTCAGTA
CCGCTACGCT
AACAGGATTG
TGAAGGAGAG
TTTAATAGGC
AGAGATCGCT
CAGAAGAGCC
CCATGTGAAT
TGCGATTTAT
CGCGGCTGAT
CGCTAAACGC
GTATGCGATG
CAATCAAGTG
TGTGGGCGAA
AAGCAATGAT
A.AGAGGTTTA
AGCGTGGTGC
AAAATGCGTT
CTTTTATCCG
GGTATTTTGC
TTAGGGGTTG
GCCCACCATG
GCGCTTTCTG
ATGCAAGCTT
GAGAGCGGGC
CCTATTATTG
GTGGGCGTGC
TCCTAAAATT
TAGAATTAGA
ATCGCTTTAG
GCTTGATTGC
ATGATATTGG
AGGTGTGCAA
GGCATGAAGA
CAGGGCGTCC
TAGAAGAGAT
GAGAACTAAG
CTAGAAAGAT
AAGTGGTGCG
CACCCGCAAC
GCTTGGAACT
TTTTGGGCAA
AGAACAGCTT
TAAAGCGCTC
GCGCCATAAA
A.ATCATGAGC
GGGCGCTAGG
CGCGTTAGA
AGTGATTGTC
CGCTAAAAGG
AGAAAATCGT
120 180 240 300 360 420 480 540 600 660 720 750 WO 97/37044 PCTIUS97/05223 238 INFORMATION FOR SEQ ID NO:177: SEQUENCE
CHARACTERISTICS:
LENGTH: 2412 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 2412 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:177: AGTTTGTGGC TTAAGTCAAA ATTCTTTCTT
TTA.ATGGGCG
CTCAACGCTT1
TTAACCTTAC
GGGATTGAAG
TCTATTCCTA
ATG CGCCAGC
TTTTCTTTTG
GCCAAGATGT
AACGCTCAlAA
CAAGAGCTGG
TATTTAGA.AG
AGCCGCGCGT
ATTATCGCAT
TGGATTAAAA
GCTTTAGATG
GAATACAAA
GAAGGCTCTG
GACAAAGAGA
AACTTTAATA
TCTATGCACA
GAAAGTGCGA
GAGCAAGCGC
CACCTTTACA
AGAGCACGCG
CACTACGATA
TTGAAAGCCC
AATTTGCCCA
TTAGAGGATC
TTCAACCCCC
AAAGCGCAAA
ACATGGCTTT
CTGGCTTCTA
ATTGCTTTTG
TTGTATGCGT
TAAGCCTCAC
GAAACGATAA
CGTCTTTATC
AAGAGGGCTT
AACAATTCAT
ATAGGGATTA
GGCAAATCAT
AAGGCTTGAA
ATGTGAATAA
CTAAAIAkACA
TTAAAAATTA
TAGGTAAATT
ATTACCCGAC
AAAACAACAA
ACTCCCGCTA
ATTTGAGCAA
GCGCGAGTGA
ACGCTAAATA
GCGAATTAGC
TTGAGATCGC
TCTATGATTT
ATCTGCAATA
ATGAAAkAAGC
AAATCATTCA
AACTCTTGTT
AAGATTCTCC
ATCGTTGCA
A.AGAAGAAAT
TTATTGCCCT
ATCGTTTGGG
AAGACGCCTT
TGTTATTTTC
TTTTAGAA
GCTCACGCAJz GGCGTTTTC1
TATTATACAC
TACCCCTTTP
TTTACACATC
TAAAAAJAGCC
AGGCTATGAT
TTTCCCCATT
CAAACCCCTA
AATGGATTCG
CCCTCAAACG
AGGCATTAA
TGATCCTAAT
TTACAAACAA
CGCTCCCTTA
CGCTAATATG
AATCGCGCTT
TCTCATTGAT
CCTAGAATTG
TCACTTGCTC
AGGGGCGTTG
TTTGCAAGAC
CCTTTTTTCA
A.AATTTCCCT
TGACAATAAG
CTTGATCCA.A
AiGAAGCTTTA
CCAAGCCTTT
~AACGCTTTA
GCGCAATTAC
A.ATCCTCGCT
AGATTACATG
GCATTTCAA.A
iGGCAAAGAAC:
TGCTCTTATG
GCTAAACGCC
LGAAAACGCTT
AAACCCAAAG
ATCCCCCTTT
CAAAAAATCC
ATCATTAAAG
CTCACCACAA
CAAGCCTATT
ATTTTTAAAA
AAATCCTTAC
ATCCCTGAAG
GCCGTGCGCT
GCCCA.AATGC
CTTTTTAAAG,
AATTGGGCTG
AAGGTGGTTC
CTCAAGTTAT
CTCAATCAAG
TATGCAAGGA
CATGCGGAAT
ATGGAAGGGA
AATTCTAATG
CACTATGCTG
AAAACGCTCA
AAATACTTAT
GATTGCTTGT
AAAGCGGCTA
TACCGCTTAG
CAAAGCTTAA
CAAAACAATG
GACGATAAAC
TGGGTTTGCI
AAGGAGAAGP
CTAATGAAAA
CCATAGAATG
TTTTCAATAT
TGATGCGAAG
TTGTGGAAA
CTTTTTTGAG
ACGCTCAAAC
AGGGCTATGA
TTGACGCCTT
AAGATCTGTA
TCATAGATAT
CGCTATACTA
ATTACAAACG
GTTTAGCCAT
AAGCTTTTTC
AAGCAGAGAT
AATCCAACCC
TGAAAAA
ATGATGACTT
TCAAGGATTT
TGGAGCGGGC
ACACGCAAGA
AAGCCCAAAA
AAGTTTTAGG
ATATCCTTGC
CCCAGATCAC
ATTTCGCATC
AAACCCCTAG
GGGATTTTAA
ATAAAAAAGA
AAAAAGGATT
GCATGGCGTT
CTCCCATTCA
TTTTTCGGTT
*ACCGCCGAGC
*CGTGATAGAT
CACCTATTCC
GCTCACCCTT
CGACGCTA
CGAAAAAGAC
CCCTATCATT
TTTAAACGCC
ACGCACGATC
TTTATTAGAA
TGGCACCCA
TGTCGCCAA
AATCCTTTTA.
TGAAGCGGCT
TAACGCCAAA
AAACTATCAA
TGATTATCTT
CCAAATGAAT
GAAAGCTAAJ\
TAAAAACGCC
TTCTGTCGTT
AAAAATCGCC
GGCTTTAGAG
CATGCAAAAA
TAAAACCCCA
CGCCTTTGCA
GCTCALAGGAA
CGAGAAATTA
AAATTCCACT
GTTTTATGAC
GGCCCTCAAT
GGTTTATTTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 WO 97/37044PCIS7/52 PCT)IJS97/05223
AAATTATTAG
AAACTCCAAG
GACGCTTACA
AACCGCAGGC
GATTTAACCA
CAAAAAGATC
AACAAGGAGT
AGAATGAAA
ACGCTTATAA
GAACCACCAA
TTTCTTTAGA
ATCAAAAAGC
AAACAAACGC
CA
AGACCCTAAA
GGATTATTCT
AGATTATTCA
AGATCACCAA
AAAATCTAAA
ATOGCAAA.AT
AGCGTTAAAA
TACACGCCCT
AAAGCGTTAG
AA.AGCCTTAT
GCCAGTTTAG
TTATGCGAAC
TTTATGCCAC
TTGGCGAATT
AAACGCTAGA
AC TTGCA.AT C
AAAAATGCGT
AGGGTTTAAjA
AAGCTTACTC
TGCCCTTATT
CAAACTCTTA
CAGCTTATTG
TCAATTAAAG
TTTATTCAAA
2100 2160 2220 2280 2340 2400 2412 INFORMATION FOR SEQ ID NO:178: SEQUENCE CHARACTERISTICS: LENGTH: 1146 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .1146 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:178:
AAACACCCAA
TTCTTAAACG
CAAAACTTTT
CAGCCTTTAG
GACGCTAATC
CCACCCACTG
GAAAGCAACG
GAATTCGCAT
ATTTTAAAAC
AGCACCGCTG
AAAACA GAGC
CAAGCCAGAA
AGGCAATGCT
ACCACGCAAT
ACTTTTTATG
AATGAATTGA
TTAAACGATA
TTTAAAGAAA
CCTTTGAGTA
GAAGTT
ATCGTTTTTT
CGCCCATGCA
CTTACCCAGA
TAACTCCTAG
CCC CTTTAAA
AAAAAACGCT
AAAATAGGGA
GCGGGAAGTG
GCGTTGATAA
AAAATAAAAG
CTTTAGAAGA
GCTCTACAGA
ATCTGATAGA
TAGTGAAAGC
AAACCAGCGA
ATTTGAATGA
TAATCAAAGA
GGGTGTGCAT
TTGAAAACTC
CTTCAAAAGA
AAAACCACAA
GTCCAAACTA
CAAAGTAAGC
GCATTCTTCA
CCCTAACAAC
TAATGTGGAA
GGTTTATGAT
AGACAAAGAG
CGGTAAAATC
CCCTCAAACT
AAAGTGTAAA
AGAGCCTTTA
CATATATGAA
ATTGGCTTAT
AAAATTCATG
GAGCAGCGAA
GGCCTTAAAA
TCGTGTTGTG
ATTGAAAGAA
AATAAACCCA
GGCTCTAAAA
CCCACTAACG
CAAGATCAAG
ACCTCTAGTG
AAACAAGCGA
GATGAAAATT
ATTACAACAG
ATTACCCCCT
TTTGAAGCTA
AGGGCTAGAG
AAACAAGCGT
CGCCCCAAAC
TCTTCCACAC
GAATTTGTGG
TACAAAGAAT
ATAAAAAAGC
TGTGTCAAAA
CCGCACTCAA
GCTCATCGCA
ACTCTAAAAA
AAGTTAAAAC
AAAACAACCT
CTGATGCGAG
TTAGAGATCC
TACAAGCCTA
ACATTACCCC
ATACTAAAAT
AAAATAATTT
CGAGAAAAGA
GGGAGAGCGA
AAGACGATCA
GAAAAAGCGA
AAGTGTATGA
GGGTTAAAAA
AACCACGAGC
AGGGGAATTA
ACCCAATACC
ACAATCTCCT
CAGCCTTTTA
GCCAACAAAc
CTTTGTAGCG
TGAAAACAAT
TAATATCAA
TCGCCCAAGC
TTGCGATTAC
CTCTGTTCAT
CGCCATTCTC
CGGCACGACC
GTATGAAATC
AGTAGAGCCG
AATAACGCGT
GGGTCATTAT
CCATGTGCGC
CAAAAGCACG
TTTATTCAAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1146 INFORMATION FOR SEQ ID NO:179: SEQUENCE CHARACTERISTICS: LENGTH: 315 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 240 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...315 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:179:
ACCAAAAAGT
AAAATCATTT
ATGCCTATTT
TATAGAAGCA
GGGGGGGTTG
ATGCCACGCT
TAAACAATAC TCTATTCAAT AAAGGATTAA TTATTTTTAA AATGTTTAAA TTTTGTGCGT TTTTTTGATA GGGGGATTTG TCATTCCACC CCTTGAAGCC TGCGCAATAA AACCCCCAAA AAAAATTACC AAGAAGCCCA TGAAAAGCTC TCATTAACCG CCAAAAGCTC ACGCGTAAAA AAAGCGGGTG GTATTTTTTA GCGCTGTAGA AGCCATTAAG GACTATCAAG GCAAGGAAAT GAAAGATTGG
CAATT
INFORMATION FOR SEQ ID NO:180: SEQUENCE CHARACTERISTICS: LENGTH: 207 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...207 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:180:
AGTAAAACTT
TTTCCATTAT
CTCGTAAAAC
CACCATTCAA
CCTATTGCAA TTCTTACAAG CGTAAGCTCC TCCCCTACCA CTTTCTACTT GGGCGTGTTT TCACAATGTA AGCAGACCAT TTTCATTGAG CCATTTATTT GCCCTTACAG TCGCCTTTAT TTACATAATC AGAGCAACCA TAAAATGAGA TCCTCTTTGT ATCATTT INFORMATION FOR SEQ ID NO:181: SEQUENCE CHARACTERISTICS: LENGTH: 1374 base pairs WO 97/37044 PCTIUS97/05223 241 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .1374 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:181:
CAAGATAAGG
AACATTTTAA
ATGGACTTTA
CTCAATCAAA
ATGGAGACTA
GAAATCTTAG
TTTAGAAATA
TTGACAAA.AC
TTAGAGCTTA
TCTCAACTCA
ATTTCTGGGG
AGCGTGATCC
CATTCTTTGG
GAATTCACTT
AATTCCTTGC
AATGGGAATT
GTGGGCTTGA
GCTTACCAAC
ATGGGGTTAG
TTGGGTAGGG
AACACTTTAT
GGCGAAATGC
TTGCAATACC
GGTTATTATT
GCCTTTCTGT
CCCCTCCCAC
TTGA.AGCTGT
AAGAAAACCC
GCGTA.ACAAA
ACGCTACCAG
TCTCTGATTT
AAAACAAGCG
GCAAACACCC
GCA.ATGGCAC
TTGGGGGTTA
CTA.ATA.ATGT
TGAGCGCGAA
TCTCTGTGTT
ACGGCTATGA
GCTATCATTT
AATTCGTCAT
AGAGCCGTAA
ATCTTTTGAT
TGTACCGCAA
ATTTGTGGCG
AAGATCTTAA
GAGTGTAGCA
CCTTCACAAC
CTTACAGGAT
TGGGGGGAAT
GCTTTTTGCG
AGATCTTCAA
CCTTTTAGA.A
TAGGGCTAGA
TTTTAGCGAT
CAATAACCTT
GCTTTATGGC
TGTGGCTTAT
GGATGTGGGG
TGAAACTTAT
GAACCAACGC
TTTCATGTTC
CATAGGCTTG
GCATTCAAAC
ATATTTTGGT
CAAAGCTAAA
GGGGGAAATT
TTTGATGTAT
TATCACTGGG
CTACCTAACT
CAGATTAAAA
TACATTGTGG
AACGCTATCA
CCGATTTATT
AACACCGCAA
ATGGCGAGTT
GAGGGAGAGT
CCTAACCCTA
TGGATTCAAG
TTGAATGTGG
GGCTATAGCG
ATGTATGCGA
GGAGGCAATG
TACAACTACA
AAACAAAAAA
AGCGGGATGA
CCTTCTAACG
AAAAATTCCT
GGCGACAATG
TTTAACACTT
GTGAATGCGG
AATGTGGGCA
CAAACAACGC
TGTCTTATGG
GCATTCAAGG
AGTGGCTTTC
TAGAAAACCA
GCTTGATTTC
ACACCCAACA
CCAATTTTTC
GTGAGGTTTT
GGGTGGGAGG
GCTATGACCG
GTTTTAACGG
GGGCTTTTTT
CGAGTCATAT
ACACCTGGAC
GCGTGGTGCT
AAGGTAAAAT
AATCGGTTTT
ATTATTTTGT
TGGTGCGTTT
TTGCGAGCGT
GGGTGGGGCT
TGCGAGTGGC
CTCTCAAAAC
TAATAAAGTG
ACAAAGCGCA
AACATTGATG
CTCTTTAAAT
TAACCCTAAT
AACCAGCCGT
AGAGCGCTTG
TGTCAAATAC
AGCGAGCTTT
ATTGGTTAAA
GAACATCATG
GAAAAGAAAC
CAATTCTTCT
AACGAGCGTG
AAAACCTCAA
GCAAAATCCA
AACGCTCAIAC
ALACGGCGAGG
TGTGGGTGAA
GATCACAGGA
TAAAATGGGC
GTTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1374 INFORMATION FOR SEQ ID NO:182: Wi SEQUENCE CHARACTERISTICS: LENGTH: 558 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO WO 97/37044 PCT1UJS97/05223 242 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .558 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:182:
CCCTTTGAAG
AGGAAAATTT
GGCCGTGTTT
TTGCCCACTA
ATTTCCTCAG
ATCCCGGACG
GAAGATTGTT
TCTAAAGAAT
ATAGACACCA
CACGAA-AGAA
CCTTGAATTT
GCGGGAACAI\
TAAGGGACTC
ACATGCCTGA
C.GAAGGCTTT
CATTCATTGT
TCAAAGAAGA
GGGGGAATAT
TGTTAAGCCT
TGCGACGC
TTACTCCAAA
GCCTTTGGGG
TTGTGAGGCG
ATTCGCTAAG
AAAAATCCTT
GGAAGGGCCT
ATTCCAATTA
CCCTATCATC
TGGAGCGAGT
AAAGCGTTGA
GCGAATATTT
GGGGCGAATA
GATTTTAGCG
TGTAAAAGAT
TTGAGTGGGG
GAAAACTTAG
GCCGCGGGGG
GGGGTGCAA
ATGAGATTTT
TATACGCTAT
TTATCATTAC
ATGTGGCGCT
OGAGCGATCG
GGCATCAOGG
TGCCTAAAGT
GAATTTGGGA
TGGCGATCTC
TGCAAACGCT
CAATGACTAT
AGGGGCTGGT
CATCCCTATC
CTATAAAAGA
CTTTAAATAC
CGTGGAJAGCT
TAGGAAAGAT
GTTTTTTACG
120 180 240 300 360 420 480 540 558 INFORMATION FOR SEQ ID NO:183: SEQUENCE CHARACTERISTICS: LENGTH: 264 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .264 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:183:
GCTATCCGGC
CCTAAAATCG
AAGGTGGGCT
GGGCTTTATT
TTGATTAA-AG
TTAGGCTATC AATACGGGGG TTGATCAAAC GCATTGAAGA
AGGTAACGCG
CATGCGTGAG CAATTGTGTA ACGCCTTGTA ACAGGGGTGA
AGAAACTAAA
ATTGTATCGC TGATGGTTTG GGGCGCAGTT ATTTAGGAAA
CAGAGAAGAG
TTACCGGGGC TAATGGCTAT AGAGTGGATA AGATTATCAG
CGTGCATGAA
AGCTTACAGA
GGGT
120 180 240 264 INFORMATION FOR SEQ ID NO:184: SEQUENCE CHARACTERISTICS: LENGTH: 852 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 WO 9737044PCTIUS97105223 243 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANlISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .852 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:184: AATTTCAATC
AAAGGGCTTT
GCGCTTAATG
CAGAGAGCTT
AAAAAGGTGG
TTTCGCCTAT
GGGTATCAT
TAGGGAGCAT
AATAAGAGTC
AGATTATTTT
TCTA.ATGGTC
TTGGCGTTCA
TGGTTTGGCT
TCAGGTGGGG
GAATCGCAGT
CTATCATCAT
TATAATGGGG
ATAAGTTTTT
GACAGATTGA
GCGATGCGTT
GTGAATTTAA~
ACGGCTTCCA
AACCAATTTG
GCTTTGGGAT
AATAACATTA
GCAATAATAG
ATCA.ACAGCT
TGTTGTATCA
ACTTATAGTT
TT
TTTGAAACGA
TAAAGATGTT
CAAACGCTAT
TCAGCACAAC
CAGCGATAGT
AGTGGGCTAT
GCTGTTTTAT
TTCCACTTAC
TGCCGGGTTC
ATTGTATCA
GTTTTTGGTG
TAAAATCCCT
TGAAAATGTC
AGTTGATTTC
GCGTTATGGT
TTGACTAAAG
GCGGATAGAT
TCTAGCAACT
CTAAGCCCTG
AAGTGGGTGG
GATTTGAGCG
GGCACTTATA
AATCTGGGGA
ACTCTTCTTC
GATTTAGGGG
ACTTATTATT
TTAAAAGTTT
AGGCGCAATT
TAATTTTAGG
GGGATTACAC
CGGCGTTTTA
TGAATTTATC
TTTTTAAAAA
GTAAGCATGA
CTTCTCTTTA
TGGATTTATT
TCGCTTTTGC
AAAACACTTT
TTCGTTTAGG
TTAACCATTA
TACGATTTTT
ACTCGGTTTA
GCTTTTTTAC
TTTTTTTAAT
TCTGGGGCTT
CCAACGATTT
TTCGTATGTG
AGAGACGAAA
TGGCTCTCAA
GCTTAACGCT
TGGAGTGTAT
TGGCGGGAAA
GAATGAACAC
TTATTCCATG
AGAATACGGG
TTTCAACTAC
120 180 240 300 360 420 480 540 600 660 720 780 840 INFORMATION FOR SEQ ID NO:185: SEQUENCE CHARACTERISTICS: LENGTH: 198 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (ii~i) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .198 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:185:
AAAATGCACA
AGGCACAATG
ACGCATGGCA
CGAGTTTTTT CCAAATCCCT TTGAA.TTTTG GGGTTCGTGT
GAATGTGGAT
GCTTTGAAAT GGGCTTAAAG ATCCCTTTAG CGGTCAATTC
CTTTTATGAA
AGGGGTTAAA CGCTTCCCTC TTTTTCAA.AC GCCTTGTCAT
GTTTAACGTG
120 180 WO 97/37044 PCT/US97/05223 AGTTATGTTT ATAGTTTT INFORMATION FOR SEQ ID NO:186: SEQUENCE CHARACTERISTICS: LENGTH: 414 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...414 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:186: ATCAAACGCA TCATTAAATC ACCCCAAGTC ATCTTTTTTT CATTCAACCA TTCAACCATT CAATCATTCA ATCATTCAAC TTTTACAAAA ACCTATTTAA TTTATTAACC CTTTTATTAA TTTATTAACC CTTTTATTAA AAACGCATCA TTAAATCAAT TAAATACCAC TAGATACAAT CAAAAAAGGG GTAGGAATGG CAACCATTCA ACCATTCAAC CAACCATTCA ACCATTCAAC CATTCAATCA TTCAATCATT CATTCAAGCA ACGCTACCTT ATTTTTATAA CTATTTATCT AAATCCCCTA TTTTTTATTA TTCCCCCTTT TATTAACCCT CCCTTTTATT AACCCTTTTA TTAACCCTTT TATTAACCCT CCCTTTTATT AACCCTTTTA TTAACCCTTT CATA INFORMATION FOR SEQ ID NO:187: SEQUENCE CHARACTERISTICS: LENGTH: 579 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...579 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:187: CGCTACCACC ACTTCTCCGG CCTATCAAGC TGTGGCTTTA GCGCTCAATG CGGCTGTGGG WO 97/37044 WO 9737044PCTIUS97/05223 245
CATGTGGCAA
CATTTAGAAA
GGTAGCGGpjA
CTATCTAGCA
AACCAAAACC
ATCAATCA,
AATTATTATT
GCTGAAAAcc
CATGTGAATG
GTCATAGCCT
ATGGGGGCGT
CGACCACCAC
GCGAATACCA
AAGGAGGCGG
CTTTCAC.%AA
CAGGCGGTTC
TTTTGCAACA
GTGGTGGTGG
TTGGCATCAG
TCGATCGTTT
CACTTGTAAT
GGTTCTCA.AT
GATGCCTGCC
AAACCCTACC
ATCAATCCCA
AGCCGCTACT
GGTCATGGGG
CTGTGGCCCT
GACAACACGC
GGAGCCAGTA
ACCGCTTATC
TTGAATAGCT
ACAGAATACA
ATC CAG CTAA
ATCATCA.ATG
GTTTGGCGG
GGCCCAATCT
CAAACTACAG
ATGTAGGGCC
AAACTATCCA
CCAAAAATAT
CTTACCCCGA
AGATTAGTAG
TCCTTACCAC
TGGCCCAGAA
CTACA.ACACC
CAATGGTATC
AACCGCTTTA
GGTAGTCAAT
TGGGAATGGC
CGTCAATGAC
CCAAAACCCG
120 180 240 300 360 420 480 540 579 INFORMATION FOR SEQ ID NO:188: SEQUENCE CHARACTERISTICS: LENGTH: 1254 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1254 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:188:
ATGGTGGTGG
GGCGATAGTT
ACCCAACAGC
TACACTTCTA
GAGATTTTGA
CAACAAGATC
GGTTCAGGTT
TATGGCAACC
GCCCTT AGCA
AACGCTAAGT
TTGCTCACTT
TTAGGCAAAA
ATGAACGGCA
GGGTTAAGGT
AACTCGGCTT
AACGATAAAA
GGCTTTGCGT
AATGGCATTT
TTGAGAATGA
ATGGAATTGG
GAACTCAAAT
TGGGGTCATG
TTA.ACGCTAT
TTAACGCTAAk
AAGACACGCA
GCTTAGCCCA
TAGAAGAATG
GCGCGTTTGT
AGGTCAATCA
CTTTAGGGAA
CCCTTCAAAA
ATTCTTTGGA
ACCCCTTTAG
TCGGCGTGCA
ATTATGGTTT
CTGATGTGTG
ACACCAACTT
TAGCCGGGAC
ATAACGCTAA
ACCTCGCTAG
GCGTGAAAAT
ACAGAAGGCT
GGGGTTTGGC
TAACGAAATG
TGAAAACACC
GTTCGCTCAA
ACAAGTAGCG
CACCGCAGGA
GAAAGAGACT
GGATAGGGCT
CGACTCAAAA
CATGACGCAT
TACCAGCAAA
GCGCATCGGC
AGCGGGCTAT
CTTTGATTAT
GACTTATGGG
TTTAGGCAAA
TTCGTGGCTT
TGTCAGCGCT
GCCCAAGAAA
CCCCACCATT
CTATAGCGTG
GGTAAGACCG GGAATGTGAT
ATCAAAAACG
CAAATCACGC
GAAATGCTCA
GACAATTTCC
TCAGCCGGTG
CTCAATTCCT
TTGTCTCAAA
GCGATCAATA
GCCACTCAAA
TACAACCAGC
GTGATTAACT
AAGCAATTCT
AACCATGCTT
GTGGGTATGG
AATAACAAGC
AATTCCCAAC
TCTAACTTCC
AAAGACAGCG
AACACGGATT
TATCTCAATT
CTCAAGCCGT
AACCAGACAA
ATAGAGCTAA
ACAGCATTCA
TGATTAACGA
TAGAGCAACA
CCATTTTGAA
GCGGTATCTC
ACCCTAATTC
TCCAAACTGT
ATCAAAACAA
TTGGCAAAAA
ATATCAAATC
ACGCGCTTTA
TTTCTGTGGG
AAGTGAATTT
AATTTTTGTT
ATCATGCCGC
ATTATTCTTT
ATGTGTTTGC
GGATATTTTT
TTTAGAAAAA
TTTCAACCCC
CGCTCAAGCA
AGGGCCTATC
CAACACTTAT
CACCGCTTAT
TTTTAAAGAA
TAACTTGCCT
CCCAGAAGGT
TGCGCAAGAA~
TAACGGGGCG
AAGGAATTGG
TAATTTTTTT
TAACTTCATC
GCTTTTTGGT
GACCATGATG
TGATTTAGGC
TCAGCATGGC
CATGGGGGCT
TTAC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1254 INFORMATION FOR SEQ ID, NO:189: WO 97/37044 PCT/US97/05223 246 SEQUENCE CHARACTERISTICS: LENGTH: 489 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...489 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:189: GCTTTATCGC GCTGTCTTAG CGATCGTTTT AATCAGTATC CTGCCCTACA AAAAACCGAA CACCATTTCG TGGATTTTTT AAACCAGGAC AAGCATTACG CCATTATCCA AAGAGCGGAT 120 AAAAGCATTT CCAGTAATGA AGCGTTGGCT CGTTCGCTCA TTGGGGCGTA TGTGTTAAAC 180 CGAGAGAGCA TTAATCGCAT TGACGATAAA TCCCGCTATG AACTGGTGCG CTTGCAAAGC 240 AGCTCTAAAG TGTGGCAACG CTTTGAAGAT TTGATTAAAG CCCAAAACAG CATCTATGTG 300 CAAAGCCATT TAGAAAGAGA AGTCCATATC GTCAATATTG CGATCTATCA GCAAGACAAT 360 AACCCCATTG CGAGCGTGTC CATTGCGGCT AAACTTTTGA ATGAAAACAA GCTCGTGTAT 420 GAAAAGCGTT ATAAAATCGT ATTGAGCTAT TTGTTTGACA CCCCGGATTT TGATTACGCT 480 TCCATGCCC 489 INFORMATION FOR SEQ ID NO:190: SEQUENCE CHARACTERISTICS: LENGTH: 582 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iil) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...582 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:190: TCCTTTAAAA AAAATGAAAG GTTATTGTTG CTTGTTTCTC GCTTTTTAAA CGCCATTGAT CCTTTTAATT TAGGGGTGTT GTTGAGCCGT TTCCAAATTA AAAATGGTTG TATTTATGGG 120 GTGTGTTCTT ACAAGGTTTC AAAATTTACC CCTGGATATG AAGAAAGCAA AGCGCGAGTT 180 WO 97/37044 WO 9737044PCTIUS97/05223 TTAAACGCTC TCAATATTTT AAGCAAGCAT CAAATTTGGC ACAAAAGTCA AAGGAACTTT TGTTTTCATT
TTAGAAAATG
TCTTTTTATA AGAAACTTTT AAATCTGATT
ATAGACAATG
TTAGTAACTC CGAGCAATGG CACCAACTCC
CACCCTGAAT
AGAGAAGCCG CTAGGATACA AAGTTTTAGC
GATGATTATA
AGCGTTTGCA AGCAAATCGG TAACGCCGTG CCTCCTCTTC GCGATCTTA-. AAAGCGCAAG AAATGATACA AATCCATCAC
A-ATCCAATCA
ACCTGCATTT
ACTTTTTTAA
TGCACCGCTC
TCTTTTATGG
TAGCCCTAGC
C
AGAAAGCGTT
AGACGAAAAC
CCGCTCTCAT
TATCACGCCC
CAATAAAACG
CTTAGGCAAA
240 300 360 420 480 540 582 INFORMATION FOR SEQ ID NO:191: SEQUENCE CHARACTERISTICS: LENGTH: 495 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .495 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:191:
ATGCAGGATA
GAAATCAAAC
ACTAAATACA
AATATCTTTT
ATCCAAGCCC
AT.ACCGGCTG
CTTTACATTG
GATCACTTAA
CTAAAAAAGC
ATTTAGTGAG
CTTTAA.AATC
ATATCCCGCT
TTAGCAATAA
TTAACGCTAC
ATTATGCGAT
TTTCTGATCC
AAGAAAACAC
GGCTT
CGTGATTGAA
CAGCCAGGAT
TGTAGTGAGT
AAGCGATGAT
CCAGCAAAAC
AGAGTTGCCC
CATGTGCCCG
TGTGAGAATG
AAACAAACCA,
TTAAAAATGG
AAGGATGGTA
GTGAAATTAG
AGCGCGAAAT
TCTACTAACG
CATTGCCAAA
GTTGTAGTGG
ATAAAAAGGT GCGTATGTTA TCGTTATTGA AGATCCGGAC ATTCAGTCAT AGGGCTTAGC TTGCAGAAAC CAATCAAAAA TGAACGCTAT TTTTAATGAA CTGAAAATAA
GGATAAAATC
AAGAGCTCAC
TAAACTCAGG
GGTGGCTTGA AGCAATTCGG 120 180 240 300 360 420 480 495 INFORMATION FOR SEQ ID NO:192: SEQUENCE CHARACTERISTICS: LENGTH: 345 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737044PCTIUS97/05223 248 (ix) FEATURE: NAME/KEY: misc-feature LOCATION .345 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:192:
ACGGGAGCAA
TTGGTGTTGG
ATGAGGCGTT
CGTTTGAGAG
GACAGCTCAT
TCCATGGATA
TCATGTTATC
TGTGCTTGGT
TAGAAAAAAC
AATTAGAGGG
TAAAAACGAC
AAGAGCGCGA
TTCTAATGAT
GGGGTATTTG
TTTAGATGAA
GCGTTTGGAA.
TCTTTCGCAC
TTACTTAGAA
TTGTTTATGG
TATCTTAAAG
TCCTATCAAG
GGCCTTTCTT
CTTTATAACC
GAAAAAATCA
TCGTTTTAGG
AAAAAGAGTT
AAAATTATCT
TAGAAAAAAG
AGTTGCAAGA
TTACT
GGCGATTTTA
TTACCATAAA
CTATTCTAAG
CGCTAAAGAG
AATCCAAAAA
INFORMATION FOR SEQ ID NO:193: SEQUENCE CHARACTERISTICS: LENGTH: 972 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .972 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:193:
GAAAGGAAAA
TTGCAAGCTA
TCTCAAGAAC
AAAAAGCAAC
GAAGTTAAAG
TTTTTCTTAG
ATTTCTAATG
AATATGGGAC
AAAATCTCAA
TACAAATCTA
TTGTTTGATT
TTTGGCTGGC
GAGTTTATCA
CTACATTATT
AACTATCAGT
GTCTTGTTTT
GCTTATTTAT
AAATGAACAC
ATGAGGGCTT
AGAAATTAGA
A.ATCAGCTCC
AGCAAAAAAA
GCTCAGAAAT
CTACAGATGA
TATTAATAGG
GCCATTTGTA
AAGTTAGCAC
TTTTAAATGT
GTATGCAATA
CTAATAAACC
ACTTAAACCA
CAGTTGTGCC
CTAATTTTCT
TT
TAATTTAAAG
TTTAGCCATT
ACACAAAATG
TTTACAAAGT
GTCTCAAAAA
TGCTATTACA
TTTACTCACT
TTATCAGCAT
TGGAGGCACA
CA.ATTACACG
CATTAAAAAG
TTATTTTGCA
TAAAGATATT
TCATCAATTT
TAGCAAAATT
TACCAAAGCG
AA.AGCCTTGT
ATGCCTGATG
GAGCAAAAAA
GCTGAGTTGT
CACCTAGTAG
GGCACAAGTT
TATAGCAAGA
TATTTTGGAG
CCTAGTGATG
CCTTTAA.ATT
A.ACCGCCATA
AAAATAGACC
TTTAATCATG
GAAGTGAATT
GTCGTTCAAA
ACTAATTCTA
TTTTAACTAC
AAAATAAACA
TTGCTCCACA
TACAAAGTCA
CTATGCAAAC
TAAAACAAAG
CTTCTAAAAA
CTAGTAAAAG
TAAGCACAGA
TTGGCATAGA
CACTAGGATT
CCATAAAA.AC
GTTTTT.ATCC
ATCGCTTTGG
ACGGGAATTT
GCTACATAGC
TCTTGGTTAT
AGAAGACATT
AAATACCCTT
AGTTAAAAGA
TAAAAATGGC
TTTAAGCCTT
CGCTAATTTT
GCATGGGATT
CATAAAAATT
TGTGAAGTAT
AAGTGTGGGC
CAGCCATAGT
CACTATTGGA
AGGAGCAATT
TAAGGGCGAG
GTTCAACTAC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 972 INFORMATION FOR SEQ ID NO:194: WO 97/37044 WO 9737044PCTJUS97/05223 249 SEQUENCE CHARACTERISTICS: LENGTH: 411 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .411 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:194:
GCTAACGAAA
GATTTACAAC
TTTAAAGAGC
GAATTAACCA
ACCTTTTTAG
CCTATTTTTT
CTTTCTGTGC
AAATGGAGTT
AACATTCAAA
AATTTGAGAT
AAAAAACCAA
TTAGAGATTT
GCGGTGATGT
TTGAGTTGGA
AATTAAGAAA
CGAGCTTTTT
TATGTTTAAA
ATTTGATGGC
TTTTA.ATGGG
TAAATGCGAA
AGAAACGATA
TTGGAAAAAG
AAAATGTTGA
GCATGGGTTG
GAAATGATTG
ATTTTTAAAT
GATTTTAATG
AATCCTAATA
AAAGCGAAGT
TTATTGATA.A
AAATCGTAAA
GCTACACAGA
CCAAAGTAAT
CCCTAAGAAG
AAATCCCATT
TTTA-AAGAAA
TGAAGATTTG
AATGATGTTT
AGAACTTTTA
ACCTAA-AATG
TTTAGTTTAT
T
120 180 240 300 360 411 INFORMATION FOR SEQ ID NO:195: SEQUENCE CHARACTERISTICS: LENGTH: 681 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 681 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:195:
AATAAGGAAA
ACGCTTTGTT
GATACAGATA
CCCACTTTTA
TCTA.AGTATG
ATAACATAC CATTTGCGTT GCTAATGAAA AGGGTGGCAG TAAACTTAGC CGTGCAATTA. CTCAA.AGACA ATAAAGAAGT GCCAAAAGTC TATGGAAACT TTTGCTACAA. TTCGTGCTGA.
GCTTATTTAA TCGTAGTAGT GGCTTTAGCG ATACTTTAA AAAATATCCT TATTGATACT AAGGGGGAAT ACAGCAAAGA
TGGTAAAAGC
GGTTGTTTTA
AAAAGAACGC
GCAAATGGTA
AACCCAAAAA
WO 97/37044 PCT/US97/05223 250 GCTATGCTTT TAAGTAATAT TGTGCTAGTG CCAACAACTC CTAGCCAATT AGACACTGAA 360 GTCTTAGCTA ATATGCTAGA AAGAATTGAG CAACTCCAAG AGCTTAATGA AAATCTAAGA 420 GCCTTAATTG TCATCAATAG AATGCCTACT ATTCCTACCC TTAAAGAAAG ACAAGCCTTA 480 ATAGAGTTTA TTAAAGAAAA TAACCCTAGC GATAGGATTA CACTTTTAGA AAGCTCTTTG 540 AGTGAGCGCA TTGTTTATAA GCGCAGTGTA AGCGAAGGCT TAGGGGTCAT AGAATACAGC 600 GATAAAAAGG CTATCAATGA GTGGGTTAAT TTTTATAACG AATTAAAAAG CCATTTAGAA 660 AAAGAGAAAA TACATGCGTT T 681 INFORMATION FOR SEQ ID NO:196: SEQUENCE CHARACTERISTICS: LENGTH: 315 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...315 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:196: CGTTCAACTA CGCTTATTTA TTTTAATTTT AAGGAGCCAA TGATGTCAGA TGAAATCACG CAAGAAAACG AGTTAGAAAT TAACTCCAAT AATCAAAACC AAGAGCCAAA GGAAGTAGAA 120 AAAATGCCAT TGAATAATAT TCAAAAAGCT AAGAAATTAA AAAACCACGC CAATTTAATT 180 GTTCGACGCA CTGATGAGTT AGATAAGGTT ATCAATAAGC GTGAAAGCTT GCAAAGAGAG 240 TTTAAAAGAC GCATTAAGCA CTTGGATAAC AAGATTGAAA CGCTTAGTAA CAATATTGAA 300 GAATTAAAAA GAAAG 315 INFORMATION FOR SEQ ID NO:197: SEQUENCE CHARACTERISTICS: LENGTH: 519 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature WO 97/37044 PCTIUS97/05223 251 LOCATION 1 .519 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:197:
TCTATGAAAA
AATGAATGTA
TGTATAGAAA
TTTTATGA
GCTAATGGTA
AAGCCCTTTT
CTTAATAAAA
AAAGAACAAA
GAAAAACAAT
AATATCAAAA
AGGACATTTT
GCCTTTTATT
GTTTGTTTAA
ATAACATTAA
ATTTGCAGAA
ATGTAGAAGA
AAAGCCAAAC
CTACTACTGA
CAAACGAAAG
AGAAAATTAT
AGCCCTAGAC
AGTCATCAGT
CCAAATAGCC
TTTAGCATTC
ATGTAAAAAA\
AGAATTTAI\C
GCCAAAAAAG
CTOCGTAAGC
TTAAAAGTCA
CCTAGCAA-AG
CTTAACCAAC
AAGAACACCA
TTAGALAGAGA
ATC.TCTTTGG
GCCATTTCAA
GACAACGCA
GTATCAATTG
AAAAAATTAG
AAACACTAAG
ATGTCTATA-A
ACATTGCCAT
TTTTTAAGGT
AATTATTACT
AAATTCTCAA
CTTTATTTCA
CGTTACTGAG
AGCAAGGGAT
AOAACTCATG
TAAGTATAAC
TATGGATACA
TCAAATTTAC
CGCTCTTAAT
120 180 240 300 360 420 480 519 INFORMATION FOR SEQ ID NO:198: SEQUENCE CHARACTERISTICS: LENGTH: 327 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: H-elicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .327 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:198: AAATCACCTT ATCGCCATAG TATATCTTCA CAGGTTTCTG TAGAAAACGA
TGAGGAAATT
ATTGAAGTTG AAA.AGGGCGA AAATCAAGGG GCGTTTTCTT ACTTTTTAGG
CGGCCCCACT
TGTTTGGCGG GGGATTTTAT GGGGAGTTTT AGCTTTGAAA CCCCTTTAAA
AAGGGGCGAT
AAAATCGTGT TTCAAGACAT GCTCCATTAT ACGATCGTGA AAAACAACTC
GTTTAATGGC
GTGCCACTGC CAAGCCTGGC TAAAATAGAT TCGCAAGGTT TTAAGATCCT
CAAAAGCTTT
TCTTATGAAG ATTATAAA
CAGGAAT
INFORMATION FOR SEQ ID NO:199: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 399 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoinic) (iii) HYPOTHETICAL:
NO
120 180 240 300 327 (iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 252 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...399 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:199: GCTTTAGAGA AGAATGGGAC TGCTACTGCT AATAGCACTA GGTTCAGATG GTCAAACTTA CTCTCAACAA GCTATTCAAT ATCTTAAATA ACGCAGCGAA CTTGCTCAAG CAAGATGAAT TCTGCCGTAG CTGCTAACAT TGGGAATAAG GAATTCAATT GTGCAAGGCA TTATTGATCA ATCTCAATTG GTTTATAACG AGCGGGAGCG CGGTTAATAA CGCTGGGATA AACTCCAACC GTGCTAGTTA GCTCCCTAAC GCTCTTTACA ACGTGCAAG GTAGCACTAG CGGTGCGACT ACCTTCAAGG
CCAACAAAAT
TGCTCCTAGA AGCTTTCAAC CAGCCGCTTT TACAGGTTTG AGCTCACTAA AAACACCATT AAGCTAACGC GTGCAAGGGC 120 180 240 300 360 399 INFORMATION FOR SEQ ID NO:200: SEQUENCE CHARACTERISTICS: LENGTH: 252 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...252 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:200: CTTATAAAAG AAGCGATGAT TCCAGTTTTT GGGAGTGAAA CAGGGATTTA TAACCATAAG GAGCAAAATT TTAAAGGCAA GGGCCGTTTC ATTCTCACTT CAAAGGACAG CAAGGTTGAA GGGCTTGACA TTTCTTATTC GCATGCATTA GCTATTATTG AAGCTCAAAG CATTCAAGCC AATTTGTTTT TAGATGAAAT CAAACAAAGC CAGAAAGAAA AGAAAAAATT CCCCACTTTC AAAGGAGGGT TT INFORMATION FOR SEQ ID NO:201: SEQUENCE CHARACTERISTICS: LENGTH: 483 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular 120 180 240 252 (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/05223 253 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...483 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:201: CCCTTTCTAA AAAAGAATAA ATTTTTACAA ATTTGTCAAT ATTTTAGCGC GCATTTCAAG CAAGTGTTAA AAAATGAAAA GCCCTTAGTT TATTATGGGG TTTTAAAGGC TAAAGCCCCT 120 AATTGGGCTT TATGGGTTTA TGAAAAACCC TTAAAAAAAG AAATTTACAT GAACGATAAG 180 GAAGTGGTGG TTTATGAGCC TAATTTATTC CAAGCGACTA TCACGCCCTT AAAAGACAAG 240 ACGGATTTTT TCACCATTCT CAAGCAATTG AAAAAGCAGA CTGACGGCTC TTTTAAAACG 300 ACTATCAACA AGACCACTTA TCGTTTGGTT TTTAAAGACG GCAAGCCTTT TTCGTTGGAA 360 TTTAAAGATG ACATGAATAA TCTTGTTACG ATCACTTTTT CTCAAGCAGA AATCAACCCT 420 AAAATTCCTA ATGAAATCTT TGTTTTTAAC CCTAAAGATG AAAATATTGA TATTGTACGC 480 CAG 483 INFORMATION FOR SEQ ID NO:202: SEQUENCE CHARACTERISTICS: LENGTH: 213 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...213 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:202: CCTAGCTTTT TGAAAGAAAA ATTTGATTTT TTTAAAGGCA AAAATTTTAA AATTGTCTAT TGTATTGGCG AAGATTTAAC GACCAGAGAA AAGGGTTTTA GGGCTGTAAA GGAATTTTTA 120 AGCGAGCAAT TAGAAAATAT TGATCTTAAT TATTCCAATT TAATTGTGGC TTATGAGCCT 180 ATTTGGGCGA TTGGCACTAA AAAAGCGCGC TTT 213 INFORMATION FOR SEQ ID NO:203: SEQUENCE CHARACTERISTICS: LENGTH: 465 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 PCTIUS97/05223 254 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .465 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:203: CGCGATGTTA GATTCCATGT GAAGTTTCAG ATGTGATTAA ACCGAAATCG CGCGGCAGTT AGTGGAGGGA GTGGIATTGA GAGAGCGAAG AGTATAATCA TTATCTTTAA GTGGTAAAGT TTAGACTACG ACTTCACCCT GATGTTAAGC CTATTGTGAA
TAGGGACCTA
CGACACCACT
GCGATTGCGA
AGCAGACAGC
AGACACCACT
ATCCAGTATC
AAGCCTTACC
AAATGCTAGC
GTTCGGCAAC
CAGCCCAATT
TCTAATGGGA
AGAATGGTGA
GTAGAAAAAG
GCAGCCTCTA
AATAGGAAAA
AATAAGCGTA
TCAAGGGCAA GCATTTGATT TGGACATGAA TCTTTTGACG GGTTCAATAT CACAAGGGCG AACAGCGCGA AAAAGAACGA GCACTTTAAA AGCCGCTGAT TTAGTAGTTC TAGGCAACGC CGGGTGAAGA GGTATGGAGC
TGTTT
120 180 240 300 360 420 465 INFORMATION FOR SEQ ID NO:204: SEQUENCE CHARACTERISTICS: LENGTH: 657 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION .657 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:204: ATTAAATCTG TTTTAGGTTT TATATGTTAG TTACAAAACT GAGGTTGATG AACACTTTGA TTTTGGCCAA AAGATTTTAC GTGAAAGACT TCCACGAAAA GTGCATTTCG CATGGAAAAA CCTATGGTGG CTGATATCAC GCGATCGCTT TGAGAGGTGC
AATTTTGATC
TGCCCCCGAT
GCTTTCTAAA
TTTTGTATGC
AGGCTTTAAT
CACCCCTGTG
TAAGAGCATT
TTTTTTGATT
CAACAAAATT
TTTAAAGCGC
AATTTAGGCA
CCTACAGAAA
GTGATTGGCG
GAAAA.AGGCG
TCTAGAGACT
GATAAAAACA
TTAAAACACT TAAGGAGTTG CTGCCGTTTT AGGAAACAAT AAAATGGTGT GATCCTTTTC TCATTGCGTT TGACAAAAGA TGTCTATTGA CAGCGAACAA GTATCGGTCA AGTGTCTTTC ATGATGTGCT GTTTGAAGAA TGAAAGTAAG ACACGCAGTG 120 180 240 300 360 420 480 WO 97/37044 PCTfUS97/05223 255 ATCAATGACT TGCCATTAGG TAGGAATGCA GATGAAATGC TTCGCATGGT AGACGCTCTC 540 TTACACTTTG AAGAACATGG TGAAGTATGC CCAGCAGGTT GGAGAAAAGG CGATAAAGGC 600 ATGAAAGCAA CTCACCAAGG CGTTGCAGAG TATCTTAAAG AAAATTCCAT TAAGCTT 657 INFORMATION FOR SEQ ID NO:205: SEQUENCE CHARACTERISTICS: LENGTH: 339 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...339 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:205: GGAGTAATAA TGGAACACCA TAAAGCGCAC ACAACCATTC AGGCTTTACA AGCCAAACGC AAAAGGTTGC TAACCGAATT AGCCGAGTTA GAAGCAGAAA TAAAGGTGAG CAGCGAACGA 120 AGGAGCAGTT TTAACGTTTC GCTCTCGCCG AGTTTGTTAG CCGAAATAGA AGAGATAGAA 180 TACGAAGAAA AAATGAGCAA AGAGCGAAGA ATCCATCACA ATCTTTTGCT TTCGCCCAGT 240 TTCATGGCTA AAGTGGATGA ATACATGAAA GAGAAAGGTT TTCCTAATCG ATCGCTTCTC 300 TTTGAAAAAG CGTTGGAGTT TTATATGCTA AAACACCCA 339 INFORMATION FOR SEQ ID NO:206: SEQUENCE CHARACTERISTICS: LENGTH: 561 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...561 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:206: WO 97/37044 WO 9737044PCT/US97/05223
GGTTTTGTTT
ATCAAGGGTT
GCGGCCA-AAG
GAAATTAACG
AACCACGACC
ATTGAATACA
TTAGAAGATA
TTTGACTATG
AGAGTCAAALA
CAAAGGGGGG
GCGGTAGCGT
TTGTTAAAGA
ACACCCATGA
AAATGCTAGA
ATCAAGAAAG
ACAAGAGTAT
GGGTCATCCA
ATAAAAAAAG
ACGAAATCGT
GTCATTGGTT
TTTATTTCAA
GAGAATGCCT
AAACGCCCAT
GGCTAAAAAA
CGTGTTTTTA
TTATAAAAAT
TCCCACTTTA
TCAAA.ACCCT
TACTTTAGAA
TTAGGAGTTT
TTTGTAATGA
GATAGGGACG
CTTCATTTTC
GCTATCGCCT
AGTTCTCTTA
ACTTTACCCA
AAAAGCCCTC
GAGAGTTTGT
TTATGGGGTT
GATGCGTTCG
CTTATTGTGG
TTCAAGA-AAA
CTTTAAATAA
ATTATAATTA
ATCCCACGCA
TAAACCCATG
TTTCTATGCT
AAAGAATAAA
TGGTTTAAA.A
AATCAATCGT
AGCTTTATTC
TGAA.AGTTTC
TGGGGGGCAT
TTCGGGTTAT
GGCTTTTATT
TCCTGCCGTT
INFORMATION FOR SEQ ID NO:207: SEQUENCE CHARACTERISTICS: LENGTH: 690 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .690 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:207:
AAGAGAGTTT
TGCGATGATG
CCTATCAGCT
TATCATTACT
GATTGCGATC
ATTAAAGAGG
GTTCGTAACG
AACGAftTGCG
CTTAAAGATA
AAAAAGAGGC
CGCCATGCCG
TTAGAGGTAT
GTTTTCTATG
GCTCTAAAGA
ATCCTTATGA
GTAATTATAC
ATATCTATGA
TTGTGATGTA
ATGGGGATTT
AGCCTTTTGA
AACACCATAT
GAAACGCTAT
ATTTGATAGG
ATCAAAAATT
CTTCCTGCCG
AGTGGTTTTA.
AGTCATACTA
GCTTTCTTTT
CGCTAAAAAG
TTCGCGCATT
TGGGTTTTTA
AATTTGGCGC
TAAAGATAAG
TGTTTATGAC
CACAAGGATT
CCGCTTGCCT
TTCAAAGGGG
GAGTTTTGCA
AAGGATTGTC
ATCCCTAAAA
CTTTATAAAA
AATTTTGTGG
GACGCATGGG
TACAATGATG
GAAATGGTGC
GATTTGATCC
GAAGAAAGCA
GGGTCATTGG
AAAAGTTCCC
CGAGTTTGTG
ATGAGTGGGT
GTTTTTATAT
TGCGAGATTT
GGGATCATTG
AAAGTTATGA
AATGGCACTT
CTTTAGAAGA
TGCTAGATGA
TTTTAATGAT
CTCATTTATC
GCACCAACTC
AGTAAAAATT
TCCTAAGAAT
TGAAGTGTTT
GTTATTGTAT
AGTCCTAAAG
CCCTTTGGCT
ATTCAAAAAA
AAAGAGGATT
120 180 240 300 360 420 480 540 600 660 690 INFORMATION FOR SEQ ID NO:208: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 258 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/05223 257 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...258 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:208: CTAAGTTTAA ATAAAAAAAT GATGTTTGAA TTAACCAAAA AAACCAAATT TGATGGCGAA ATGATTGGCT ACACAGAAGA ACTTTTAACC TTTTTAGTTA GAGATTTTTT TAATGGGATT TTTAAATCCA AAGTAATACC TAAAATGCCT ATTTTTTGCG GTGATGTTAA ATGCGAAGAT TTTAATGCCC TAAGAAGTTT AGTTTATCTT TCTGTGCTTG AGTTGGAAGA AACGATAAAT CCTAATAAAA TCCCATTT INFORMATION FOR SEQ ID NO:209: SEQUENCE CHARACTERISTICS: LENGTH: 492 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...492 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:209: 120 180 240 258 TTTTCAAGAA AGACAGACAA CAAAGGCGTT TAAGAGAGCA AAAGGTTTGC AACAAAATTT ACTTTAGATT TGCAAATCCC TACGCTCAAA TCTATCAAAT TCAGTGAGCG TGCTGATTAT TACTCTGATT TTAAGGATTA GTGGATTTCC CCCCATATCC AAGGAAGAAC AA
AAACGCTCAA
ACAACGCTTG
GAATCAATTC
TAAACAAGAT
TTTATATAAA
GATCACTAAA
TAACAAGAGC
TGGAGGAAAC
AAAGACGAGC
AAACAAAACC
ACGCAAAAAT
GGGGTTGATG
GGCTGGAGGG
GATGGAGAGT
GTGATGACCC
ATGATTTCTA
AAAAAAATGA ACAAGAAGAA AAGAAAACCA AGAGATGTTG TAGAAAGCGT TAAAAACAAG AAAAGGCTTA TCAAGAGTGG GCGTTTTTTA TCACAAGGCT TTGATTATAC CATTCTTAGC TTTTAGATGA TTTAAAGAAA TTAAAGTTAA TTTTACGACT 120 180 240 300 360 420 480 492 INFORMATION FOR SEQ ID NO:210: SEQUENCE CHARACTERISTICS: LENGTH: 204 base pairs TYPE: nucleic acid WO 97/37044 PCTUS97/05223 258 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...204 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:210: AGAGCGCGAT TACTTAGAAG AAAAAATCAT TACTTAGAAA ACAAATTTAA AGACATGGGG CATTATGCCG CTAGCGATGA AGTCAACGAA AAACAGGTTT TGAAAATGTA TCAAGAAGGC 120 TATAGCGTGG ATTCTATTTC TAAAGAATTT AAAGTGAGTA AGGGCGAGGT GGAATTTATA 180 TTGAACATGG CAGGGTTAAA ATGG 204 INFORMATION FOR SEQ ID NO:211: SEQUENCE CHARACTERISTICS: LENGTH: 291 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...291 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:211: ACGAAATTGG ATACAATCAG CTTAAAAAGG ATACAAATGG AAAAATTACC TAAAAAACGA GTTTCTAAAA CCAAATCACA AAAACTTATC CATAGCCTAA CCACCCAAAA AAACAGAGCC 120 TTTCTCAAAA AAATCAGCGC TAATGAAATG CTTTTAGAGT TAGAAAAAGG GGCGTTTAAA 180 AAAAATGAAG CCTATTTTAT TTCTGATGAA GAAGATAAAA ATTACGTTTT AGTGCCGGAC 240 AATGTGATCT CTCTTTTGGC AGAAAACGCC AGAAAGGCTT TTGAAGCCAG G 291 INFORMATION FOR SEQ ID NO:212: SEQUENCE CHARACTERISTICS: LENGTH: 816 base pairs TYPE: nucleic acid PCTIUS97/05223 WO 97/37044 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION .816 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:212:
AGGAGTATTA
GATGTCTTGA
GACTTCATAG
GATTTTTCTA
GATGATGTCA
CACTTTGTCA
AACGATGGGC
AATGATGCAA
TACTTCATGC
ATCAATAACA
AATTTTTTAG
ACGCATTTGA
GCTAAAAAGA
ATGCTCGCTC
AAATGAAACA
CGATTTTGGA
AAGAAGAAAT
ATTTCTTAGA
AAACAGAACA
AGTCATTTCT
AAGATCCCCA
GCTCGTTTAA
TAGAAACTAT
ACATAAGCGA
AATACTATGT
TCATGGACTC
AAGAGCTTTT
AAGAAAATCA
AAGTTTGCGC
TAGTTTTTCT
GGAAGGTGAA
AGACCATGTC
CCTTTATTCA
CAGTAATCAG
AAAAACATTA
AGCCAATAAT
CAGGCAGTCG
GGACATGCAG
TCAGAACkAA
CATTATTCCT
TGAAAAATAT
AAACAAAAAC
GAACAAAAAT
A.ATTACCTTT
ATCACTGAAC
AATGTGTTTT
GGTCTCATAG
GATTTAGACT
TCAAGATTGA
TCTTTTGTTT
TATAGGATTC
AGCGATATAG
ATATACCCAA
AA.TTGGATTC
TTTCAAAACA
AGCGAT
TATTGAAAAT
TTGAACTGAG
AAAACCTTAC
ATGAGAATGT
ATAGTCTTAA
TCCGCTTTTT
TTCCTCTTCA
CATTAGTTTA
TCAGATTACT
AGAATTTTTT
CCAATCATGC
AGATTGATAT
TTGATGAAGT
TTTGGAAA.AT
AGAAGAATTG
CGCTCTTTAT
TTTGAATATA
CGCTAATCTT
TAAAGAAATA
AAGTGGGAAG
TGTTTATGCT
AGAAAAACCT
TGTTCAAGCA
CTATGACTTC
GAGCGTTGAA
AACAAACAAA
120 180 240 300 360 420 480 540 600 660 720 780 816 INFORMATION FOR SEQ ID NO:213: SEQUENCE CHARACTERISTICS: LENGTH: 687 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (Li) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .687 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:213: GTGAAAGGCG AGAAAAACGC TTGGTATCTG GGGATTAGCT ATCA.AGTCGG, TCAGGCTTCA WO 97/37044 WO 9737044PCTIUS97/05223 260
CAAAGTGTTA
AAAACCGACT
TTCGGGGAGA
GTGTTTGGAG
TGCGCGACCA
ATTGACACTT
GCTCAAATCG
CCTTATAAGC
GGGATTCGCA
ATCAATGTTT
CTTTATGTAG
AAAACCCCCC
ATCTGGCCGT
AAAGATGGTT
CGAACGCTTT
AAGTAGGGAC
TATACAATGT
CGGGTAACTC
ACACCTCCTA
CCCATATTGG
ATTATTTTAA
GGTATCGTTA
CAAAAGTAGT
TATGCAAGGC
TGGTGCGCGC
GACATCAGAT
TATGGGCAAT
CATTAATAAG
TTGGGGTAAT
TAGCCTTGAT
TCAGCATCAA
CCATGGGAAC
CAATTTC
GAATTTAACT
TTAGGGCTTA
TATTACGGCT
A.ATGGTGGGG
CTGTCTGACA
GA.AGACGCGA
ACGACAGGGG
CCGGCGATTT
GAATTTGACT
TTGAGCTTCA
ATCCTAAGTT
CTGTGGGTTA
TTATGGATTA
TGTGCAAACT
TGTTCACTTA
GTTTTGGTTT
CCTTTTTGGA
TCCAGTTCCT
TTGGCGTGAA
CTTACCGCCG
CCCTGTGGGT
TAAGCAGTTT
TGGGCATGCC
CAATGAGCCA
TGGTGTGGGT
CTTTTTTGGG
AACTAAAAGC
TTTTAATTTA
GATTCCTACT
TCAATACAGC
120 180 240 300 360 420 480 540 600 660 687 INFORMATION FOR SEQ ID NO:214: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 195 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .195 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:214:
AGCGCTGTAG
TCGCTCTACC
ATTTTATGCG
CCTTTCAGTT
TGGAGAGCGT GGTTATTTTA AAGATTGACA TCAATCAGGG GCGTTCTTGC CCACGCCTAA AAGCGTTTCT TTAGTGAGGA ATCAAAGCGT AGCCTATGAA AAAACCAGCC GCTATGGATA GAAGTTAGCA CCAATTTAGG GCAAACGCTA
TAATT
INFORMATION FOR SEQ ID NO:215: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1095 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PTU9/52 PCT/tJS97/05223 261 (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1095 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:215:
CGCTTTAAAC
AAACCCACC-
AACAGCAATA
TTAAGCACTA
CAATTCATCC
CAGCA-AGTCC
AATGCGATCA
GCCTATCAAA
TTTAOCGAGC
ACCAACGATC
TCTAGCGATG
AGTTTTGACA
ATCGGCGTGA
ACTTCAGAAG
GGGACAAGCC
CAAAATATTT
TCCAAACTAA
AATCAAAGCA
ACCGTATTGC
ACTTTAGGGG
GACA.AGTCGT
ACCAACAACA
ACA.ATCAAAA
AAAATTTAAC
AAGCCATCGC
ACAACACCAC
GCACGATAGA
CTAAAAACTT
AAGGGCAAAA
CTTCTAGTGG
ATTCTTTAGT
ATTCTTTTAA.
AAAATTTGCA
CATGCAATAG
TAAGCCCTAC
AAGCGATGGT
GTGGGCCAAC
AAAAT
TGGGGGTTTT
ATTCTATCCA
ATACAACAAC
TAACCCCAAT
CCCTTTAGCC
TCAAAAACTT
CTATAATTTA
ACAATACAAT
GCTCAAAAAC
TATCAGCGCC
GATTTCATGC
CGCTACCTCC
TCTTGTCTCT
AAAAAACGCC
CTCTTCAGGG
TAATGGGACT
AATGGTGAAT
CACACAATCT
CCCCACCACA ACCTCAACGC ACTAATTCCC TTTTAGGCTC ACCCTTTTAA TGAACACCTT GGTTGCGCCA ATCAAATCCA GCAACCCCCA CTTCAACTAA CAAAGCGTTG CTATCAACGC AACAACTTGC ACAACGCTTT AACGCTTTAA AGCAAATTTC ACTTCCAATA ACTACCAAAT TATGATTGCA CAAGCGCTAC TCAGCCACAA GCTCCACAAG AAAGTCCAAA CCATCAACGG CAAGTGTGGA GCGTTTATAA AAAATATTAT GCA.ACAATGG GGTTTGAGCA TCAGCGGGAA ACCACTAATA CTCAAGCTA-A AATGAAGAAG AAGCCAAAAC TCTAACAGCA CGGTGATGGG ATGTGGTOC7
TACTTCTTCA
ACAAGGGGAA
GTGTTTAGAG
CCAGGCCAAC
TTTAGACA.AC
GAATTTCCA-A
GTGGATTAGT
CGGCACGGTT
CGGAAGCCTT
CAACACAAAT
CAAAGAGCAG
CTCTTTAAAA
ATCGCAATCT
CGCCCAATTG
AAGCAACGCT
GACCAATTTC
AGCTTTAAAC
120 180 240 300 360 420 480 540 600 660 720 -780 840 900 960 1020 1080 1095 INFORMATION FOR SEQ ID NO:216: SEQUENCE CHARACTERISTICS: LENGTH: 405 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .405 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:216: AACCCATTCT TCACAATCAC GGCATCGCTC TAAGCTCTAA TCCCAAGAAG TCAAAGCCCA CAAATTGAAC ACAGCATCAC AATCTGACCG ATGCGGTGAA AACGCTTTAA ACACTTTAGG CTAAACCCAC CGGACAAGTC
AGCCAAACTC
TATCATTAGT
GCTCCAAAAC
TAAAACCACT
TGCATCTAGC
GGTGGGGGTT
GTATTCTATC
CTTTCCAACA
GCAGTCAATA
ACCGCGCAAT
AGCACCACTT
AATAATACCA
TTCCCCACCA
CAACTAATTC
CCCAAAGCGC
GCCTAAACCC
CCATGACAGA
ACGCACAATC
CTTATGTGAG
CAACCTCAAC
CCTTT
GTTTGATCAG
TAGCAACAAC
ATTGTTGCAA
CTTACTCTCC
CGCTCTTGTT
GCATGTGGTG
120 180 240 300 360 405 WO 97/37044 PCT/US97/05223 262 INFORMATION FOR SEQ ID NO:217: SEQUENCE CHARACTERISTICS: LENGTH: 273 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...273 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:217: GGAAGAGGAA TGAAATTAAA AAAACGAAAA GTTGCGGCTA CATTGCTAAA GCGTTTGACC TTGCCACTAT TGTTCACTAC GGGTTCATTA GGGGCGGTTA CTTATGAAGT GCATGGGGAT 120 TTTATCAACT TCTCCAAAGT GGGTTTTAAC CGTTCGCCTA TTAACCCTGT TAAAGGTATC 180 TATCCTACAG AAACTTTTGT TAACCTTACG GGTAAGCTAG AGGGGTCTGT GCATTTAGGT 240 AGGGGATGGA CCGTGAATGT AGGCGGTGTT TTG 273 INFORMATION FOR SEQ ID NO:218: SEQUENCE CHARACTERISTICS: LENGTH: 2163 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...2163 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:218: ATGGAGCAGC CAGTCATTAA AGAAGGGACT TTAGCTTTAA TTGATACCTT TGCGTATTTG TTTAGAAGCT ATTACATGAG CGCTAAAAAT AAGCCTTTAA CCAACGATAA GGGCTTTCCT 120 ACAGGGCTTT TAACGGGGCT TGTGGGCATG GTTAAAAAAT TTTATAAAGA CAGAAAAAAC 180 ATGCCTTTTA TCGTGTTCGC CCTAGAAAGC CAGACTAAAA CTAAAAGAGC TGAAAAACTG 240 GGCGAATACA AACAAAATCG TAAAGACGCC CCTAAAGAGA TGCTTTTACA AATCCCTATC 300 WO 97/37044 WO 9737044PCTIUS97/05223
GCCTTAGAAT
GATGATGTGA
GATAAGGATT
TTTTTGGCGA
CAGGCCATTG
AACGCTAAGG
TTGGCGAAAAJ
GCCTTTTTA.A
TTAAGTTGCG
TATGGTTTCA
CCCATATTAA
AAATCGCGCA
GAAAACCCTA
GCCCTAGCGT
TCGCCCTTTT
ATCATTGGGC
TTGGAAAATA
GTGGGGTTTG
ATCAAAGATT
AACGCCTTAA
ACTTTAGCTA
GGCTTTAAGA
AATGGTGCTG
ATCAAGGCGA
CTATCCACTC
GACATCTACG
AATCTCTTTA
GTGCCACATT
GAAATTAATG
AGCGTGGCTA.
AACGATGCTA
ATT
GGTTGCAAAA
TCGCALAGTTT
TTAACCAGCT
AAGATTGCGT
TGGGGGATAG
AATTGTTGCA
ATTTACTCAG
GCAAAGAATT
CTTTTCCGAG
TTTCTACCTT
ACAGCACGCC
TGATTGTTTT
ACGCTAGGGT
TTTTATTACA
CTTTGGAATT
ATGATTTAAA
TCCGCATCCA
ATGAGGTTTT
TTAAGACA-AA
AGCGTTTGTG
GAGACATTGA
TTGATGCCGC
GCGCTATCAC
TGCTACCTAT
AGTTGCAAGC
CTTTAGCTCA
ATTCCATTCC
TGGGTAAAAC
CGGTTCAAGA
AAGATGTTTA
AGAATTTGGG
AATGGGTTTT
AGCCACGCTA
TTTGAGCGAT
GGAAAAATAC
CAGCGATAAT
GCGATTAGGG
CCCTAAA.ATG
AGCCACTTTA
C GAAAAC CC C
AAGGGATTTA
TATATTAGAC
AGAAAGCGCT
TTTTATGCGT
AGATCAAGGC
TTTACAAAAC
GCCTTTATTA
AGACACTCAG
AAAGGAATAT
AAGTAAGGCG
CGAATACTTT
AACGCCGTTT
CTTGGAAAAT
ATCCACTGGT
CTTGCAACAA.
TCGAGCTATG
AAACCAAAAG
TAAAGACCAA
CCCTACTAAC
CAATGTAGCT
TAACCTAAAA
CCCAGGGGTT
ACTTGCGTGG
AGCCCTTATA
AAAATCGCGC
GGGATTTTGC
TACAAGGGGG
AGCTTGGAAA
TATCAAGCCT
GAAAGAGGAT
TTGTTGAAAA
GAAAATTCCC
AATACCCCCG
GAGCCTTTGA
TTGGTTTTGG
TATTTTTTAC
GCTTTTTCTC
AGCTTTTTAA
ATTTTAGCGT
TTAAAAGAAG
GAAAAATCAG
GAAAAAGGGG
GTGAAAGTTT
ATCGTCATCG
CATGTAACCG
GCGCTTACGC
GGATCTCAAA
CAAATCCTTT
CTTAAGTATT
CCTTACAGAC
ALATTATGGTA
TCCAATCAAA
TCTAAACTTC
AGGTGGGAGG
AAACCCGCAT
TTTTTGATGG
CGAGTCAATT
TTAAAGGCAT
AAATCTATGA
TGATACAAGA
GCATTAAAGA
TCAAAGATGA
CTTTCATTGT
CCTTAGACAA
GCATGTTTTT
ATAAAGACAA
CTTTAGAAGA
AAATGTTACA
AAGCCAAATA
TTTTAAAGAA
ATTTAATCCC
AACTTTTAAG
GGCTAGAAGA
TAATGGGCAT
CAGGTGTTAC
ACTATGCGGT
TTTCTCAAAG
CAAATCGTGA
CTAACGCTTC
TGGAGAACGC
AGAATGTGAA
ATCGTTTGGA
CAGAGATCGT
CCTGTCGTTG
GTTTGAAGCC
TTATTCTAAA
CAAAACGGAG
CACGGATTAT
TGGGAGCAAG
AAATTTAGAC
CAAAGGAAGC
ATTTGATTTT
ATTGAAAGAA
AGAAAACGTG
CGCCCCTAAA
AGAAAAATTA
AAAAATTCTA
GGCGTTGTTT
GCATGCGTGT
TCAGGTGCCT
TCCGGAAAAA
GCATGAAAAG
CATGGAATTA
GGATTTGCTC
GGAATTTCAA
AAACAAACCC
GTTTAACAAC
TAACCACACC
ATTCGCTAAA
AAGTATCTTC
TTACTTGAAA
TTTGAATAAA
TTCGGCTTTA
AACCACTTAT
TTTGGAGAAG
360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2163 INFORM~ATION FOR SEQ ID NO:219: SEQUENCE CHARACTERISTICS: LENGTH: 228 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .228 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:219: ACTTCCCTGT CGTTGTTTGG AGAAGATTTA GCCAAAGAAA AACGATCCAT CGCTAAAAGC WO 97/37044 WO 9737044PCT/US97/05223 264 ATTAATTTTG GGCTGGTGTA TGGCATGGGG AGTAAGAAAT TGAGCGAAAC
TTTAAGCATC
CCTTTAAGTG AGGCTAAAG TTACATAGAA GCGTATTTCA AGCGATTCCC
TAGCATTAAA
GATTATTTGA ATGGCATGCG AGAAGAGATT TTAAAAACTT
CTAAAGCT
INFORMATION FOR SEQ ID NO:220: SEQUENCE CHARACTERISTICS: LENGTH: 975 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .975 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:220:
AAAGAAAAGG
AGCCTACTCC
GGGGTGAGTT
AGTCAAGACG
CCTTTAGCCT
TGCCCTGATC
AATAACGGCG
CAATCTCTAG
AACCCTGAAA
GTTATTGCAT
AACGCTTTAA
AGTAACACTT
GCCACAGCAA
GAAGCTAAAT
ATTTTAGGGG
GGCGGCGATT
CAACACCACG
ATAACCCCTT
CCTTATTTTT
ATCAAACTTC
CATCCACTTA
ATTATTTAGA
CTTCTAAkAG
ATACAGGCAA
TCAATAATTT
ATCTTCCTAA
TGCCTGAGGG
CCACGCTCTG
CTGTGAATTT
ATAATAATCA
CTATCGCTCA
GCTTAGCCAA
CCCAACAAGG
CAAGC
GCAAAACTTT
TCTTAACCCT
TTTGGCCGTT
TATCCGCCAA
AGCGATGGGC
ATGTTTGCTC
CAACCCCCCA
AAACAAGCTC
CTCCAAAGTC
TCTAGCCAAT
GTATAACCAA
TAGCCCCCAA
AACCATTTGC
AAACGCCCAA
TGAAAAGCAA
CTATTCAAAG
GTTTTTAATA.
TTAATGGCAG
CAAAGGGTGG
AACGCTATCG
CAACAAACGA
TATGCAGGGG
AGAGGCAATG
ACCCAACTCA
TTTAACGTCA
ACCATGGACG
ACCTTAACGA
GTCTTGCAAC
AGCACTCAAA
AACATCTTCC
TTTGGCTTCA
CTTTGCGGCC
AAAAATGGCT
AAGACGATGG
ATAACTCAGG
CTCTAGAATC
GAGTCTTAAT
GCTATCAAAA
TCAATGCCAC
TCGGCGAAAC
AATTTGGCAA
CTTTAAACAA
ATAAATCTTT
ACCTTTTACA
ACCAATGCAC
AGGCTTTAAT
CTTACAACAA
CGTATTAC
CATCTATTCT
GTTTTTTATG
GCTTAACGCC
TGCGGCGGTG
GCAAATGCTC
CGGACAAAAT
CTTTGATATG
TTTAATCCGT
TCAAAGCACT
TGACATCACC
TAGCACCCCT
AGACGGCTTA
CGCCACTAAT
GCAAGCAGGG
AGCCCCCAAT
ACCAAAACGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 975 INFORMATION FOR SEQ ID NO:221: SEQUENCE CHARACTERISTICS: LENGTH: 840 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
WO 97/37044 PTU9/52 PCTIUS97/05223 265 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 840 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:221:
TATAACCAGA
TATGGGGATT
AATTTCTTGC
ACCCCACAAC
AACGCCCCCA
AAAACTTTAA
GAAATGCAAA
CATACACTCT
TTTTTCGGCA
GCTCAATTTG
ACAGACTTTC
TTTTTTGCCG
GATGGCAACC
ATGGACCAAA
TGCAAGACGT
TCATCACCAA
CTAATTATGG
AAGCCCAACA
CCCCCGCGCA
TGGCTTATGG
CTAAAGTGAA
ACATTCTAAT
AAAAGCGCAT
GCACAGAATC
TTTATAATGT
GTATCCAACT
ATCTTAAACC
TTTTTCCAAA
GATCAATTAC
TTGGGTCGCC
GGGGCAATTG
AGAACAAAA.A
AATAAACAGG
GCTCTATCGC
TCAAGTCTAT
AACACTGAAC
CTTTGGGCTT
TTCTTTAGTG
TTTTACCCGA
TGCAGGGCAA
TAAGGACACT
ATCGCTCATC
GGGGAAAGCT
CCCTATTTGG
AATGGCGCTA.
GTGATCATGA
ATTTTAGCCA
TCTAAAGCAG
CAAATGGGCT
GGCTTTGGCG
AGGTATTATG
AAAGCCACCC
AAAAGAGGGA
ACCTGGAAAA
TCTTTCCAAT
TCAAAAAAGA
TGCTTAGTAA
ATTTAA-ACAA
ATAATCAAAC
ACCAATTAGA
ACCCCTATTC
TGATTGGCGG
TTGCTAGGAA
TGAAAATGGG
GTTTTTATGA
TCTCTAGCTA
CTGAAGCGAT
CGAATTTTTT
TCCTTTTTGA
ATCTCGTTTT
CACCGTAGCG
TAAAGGTTTG
CCCACAATTA
GCAAGCCACA
CCCCACGGCA
AGTGATTGAT
TTTTTTGGAG
CTATAAGCAA
TTTTGGTTAC
TGGAGCGGGC
AGATATAGOT
AGATCAAGTG
TTTGGGCATA
TCTCAAGGGA
120 180 240 300 360 420 480 600 660 720 780 840 INFORMATION FOR SEQ ID NO:222: SEQUENCE CHARACTERISTICS: LENGTH: 405 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .405 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:222:
CAACCACACA
GTATTGGCTT
AATGATGGAT
CAACAATGTT
CAAAAAGCTC
TTTCCTGTTT
GGAGCTAATA
AACAAAGAAA
TTTGGAGTCT
CTAATTCTCC
TTATGAGCAA
AAGGCAATCT
TGGATAGTGC
AAAGTGAAAC
CATCGCCCAT
TTATGCAGGG
TTTAGGAAGA
AGAAACTTAT
CTGTGCCTTA.
AGGAAAACAA
TACTACTACT
CTACAACACG
AACGCTCTCA
ATCCATAGAG
GATAAAATGA
TCAGAATGCT
GTA.ACTATAA
ACTACTACTA
CGCAAGCTGT
GTTTTCATGT
ATGGGAACTG
AGACACTTGC
CTAGCAA~TCA
CAATAACAAC
CTAAT
TATCACTTCA
GACCGGTTTG
CACAGGATTA
CGAAAACCTC
AGATAGCGGA
GCAAACTAAT
WO 97/37044 PCT/US97/05223 266 INFORMATION FOR SEQ ID NO:223: SEQUENCE CHARACTERISTICS: LENGTH: 372 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...372 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:223:
GGACTATCAA
CAAAGTTTTT
GGAAAAGTGA
GTGGGTTTGG
TTTGGGGGAG
AAACATTCTG
ACACCGCACT
GGCAAGGAAA TGAAAGATTG GATGCCACGC TTAAAAAATA CATCGGGATT AGGGGGGTTT ATTATCAAAG CTATAAAGAT CCTACCAACT ATGTGATCAT GGAATTTCCT CTAGGGAGTT CTAGGGGGGC TTTAGTTGTT TATACAGACA TGGTTTCAGG GGGATTAGCG ATTAGTGGAG
TC
TCAATTTAAA
TTGCATGGGA
CTTTTTTTAC
ATAAGCATTA
AGCAAAATTT
GTTGGTTGAT
AACCGGCGTG
TCTTGGGTCA
CATGCTTGCG
TTTAGGGGCG
CAAGTTTTTT
ATTGTTGCGG
120 180 240 300 360 372 INFORMATION FOR SEQ ID NO:224: SEQUENCE CHARACTERISTICS: LENGTH: 309 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...309 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:224: CCGGTTGGGC TCTACCCCCT TGAATCCCCT TTAATTTATG AAGAAAACCA TTTGTTACCC ATGGGGTTTA TCCATTTAGC CTTTAGAGGG GGTGGGAGCT TGAGCGATAA AAACCAGTTG GGTTTGGCGA AATTGTTCGC GCAAGTTTTA AACGAAGGCA CTAAAGAGCT TGGCGCGGTG WO 97/37044 PCT/US97/05223 267 GGGTTTGCGC AACTTTTAGA GCAAAAAGCG ATCAGTTTGA ATGTGGATAC CAGCGCAGAA 240 GATTTGCAAA TCACTTTAGA ATTTTTAAAA GAATACGAAG ATGAAGCCAT AATGCGCTTA 300 AAAGAGCTT 309 INFORMATION FOR SEQ ID NO:225: SEQUENCE CHARACTERISTICS: LENGTH: 258 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...258 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:225: TATAGCGTGT ATATCCGCTC CAACTTTTCT AAAGTGGCGC ATTTTGCGAG CGGGTATTTG CAAACCAAGC TCAGCACTCA AGCTAAAAGC GTTGCCTTAG CCAAAAAAAC AACTAAAGAA 120 TTTATAGAAA AAGGCATGAC GCAACAAGAA TTAGACGACG CTAAAAAGTT TTTACTAGGC 180 TCTGAGCCTT TAAGGAATGA AACGATCTCA AGCCGCTTGA ACACCACTTA CAATTATTTT 240 ATTTGGGTTT GCCTTTAG 258 INFORMATION FOR SEQ ID NO:226: SEQUENCE CHARACTERISTICS: LENGTH: 1029 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...1029 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:226: AGACGCGCAT GGATTCTGTT GGGTTTGAAG AATCTCAAAG GGGTTTGGGT GCTTAAGGGG WO 97/37044 PTU9/52 PCT[US97/05223
TTAAAAAAAG
CACAATCTTT
TTTAAAACTT
AAAATCGCCA
GCTAAAGAGG
TATCAGGCTT
AAGGATTTCA
ATTTATGATT
CGTTCAAGGT
TTTTTAGAAG
GATAATGAAG
GAAGCGATAG
AGTTTGATTG
CGTGGCGTCT
TGTCAAATCC
AGGCTCGATT
CATGCGTTT
CGTTTAAGGA
TATCCGCTCA
TTGAGACTAA
GAACTTATAG
TTTTATGCGA
GTGAAGTCAA
AGCGTTTTAA.
TTGTGCGCGA
TTTATTTTTT
AGCAACCTGA
CTGTGGATTT
GCCTAGACAG
AAGGCATGAC
ATCATTCTAG
ACCACA.AGGC
TTTCTTTAGT
GAGGTTTTGC
AGTTTTAAGG
AGTGGAAACT
CCAA.AAATAC
AAAGCAGGCG
TCAAAACTAT
AATCCAGGAT
TAATTTAGAA
GATTGCGGAT
AGAGTTTATA
GTTTTTGAGT
CAGCAAGGAT
GAATATCCCC
AGAGATAGAC
GTTAATGCAT
GGAGCGGTTG
TCTCAAGTGT
GTTAAAAACC
AAAAATGGTG
CCCTACACCT
TTTGAACAAA
TGCCTTTATG
GTGGATTTTT
AACAAGCCGT
AAAAAAGAGA
GAGAGCAAAG
GAAATCCAAG
AATAGCGAAA
TTGATTGCAG
TTTGTAGAAA
TTGCAAGAAA
AATGTTTTA.G
ATATCTCTTT
ACCGCATCA-A
AAGTCCCCAT
ATTTTAGCGC
TCAAACAAGA
TGGAATCTAA
TGTTTTCGCC
TGTTGTATCT
TTTTTTTAGC
AAGAAGATTC
AAGATATTGA.
AAATAACAGA
ATGTTTTGCA
AACTGGTTGT
CTTTGATGAT
CGCGCATGGA
TA-ATGTGGAT
AGAGAAATTT
TCAAGCCTTA
GATGAGTAAA
AAATCAAGAT
GGATTTTCTA
CTTTAGCCTT
GCTTTTAGAG
CAAATCCGTG
TATGGGAATG
CAGCCTTGAA
GGACGCTTAT
AGAGGGATTG
TTTAGACAGC
AGAACTGGAT
GAATGAAAAG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1029 INFORMATION FOR SEQ ID NO:227: SEQUENCE CHARACTERISTICS:.
LENGTH: 1260 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1260 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:227:
ATGATTTTTG
AACGAGCTTA
GGGTA.TCTTT
CCTTTTTTGT
GAGCATGCGT
TTTA.AGGCCG
GATTTATTGT
CAAAACACGC
CCGGAATTGT
ATGAAAGGCA
TTGCAAA.ATG
GATTTGAGCC
AGTTTGCCGA
AGCTTGTTTG
ATCAAAACCA
GCGATAGGCA
GGGATTTTAA
AAAACGCCCT
TGTATGAAGC
ATTTTGAACA
TTTACCCTAA
TTAAAGAGCA
TAAACACTAA
CTTTTAAGGC
TTTTTGAATT
CGATCGCTCG
ATGACAAAA.A
GCTTGGCCTT
GCGTGTATCA
AGATTTTTAA
TGCAAATCAT
TGGTTGGAGG
ATATCAAAAA
GGATTTTATC
GCGCTTAGCG
ATTTTTAGAA
AATCCACAGC
TCTCAAAAAC
GGCCAAACCA
TTTTATAGAA
AGAGTTTTTA
CTCAAACAAC
TAGAAGCGAA
AAAAAATAGC
AATGATAAGC
GGCGTTGTTC
TGAAAGTTTA
AAAAAAAGCC
AGCGTTAAAA
TCTCAAAATA
TTTTTAGATG
AGAAAAAAAT
TCTTTAGATC
GGCGATACCT
AAGCGCGTTT
AATGAGTTTG
GACACAGCGA
CCCTTGATAG
AATGTGATGA
GTGAAAGTCA
GAGATTGAAG
CCTTGCGGCT
GAAAAkACGCC
CTTTTTAGCG
AACTCACAGC
GGGGGAAGGG
AAAATTTTCA
ACCTTC-TAGA
AAAAAACTTA
ATCAAGTGAA
TTAAGGAAGT
GGAGCGTTTT
TTAAGATTAT
ATGAAAAAAA
TCGTGGATTT
ATCAATTGTT
CTCAATTGCC
CTGTGACCGG
CTAGGGGGGT
TGCCTATCCG
CACTAACCTT
GTATTTTGTG
A.AGCCAAACC
GCATCTAAAA
TTTCAAGCAG,
TTTGACA.ATG
GATACACAAC
AAGCTTTTCG
CACAAAACCC
CCGATTGTTT
ATTGCGTAAC
TGAAATCATC
CCTAAAAACA
ATGCCCTAAA
GTATTGTGGG
CACTTTAGAA
WO 97/37044 PTU9/52 PCTIUS97/05223 AAAAGAGCGC ATGAAGATTT TTTGCATTTA GGGGTAGGGA AAAGCTTCAA AGGAATATGA AGAGAGCTTT TTAAAATCCT GAATTTGAGA TCGTGGAAAC GATGAGAGTT ATCAAAAGGG AATAAAAACG CCCATAAAGA ACGCTTAATG CATAGCGCCC GATGAAAATC TTTTAGACTT TGAATTAGAA GGAGAAGGGG GTGGGGTAAC TTATAAAAGT TTTTTGTGAT GCCCAAAATA ATCAAAAATT AGAGATTAAC AATATTTTAA CTTTAATACA TTTTAAGGGT TTTAATTCA.A 1020 1080 1140 1200 1260 INFORMATION FOR SEQ ID NO:228: SEQUENCE CHARACTERISTICS: LENGTH: 855 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION I1.. .855 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:228:
AAACAAATAA
TTAAGCGTGT
CTAGGGTACC
TTTACCGCGT
TATGGCACGA
TCAAAATACC
TTTCCTAAAA
TTTTTTCTTA
AAAAGCTATA
GTGGGGGCTG
GTGGGAGAAA
AATACTTACC
TCAAGGTATA
TATTTTAAAA
TCAACTTATG
GAAAAGAACG
ATTTAAGGGC
AGCGGTTTTT
CTAAAAAGCC
GCGTGGTGCA
GAAACATTCC
AAGAGCACTA
AAAATAAAGG
GAGAAAAATT
ACTTTTTATA
CCTGGTTTTA.
GCCCCAACAT
AGAATTGGGC
CCCCTTTATA
ATTTT
CTTGAACAAA
TGATGATCTG
AGCCAAGAAG
CAAGAGGAAA
AATGTCATGG
TTTTGCTGAA
CGGCTTCCGT
CGCATTAGGC
ACAAAGAAAA
CAAACGCGCT
TGAAACAAAG
GTTTCAAGTG
GATAGAATTT
CACCCTTCAT
CTGCTTAAAA
GTTACTTATA
TGTTTAAGGG
CCTTTTAATA
CTACAGAGTA
GTCAGTTTGA
TTTTATGTCT
GATAGTTTGA
GAGACTTTTA
TTTGGGACGC
ATTTTCAAAC
ATGTTGAATG
GGCGCCCA
TTCAAACGCA
AGGGGTTTTT
CGATCGCCAA
GTAAAACCCA
TAGATAAAAG
GGGAAAAATT
TTTATGGCTA
CTTTGGATTA
GGGCGAGTTC
TTAACGCTAT
TGATTTTAGG
A.ATGGGCTAA
TGGGGTATCG
TCCCCTTTTT
ATATTTCTGT
AGCGTTCTTT
AGAAGAAGAT
CCCCCCATGT
CTCCCATTAT
TGAAAACCAT
TA.AACAATTT
CGCTTATGGC
GCAGATCCCT
TTTTTATGGC
GGTGAATTTC
AGACTCTCTA
CTACCGCTTT
AATTAATGAT
CTATCTCACT
INFORMATION FOR SEQ ID NO:229: SEQUENCE CHARACTERISTICS: LENGTH: 693 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PTU9152 PCTIUS97/05223 270 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. .693 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:229:
AATACTCTTT
TTAAAGGCTT
GCTTTGAAAT
GTTTTGAAAT
AATCTTAAAA
GGGTTTTATA
TTTCAAAGTT
GGCATTGGCA
GTGGTGGATA
GATGAATTGC
CTTTATGAAA
GAATTTTCCA
TTTATTTCTT
TAAAGAGCTT
TTGAAGCTCT
CTTTAGAAAA
AAATCGCTTA
ACCAAAAAGC
TTGAAAATTT
AAGAAAGCGC
AATACAGCTA
AGCATTTTTT
ACACCATTTC
AACAAAAATT
TTTATTACAG
GGATTTATTG
ACTAGGAGCG
TCTAAAAAAC
TATAGAGTTT
CAAACGACTG
TAAACAAGAA
GGATGCGATT
TCTTTTTTTA
TGAAAAAGGC
TTTAGCGCAA
GGAATTGAAG
GCGGTTATTG
AAAAACGCCC
GTTTTAACGC
GCTTTCATTT
TCAAAGCTTG
ATTGATTTGA
GTAACCAAAG
TTATGCTATG
AAAAA-ATTAG
GTTCAAGAGA
CTTTATGCGA
CTT
TGTTAGATAG
CTGCTTGGTG
AAAATACTAA
TAGAAAATGA
CAGAGTGTGT
GCAAGAATAT
AGTGGCTTTT
TGTGCGCCAA.
GCATAGAGAT
ATTTAAATTC
GATTCCATGG
TTTTGAGATT
GTGGCCTAAC
ATTTGA.AGCC
TGATGAGATC
CCGCCCTAGC
TTTAA.A.AGAC
AGACCAAAAG
AGAAGTGATG
AGAAGATTAT
CGCCTTAGCG
AAAGATCGTG
120 180 240 300 360 420 480 540 600 660 693 INFORMATION FOR SEQ ID NO:230: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 693 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .693 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:230: ATCAGCTTCA ATGCGGACTA CTTTAGGATT TGGGCGCGCG ATTTTGCCAC CGGGCAGTAT
TCAGTCTATA
GGCGTGGAGC
AACTACATTG
CTAAA.AGGGA
GACGCTCGCT
GCTTATAGCG
GGGGGGAACA
ACCCAACATG
TTCTGGGAAA
CGAGCGGGCC
TGGAATTGTA
ACACTCGTGT
CTAGCTATAA
ACA.ATTGGCG
GGATTAGCAA
ACTATGAAAG
AAGGGCTCTT
ACGGAAGACA
CATGAAAGGT
TTACAGGCCC
AACTAGCCAT
CAAGCATTTC
TAAAACCACC
CAGCGCAGCA
CGTTCTTAAT
GCCTTGGTAT
CAGAGTTACA
AATGTGCGCC
ATTAGAGGGT
GGCCCTTTAA
CCTTTTGTAA
ATTGGTATTT
GGAGGCTATT
AGCGGTTATC
TGGGTGTGGA.
GGAAGCTTAC
CCATTAATGG
TGCAATTCCA
CCGACTTGAA
GCCCTTTCCA
CTAGCTATTT
ATGGGATGCA
AATGCGAAGC
ATATCCAAGT
AAATCAATAA
CTATTCTCAA
TGCCGCTTTC
CGGGGATGTG
ATTCATTTTT
TTATAGCCGT
ATACTATAGT
TTGGTGTATG
GAGCCAAATT
CATCTTCAAC
WO 97/37044 PCTIUS97/05223 271 ATGAAGTATT ATTTTACAGG GATTGGCTCT AGCCCTGCAG GCTTGCAACC TGCGCCTGGA AGATCGGTTA CAGCGTATTT GAACTACACT TTC INFORMATION FOR SEQ ID NO:231: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 804 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .804 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:231: 660 693 AGCGGTGTTG TGAAAATGAA GGGGAAAAAT ACACCGTGTG TGTTTCACTC AAAAAGGCGG GCCCTACTCA TTAAAGAGGC ATGGGAATGG GCGAGCCTTT AATACCGGCA TGCAAATTTC AAAATCCCCA TTTTAGCGGG GTAGATGATA AAACGCGCTC GTTTTGAATG AAGTGAGGAA CTTTTAATCA AAGATTTAAA AACGGCATTA AATCCAAAGT GAGCGCCCTA GCTTAGAGAG TTATGCACCA TTAGAGAATC GAGAAAAAGC TCCAGCAGAA
AGACAAAAAG
CGTGTCTTGT
TTTTGTAAGG
CAATAACCTC
GAACAATTTA
ACCTAAAAGA
CAAAAACTTA
GTCTTTAATG
ATGGCCTTTA
CGATAGCCTG
GAATTTGATC
CGCCAGAATG
CAAAGCCTTG
AATT
ATTGATGAAG
CAAATCGGCT
GATCTCAAAG
CCCATTGAAA
GATGAGGTGT
GTCACCATTT
GGCGTGCAAT
CCCTTGAATA
GAGCAGCGAA
GATTGCGCTA
TTATTCAACC
TTTGCGAATT
GATATTGAAG
AAACGAACGC CGTTTTAGAG GTCAAGTGGG TTGCTCGTTT CGAGCGAGAT TATCCAGCAA AAGCGCTCAA CATTGTTTTT GTAAAGCGGT TGAGATTTTT CCACGAGCGG CGTGGCCGAT TAGCCATATC CTTACACGCC AAAAATACAA CATTGAATGC AAAGAGTGAT GTTTGAATAC AAAAACTTTT AAAACTTTTA CGCATGAAGG CTCTAAGTTT TTTTAAACGC CAAAGGCTTA CGGCTTGCGG GCAATTGAGG 120 180 240 300 360 420 480 540 600 660 720 780 804 INFORMATION FOR SEQ ID NO:232: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 708 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737044PCTIUS97/05223 272 (ix) FEATURE: NAME/KEY: misc feature CE) LOCATION .708 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:232: AGATTGTCTG AACCCATAGA TAGATTCACA CGCATAAGGT
GAAAAIAATCC
CTAGATTCTT
GATGTTACCA
GTGTTGCAAG
TTAAATTCAT
ATTAAAACAA
ATGGGGAGCG
TATGGCGATA
GATTTTAAGG
GCGGTTACGG
AATGAAAGGC
GCCAACAAAG
TGTATCGTGT
ATCAAAACCG
ATCTCTATAA
TTAATTTTAG
GCCTAGCGAT
CGAAGCGTTT
AATTCGGGCG
TGGTTTTTAG
CGAGTTTTGG
AATGGAATGA
GGTTTTAATC
GGGGATAGGG
CCAGATTGGA
AGGCATCCAA
AGATTATGAT
AAAATGTCAA
GAATCCTAAA
TAAATTTAGG
CCCTGAAGTC
TTTGCAAATA.
AAGATTACGA
TGTGGCGTGG
CAAATCACTA
TCAGAAAGGA
GCTTTAAATT
TACATTTTAG
AATTTCGCTT
CACATCCAAG
GATTTTTTAA
CCGCATTGCA
GCGAGTGAAG
AGACGAATTG
GGTTGTTTAA
GGGGCGTTGG
TCATTGATAA
TAGGAGAATC
GTCGCATAGA
ATTGCATGGA
ACGGGAAGTT
TGGGGAGCGT
AAAAACGCCG
TAGAGCTTGG
TCGTGCAAGA
GAAGATTT
AAACGATTTT
AGGCTTTGCA
AGACATGTTT
TAAAGTGTTG
TGAAGCGTTT
CGATTTGCCT
TATCAGCTCT
GTGGGAAAGC
TTTTAAGGGG
AAGCTTTAAT
TATTATCAAA
120 180 240 300 360 420 480 540 600 660 708 INFORMATION FOR SEQ ID NO:233: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 999 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .999 Cxi) SEQUENCE DESCRIPTION: SEQ ID NO:233:
GCGAAAGCTA
TATTATGAGG
AATATGGAAC
GACAAATACG
TTTTACGCTT
ATCAGTCAAA
CATTTCACTT
GATGTGCCTA
GAAAAAGGGG
TTTGACACTT
TACATTATCA
AACTATTATT
TTTGCAGACA
AACAAAGCCT
GGGTGCAATT
AAGTGCAAAA
GCTCTAGGGC
AAGCTTATCA
TAA.ATGGGGA
GCCCTTTCGT
GGATCATCAT
TTTTTATGAT
AAGCGTTTTA
TAGCCTCAAT
CTAGGGCTAT
TAGGGTTTGT
CCAGAAACAC
TTGAAAGTTA
CAACCGCGCG
AGCCATTAAA
AGAAGTGAGC
AGGCTTGCTT
TAAGAATAAG
GGCTCAAGAC
TGAAGATGGT
TGATTCGGTT
TCAAAATTTC
AGATGCGGTG
TTTATCGGCC
TGGCGGGTTA
GAGCATTTTT
TGAAGTTCGA
AATGAACGCC
GAAATCGATT
GAGATTTTAA
AACCCGGCGG
GGATTAGGCT
TTGGTTTTTT
AAAGAGCCGC
TATAACGTGA
ACCCTCAAGG
GTCGCTAGCG
ACTTTTAAAG
GTAACTTCGT
GCCCATAAAA
GCCGATTCCA
AACGCAGGGC
CTAGCAAAAA
ACAACACCTA
TTTCGTATCT
ATCTTAATGA
TCAAAAACCC
AAAAAAGCGA
GCGTGGCCTT
ACGGAGAAAA
AATTCAGGAA
TGGGCATGCA
TGTATTCAGG
TCTACCTCAT
TTGACGCTTT
TAAAGAATTT
GCACAATATT
TTCTAATTTA
TTCAGGGTTG
AGCCTATGGG
TAACAGGAGC
ATTTAAAATT
GCCCAAGCTA
AGTAACACCC
GCAATTGCCC
AGCGGTGGCG
TGTGAGCACC
GCGCATCAAA
TTCGTTTTCA
120 180 240 300 360 420 480 540 600 660 720 780 840 WO 97/37044 PCT/US97/05223 273 TTAAAGCCTT GTAAAAGATC GCTTGAAAGC CCTAAAATCA TTGACGCTAG GGAATTGCTT 900 TCAGGGTTTG TAACAGCCCC ACAAGTCTTT TGCTCTAACC GCCATAATAT TTTATATGTG 960 CGCAGCTTTA AAAACGGGTT TGTTTTGAGT CATTTAAAA 999 INFORMATION FOR SEQ ID NO:234: SEQUENCE CHARACTERISTICS: LENGTH: 273 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...273 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:234: AGGAATAAAG ATAAAAGAGA GTATTTTGAA AACGCCACCG CGCAAATTGG GGCTAAACTC TTGGTGTGCG ATATTAGAGA GCAGTTCTTT AACGATGTGT TGTTCAAGCC CAAATACGGC 120 TATGGGAAGT ATTTCAACCC TTGCATTGAT TGCCATGCCA ACATGTTTAG GAACGCTTTT 180 TATAAAATGC TTGAATTGGA TGCGGATTTT GTTTTGAGCG GGGAAGTGCT AGGCAACGCC 240 CTAAATCCCA AAGGAAAGAA GCGCTCAATC AGG 273 INFORMATION FOR SEQ ID N0:235: SEQUENCE CHARACTERISTICS: LENGTH: 906 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...906 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:235: GGGGAAGTCA TGGCTGATAG TTTAGCGGGC ATTGATCAAG TTACGAGTTT ACATAAAAAT WO 97/37044 WO 9737044PCTIUS97/05223
AACGAGTTGC
GTCTTTAAGA
AACA.ATTCGC
ATGMAAT
ATACAAAAAG
ATAGGGGTTA
CAAAGCGCTG
TTTGACGGGC
TGGATTGAAG
TGCGTTTTGC
AAGCTGGGCG
AACCCAACAA
GCGAGCGGTT
ATTGTGGTCA
GGCCAA
A.ATTGTTC-TG
TCCGTGAAGT
TCGTTGAGGG
C-GTTTTATTA
AAAAAAGCGA
GGATTTATGA
GGCTACGGGG
GCTTGGTGCA
ATGAAAAACA
TCGCTGATGA
TCAAGCACAT
CCGATGTGAG
TTGAAGTGAT
ATTCTTCTAT
TTTCAGGCTG
GGTGAAATAC
CTTGATCATT
TGACAGCCA.A
AGACGATATT
AGCGGATAGG
ATCTGCTGGG
AGTGGTGGAT
CAACGATGTA
TTCCCCAAGC
TGACTTTATC
TAATATTGAC
CAACCAGGTT
GAGCGGGAGC
GGTAAAAACA
CATGGCAATC
ATAAGAGAGC
AACAAGAGCA
GTCATGATTT
ATTTCGAGCA
AATAACAAAC
ATTGAAAAAA
GAGACGCTTT
GTCTTAAAAA
ALATGGTAAAA
CTGATTATTA
AAAAACACTC
TCTTATGA.AA
AGGATTTGTA
TCACTATCAT
TCACCATTCC
AGGATTTACG
GTGAGTTTTC
AGAALATGGAC
TCGTGAGCCG
TGCTTATAGA
C TAAAAT CCA
CCATGCAAAT
CCTTATTAGA
CCGATTTAGA
CTTTGACTTC
AAAATGGCAA
TGCGGTCAAT
TAGCCACGAA
TTTGATTGAC
CCCTTACAGG
TCGCTGGACC
TGAGATGGAG
CACGCGCTAT
CGTGTTCCCT
TTCTA.ACCAA
GATTTTAGAC
GCATTTGTTC
AATGCCAGAA
AACA.ATCCCC
GAATTTGAAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 906 INFORMATION FOR SEQ ID NO:236: SEQUENCE CHARACTERISTICS: LENGTH: 378 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .378 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:236:
GCAGTAGCGT
TATAGCGCTA
GAACTGAATG
AAGGTTTATT
CCCGGGTTTG
GGCGTGCATT
AGGGAGTTAA
TTTGCTTACC
ATGAACGCAT
AGCAAAGTTT
TGAAGAACGT
ATAGCCCCAT
TTGTCATTCT
AAACCTTT
GTATTCAAGC
AGAGGCTTTT
TGAGGCGATT
AGCCTTAAAA
TTCAAGCCAC
CACAAGCGTA
CCAAAGACTT
TCAAGCAATG
AAAGAGAATG
GATAGTGCGC
ACCCATGCCA
GAAGAGGGTA
CTTTAGCCAC
ACGAAAAAAC
CGGCGAAGTA
CTTTAGTGTT
TTTTGGAAT.A
ACCTCACTAA
TGAATTGCAC
AGAGAGTTTT
TTCATACCTT
TGTGGATATG
TTTGGAAAGG
ACGCATGGTT
INFORMATION FOR SEQ ID NO:237: SEQUENCE CHARACTERISTICS: LENGTH: 975 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PTU9/52 PCTIUS97/05223 275 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .975 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:237:
CGGCACCCGA
GGCGCGGAAA
AATTTAGTCA
CAAAGGAACA
TTAGACATTT
CAAAACAAGA
GACACGCCCT
TGGAGCATGG
ATTTTAGAAA
AATTCTATCG
AGTGCGGACA
GCTTCCATTA
GTGGAGCGCT
GTGGTGCTGA
TTTGGTGGCT
AAGCTCTCTA
AGTGTAGCGA
TTGAAA.ATTC
TGTTTGTGGG
TTAACGCCAT
AAAAACTAGC
CACAA.ATCAC
CTTTAAAGCC
TAGATCTTAT
GGGATAAA.AT
CCTATTGGCT
TGCATGCTTT
TGCAATTACC
AACCCTTAGA
ACACTTTGTG
ATGCGAGCAA
TTATCCAAAT
GTTTAGATGA
GAGTG
GATCTGCGAT
GTTAGACGGC
TGTGGGCGTA
CCTAGCGALAT
GCCCGTTGAT
TAAATCTTTA
CGCTATTCAA
CACCATTGAT
TTTTGGTGCG
GGTGGAGTTT
CATAAGCTAT
TTTATACGCT
GCGTTATAAA
TGAAGTGGCG
CATCTCTCAA
AGTGCTAGCG
CTTAGCGAGT
ATTGATGCGA
CCCGGATTGA
AAAGAAAGTT
AGCGAGCATT
ATCATTAGCG
AATGCGCAA-A
TCAGCGAGCA
TCTTTAAAGA
GAAGACAACT
GCCATTAACC
TTAAGCGCGA
GATTTGTTGT
ATGAAGAAGT
GCTTTAGAAT
TTAGATAAGG
TGAATAATTT
TGATAGAGGA
AAGCGAGCTT
TAGTGAGCGC
TTGGTTTGTG
CGAGTGGGGG
ACGCGCTCAA
TGGTCA.ATAA
TTGACGCGCT
CTGTCATTGC
CTAAATTAGC
TTAAATTCGA
TGGAAAACCC
TCTTGA.ATCA
TGTATGCTA-A
AAGTTAGGGA
AAAGCCTTTG
GTGCGTTTCC
TAAAAGCTTA
GGGGCATTTA
GGCGTTGTTG
GGCTTTTAGG
GCACCCTAAT
ACTTTTTGAA
GATTGAA-AGA
GCATTTAGCG
TTCTTTGAGT
ACCCATTAGC
AAAGCTTGGC
AGAAATCGCT
AAAATCTTTC
GCGTTTTGGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 975 INFORMATION FOR SEQ ID, NO:238: SEQUENCE CHARACTERISTICS: LENGTH: 396 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 396 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:238: GAGACGCTCG CAAGGGTTCT CAAAGTTTCA TGTTTGGCTA TGGGGCTOGC ACAGATGTGT TGTTTAACCC AGCTATTTTC AATCGTGAGA AACTTGCATT TGGGGTTTTT CTTGGGCGTT GCGATTGGTG GCACCTCTTG GGGTCCAACA AACTATTATT TTAAGGACTT AGCTGAAGAG 120 180 WO 97/37044 PCT/US97/05223 276 TATAGAGGGA GTTTCCACCC ATCAAATTTC CAGGTCTTAG TTAATGGCGG GATCCGCTTA 240 GGCACTAAAC ACCAAGGTTT TGAAATTGGC TTGAAAATCC AAACCATTCG CAATAATTAC 300 TATACCGCTA GCGCGGATAA TGTGCCTGAA GGGACTACTT ATAGATTCAC TTTCCACCGC 360 CCTTACGCCT TTTATTGGCG TTACATTGTA AGCTTT 396 INFORMATION FOR SEQ ID NO:239: SEQUENCE CHARACTERISTICS: LENGTH: 180 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...180 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:239: GTCTTAAAGA ATGAAATAAG CTATAATAAG CCCATTTTGA ACACCAACGA TTATTTGAAT AAAGACAATA CGATGAAAGA TAGTTTTCTT TTCACTTCCG AATCAGTAAC CGAAGGGCAT 120 CCTGATAAAA TGGCCGATCA GATCAGCGAT GCGGTTTTAG ATTACATGAT TGAGCGGGAT 180 INFORMATION FOR SEQ ID NO:240: SEQUENCE CHARACTERISTICS: LENGTH: 333 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...333 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:240: AAAACCAAAG TCGCATGCAA GACTTTAGTT TCTAATGGTT TATGCATGAT CACTGGCGAG TTAGAAACTT CTGTTTATGC ACCGATGCTA GAGATCGCAA GACATATGGT TAGAAAAATT 120 WO 97/37044 PCT/US97/05223
GGCTATACAA
GGCGAGCAAA
GGGGATCAAG
CCCATCCATT
ACGCTCTCTA TTGCTTTGAT TACAGAAGCG CGGCGGTTTT AAATGGCATT GTCCTGATAT TAATCAAGGC GTGGATAGAG AAGATGGCGA GATTGGGGCG GGCTTATGTT TGGTTATGCA TGCAAAGAGA CTGAAACGCT CATGCCTTTA TAGCGCACCA GCTCGCTTTC GCC 180 240 300 333 INFORMATION FOR SEQ ID NO:241: Ci) SEQUENCE CHARACTERISTICS: CA) LENGTH: 507 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .507 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:241:
TCCTTATCAA
CTCAACGCAG
GAGCGCTATA
GGGGATTATG
GTGTTTTTAA
GTTAAAATTT
GGAAAATTAC
AAAGGCTATA
GCAAAAAGTC
CACTTGCGGT
CCAAAGACAA
AAGATGAAAT
ACAAGATTGA
GCGAGCATTA
CTGAGGGCTG
AAAGCAGGGA
AAGATATGAC
GGCTTGATCC
TTTATTGAAA
AAAAGAGGGA
CAAAGAATTG
TATTTTAATC
TAACGCGCGC
TAACCAAAAA
ATTAGACTCC
TTTTATCGCT
AGCTCAT
GCGCTAAACA
GCGATTTTAA
ATCCCTGAAG
GCTAAAAAAC
ATTATCACGG
TGCTCTTTTT
ATTTTAAAAG
CAAGATTCTA
AGAGAGTATT
TTGCGAGCGG
TGGATATTTT
AAAACCAATT
GATCGAGCGT
GCGCTATCCC
AAGTAGAAGA
GCTCATTTTT
CCAAACCATT
GTGCTTGAGC
TACCGGCGTG
CAGCGAGCAG
GCATGCTTAC
TAGCTTTAAG
TCTAGCCCTT
ATCCGATAGG
120 180 240 300 360 420 480 507 INFORMATION FOR SEQ ID NO:242: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 474 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: CA) ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature WO 97/37044 PCT/US97/05223 278 LOCATION .474 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:242:
AATTTAAGGA
GCTAAACAA.
GGCATGACTT
AATGCGATTT
ACGATCAAAG
TCTTTGGAAG
GCGAGTTTGT
CCTGAATGGA
CTATAATGCA TTATTCTTAT GAAGCCTTTT TGGAGCGCCT TTGCGGTATC CCAGAAGCCC TAGTGCATTT TTTGAGTTTO CACTGGAATT CTTATGACAC CACCAAGCGA CAAAACGCCC ATCATTTAAA AACCATTTTA GTGGTAGATO CGGTACTTAA AGTGTTGCAA GACAAACACC TCCAAAAAAC AAGCGCGAAA TACAAAGCCG TTGATTTCTT TTGGGAAGTG GATTTGAAAA TAAAAGACAG CCTOGAGTTA TTGTOTGCOT GATGCGAGGO TAAGGGAAGT TTATOGCATC TAAAAATTGA AAATACCCCC AAATCGTAGA TAGCGGTAAT CTGATAAAA GTTTTATAGC ATGCGTTTTT AAAAGACGCT ACTTGAAAAG CCAC 120 180 240 300 360 420 474 INFORMATION FOR SEQ ID NO:243: SEQUENCE CHARACTERISTICS: LENGTH: 681 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1...681 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:243: TCCAGCTCAT TAGAGCGATT GCCAGCAAGC CCCTCTAGCA CCACTTTAGA GCTGATTGGC TATTTTTACA TGCCCATTCA GCATATCAGC TCCAGCCAAG CACACCATTT AAAGCTTTTA ATAAGAAGCA CGATCATTGT AGGGCATCCA AGCGCGTTTT TAGACGAGTT CCGGTTTGAC GAAAACACGC ATGCCTATTC TTTAGAGAAA AAAGCCTTGA ATAAAATCGC TTTAAAACAC AAGCCCATTA AGGCGTTAGT GGAACATAAA TTGAGATGGG CGCCTGAAGT GGATGGGGAA CCTTTAAAAC CCGGGCATTA CACGATTGTG GCTAAGGTTT TAAGTCCTTT T
CTTAAAGACG
GCGATTGAAG
GACTCCATGC
AACGCCATGA
GAAGAAAATG
AGATTGAATA
GTGCCTAA.AA
CAAAACCATT
GAGGGCGAAT
ATTTTAATCA
CCTAGCGCCT
CGCGTATTTT ATACCTCTAC ACTCGCCTAT TTTTCAAAAT TTAAAAAAAT GCGGCGCAAC AGCAGGTTAA AGAAAGCTTT AGAGCGAATT TGAAGAATTG TTTTTGCTTT CAGTGCTGAA AAATCATCAA CGCCCGCATT CTTTTAAGGC TTTGTTGAAT ATTTTTACAA AGCAAGGGAT ATGACAGCGA ACTAACCACC TTAAAGACAA TATTTTACTC 120 180 240 300 360 420 480 540 600 660 681 INFORMATION FOR SEQ ID NO:244: SEQUENCE CHARACTERISTICS: LENGTH: 324 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/UJS97/05223 279 (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) AINTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .324 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:244: ATAACGAATA CAACTTTCCA ACTCCAAGGC
GTTTTATTGG
GCTATCTCTT ATACCCCTAA AAAAAGCGTT
TTTAATOAGC
AATTTTAGCG CTTCTTTATA CGCTGATTTG
AAAGACAAGA
AACGATGATT TAGAACGCAT GATAGCTTTT
AGAGAGCAAG
GATTGGATGG GTGCATATTC TTATGATGCA
ACCCTGAGGT
TATCTAGACC GGTTGGGGTG
GGAA
AATACGATTT
TGAACGGCCT
GCTTGAAGGA
TTTGAGGGAJA
TCCATCATA
CGCCTTGTCT
AATTTGAAAA
GAGTTTAGAG
GTTTTAGTAG CGAGCGTATC 120 180 240 300 324 INFORMATION FOR SEQ ID NO:245: SEQUENCE CHARACTERISTICS: LENGTH: 1092 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1092 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:245:
AAA.AAGGTGG
CTAAAAATGA
AGCTTAAAAG
AAAAAGCATG
TTAGAGGTGT
GCCTATGAAG
GGGCTAGCTT
GCGAGGGTCA
GATCAGAGCT
TTAGGGGATT
GGCACTTTAG
GACACTTTAA
TTATGAATGG
AAATAGCGGT
AGCAAGGGCA
ATTTATACAT
TGGATTTTCA
AAGGGCAAAC
TAGATCACGC
AAGAAATTGA
ATTTTTTATA
TGCTAAAAAA
AAACTTATAA
AAAAGCATGT
TTTTTGCGCT
ATTACTCAGT
TGAATTAGTG
TAAAAACGCT
AAAGGATTTT
CCCTAACCCA
TTTAAAATTA
CAAGGTAAGT
CGCTTTAGAG
GGATATTAAG
GGAATCTCAA
TGAAGTGGAA
AGACTACGAG
GGGGGGGTGG
GGGATTTATT
CAAAAAGCGT
AAAAOCGCGG
TGCGCGTTGT
GGGTGTGAA.A
TATATTCAAG
CATGAAGTTA
CCTTTAGCCT
GAAATCTGCT
AAAGAGGGCG
CCATAACATT
ATAGCTCTTA
TAAAACTCCA
GCGAGTTTTT
TTTATGATGA
GCAACCCTTT
AGATCGCTAC
AGGCTTTGGA
TCGCTAAATT
TGAATGCOAT
TTGTGGAAAAJ
TGGTGAAA
TAATGAAAGA
TAGCGCTTAT
TGCGAGTGA.A
AGGCATTCCT
ATTTATCAAC
AATGAAGTTT
CGGGCATTAT
TAAAACTAAA
GGTGTTCCCT
GCCCTTTTTA
AAGCTACATT
CCTACAAGGC
120 180 240 300 360 420 480 540 600 660 720 WO 97/37044 WO 9737044PCTJUS97/05223
GA.AGTCATTG
AGTGTTAAAG
TTGATTGTGG
TTA.ACGA-kAG
ACTAA.AGCGT
TACGGCGTGG
GGCGTGATTG
GCACGCATAA AGGCTATATG CAATACACGA GCGCGTTAGA GCCGCATTTT GTGGTGGGGA GCAAAAAAGA AGATCTCGCC ACGCATTCGC ATTTTAAAGA TGGCGAATAC TTCATCAAGG TTGTGAGCTT AAAAGATGGA ATGATTGAAG CTAAAGGGCA AGCCTTGGTC GTTTATAAAG
TT
TTGGCAAACG CAAAGGCTTT TTGACGCTAAx AAAGA.ACGAG TTAAGGCTA.A AAACAAATCT CCCGTTACAG GAGCGTGCCT TGGAATTTAA AGAGCCTTTT ATGATATTTT GCTTGGCGGG 780 840 900 960 1020 1080 1092 INFORMATION FOR SEQ ID NO:246: SEQUENCE CHARACTERISTICS: LENGTH: 399 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-teature LOCATION .399 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:246:
TTGTTTAAA.A
AGT CAATCGC
GGTGTCAAAT
A.ATTATTTTG
ACCCCAGAAA
AATGCGGTCA
ATCAAGCGAT
CCAATGGGAT AAGTGTGGGG GAATACACTC GTATCAATAC CGTGCGTTTG GAAACTGGCA TTAAAAGCGG TGAAAAACTA GTTATCAATG ACGCTAGGAA TGTTAAAAAT GTTGAAATCA ACCCTTGGGG CACATCAAAG CTCATGTTTA TGGACTATAG TCAATTTTCA AATTTAACCA ATCATTTGAC GAAGGGTTTT TTTACCCCC
ATTTTAGCGA
CTAGGTCAAT
ATTTTTACTA
CCAGAAAATT
ATAATCTAAC
TTCAGGGGAT
AGATATAGGC
CTTTTCTGGG
TAGCCCTTGG
CGCTTCTTCA
CTTGGGTCAA
TTTATCTACA
120 180 240 300 360 399 INFORMATION FOR SEQ ID NO:247: (di) SEQUENCE CHARACTERISTICS: LENGTH: 246 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 281 NAME/KEY: misc feature LOCATION 1...246 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:247: ATGAAAAAAA CCCGTATCCG GAAGAAGTCA GCTTTAATGA ATGGCTGTTG GGCGGAATTT GATAGCGTTT TTAGCGCCAT TGTGCCTTTA GAAGATTTAA ATAAAACCGC ATGCGCTCAT 120 CATGCCCTAA AGGCTTTACA AGCTACGCTA AAGACAACGA TTTGGGCTTT GATGCGACAG 180 AGTTGGAACA GATCGCAAAA GGATTCATCC CTAGGGGGTA TTTGTGGCAT TTTGACGCGA 240 ATGTTT 246 INFORMATION FOR SEQ ID NO:248: SEQUENCE CHARACTERISTICS: LENGTH: 315 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...315 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:248: TGGACTCTAG CCAAAATTCG TTGCGTTAAA AACTTTAAGG AAGCTATAGA AGGTTTTACA GAAAAAATCA AAGAATCCCC TAACGATTCA AACGCTATCA ATGAAGCCTT TGATAATTTA 120 GAAACAGAGT TAGAGCGCGC TACAGAAAAT TTGAGTCAAA AAATCGATCC CGTTTTAGAA 180 CGGAATGAAA ATTATACGCA AAAAGCGTTG GAGTATAGGG AGTTTTTAGA AAGCCGTAAA 240 GAGAGCTTTA TTGTAGATGA AAAAAACCCG TATCCGGAAG AAGTCAGCTT TAATGAATGG 300 CTGTTGGGCG GAATT 315 INFORMATION FOR SEQ ID NO:249: SEQUENCE CHARACTERISTICS: LENGTH: 852 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PTU9/52 PCT/US97/05223 282 (ix) FEATURE: NAME/KEY: misc-feature LOCATION .852 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:249: AGACCTTTGG CACTCATCAT GTGGTGGCGA AATATAGACA ATCGTATCTC GTTGTAAACA AGCCCCCAAG TTAGATGAAG AGGGTTTGAA CTCAACCCTA AAAACGAATA GAGGCTTTTG AAATTTCAGT TTAAAAAGCT ATGAAAA.AGA TTAAGTTTAG TGAAAGAGCA AATGAAAGCG CACTGGATGT GTGCATATGA GTAAGGGCTT TTTTTCCCGC ATAGGGGGTT TACGTGGCGA TCACTAGGGC TATTTTGGGA GGAAAATTTC CAACA-AGACA AGCCTCCTAA TTGATCAAAC ATAAGATTTT GCGGTTTGTG CT
AGGCCTTGGC
TTTAGAAGAA
CGCTTTAAAG
AGAGAAGTTT
AGACAATTAC
TTTTAAAACT
CCATAATACA
AGAGTTTAAG
CAATCAAGAA
TAAAGAAGAA
TTGCTCGCCC
ACAA.AATCAC
TGGCACCGGG
AAGATCACTC
GCGCTAAAAA
AAATTCACCG
TGCGAGCGGT
GAAGAAAGAG
AACCCCACGC
GAAAACGCGC
CATGTGTTTG
AGCGATTTGG
TTACAACTTT
TCTGTGTTTT
CAAAAAGATA
GAGGGTTTTA
AAGAATGGAT
TTGGGGCGTT
CTATGATAGG
TTTTAGAAGA
AGGGCTTTGT
ATTCTTTACT
AAAAAGTGAG
TGATCGGGTT
AAGAAGAAAG
CTTATGTCAA
TAGAAGAAGC
CGCCCATTAA
GGCGTAGAAA
TATTAAGCGC
TTTTTCTCTT
TAAAGACAAA
GCGTTTGAGG
GACTAACCTT
TAAAGAGCTT
GGATTTTTTG
TTGCATGAGC
AGAAGAGGGG
GCGCTTGGCT
AGAGCGTTCG
CCAGTTGCTC
AGTGGGGGAT
AAGGCTTGCA
120 180 240 300 360 420 480 540 600 660 720 780 840 852 INFORMATION FOR SEQ ID NO:250: SEQUENCE CHARACTERISTICS: LENGTH: 726 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .726 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:250:
GATGGGTCAA
GCTTTAGTGA
AACAATAACG
AATTTAGAAA
AGCTTACTAG
AAAAAAGAGC
TTAATTGACA
CTAGATCAAA
AAATTCCTCC
CAGCCCATGG
AAAGCGCGCA
AAAAGATGAA
TCGCTATAGG
CCCGCTCCAG
AAATAGGTTC
ACATTATTTA
TGGGCGAAAT
GAGAGGATAA
ACCGCCCTTT
CCTTTAAGCA
TGTTAATCAA
CCCGTTTGGT
GTCAAATAAA
CTTAGCCGTT
CCGTAGGGCT
CTTTGACACG
TTTAGAAGCC
CCCTTTTTAT
AAAGGGGATA
AGTGATTTTT
AGGGGCTAAA
TTCCATTAAA
CATGCTAGAA
AAGTCCAATC
ATCATTGTTT
TGTTCGTGTT
GGCGCTAAAC
TACCACCCTA
GGGCATGCTC
GTGAGCCTTT
CCTGAAGGCA
ATCATCGCTG
ATCTTTAATT
AGCTATACGC
ATTTAAGAGC
TCAATTATTT
TTTTTCCCCC
TCATTGTCTT
GTAATATTTG
TAACGGATAC
TGAAAGCATG
CTAGAGGCAA
AAAAATTCCA
CCAAGCCCCT
CTGATTTTAG
GATTTATAGG
TAACCGCAAA
CACCGGGGTT
AAACCACCAA
CTGGATCGCT
CGGAATGATC
TAAAGAAAAG
AGGAGGAGAA
GCTCAAAATC
AGAAGCTTAT
CTCGCCCACC,
120 180 240 300 360 420 480 540 600 660 WO 97/37044 PTU9/52 PCTIUS97/05223 283 TGGTATGAAG AATTACAAGA ACGCATGCAA AAAGAATATT TAAAACACTA
CCACGAACTA
AACGCA
INFORMATION FOR SEQ ID NO:251: SEQUENCE CHARACTERISTICS: LENGTH: 831 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .831 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:251:
GTCATGAAA
ATTGAGAGTT
GAGCGGTTGG
AGTTACAAAG
TTTGGTTATA
GTAGCGGATA
GTGAGCCAAA
ATGCAATATT
TATGAAACAG
GGGAATTTAG
TTTGACGCTA
AGCTCTTTTC
GCTGAA.ACGC
GACGAGCTAG
CTTCTAACAC
TAGAAAATCT
ATTTTTATTT
AACCTGGTTT
AAATTTTAAC
TTTTACAAAT
CTAACGCCAT
CCGTTCTAAA
CGTTAAAAJAA
TGGTGGATAT
CCCATAGCGT
CCCCTATTTT
ATATTGATCC
AACACTTAGT
AAAAACCCCT
AAGAAGTATC
TAAAGCGAAT
AGAAAAAGGC
CGATGTGCAT
CCCGGCGTTT
TGTCAATATC
AGCCCTTAAA
TGGCGTGTGG
GCGCTCTTTA.
GCAAATGCCA
ACCCAGAGCG
TAAAAACGCC
AACCAACATG
AAACCCGTTT
GCTATTAAAT
TTTGATAATG
TTAGAAATGT
GAGAGCTATC
TTGTGCCGCC
AAAAAAGGGC
ACAAGGGATA
CTGTGTGAAA
AAAATCATGC
GGGGGAGCGA
GCGGCGGCGG
CTAAGCGATG
TTAAAAATCC
TAATCGCCGG
TACAACCCCT
CTAACCGCAC
TACAAACTAT
AAGCAAGCGT
AAACGGATCT
AATTCATGA.A
GTAGCATTCA
GGGGGAGCAG
GAGAATTTGC
ACGGGAAAAG
TGGGGATTGA
GAGCAAACAT
AAAATTTATT
GCCATGCGTC
AGCCAACAAC
GAGTTTGGAA
CAAAGATGAJA
GGCAGCCAAG
GATTGTAGAA
CCCAAAAGAC
AAGCCCCACT
CTTTGGGTAT
CCCTGTGATT
TTCAGGAGAC
TGGGCTGTTC
GCTAAAACCT
T
120 180 240 300 360 420 480 540 600 660 720 780 831 INFORMATION FOR SEQ ID NO:252: SEQUENCE CHARACTERISTICS: LENGTH: 939 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737044PCTIUS97/05223 (ix) FEATURE: NAME/KEY: misc feature LOCATION 939 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:252:
CAAAGAGATC
AGATACATCA
TTGTGTTTAG
AAGAGCTACA
TTGAAAGAAA
GAAAAGAATT
AATGGGTGTC
AACAAAGCCT
AGCTTAGGGG
GAATATTTCA
TTGTATGATG
AAAGCTTGCG
GGCGAAGGTG
TTGGAAAATG
ACAAGGAATG
GGGGCATGCG
ATGGCGTTTT
TGTTAGAAAA
GGGGGCTA.AT
AAGAGCAGGA
ATAGCGGGTG
TGAAAAA.AGC
ATTTGCTAGG
GGCAATACTA
GGATCTATCA
CTAAAGCTTG
CAGGCAGAGG
ACTTAAAAGA
CGGCGALAGAA
GCGGAGGGTG
AAAAGCAAGC
ATATTCTCAA
GTATTTGCTT
TGTCAAAAAA
AGCAGAGCAA
TTTCACTCAG
TTTTAATTTA
CGCTTCATTT
GAATTTATAT
CTCTAAAGCG
TGATGGCAAA
CGATTTGAAC
CACGCCTAAA
CAGCCCAGGG
TTTTAAAGAG
TTTCAATTTA
AATAACACTA
TCCCTTTTTA
GACCCTAAAG
GCTAAGAAAT
GGGGTGCTTT
TACTCTAA.AG
TATAGCGGGC
TGCGATTTGA
GTGTTTACTA
GATGGCGATG
GATTTGAAAA
TGCTTTAACG
GCCCTCGCTC
GGGGCTATGC
TAATAAAATT
GGGTTTTGTG
AGCTTGTCA.A
ATTTTGAAAA
ATTATCAAGG
CTTGCGATTT
AAGGCGTGTC
AATACGCTGA
GGGATTTTAA
GTTGCACGAT
AAGCGCTCGC
CAGGGAACAT
GTTATTCTAA
AATACAATGG
GCTGTAAATT
TTTAATAAGG
CTTGGGCGCG
TTTGGGAACA
AGCTTGCGAT
GCATGGGGTG
AAATTACAGC
CCAAAACACC
AGGGTGTGCG
AAAAGCGGTG
ATTAGGGAGC
TTCGTATGAT
GTATCATCAT
GGCATGCGAG
CGAAGGCGCA
AGGCGCTAAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 939 CATAGAAAAC' TTTAAAAAAG GCAGCTCAAA ATCAAAGTT INFORMATION FOR SEQ ID NO:253: SEQUENCE CHARACTERISTICS: LENGTH: 1590 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni
FEATURE:
NAME/KEY: misc-feature LOCATION 1 1590 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:253:
ATGAGCAGCG
CTAATCATGT
GGCGCTTCAG
GAAATGCGCA
CAACTCCAAA
AAAGAATTTG
GAACGCCAAA
GAAGCTCAAG
ATTAAAAGCA
ATCAGGCGTT
GGTTAATTTA
ATTATGTGAT
CCAAAGCTAA
TGAAAAGCCA
CCCATTTTGA
TAAGAGATGA
TTTTAGAACA
CCAAAGCGCT
TGATCTTAGA
ATGAAAAAGA
CATTTCATTA
GAAAAAGATC
ATTA.ATGGAA
AGAATGCAAG
TAAAAAAGAA
AAAACGCTAT
AGAGAGGGAA
AGATGCGATG
GCA.ACTAGAA
AGCCAAAGAA
GAAGTCTTGG
TATTACGCTA
TTTCAAGCGA
TTGCAACAGC
GCGCATTTGA
TTGGAAAAGG
AATTTTAAA.A
CTCAATTACA
GAGGAACTAG,
GAGGGTAAGA
TAGCGTGTTT
GAGGGCAAGC
A.ATCTTTCGT
AATATGAAAA
AGCATTTAGA
AAAAAAAAGA
AACAGCGCGC
TGGCTTATAC
AAGCGCAAAA
AAAAATCGTA
GATCACCGCT
CATTTTAAAA
GGAAGCTGAA
TAAGAATTTG
AGCGCAGCAC
GCTTGAAAAA
CATTTGTAAA
CAAAGATGAA
AAGCGCCTTG
TGCCATTTTA
WO 97/37044 WO 9737044PCTIUS97/05223
GCGGAAGCGA
GCTTTGCCTT
GCGTTTAA
TTGTCCAGTT
GAAGACGGCC
ATGGAAAAAG
ATGGAAGATG
AACGCTTTAC
GCGGGGGATA
ACTCAAGAGC
GAAGACCCGG
GTGGAGTGCG
AGAAAGAGCG
TTTGATGGGG
AAATCCA.ACC
ATAGAAGAGA
TTCAAAACGA
CAGCCCGTTT
GCTCAGATTA
AGATTAGTGG
TCAATATTTA
GTATCCAGCC
AATTGCTTTC
AGCTTAAAAT
AGCATTCTAA
AAAAACTCGC
TTGGGAGAGA
TTGTGATCAA
CGAGCGTGTG
ATGAAGAATA
TGGAAAAGGC
AAGTCAGGGA
GCACTCAGTA
CCGCTACGCT
TGCGGGCGAT
TGTGGGTCGT
GGTGGATATA
TCGGCGTGAA
TAACAGGATT
TGAAGGAGAG
TTTAATAGGC
AGAAGTCGCT
CAGAAGAGCC
CCATGTGA.AT
TGCGATTTAT
CGCGGCTGAT
CGCTAAACGC
GTATGCGATG
CAATCAAGTG
TGTGGGCGAA
CAAGCAATGA
TATGCGACAG
GTGATAGGCA
GAATTTAGCG
GTGGCGAGCG
GAAGAGGTTT
AGCGTGGTGC
AAAATGCGTT
CTTTTAGCCG
GGTATTTTGC
TTAGGGGTTG
GCCCACCATG
GCGCTTTCTG
ATGCAAGCTT
GAGAGCGGGC
CCTATTATTG
GTGGGCGTGC
AGAATTTAAC
AAGACGGGAA
AAGACAGTAG
AGACGATTAA
ATCATAGAGT
TAGAATTAGA
ATCGCTTTAG
GCTTGATTGC
ATGATATTGG
AGGTGTGCAA
GGCATGAAGA
CAGGGCGTCC
TAGAAGAGAT
GAGAACTAAG
CTAGAAAGAT
AAGTGGTGCG
AAGCCGTATC
AAACATTGAA
CGAATTGTGC
AATTTTAATA
CGCGCGCAAC
GCTTGGA7ACT
TTTTGGGCAA
AGAACAGCTT
TAAAGCGCTC
GCGCCATAAA
AATCATGAGC
GGGCGCTAGG
CGCGTTAGAA
AGTGATTGTC
CGCTAAAAGG
AGAAAATCGT
660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1590 INFORMATION FOR SEQ ID NO:254: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 750 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 750 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:254:
ATGAAAAAPAG
ACGGCTCTAG
GAGTTGCATT
GTCTTAATCG
TTTGAGAAGT
TCGCAATTTA
TTACGCATGG
AGTGAAGAAA
AAAAGTGAAG
GAAAGGGTGG
AGGATCAAGG
AAAGTGATGG
GTTTCTAGTG
GTAGTTTAGC
CTGATGGAAT
TCCACTATCC
ATCCTAAGAT
CTTTGTTCCT
AAGATGTTAG
ATGGGAATGT
AAGTGATAGA
ACATTATCCA
AATTGCGGCG
AAACCGACCA
CGCATATCAC
AAATGAAAAA
A.ATCGTTTTA
GCCTATGAAG
TATTAAAGGC
AGAGGCTAAT
CCAACTGAGT
CGAAATCCCT
GGCTATCTTG
CATGTCTTCA
TAGTTTTGGT
TACCAATTCT
TGACAGAGCC
CAAAGAATTA
GCGAAAGTAG
GGATCGTTAC
CAACAGCACA
AAACA-AGAGC
AAAGTTATCC
AATTTTTTAG
CAAGACATCA
GAAGATATTG
GGGTATTTGA
ATTGATGTTT
GGAGGTTTTG
ATTAAAAAGA
AGCAAAAAAC
TAGCAAGTGG
ATAACATGGG
CTAAAAATAA
CTGAAAATTA
AGAGAAAGGG
AAGAAAAAGC
TAGAAGAGAG
ATCTGAATTT
CAAAGATTAA
TCCCCAAAAC
TCATGAATCA
ACATGGAACG
GACGTTTTAT
TGAGTCTGTG
CCATTTAGTT
TCAAAAAGAA
CTATAGCGTT
GTTGCTCGTT
CGATGCGCTC
TGTTGAGCCA
GGCTGTGATT
TTTTGTGCAT
AGCCTATCAC
TTATGAAAA-A
120 180 240 300 360 420 480 540 600 660 720 750 INFORMATION FOR SEQ ID NO:255: WO 97/37044 PCT/US97/05223 286 SEQUENCE CHARACTERISTICS: LENGTH: 357 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...357 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:255: ATGCAAAACA CCTTTAACTA CACCAACAAC GCTTTGAAAA ACAATGCTAA ATTAACCCCC ACTGAAATGC AAGCCGAACA ATACTACCTC CAATCCACCC TTCAAAACAT TGAAAAAATA 120 GTCATGCTTA GCGGTGGCGT TGCGTCTAAC CCTAAACTAG TCCAAGCGTT AGAAAAAATG 180 CAAGAACCCA TTACTAACCC TTTAGAATTG GTAGAAAACT TAAAAAATTT AGAATTACAA 240 TTCAGCCAAT CTCAAAACAG CATGCTTTCT TCTTTGTCTT CTCAGATCGC TCAAATTTCA 300 AATTCTTTGA ACGCGCTTGA TCCCAGCTCT TATTCTAAAA ACGTTTCAAG CATGTAT 357 INFORMATION FOR SEQ ID NO:256: SEQUENCE CHARACTERISTICS: LENGTH: 945 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...945 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:256: ATGGACTTAG ACAAACTCAA AGATTATAGG GCTTTAAGAA ACGCTATTTT AAGGCTATTG CCTTATTTAG ATAGCGGTAT TACAGAGCTA ATTATGAATA AAGAAAAAGA AATTTGGCTC 120 TATAAGCTTA ATGGAGTAAG AGAAAAAGTC TTTGATGAAA ATTTAGACAA AGCCTTTATT 180 CTGGGGTTTG GGGAACAATT AGCGAGTTTT AGAGATTTGT TTTTTAATGC TAATTACCCC 240 ACCCTTAATA CTTCCATTCC TACTTCTAGA TACAGAGTGA GCATGAACCA CTTTGCTATT 300 AGTGCGGATA ATGAATTGAG TTTGAATATT AGAGTGCCAA GTGATAAAAA GTTTGATTTA 360 WO 97/37044 PTU9/52 PCTfUS97/05223
AAAGCTTTCA
ATTGATGGCA
AACGCTCTAA
GAATTAGATT
TCTAAATTTA
ATGGTAGGCG
CATAAAGGCA
GCTTTAA.ATT
AAAAGCAGTG
ATTCAAGAAA
AGCTTTCTAG TATCTGTCAA TACGATTATG AAAACTTACT CATTAGTGGA GGCACAGGCA TTGAGTTTAT CCCCAAACAC ACACGAATAG TA.AGAGCGTT TGAAAATCAT AAATCCTTGT CATATGAAAA CGCCTTGAAT ATGGCAATGA AGATTGACAC ACGAAATTCC ATGCTCTTTT TGGTTTCTAC TTTACATGCA GATAGTGTGC TACAAATGAA TAAA.AGTGGT TTAGATGTCA TGGATGTTGT CGTTCAAATT GTATTAGATA TCTTACCAGC AAkAGATTTA AGAGATAGTC
AGTATTTAAA
GTGGAAAAAC
TAAGCGTTGA
TAGTGGATAA
GAATGTCTCC
TAAGATTTGG
ATGGGGTTAT
ATGTAGCAAA
AAGCCACTAA
TATGA
AAACTTAATG
GAGTTTTTTA
AGATAGCGAA
AACTGAAAGC
TGATAGGCTT
AAACACTGGG
AGAGGCGATT
AAAATTCTTT
CACCAGATAC
420 480 540 600 660 720 780 840 900 945 INFORMATION FOR SEQ ID NO:257: SEQUENCE CHARACTERISTICS: CA) LENGTH: 435 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoinic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 435 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:257:
ATGAATATAT
TTGTGGGCGA
GCAGAGATAA
GGCCATCAAA
GAAGCGATTC
CTCAATCAAG
GAGCAATTGC
GGTTTAGGGA
CGGTTAACCC CTATTTAATG GCGGTCGTTT TTGTAGTGTT TGAATGTTTG GGTGTATAGG CCTTTGTTGG CTTTTATGGA AGGACAGCTT GGCTAAAATC AAAACGGATA ACACCCAAAG TTGAGACTCT TCTTAAAGAA GCCGCTGAAA AGCGCAGGGA AAAAAGCTAC AGAGTCTTAT GACGCTGTGA TCAAGCAAAA AGTTTGAAGC GTTCGCAAAG CAATTACAAA ATGAAAAACA AAGCGCAAAT GACGGTATTT GAAGACGAGT TAAACAAGCG
GTTGA
TGTGTTGTTG
TAACAGACAG
CGTGGAGATT
AATGCTAGCA
GGAGAACGAA
AATCTTAAAA
TGTGGCTATG
INFORMATION FOR SEQ ID NO:258: Ci) SEQUENCE CHARACTERISTICS: CA) LENGTH: 276 base pairs CB) TYPE: nucleic acid CC) STRANDEDNESS: double CD) TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 WO 9737044PCTIUS97/05223 288 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION I1.. .276 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:258: ATGGCTAGTG GCCTTTTTGA AAACGATGAA ATCAAAAACA TATAGCCATA GCTCCCTTAT TGTCTTTTTC CTCTTACTGC GGGAAGTTGC TTTTTGGGGG TTCTTCTTTA GAGGTCTATT GAACGATTGC AACAAGAAAT CACCGAATTG CAAAGCAAGA TTGTTTGAGT TGAGGGA.ATT ACGGCCTAGA GATTAG ACAAAGCGCG AGATTTTTTT TTGGGTTTGG GTATTATTTA TGGATTTAAG AGACAAGCAT ATGTGCGCTT GCAAAAGCGT 120 180 240 276 INFORMATION FOR SEQ ID NO:259: SEQUENCE CHARACTERISTICS: LENGTH: 900 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .900 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:259:
ATGAAAAAAA
GCTAAGCCGG
TCAGCAGGCG
ATTAAGCAAC
TTGATTGAGC
AATCAGACTC
TTTTGGGCTA
CAGGATTTTT
CATATTTTAG
CCAAAGGCTA
AACAGCAAGA
CCGGATTTTT
AAA.ACAGAGT
ACTTATGAGC
CGCATGAATC
ATATCTTAAA
CTCATAATGC
TGTTAGCGAC
GAAATCCTAA
AAGCTATCCG
CAGAATTTAA
AAAAACAGGC
ACAACGCCAA
TGAAAACCGA
AAAAAGAAGC
ACGCGCAAAA
CTAAAGCCGC
TTGGTTATCA
AAGCTAAACC
AACGCATTGA
TTTAGCGTTA
GAATAACTCT
AGTGGATGGC
TTTTGATTTT
CACCGCGCTT
AGCGATGATG
TGAAGAAGTG
TAAAGATCAG
AGATGAAGCC
CAAATTCATT
TGGCGGTGAT
TTTCGCTTTA
TATTATCTAT
TACCATTAAG
GGAATTAAGG
GTGGGCGCGT
ACGCATAACA
AGACCCATCA
GACAAGCTTA
GTAGAAAATG
GAAGCGGTTA
AAAAAGATCC
CTTTTTGTCA
AAAAGGATTA
GAGTTAGCCA
TTGGGGAAAT
ACTCCTGGGG
TTGATTTCTA
GGGATGTTAC
AAGCACGCTA
TGAGCGCGTC
CGAAAGAAAC
CCAAAAGCGA
AAGAGAAAGA
AGGCTAAGGC
AAAAACAGGC
AAATCCCAGA
AGCAAGAAGC
TTTCTGAGAT
ATCGGGATAC
TCCAAAAGAA
ATTACACTAA
AAGATAGCCC
AAGAAAAGCT
AAATTGTTAT
GTTTTTGATG
GACTGATGCT
TTTTGATATG
AAAAGAAGCC
AGAAAAGCTT
TTTAGTGGAA
AAAAGAAATG
CCATGCTAGG
TGACAAACAG
GATTGATCCT
CCAAATGGCT
AACCCCTGTT
TGTAACTTAT
TTTCCAAGAA
CAACAAGTAG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 INFORMATION FOR SEQ ID NO:260: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 289 LENGTH: 918 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc Ifeature LOCATION .918 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:260: ATGATAAAGA GTTGGACTAA AAAGTGGTTT
GGTCATTTGG
AGAGGGGATT
GTTGGTTGCA
TCAAAAGCCG
AGTTTAGGCT
TATTATTATA
ATGTATTTCA
TACGCTTGCA
GCCAAAGGCG
TTAAAAGACG
AAACAAAATG
AGCGGTTGTC
GAGAAAGCCA
GTGTTAGAAG
CAAGATAGCG
TGGCTACAAC
ACCATAGAGC
CGAGTTTAGG
TTTTTTATTA
CTATGTATGA
GGAGAGGGTG
ATGGTACGGG
GTTTGAATTA
TGGAGAAGGA
GGGCGAGCTG
AAGAGCAGGC
ATAATGTGGC
CTTCATATTA
TGATTGGCAA
TGCAATAA
CGGTGAGAAA
GGTGGCTTTT
CTCTATGTAT
CAGAAGAGGG
AGATGGCGAT
CCACTTAAAA
CGTTAAACAA
TGGCATTAGT
TTTGAAAAAA
TGTGAGCTTG
TTTGAATCTT
GGTGATGTAT
TAAGAAAGGG
GGAGTCTGAT
TTGATTTTAT
TATTTTAAGA
TATAAGAGGA
GAATATGGCG
TGTAATTTGA
GGCGTGCAAA
GGTGGGGTGA
AATTACGCCA
TGTAACTTTG
GCCCTTGCGA
GGATACTTGT
TATAAAAAGG
TACACGGGTA
TGCGCTTTAG
AATTTGCAAG
TTTTAATGGC
TGGCTAATCA
GCTGTAATTT
ATGGCGTGGA
GGAATCATCT
AAGACTTTCC
GCTGCGGTAG
AAGCCCTTTC
TAGGOTATAT
ATTTTAAAAG
ATGAAGCCGG
GTTGCTCTTT
AGGGAGCTCC
GCTTTAGTGG
ATGACGCGCA
AAGCTGTTTT
AGCCCTTAAG
GAGGATGGGG
TCAGAATATT
CGCTTGCGCG
AAAGGCTATC
TTTGGGTTTT
TTTTTCTAAA
GTATAAGAGC
AGGGTGC CAT
TATGGATGTC
AAAAGAAGGG
AAAAGATTTG
TAGTTGTAAG
AAACGACACG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 918 INFORMATION FOR SEQ ID NO:261: SEQUENCE CHARACTERISTICS: LENGTH: 1509 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1509 WO 97/37044 WO 9737044PCT/US97/05223 290 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:261:
TTGAAAATCT
TATAAGACTC
AGCTTGTATG
GACGGCTTTG
GACATGCAA\
TTAAACGCGG
TCAGATTTTT
A-ACCCCTACT
CGCATTAAAA
GGGGGGCGCA
TTAGACGCTT
TGGCGATTCC
AACGTTAAAG
GAATTTGAAA
TATTATGGGA
TCGCCCATCA
GCTTCATCGT
AAGGGGATTG
TATGGGGCGT
ATACGAAACG
CACGGCAAGA
CCGCGCTCTG
AAAAGGGTGC
CGGCATAGAG
CCTGATACCT
GAACTTTAA
TTTTACTTAT
CCATTTCAAA
CTAAAA _ATTT
ATGCCTTGTT
CTTATATCTA
CCAATCGTGG
CAGACATCAT
ATATCCGCA-A
AACGCATGCA
ACATTUGGGA
TGTTTTTTGG
ACCGCTCTAT
AAATCGCTAA
AAAAAGTCAA
ATGCCATTTT
AAATCGCTTT
ATTTTATTCC
AATTGAATAT
GGGAAAGGTA
ATTTTTTCAA
CAATTGTTTT
CATATATTAA.
GTTTGTCGCT
TGATTTGGGA
CTTTCTTTTT
TCTAAGCGTC
CCCCCCTATC
AAAAGAAAAC
GCACAGAGTG
TAAAAACGAC
GGTAAAAGTG
GCTTTTAA.AT
TAAAGGCTTG
CAACAAGCTT
CAATTATTTT
GGGGGTTGCT
CCCTGTTTCA
GCTCCATGAA
TGATTTTATA
TCTAGCCGAT
TGAAAAAGCC
AGGTAAAAAG
TCTTACCAAT
TCGCAATAAA
CCGCCAGATT
TCACGATGCC
CACTGAAAGT
TAAAGATCAT
AGCTACAGAA
GCGTTTGATT
TTTTTTTTTA
TCTTATGACC
CCTAAACATA
GGTCTTATTA
CTTTCTTCTC
CGCATCCTTT
TTCCACAAAA
COTTATTTTG
TTCATCGTGG
GATAACGATT
TCAAAGGCCA
TTATTAAGAA
AAAATCCCTA
GAACGCTTCC
TTGCCCGCCA
CTTAAAAACG
ATAATGAAAA
TCCCTTTCAT
TTAGTGCGAA
AALAGGGCGCT
CTAACGCTTC
GCGGTCTTGT
GCCCAACAAT
GAAGGCATCT
AAAGAATGGT
ATGGGTGTTT
CCTACACTAC
GTGCGGCCAT
GAATGAGCCA
AAGTGATCGC
TAGATGACAA
ACATTGAAGT
A.AATGCTTGC
ATAATTTCGC
TAGACACGAA
AAGAAAGCTT
CCCATAAAAG
TCAGCGCTGA
AAAAATACCA
AAATTGACAC
CTAAAGACTC
TCTTTAAAAA
CTACGGACGC
TGGGCGCGAA
TTAGCACCAA
TAGGGAGTTT
TTGACAACCC
CATGGCATTT
TAATCCATGA
CTAAAGTCCT
TGGGCTAGCC
CACCATTGGG
TCTTTTAGAA
AAAAAGCATT
TAAAGAACTT
CGGGTTAGAT
GAAAATCTTT
GGATTATGAA
TGTCATTATA
TTTTTTAGAT
TGAAAATTAT
ACTCAAAAAC
AGACGCAAAC
ATACCCCATT
GCCCTTGTAT
CGTTTTTATC
TCAAATTTCT
TATCGTGGTC
TGTCTATGAA
ACATTCCTTA
TAATATTGAT
GTCTTTTGCC
AGTCTTGTAT
AAAAAACTCG
TCCTGAAAGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1509 INFORMATION FOR SEQ ID NO:262: SEQUENCE CHARACTERISTICS: LENGTH: 510 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc_feature LOCATION .510 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:262:
TTGAAAAATC
TTTAATGAAG
AAAGGCGAAA
GATGTGGATA
TCTCAACACT TCTGGTGTTT TTATTCTTTT GTTTGGGGTG TGTGAGCAAT ACACTTACAC GCTAGACTTA GTTTTAGAAA AAAAGATCCA AGCCAGTAGG TCACCCAAGA TAATGTGCCT ATCATCACGG CTATCGCTAC GCATTTAAAC GCGGCACTTA CTACGATCAT GAGTATTTTT TAGTGGAGAT TTTCACGCAA 120 180 240 WO 97/37044 WO 9737044PCTIUS97/05223
AATAACGACT
GGATCAGAAC
ACCACGAACC
GTGCAAGpjAG
GCTTACCAAG
GGATAGATGA
CTTTATGGGT
GGTGGAGCCG
CCAAACTAGA
TCCCCTTACC
TGGCTATATT TCTTATGAAC TTTTTGGCAC
AAA.ACCTACA
GCGAGAGATT ACAAGAGATG AATTTGATGG
CATTTTAGA
AGCCTTTTTG ATCGCTTTTG ACAA.ATTGGA
TTATTTAGCG
GCTTGATGCC TATAGTTTAG GCAAAATTGT
TTTTAATTTC
TCAATTTTpA 300 360 420 480 510 INFORMATION FOR SEQ ID NO:263: SEQUENCE CHARACTERISTICS: LENGTH: 1536 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 1536 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:263:
ATGAAAC
CTTATCCTTT
AATCCGAACG
TTCAAAGCTC
ACGCAAAGCG
TTGATACCCT
AAAGCCCTTG
CTTTTAGATG
GATTTGCTTT
GATTTTAACT
ATCAATAACG
AACCTCTCGC
AGCCCTTTAG
AACGCCCCCT
CCC TTAAAAG
GTCAGTGGCC
TTGAAAGCCC
AAATTTTTCC
GTGTTGAAAG
TATTCCATTT
CAAATCAACC
AAAATCCATA
GAAATCTTAA
TCCCTCATTT
CTTAAAAACG
GAAAAGCTTG
TTCTTTATAC
TTACAGAATG
AGCGTTATTT
AAGCCAACGA
TAAATTTGGA
ACCCCTTAAG
TAGTTCAAGG
ATTTCAAGCT
ATTTAATCAA
CTCTAAAGCC
CCCTA.ATCAA
ACTCAAGCGA
TTAATTTCAC
ACACTTTAGA
GGAGCTTGAC
ATTCAAATTT
GTTTTTCCAA
AATCCATTGC
CCAATCTCAA
CTCGTTTTGA
AGCAACGCCT
ACGGCTTGTT
AATTCATCTT
TAAACGAAAA
ACACCCTTAA
AAAAAGGGCT
CATGCTCGCG
GGGGAATAAG
GAGCGTTAAA
TGATTCCACG
TTACCACATA
AGGGGCTATT
CGTCTCCAAT
TTCTCACTTA
CCGCCCCGCT
CTTAGAAGGG
TCAAATCTTT
TTTTAAAGGA
AGCCCTAAA
AATACCCCAT
TTTAAAAGGC
ACTAGACGGC
TATTTCCACT
AGACGCTAAC
GAACGCAAGA
TATTACTAAA
GCTCTCTGAT
GGATTTAAAC
TAAAATGAAA
AGCCATTCAG
AAAAGGTTTA
TAAGGGGCTT
CTTCTTTTAA.
ATCATCGCTT
ACCTTTAAAT
CTCATTCTTA
GACATTAAAA
GTTACTTCTG
GTCGCTCAAT
AGTTTGAACG
TATGCGAACG
CATTTGATCC
CATTTAAACC
AACAAAGCCA
AGCGA.ATATT
CTAGCCAAAC
GATATAGAGC
GCGCTAGACT
TCAAAAGCCT
TTGGATTATG
TTCCTCAAAA
GAAATTTATC
TTGAGTTTAA
ACCAAGCAAA
CTTCAAGGCA
CAAAACCTGC
GATCATTTGC
TTTTAA
TCGGCCTTTT
CGTATATAGA
TGAGATTCAA
AGGGGGATTT
ATTTACGCTC
GGAATATCAA
CCCACACTGC
CAAAAGACGC
CAAAAGTGTC
TAACAGCTAA
TTAAAGACAC
TCAGCGATAC
CTTTCCCTGC
TCCAAAACAT
AAAGCCCTAA
TCACGCTTTT
TAGATTTATT
ACCTTATCTC
ATGCATTCAG
ACGATGCCAA~
AAAGCCCCAA
TGGACATACT
GCATGCACCA
AACAAGGCTT
TTAAAGATGA
AACGGCCTAT
GAAAAAAATC
CTCTTTGGAT
CTCGCTTTTA
TTTTAAGGAT
AGGGCATAGA
CTACAACGCC
CAATTTAGAA
CTTACAAGCG
TAACGCTTTA
GCTCGTTTTC
CACCCTAACT
TTTAAAACTC
TACCAACCAC
ACTCTTAAAA
AAATAAAGAT
CCATTACCCT
TAAGCAAGGC
CGATTTCCTC
TCTAGTAAGC
AACCCAATTG
CATGGATGCG
GCCAAAATTT
GA.AAGAAATC
TAAACTCAAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1536 INFORMATION FOR SEQ ID NO:264: WO 97/37044 292 SEQUENCE CHARACTERISTICS: LENGTH: 483 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...483 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:264: PCT/US97/05223 ATGTTTGGCA TGGGCTTTTT TGAAATCCTT GTGGTGTTGA GGACCAGAAA AATTCCCTCA AGCTGTCGTG GATATAGTGA AAAACGCTCA ATGACGCTAA GGACACTTTA GATAAAGAAA AAAGAAACCT TAGAGTATCA AAAACTCTTT GAAAACAAAG AAGATTGAAG AATTAGAAGA CGCTAAAGTA ACTGCAGAAA GATTTGATGC AAGATTATAA ACGCAGCTTA
GAAACCAACA
GAAGAAGTTT CCAATGAAGA AGCTTTAAAT
AAAGAAGTTT
GAAGTCCAAT TAACAACCGA TAACAACGCC AAAGAACACG
TGA
TTGTAGCGAT
AATTTTTTCG
TCAATATTGA
TGGAGAGTCT
ATGAGATTAA
CGATTCCTAA
CAAGCGATGA
ACAAAGAAAA
TATTTTTTTA
CGCGGTTAAA
AGAAATCAAA
TAAAGGCGTT
AAGCATTCAG
CCATTTAAAT
ATCTCCTAAA
AGAGCATGTT
120 180 240 300 360 420 480 483 INFORMATION FOR SEQ ID NO:265: SEQUENCE CHARACTERISTICS: LENGTH: 243 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...243 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:265: ATGTTAAGAA TTTTAATCCC CTTACTCATT ATTGTGTGGG TTTTATGGCG TTTGTTTTTG AGGCAAAAAC CCCACAAAGA TGACCACAGA GACAACCACT CTTACACGCA ACAAACCCCC 120 WO 97/37044 WO 9737044PCTIUS97/05223 293 AAAGAATTAG AAGATCACAT GATTGTATGC TCTAAATGCC AAACTTATGT
CTCTAGTAAA
GACGCCATTT ATAGTGGGGC GGTAGCCTAT TGCAGTGAA CTTGTTTGAA.
GGATAAGGGG
TAA
INFORMATION FOR SEQ ID NO:266: SEQUENCE CHARACTERISTICS: LENGTH: 645 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION .645 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:266: 180 240 243
TTGGATGGTT
TTAATACTGA
TCTTCTTATC
GGGGCTTCAA~
TATTATTTCT
AACATGTTCA
AACCGCTGGG
AAAGTCAAAG
ACTTATTTTA
AAGGTGGATG
GAAAGGAGTT
TGAAAAAAGA
GTATGAGTTT
AAATAGGCAC
TCCTTCAAGG
TTATAGATTA
CTTATGGGGT
CTTTTTTCTT
ATTTGGTGGT
GGGCTATCGG
TGGAAATTGG
TTTTGTTTTT
AAGACAAGGA
TTTAAATATT
GGTGTTTATG
CTATGA.AGTG
TGGTAATGTG
GGGAGGAGAT
TGGTTTGCAA
GAATACTTGG
GAAATTTGGG
CATGAAAATC
TGTTTCGCAC
TTTTACAAGC
TTAAATGCTG
CGCCCCTTAA
AATCCTAAAA
CTTTTCAATA
TTTATOGTCG
CTAGCGGCTA
GATTCATTGA
GTGCAGTTTC
TTTTTAACGC
TCGTGGCATT
AAGTGCATTA
AAAATTTGAG
ACACCAACAA
ACGACTGGGC
ACGATTCCAC
CTTACGCTAA
ACACATGGAT
AAGATTTCAA
GCACGATCGT
CTGAAAGGCG
TTTAA
TTTAAGAATT
TTACATGTCT
ACTTTTACAA
TTATTCTAGG
TTTGCAAGCG
AAACCCTATC
ACTCAA.TAAT
TTTTCATAAC
TTTGTATCAT
CAGCTTGTTT
120 180 240 300 360 420 480 540 600 645 INFORMATION FOR SEQ ID NO:267: SEQUENCE CHARACTERISTICS: LENGTH: 336 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 WO 9737044PCT/UJS97/05223 294 ()NAME/KEY: misc feature LOCATION .336 (xi) SEQUENCE DESCRIPTION: SEQ TO NO:267:
ATGAAACCAA
CTTOGAGGGA
CACAATTTCT
AGCAATCATG
GAGGGCAATA
GTTAGACGAG
CGA-ACGAACC
TTTTACT"CAT
TAGCTTCTAG
AAGTGGGAAA
ATCGTGTGAT
AAAAAAATCA
TAAAAAACCT
TTTTTTCCTA
CACTAAAAAT
TGTGAGTATC
TTATATCGCT
ATTATTCTGG
TTTTTTCAAA
CGCTCTTTCA
GTGAGCTACC
GGTCAAACTT
AAACGAGTGC
TTTTGA
GTCCCATTGT
ATTCTOATGG
ATGAAATCAT
TGATCAAAGC
TGATCTACCT
TCTTOCGGTT
CAGTTTTTCG
ACAGCTCATC
CAGCCATAA
TAGTGCCTTT
120 180 240 300 336 INFORMATION FOR SEQ ID NO:268: SEQUENCE CHARACTERISTICS: LENGTH: 624 base pairs TYPE: nucleic acid STRAINDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .624 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:268:
ATGTTAAGGG
AAAACGACTC
TCCACTTTAA
AAAGAATTTA
CTCACTTTCA
AAAGCCATTA
ATGCGTTTTT
CCATTTGACA
TTGGGCGATA
AAAGCCCTTA
ATAGAAAATG
TTTTAAGCGT
TGCATTTAAA
CCCCCCCTAA
AAAAAGCGCT
ATATTTCAGG
AAGAAAGGCT
TAAACCTTCA
CCCTTTTAAT
ACCCCTCTCT
ATAAGGCTTA
CGGCATGGCT
TGGTGTTGTT
ATATAAGGAT
AATCTTTTTT
CGCCCAGCAA
CAATGTTTTT
TAAAAAGACG
AGCGAGCTTG
CCCCACCGCT
TTTTCCCCAA
CTATTCTCTT
TTAG
TTTATTTTAC
TACCCCAAAA
AACGCTCATT
ATCGCTTATT
TTTTCTTTTG
ATTGAGCCTA
ATTTTAGAAT
CTCAGCGTGC
GAAGATAAAT
ATGGAGGGTT
TAGGGTGTCA
ACAGCCCTTT
TTGTGQCGCC
TTTTAAAGGA
AAGAGAGTCC
ACACTGACCC
GTGTCCCACA
CTATTGATTA
CCTATCATAA
TAGAAAAGCG
GTTTTTCAAC
AAAAACCGCT
CTTTTACCAA
TAAAAGCGCT
TAAGGATTTA
AAAAGCCGTC
AACCGCTTGC
CGCCAATCGT
CGCTTTAATC
TTTGAACGCT
120 180 240 300 360 420 480 540 600 624 INFORMATION FOR SEQ ID NO:269: SEQUENCE CHARACTERISTICS: LENGTH: 528 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 WO 9737044PCTIUS97/05223 295 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .528 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:269:
ATGAAAAATC
GGTTGTAGCC
ACTAAAGGCG
TATTCAGGGG
TCTACGAATC
TTACAAAAAG
ATAAGCGGCA
AAAATGCTTG
CAAATTGTGG
AAGTTAAAAA
ATGCCCCAAA
CTCCTGATTG
TCTTTTTAGG
AGGCTACAGC
ATTTGGAAAA~
CTGATACTGA
CCCGCTATGT
ATAAAGTGCG
AATTTTAGGG
ATCAGGTATC
GGTTGTAGGG
AAGGGCTGAA
CAAAGCTAGG
TGAAAAAACC
AAAAATTTCT
CGGTAAAGAT
CGAAGAGTTA
ATGAGTGTGG
AGTAAAAGCA
GATTTGGAAA
GATTTGATCA
GCTAATTTAG
AGAACGGTAG
CAATTAGTGG
AGGGTTTTTG
GGCATGGTTA
TAGCAGCGAT
ATAAGGCATA
AAGTGGCGAA.
CTAATAATGA
CGGCGAATCT
ATGCTTCTGG
ATAAGGAATT
TTTTAGTGGG
AAAAGTAG
GGTGATCGTA
CAAAGAAGCG
GTATGAAAAA
TGTGGATTAT
AAAGTCCACT
TAAAAGGTCC
GATCGCTTCT
CTTGGATAAG
120 180 240 300 360 420 480 528 INFORM~ATION FOR SEQ ID NO:270: SEQUENCE CHARACTERISTICS: LENGTH: 267 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .267 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:270:
ATGAAAAAAA
CAGGCCAATA
ATCATCAGCG
GGGCAATTAG
AGGCCTTATA
TCGTTGTGAG
AAGCGATCAG
CACAAAACAC
GGGACATGCG
TTTATAATTG
TTTATGCGTG GCGTTAGGTT TTTTAAGCGC GGATCCAGCG TGATGCGGAT TTGATTGAAG AGATAAGGGA CTTGAAAAAA~ GGAGATTAAC CAATTAAGAA AAGTGCAAGA AGTCTTATCT TAAGGATATA TTAAGCACTA GAGATTACTG TATCAGCTTA
GCGCTAG
120 180 240 267 INFORMATION FOR SEQ ID NO:271: SEQUENCE CHARACTERISTICS: LENGTH: 774 base pairs TYPE: nucleic acid WO 97/37044 PCTIUS97/05223 296 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .774 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:271:
TTGAATAGCG
AACGGGAATG
AATCAAGCGA
TTTTTTGCCA
TCAAGCTTGA
GCCTATAGCG
TTAGTGTTAA
AGCAATAGCC
GCTAACGCTA
GCGGGAGTTT
TTTAAAATCA
GAATTGCAAT
ATTTCCAACG
GCTCTAACGC
TGGAAGCCAT
ACTCTCTTZA
ACCAGCATGA
ATTTCAAAAG
CCACAGCAAG
AACCAAGCGT
AATCACAAGT
ACGTGGAAGC
TACAAGAGTT
ATGCCGCTCG
TGGCTAAAGA
CAAGCCATTT
TTCATTGTAT
TGTGGGCGGT
CTCTGGGGCC
ATTTGACTTT
CACTCTATTA
AGCGAGTTAT
GGGCGTGAGC
GGCTTTAAAA
GCGTTATTAT
CGCTCACTTT
CAGTCCTTTA.
AGTGTTTTTG
CGCTTCCAAT
GGCACAAGCG
TTTGGAAGCT
AATAACGCTA
GAAGCTCAAG
CAAGATTTGA
GGTTATGACT
TATAACCATT
AATGGCGCGA
TATGGGGACA
GGATCGA.ATG
AGCACCTATG
AATTTGGGCG
TTAGGAATGA
CCGGCGTAGA
ATGGTTATAG
ATTTTGGCGT
GGGCGCTAGG
ATCAAAGCTA
TCGCGTTTTT
TAGGTTCAAC
GCAGTCAGCA
CTTCATACTT
ATGTGGCGTC
CAAGAGCGAT
TGGTTTATTT
GGTATAGTTT
CGCTTTCCTT
CTCCTTTAGC
GTATAGCCGT
GAGCGATCA
TAATTACTTA
TAGGAACGCT
CAACTTTAAA
TTTATTCAAC
TTATTTGCAT
TTTAAACACC
GATGGGTGGG
GCACAATTTG
CTAA
120 180 240 300 360 420 480 540 600 660 720 774 INFORMAATION FOR SEQ ID NO:272: SEQUENCE CHARACTERISTICS: LENGTH: 852 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .852 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:272: ATGTTTGAAG ATTTTTATCG CACTACCCTC TCTTTTTTAA GGTCTTTATT
GCTTTTATTG
GGTTTATTGT TGCCGTTTTC GCTTTGTATA. GCTGATGAAT ATATTAGCAT
AAGTGATGAT
WO 97/37044 WO 9737044PCTIUS97/05223
TGGGATGAA
TTTGAAAATG
AAATTGGCGC
GAGGGAAAGG
AAAGGGTATG
GGCTTAGGGA
ATGTTTTCTA
ACTAGTATGC
AAATTTGGAA
CTTTCTTGGG
GGGGAAATCC
AAGGACAGGG
GAAGCTCTGT
GGGCGCGAAA
GTTTAGACCA
AAGAATACAG
GCGTGAAAGT
GGAGCGGGTT
TGAAAAAGGA.
ATAAAAGTGC
TTATTGGATC
ATAAATTTGG
AAGACATTGC
TTTATAGGAT
CTAAAAAATT
AG
TCAGTGGGAT
TTTTAATCAA
CATCGGGCTT
GGATTACAAA
GTTAGGGGGC
TTTGAAACAA
AAATTTTGCT
GCGATTCATT
AATACTTGTT
TGAAATTTCA
TGGGATCGCT
CTTGCAAAAA
GAAACTGCGC
GGCCAATACA
GGCAACGTTT
AAAGCGCAAT
GCTCTTATTT
GCACTCAAGA
AACAAATTTG
GATCTTTCAG
AAGAAAGCCC
AGCAATATTA
TATAAAGA\G
TCCGCAGAAT
GAAATCATALA
AGCAAGCCTT
ATTTAGCCA
TCTATGCACA
TAGGACGCAT
CTTACAGGCA
GATCAJAACCT
GTTTGAGCGC
TTCAAATC-
TTTTACTCALA
GGCTTGGCAC
TTGGCTATGA
GACATATTAT
TAAAGATTTT
AATGTATTTG
AAACGCTATC
GCAAGCAGAA
TGTGGTTCGC
TGCGGAATTT
GAATCCTATA
AGATAATACA
ACAACAAATG
TAGAAAGAAA
AAAAGCCATG
180 240 300 360 420 480 540 600 660 720 780 840 INFORMATION FOR SEQ ID NO:273: SEQUENCE CHARACTERISTICS: LENGTH: 2361 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .2361 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:273:
TTGTGCAAAA
ATTCGCTCAT
ATTTGTTGTC
AAGATTAACG
AGCTGCGTGC
CAAAATAACG
GATGAAAGCT
GGGGTGGTTG
AATAACGCTC
AACGCTAGCT
CAAAAAATAG
ATTTATGGGT
GAAAATTCTC
ACCACCCAAA
AAAAGCAGCC
CTCTATACCT
TATTTGGGCC
AACTCAAACA
AAAATGTCTT
GTGGGCATTC
TTCCACTTTC
TTTTGCCGGA
TAATCAAGTT
CCACTAATAA
AACAAAGCGA
CTTCTTCTAG
TGGTGGTTAC
ATTTTTCAAA
AATTCCAAGC
TTGTAACGAA
AGGTGCTTCA
TAGAAGTAGG
AAACGCCTGT
CGCCTTTTAT
GTTATATTGA
TAATCAATAT
AACGGGTTTT
ACGCCTCTCA
ATGGTAATAA
AAGGACAAAG
TCTAACGCTA
AACAACCACA
TCTAACATTA
CAATGTTTCC
TCCAACGACA
TAATGCGTCA
GGCGAATGGT
AATTAAAGGC
CAACAACCTC
TAACTTAAAC
AAATTTAGTG
GGGGGCATTG
AAATCCGCTC
GAATGTCAGC
TTACAATATC
CAACGGAAAC
ATTACAAGAT
AAACAACATT
AGTGATGGAC
CGCACTCAAT
ATGGAGGCAC
TCGCTTTCAC
ACGTCACCAT
GTGTCTCAAG
GCTAGCGCCA
AACAACGCGC
TTCAATTTTT
TCTGCAAACG
ACGATTTCCA
ATTCAAGGAG
ATCGCTTCAA
AATAATTTGG
ATTCAAGTAG
GTGGCTAATG
AACCCTAACA
CACATAGAGO
AAGGGGTTAT
TTAAGCCTTT
TTTACCCCTC
CAAATTGAAG
GCTCAGTTTT
TAACCATTCT
GCTTAACGCA
GCAATCTGTT
CAAACCCTTG
CAATCGCCTT
CAGGCAATAT
TTAAAAACCT
ACCAAGCGGT
CGTTTAACAA
ACGCTTCTTT
GAGCGATCCA
GGGGGATCAT
GCGGAACTTA
GCTTGCAATC
AAAAAAACGG
TATTGAGTGT
CTGTCCTTCA
CCACCTTACA
CTGTTGGGGG
AACGCAGGAA
GGAACGCTCA
GCAACGGCCT
TATCAACGCT
CACCACCGCT
AAATAATAAC
TTACGCTAAC
GTATCTTTAC
ATTAGAGAAA
CAACGCCACG
AAGCACCGGG
TTTTAATTTA
TAATCTCAAC
CACTTTATTA
GTATTTGAAG
CGTATTGACT
AGCACTACCT
CAACCAGATT
GGATTACATT
GAATAACGCT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 WO 97/37044 WO 9737044PCTIUS97/05223 ATCAAGTGGC TTTCA-ACATT GATGATGc3AG
ACTAAAGAA-A
TATTTAGAAA
GCAAGCTTGA
AGTTACACCC
GAGTCCAATT
CCTAGTGAGG
CAAGGGGTG
GTGGGCTATG
AGCGGTTTTA
GCGAGGGCTT
AATGCGA.GTC
TACAACACCT
AAAAGCGTGG
ATGAAAGGTA
AACGAATCGG
TCCTATTATT
AATGTGGTGC
ACTTTTGCGA
GCGGGGGTGG
GGCATGCGAG
ACCACTCTTT
TTTCTAACCC
AACAA.ACCAG
TTTCAGACG
TTTTTGTCA
GAGGAGCGAG
ACCGATTGGT
ACGGGAACAT
TTTTGAAAAG
ATATCAATTC
GGACAACGAG
TGCTAAAACC
AAATGCAAAA
TTTTAACGCT
TTGTAACGGC
GTTTTGTGGG
GCGTGATCAC
GGCTTAAAAT
TGGCGTTTTA
AAATGAAATC
TAATTTTAGA
CCGTTTGACA
CTTGTTAGAG
ATACTCTCAA
CTTTATTTCT
TAAAAGCGTG
CATGCATTCT
AAACGAATTC
TTCTA.ATTC
CGTGAATGGG
TCAAGTGGGC
TCCAGCTTAC
CAACATGGGG
GAGGTTGGGT
TGAAAACACT
AGGAGGCGAA
GGGCTTGCAA.
TTAGGCGTAA
AATAACGCTA
AAACTCTCTG
CTTAAAAACA
CTCAGCAAAC
GGGGGCAATG
ATCCTTGGGG
TTGGCTAATA
ACTTTGAGCG
TTGCTCTCTG
AATTACGGCT
TTGAGCTATC
CAACAATTCG
TTAGAGAGCC
AGGGATCTTT
TTATTGTACC
ATGCATTTGT
TACCAAGATC
ACCCGCTTTT
CAAAAGATCT
CCAGCCTTTT
ATTTTAGGGC
AGCGTTTTAG
ACCCCAATA
GCACGCTTTA
GTTATGTGGC
ATGTGGATGT
CGAATGAAAC
TGTTGAACCA
ATGATTTCAT
ATTTCATAGG
TCATGCATTC
GTAAATATTT
TGATCAAAGC
GCAAGGGGGA
GGCGTTTGAT
TTAATATCAC
TGCGCCGATT
TCAAAACACC
AGAAATGGCG
TAGAGAGGGA
CGATCCTAzAC
CCTTTGGATT
TGGCTTGAAT
TTATGGCTAT
GGGGATGTAT
TTATGGAGGC
ACOCTACAAC
GTTCAAACAA
CTTGAGCGGG
AAACCCTTCT
TGGTAAAAAT
TAAAGGCGAC
AATTTTTA~c
GTATGTGAAT
TGGGAATGTG
1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2361 INFORMATION FOR SEQ ID NO:274: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1446 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 1446 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:274:
ATGTTTAGAA~
TATGCTAAAG
ACCAAGAAAG
AATAAGAAAC
TATGACACTT
AAAAACAAAA
GAAAACCCTA
GGTTCGTTAG
GCTA.AGTTTG
GTGCCAAAAA
AGCAAGCTTT
AGACCCAACG
GATAGCGTTA
AACTAGCAAC
AAATAAGTGA
AGGCTAAACG
CTCAATATGT
TGAATGTGAA
CTTATCTCTA
TTATCAAA-AC
AACAAGTCAA
TGTTAAAAAT
GAAGCGATCC
TTGAACAACA
ATGA.AGTGTG
CTCAAAAACC
CTCTGTATCG
AGCCGATAAG
CCTCAAAAAA
CTCTGTTGAT
TGACAAAAGC
CGCTATGGAT
AAGAGCGATG
TGGGTATTAC
CAATGACAAA
CAACGCCCAC
AAAGAAAATG
TTCGCC CC TA
CAATATCATT
CTCATAAGCC
GTCATTAAGG
GAAGCTAAAC
GACACAAAAAJ
TTTGGGGATT
CTATTGGATT
GGAACTTATG
AATATTCTAA
ATACCTTATG
ACGCTTGATA
TATTTCAACT
AGAGATGAA
TACTAATCTC
CCACTAAAGA
AGCGCCAACA
CTCAAGCGCT
GGTTTGGTAA
ATAACAACTA
CAGATCTGAT
AAGCGCTCAA
CCCAAGCGAC
AGGGAGCGTC
ACGCCAACGA
CAACGCTCTT
AACTAA.AGAG
GATCCCTGAT
TTTTGATATA
TAGCGCTTTG
CTTATCCATA
TATCATCACA
CAAACGCAAC
TTTTTTGAGA.
AATTGATGAA
TGTGATCTGC
GCCCACTAAC
ACTAAAAGAG
120 180 240 300 360 420 480 540 600 660 720 780 GCTCCTTATA GCCTGTATAG WO 97/37044 WO 9737044PCT/US97/05223
ACAAATAACG
AGCAAAGAGA
GAGAGGGAAj
AAGCTCAAAG
AAAAACACTA
GATGAA.ACAA
GAGACCACGA
AAAACACCCA
CTCAAGTCTA
GATCCCTACA
CTAAGGGACA
AAGTAA
CCAATGAAGC
AGCTCATAGA
AGAAACTCTT
ATTTAGAAAA
AGAAACCTAG
TGAGAGTTAT
TCAAAAGAAG
TCAACCTTGA
ATGGCTTGTG
AAGAGGGAAT
AGCTCAA.ATA
CCA-ACCATCA
AGAGCTAATC
AGCAGAAAAA
TCAAAAGAAA
AGTAGTGGAA
CAAAGAAAAA
CTATAAGGGG
GGACTTGAGA
TTACGCTAAT
GCTGTGTGGT
CGACAAGCAG
CCTTATGCCA
GCCAACTCCC
GAAAAACAAG
CTAAAAGCTT
GTGCCTATTC
GAAAACTATA
ACTTTGATCA
AGCTTAGAAG
GGCATTAACC
TATGAGAGCG
AAGTTACAAA
CTCAAACCGC
AACTCATAGC
AGACTGAATT
TAGAAGCAGA
CTCCTAAAAC
ATGGGTTATT
GTGAAAATTC
AAGAAATTAA
TCTATGTAA.A
TTCAAAAACT
AAGCGTTACT
TCCTGAAAAC
CAATGAAGzA
GGCTAAATAC
GTTGAAAAAG
AAGTGATTCT
AGTGGATAAA
TTACAGCAAAk
GAGCTACTAT
AATCAAAAAC
GCTTTCACCA
GAAAGATTTA
840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1446 INFORMATION FOR SEQ ID NO:275: SEQUENCE CHARACTERISTICS: LENGTH: 180 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: CA) ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 180 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:275: TTGCTTTCCC GGTCTATGCC CAAAATTCAC GCCGTTTTCA AAGCTTTTAT CCCTATTCCA TTCGCCCTCT TCGCTATCCA CTTCGTAGTA TTGGGAATTG GAAGCGTCTT TAATCTTA.AT AGAATCAAAG ATAAAAAATT CATTCTCCGC GCCAAAATAA GCCACATCGC CCAAGCCTAA INFORMATION FOR SEQ ID NO:276: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 792 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomnic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori 120 180 WO 97/37044 WO 9737044PCTIUS97/05223 300 (ix) FEATURE: NAME/KEY: misc feature LOCATION .792 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:276: ATGAGAAAAA CGATTTCAGC GTTGTTTTTA TCAGCGTGTA GCGAGTAACG
CT
ATGAAAGGCG
TC
ATCCCCCCGT
AT
CCAAAAGGCT CG GTGGTATTGA AA TTGGTAGCGC AA.
TTGAAAGGTT CT GCGCGCTTGG CT GTTGTTGAAA
TT
CCGATTTTGG AT.
CAAGCAGGGA TC~ AATACGAAGG GA TGGTCTATTT
AA
TTGATTTT
GCTTTCAG
AACATTTG
GTATTTGT
ACTAAAAA
ACTTTGGG
GAAAAATC
TCTGGGGC
CCTTACCA
ATTCAATA
AAACGAAA
Z.AATGCCG
ACAAACAGAT
CCTTGATTCT
GGAAGGCGCT
GAGCGTAGTG
CGGCCAGTAT
GATTGATAGC
CTATACTTTC
GATCACATTC
AAAAGCGAAA
TGGCAATGTT
CGATACAGTG
TATGTCGCGA
TTTAGCCTAAJ
AATCTTAAAA
TACCGCTTGT
GATCCGGGCG
TTCGTCTCTC
GTGCGTGAAA
CATGGCCGTG
GAGCAGGTCG
GCCACAAAAG
TGGAGCAATA
TGTGTAACTA
GCTTTGGCGA
TAGGGTTATC
AAGATGGGGC
TCTTTGATTT
ATCAGACCGC
TAGGCACTAA
CAGATAACGG
TTGATGAAA.A
ATGTGTATGC
GGCCAGAGCT
GGGGAGTGAA
TCAGCGATA
TTTTAAA.AAT
TGTGCCAGAI.
GTCTGTTCAT
CGTCTCGGCG
AACGCACGAA~
TAGTTATTGG
TCGTAAATCG
CACGCTGACT
AGCCAACCGC
TTACACTGGG
TCCTATAAAAp
AGGCAATATC
ACTGCTCAAT
TCCAAGAAAC
GGCCAGCCGT
120 180 240 300 360 420 480 540 600 660 720 780 792 INFORMATION FOR SEQ ID NO:277: SEQUENCE CHARACTERISTICS: LENGTH: 345 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .345 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:277:
ATGAATATCA
GAAAAGAAAG
GCTTTAATAG
TATTTAAGCA
CGCCTGCTTT
TACTACATAG
AAACCCATTC
AATTATTCGC
GGGTGTTGGT
ATAATATCTA
TAGAAGAACA
AGAATAGTGA
TTCAA.ATGAA
CGAAGCTACA
TTTTGGGGGT
TTATATTAGC
GCAAATCCTA
AAATATTGGC
AAAGAACGCT
AATGAAAATC
GCGTTTTTAG
CGTAAAATTA
AAAAACGAAT
GATATTGCGT
TTGTACGCAT
CGCACGGCCT
CCTTATTAGT
ATACCCTAGA
TAGAAAAAGA
TTTAA
AGAAGAGGAC
TTCTTTAATG
GCCTAAAATC
AGATCAAAA.A
GCGTTTTAAA
120 180 240 300 345 INFORMATION FOR SEQ ID NO:278: SEQUENCE CHARACTERISTICS: LENGTH: 2826 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 PCTIUS97/05223 301 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) AINTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .2826 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:278:
TTGCAACAT
AAAAATATC
TCGGGTA
GAGAGTTTG'
AAAATTGAA(
AGATCCACT(
GTTGGGGAG(
ATCATTTCT(
ATTAAAGAT2
GTGAGGGCT'
ACCAAAAAA(
TCACGGATC(
OAAATCTTGC
TTTAAGTGCI
AAAGOGGCG9
ATTTTAGATC
CGCAGTTATT
CTTTGTTTCA
GAAATCAGCT
ATCGCTTATG
TGTTCTTCGT
TTOAAA.ATGG
CCCACGCATT
GAGATTTTAG
AGGGATGCGC
GGGAGCGGTT
AAAGACACGC
ATTGTCGTAG
CCAAAGGCTG
CAAAATAACC
TTTGAACCCC
AAGAATTTGA
AGCGGTAAL
CATGCTAAAA
AAAGTGATTT
TACACGGGAG
TTAGGCTATA
CAAGGCGATG
GATAGCTGTA
TCCATTGCTG
AAACCATTAT GGATAAGATC ATTATTCAAG
GGGCTAGGA
T TTTTAGAA' T CCACTCTCG( r CTAGTTATG( 3 GCCTAACCCC 3 TGGGGACGA' 2AATTTTGCCC 2AAATCTGTC2Z
SAAAAAGGTTC
P' TTGTTGATGC ACACCATTG7 CTAGCGCGoq
AAGACAACGC
AGATGAGTTT1
GCGAAAGTTC
CTAACACGCC
ACGCTCAAAT
*ACGAATTGAJA
*TTCATTTTAA
ACATGTTTAA
GTA-ATGGGCA
CGGATTTTTT
TTAACTATCT
AAAGGGTGTT
GAACGATTAG
TGACAGGGGT
TCAA-CTCAT
AGCATGATA
GAAGGCATGG
ATTCTACCGC
CTAAAGAAAA
GCGTTCAAAT
GCTCGCTGAT
AAAATCAAAG
ATTTAGATCA
TGATGGATGA
GCACGAGCCG
GGGATATTAA
AGGGCGCTA
ATGTGTTGA
F CCCTAAAAAC
GTTTGACAC'
GAGGCAATT'
AGCGATCGC'
P CACTGAGAT' CACATGTTT1 k TTTAGAAGAJ
-GTTTAACGAI]
GGTGATGGTC
AGCGGTGGTC
AGAAAAAGCC
ACCAAGTATI
TGAAGAATTA
TTTGGGTTTA
TTTA.AATCAA
GTTTGAAGGC
CAAAGAGCAG
AAACAGCCCC
AGAGCAAAAG
TCGCTTGAAA
AACTAAGCCC
TAACGAGCAA
TTTTTTATAC
CGGAGGGGAG
TTTGTATGTT
CAACACCCTT
AGAGACGATT
GGGTGAAGTG
CTTGTATCTC
GCATTTTTTA
CCCCTTAAAA
TTTACAAACC
CTTGAATGGG
AGCCCCCATA
AATCAGGATT
TTTCAGCTTT
AATAGAAATG
ATACAACCCC
2' CAATTTGTTG r CTATACGCTG r TTAGACAAAG P ATTOATCkAAA P TATGATTATT
SGAGCCTATTA
AATTCTAAAA
AAATTAGAGA
CGTTTAGATG
GATAGGGTGG
CTTAAAGAAA
AGGAAGCATT
GAGCCTTTGA
GGGACAAAAT
GGAGCGATTA
TTTTGCACTT
CAAGACGCTC
TTGAAACGCC
GATTTGAGCG
CCTCAAGTT
ATTGAAGAAG
GAAAAAAAGA
GATOTGGGGC
AGCCAAAGGA
TTAGACGAGC
AGGAATTTAC
AAGCATGCGG
GTTTTTAGCG(
AACGGCACTA2 GAAATTAAAA2
CAATTGGTGTC
CTTTTACCCA
C
GTGGAGATTG 9 GGCAAAACCC
C
TTATTTGCCG AATGTTAAAG
C
CACTTTTTGC
C
TTTTTACCGG
AAGGGCAAAG
TGGGTA-AGCC
AALACCACTTC
TAAGGTTGTT
GTTCTATGAG
TCATTATTCT
GTTTGCGTTT
AAGAAATCCA
TTATCAATAG
GCTATGGGGA
ACAGCGAGCA
GTTTTTCCTT
TTAGCTTAGA
AAGTGATTTT
ATAATGGCAT
TTTTGTATGG
CCTGGAAAGG
ATTACATGAG
TGAGCGTCCA
TCTATCATTT
TCGCTGAACC
TAGGGTATTT
rACGAATCGC
CTAGCATTGG
AAAAAAAGGG
%TTTTGTTGT
3GAGCGTAAA Iz UAAAGATTGA
C
%.TGTCAATAT
C
'CATTACGGG
C
~CGCTCAAAC C rAGGGTTGGA
G
ACGAAGCAA
C
~GCAAAAAGA
A
;AGGGCGGTG
T
~TGATGTGTT
A
AAACAATCTC
ATTAAGCGGT
GCGCTATTTA
TAATGTGGAT
TAAAAACCCT
GTTTGCAAGG
CGCGAGCGAT
AGCCCCCATT
GAAGGGGTAT
TTTCCACAA.
CCAAAATGCT
ATTAGAAGTG
TAAGGCATGT
CAATTCGCCT
TATTAGTAAG
TGGCTATAAC
TGACAGCGCG
GAATGGCACT
CATTATCCAA
CGAAAfi.ACC
AGTCGCTGGC
rTTTA.ATGAT
CATTTTAAAA
GACTTTGGGG
CAGTCAAATC
:TTGCATCAA
2PACACGCTC
'GATATTGGC
~GATTTATTG
~CGCCCCAAA
AATAACATT
GTGAGCGGG
CTTTTAAAC
TATTTGGAT
CCCGCCACT
~GCTAAAATT
'GAGAAATC
.GTCCAATGC
AAAGGCAAA
CCTAAATTC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 CATGAGCGTG GAAGAAGCTT
ATGAATTTTT
WO 97/37044 WO 9737044PCT/US97/05223 302 CCTAATCG C'TGTGAAGTT AAAAACGCTT ATAGATOTGG GCTTAGGCTA TATCACTTTA 2460 GGGCAAAACG CTACGACTTT AAGTGGGGGG GAGGCTCAAA GGATCAAATT GGCTAAAGAA 2520 TTGAGTAAAA. AAGACACAGG CAAAACCCTT TATATTTTAG ATGAGCCTAC CACCGGTTTG 2580 CATTTTGAAG ATGTGAATCA CCTTTTACAA. GTCTTGCATT CTTTGGTGGC GTTAGGCAAT 2640 TCCATGCTAG TGATTGAGCA TAATTTAGAC ATCATCAAAA ACGCTGACTA CATTATAGAC 2700 ATGGGGCCTG ATGGGGGGGA TALAGGGCGGG AAAGTCATTG CGAGCGGCAC GCCTTTAGALA 2760 GTGGCGCAAA ATTGCGAAAA AACCCAJAAGC TATACGGGA. AATTTTTAGC TTTGGAATTG 2820 AAATAG 2826 INFORMATION FOR SEQ ID NO:279: SEQUENCE CHARACTERISTICS: LENGTH: 252 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .252 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:279: ATGAATTACG ATGTTTTAAT GGGATTTTTA GCACTAATCT TGCTAATTCT TTGGTATGCC TATGGATTAA GGCAATATCT TAAATTAAAA GATAAGAATA AGAGATTAAA AGAGAAATTA 120 CAACGCTGTA ATTGTAATAT TAAAATTCCT AGTATTCTTG AAATGGCGCA TAAACCTATC 180 ATTATOGATA TTAAGGGGGA ATTOCTACCA CATCTTACAG AGAGTTATAG AAAATCAAAA 240 TTTAAGGAGT AA 252 INFORMATION FOR SEQ ID NO:280: SEQUENCE CHARACTERISTICS: LENGTH: 840 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 WO 9737044PCTIUS97/05223 303 LOCATION -840 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:280:
ATGAGAGTZ
AAAATATTCC
GOCTGTACTT
AAAAATCTTA
GCCCCTAATA
GTGATAGGGA
AAGTTTGGCG
AGTTTTATTT
CGCATGGCTG
TTTTGGGAAT
ACGCCCTTAC
CGCAACGAAO
GATCCTATCG
AAACATGAAA
TAATAAAGTT
TAAAAACCTT
TTTTAA.ACGC
TTTGGCAACA
GCCOTTGGAA.
TTGTGGGGTT
TCAAAAGTTG
TTAATGAAAT
GGTTTAGCTG
ATGGCTTGGA
TAGGTTCCAT
GCAAGCTGTT
GTTTTATCAT
TCCGTTCTAA
TATOCTOATT
CCAAAAGATT
TAACAGCGTT
CTTTAAAAAG
ATATTTAGGC
GTATCTCATG
GTTTGAAAAT
CTTOCACCCT
GATGACATCA
AGCGTTTGTG
TTTAGGGGAA
TGGCTCTTTA
TAGGGATTTA
TTTAAGCCCC
TCATTAAAAA
TGOATAGTTT
CAATTAGAAG
AAGTTTAAAA
ACAAGTATAG
CCAGAGAGCG
GTCCGCATGG
TATTTTGGGG
GCGTTTTTTT
GAAGTGCCTA
GGGTTTTACC
TTTTTAGGGC
GGGCTTGGGG
AATGGTTTGA
CATTCCTAAA AATATTATTG
GCOTGGTTAT
AAACGCTGAG
AGAGTAACAC
GOATTTTGGG
TAACGAATTG
GGCCAAAACT
CTATGTATTA
CTTTTATCAC
GCTGGCAGGA
AGCTCACGCG
GTTTAGCCAT
AAGCCTTAGG
ATTTGACTTA
TTGGOGTTG
ACGAAATCCT
GATCCCTTAT
CGTGTCGTTG
GGATAAAGAA
AGACAATGAT
CATGCAACCG
TTCTACGCTT
TTTGGTGATC
CTATATCCAA
CGCCCTTATG
GATTTATAAT
CAAATTTTA.A
120 180 240 300 360 420 480 540 600 660 720 780 840 INFORMATION FOR SEQ ID NO:281: SEQUENCE CHARACTERISTICS: LENGTH: 531 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .531 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:281:
ATGAAACTTT
TTTTCTGTGC
AGGGGGGGGT
TTAAGCCTAG
ATTAAATCCA
TTAGACGCGC
GAGTTTTATA
TTGCAAGTGA
GTCATTCAGC
TTAACCCTCG
CTTCTTTGCT
TGAACATGCT
CGTCTGCTTT
GTTTAGAGGG
TTTTATTGGA
GCGTGAAACT
TAGGGATCAT
AAGGCCGAGT
TTTAATCGTT
AGAAACTAAA
TTTAGGGGTG
AGAATACAAC
GATCAGTTTT
ATTGCAAGGT
CACCCCTTTA
TCGTAACCGC
AGAATTTCGG
TTTATATTCG
GGCCCTAAAA
CAAACCGATG
GCTAAAAAGC
GAGCTTTTAG
CATAGCCAGT
GAGCAAGAGG
TTGGATCGAT
TGCAATTGCC
CGCTTCTTTT
TCACTTTAGG
AGGCTTTAAA.
AAAATATCTT
ATGAAGATGA
TTGAAATCAA
AATTOCGTAA
TTGGCTTGGC
CGGCATTAAG
AGGGGTAGGG
TTTGGATTTA
AAACAAGTAT
GCTTAAAGAC
GGCGAAAAAA
AAAAGAAGCG
AAACACGATC
AGAGCCTGTA
120 180 240 300 360 420 480 531 INFORMATION FOR SEQ ID NO:282: SEQUENCE CHARACTERISTICS: LENGTH: 1311 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 304 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genc'ric) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .1311 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:282: ATGATGAAAT TTTTTCTTTT
AAAGAAATTC
AACCTCAAAC
GCCTTTGTTG
GTTTTAAAAA
ATTTTACAAG
TTAGCTTATA
CTCATGATTT
GCTAAAAACG
TTGGATTTTA
GAATTGGAAC
AAAGAAAAAT
CAAACTCAAG
GTGATTTTAA
TTAAACGCTT
TTCTTGTATT
AACTATGTTA
TCTAAAATCA
GGTTTAGGGA
TTGTGGATGC
ACGCCTAAAG
GCGTTTAATG
GCTCATGTCA
GCTTGAACGC
TGGATTTGAG
ACACTTTAGC
CAAACGTCAT
AGAGTGAAAC
TAGATCAAGA
ATATTTTAGG
AGGGATTGTT
ACAAAAAAA
TAGAAAAACT
CCTCATTGTT
AGGATTTTGA
TTATCAATAA
TAGAAGAAGA
GAGACGCTGC
AACGCCCGAT
AAAACCAAAA
ATGTGAGAGA
ATGAGGTCAT
GTTACGAGAT
TTTACTCAAA
GTCTAGTTTT
CGCGCCTTAT
GTTAGATTTT
TGATAACGAT
TTTTATTTTG
AAAATGCGTG
GGCATTGCCT
GGACATTTTA
TCAAATCATC
AGAAGATCCT
GCTCACTTAC
AGATAAAGAA
AAAATTCACT
GAATCTGAAA~
AGAAGAAAGC
GAACGGGTAT
AGAGAATATC
TATTCCTGGA
TATGGAATTA
TGACTACACG
ATACCGAACT
AGCGAATTTT
TTATTAGAGA
ATTGGTTTGT
TGTTTGAATA
CGGATTTTAG
CGTTTAGAAA
ATAGAGGCTT
CCTAATATTT
GAAAAAGATT
AAGCGATTA.A
AAAACTTTAC
CAGCATTTAA
TGCATGATTG
CTCAGCAAGA
GAAAAAATCG
GTTTTAGAAA~
GAAGTGCTGT
AAGCTTTTAC
TCGCATTTGA
GCCAAAATGT
CAACGAAAAT
ATTAGTCTAA
TAAACACTCA
CTTTTTCTAA
CCAAAAAACC
AATTCACCAA
AAATCAAGGG
TGATCCCTAA
TTCGTTTTAA
ACGAGCATCA
TTTTATCCTA
ACGCCCAAAA
AGCTGGAAGC
TCAACAGGCG
AAATTGATAA
AAAAGAAACA
CTTTTAAGGA
TGTTTATGCC
ATTATAAGGA
AAGACGCAAG
TCGTTTTTTG
TGATTAAAT
TTGTCAAAAT
AGGACACTTA
AACGCATTTT
AGAAAAACAC
CCCAGAGAGC
AAACGCCAAA
CGCTAAAGAT
AAAAGCCAAT
TGACAGGGTG
AGAAGAGGAT
TCAGCACAAA
AGAACGCTTG
GAAAGAATTG
TGAAAATCGC
GAGCATGCCC
AAAATCGCAA
AAATCAAATC
GGTAAAAAAC
TTTTA.AAATC
AGCGAATGAT
CCAAAAGAAT
GCAAAAAGAT
CATCAAAGGA,
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1311 INFORMATION FOR SEQ ID NO:283: SEQUENCE CHARACTERISTICS: LENGTH: 294 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737044PCTIUS97/05223 (ix) FEATURE: NAME/KEY: misc -feature LOCATION .294 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:283:
ATGGGACAAA
TTTTATTTTT
GAGGGCTTGA
AAAGCGGAAG
AAAAACTATG
CTAAAGAAAT TATAACGACG
CTTTTACCCC
TGATCGTTCG CCCGCAACGC CAACAGCAAA CTAAGGGCGA TAAAATTGTC
ACTCAAGGAG
CGAATTTTTT TAGCGTGAAA
CTCAATGATG
TAGCGTTCAA ATTAGACGAA
GAAACAACAC
TTTTGGTGTT
AAAAGCATAA
GGCTGATCGT
ACACCACCGC
CCAACAACAA
CTTTCTTATT
AGAAATGATA
TGAAGTGCTT
TAAACTTTCT
CTAA
120 180 240 294 INFORMATION FOR SEQ ID NO:284: SEQUENCE CHARACTERISTICS: LENGTH: 621 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .621 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:284:
TTGAAGTTTC
AAGGGCCATC
ACTTATTGGC
AACCCCAAAA
TTCATGCTTT
CCTGGGTTTT
GCTCCTGGCT
CTAGCCTTTG
ATTAAAACCC
AATACCCCAT
GCATGGGAGT
AAATTGTGAG
ATTCTGGTTT
GCAAAGTGGA
TAAAGCTCAA
TATGGAAAAA
ATTACTTGGA
ATTCATACAC
AAGTCAAACC
CTAGAGGCTT
GGATTGAGGG
TGGAACGATA
TTTGTTATTG
GGTGA.ATATG
TAGAGGAGTT
AGACCCTAAG
CCGCTATACT
TTCTTTTAGC
CAAAAATGGC
TGACAGCAAG
TTTAGGGGTG
GAGCTTGAAT
GCTTTTTTAT
TATATTGCGC
GTCGCCAAAC
GGGCCTTTGT
TTAGCTAAAG
GTGGAAACTC
TACGATTTTA
ATCGTTCTCC
TTTTTATTTG
TTAAAGCTTA
TAGCCTCTTG
ATCAAGGCCA
ACAATGAAAC
TCATGCTAGG
TCCAATCGTT
AAAAAGGCAT
AGAACAACCG
CTAGCGTGGA
ACAACAACGA
AAAACGCTTC
TTTGCCCCCT
GAGCGTACGC
GCTTAAAGAA
GAGCAATCGT
CAAGCTAGAG
CTTGCAAAGC
CCCCTTTTTC
ATTGAGCTTG
AAAGGGGGCT
TTTTAA.AGAC
120 180 240 300 360 420 480 540 600 621 INFORMATION FOR SEQ ID NO:285: SEQUENCE CHARACTERISTICS: LENGTH: 1599 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 WO 9737044PCTIUS97/05223 306 (iii) HYPOTHETICAL:
NO
(iv) ATNTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1599 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:285:
ATGAGAAAGG
TCGGTCGTTT
TTTTTAAAAA
CAAAGCGAAG
AAAAAAACGG
AGAGACGCTA
GGTAACGGGC
CCGTATTCCA
GATGTGATTA
AATGTCATCA
TTTTGGGGGC
GCCCAAACTT
GGTAAATATA
AATAGCCCTIA
AATACTTTTA
AGCGCGCAAG
GGGCGAGCCA
GTGGGGGGCG
TCCAACCAAT
AAGGGAGAAA
AGCCCTTGCT
AAACTCAATC
TTTTTAACTG
AATGGGAGTG
TATGCCAGCG
TACACTTTTT
AAAACCACGA
TTATCATAAT
TGACTTTTTG
AAGTTACAAC
AGGTGCGTAA
GGAATTTGAA
CAGGCACGGG
ATAGCAACAC
ATATTGAACT
AGGGGGGAAC
CTAAAGAAAT
GCTCCTCTAI.
TAGGAAACCA
TAGGCATTAG
CAAAGGTGCA
AAGCTTATTA
ATTACGCCTA
AGCGCTTTGG
ATTTTAAATT
ACCAAAGCGT
TCAGCGCAAA
GGCAATTTTT
TTATCGTCAA
AAGATTTATA
GGTTTGATGC
ATGAAATCAA
TAAATTACGA
AAGAGCGTTA
GAATGGTTAT
GACTTTTAAT
CACCGAGCAA
TTCCACAAGC
TATTGAAAAC
CGTGCTGCCT
CAACATGATT
GGCGATTTTC
GAGCGTCCAA
CCCCAAAGAG
TGGGAATTTT
AATGCTGTTT
CGCTCAAGGC
AAACTACTTG
CCAGTATTAT
TAACCGCTTC
GATCGTGTAT
CACTTATTTC
GTATATGAGC
AAACCCTAAT
TGACAATATC
TACTGGTAAA
CCGCCGATCC
AGGAACTTCA
TTTCAATAAC
AAAAAAAGAC
TAACCAATGG
TTGAGGGTAA
TCTTTTATGA
AAATTCAGTT
TCTCGCACGG
GCCTTGCAAA
AAAATTTCGG
TTAGTCAATG
CCTGTAACTT
TACGGCCCTA
TGGGAAAATC
GTCGATCCCA
AACACTTATG
AATTGGATTA
TTGGATGCGA
CAATACAI\CT
ATCAATGAGC
CAAAACTACT
ACGCATGACA
GGTCAAAATA
TGCGGTCTGT
CGCCGATCCG
GTCAAACAAA
ACCACCAGGA
CTCAATAATT
GGCATGCTAA
GCTCCTCCCT
AATCCGGCA
AAACCCCGTA
GCGCGAAAGA
CTAGTGCCCC
TGATTTCCAA
ATGTGCCAGG
TGCGCGGTTT
GTATCCCCAT
TCCAGTCAGT
ACACTTTTGG
AAGCGGCTGA
AAGAAAAAGG
GGCGAACGGC
ATGGGCA.AGG
TTTATAAGAT
CTTACCATCC
GCCCTGACAA
TTGGCGATCC
TGAGTAGGGA
AGATTTTACC
ATTCTTATAG
TGGTGAATGC
CTTTTAACAT
AAAACCCTAG
TCAACAATTA
CGATCACGCC
TTAAGGTGGG
TTTTTTAGCG
TAAGCACCAT
AATTTCATGG
CAAAGAACTC
GATTCAAATC
TGGTGGGGGC
TTATGGCGCG
GGATAGGATT
AGGCGTGGTG
AAGGATCACT
CAAGCCCTTA
TOGGATGTTG
TTTCAGGCAA
CAATGCGACC
AGGCACTTTG
TCAAGATGGA
GGATAGGAAA
TTTTGGGTTT
CTTTAA.AGGC
CGACACGAAT
CTTTGAGCCA
GGGAATGCGC
CATGCCTAAT
TACCGCTGTG
GGGCTTGAGA
TCAAACCCCA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1599 INFORMATION FOR SEQ ID NO:286: SEQUENCE CHARACTERISTICS: LENGTH: 768 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (Vi) ORIGINAL SOURCE: WO 97/37044 WO 9737044PCTIUS97/05223 307 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .768 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:286:
ATGAAAAAGA
GGCGCGTTTT
ACCCCA.AGCG
GCTCCAGCCC
GTAGATGCGA
ACTTATGGAT
ATCATGGATG
TATAACTTCT
TTAGGAGGGG
TGCAACAjACA
GAATTTGGTT
CCTTTATTTA
TATAAAAGGA
TTTTTTTAGG
TAGGAGGGGG
CTAACAATAA
AAGAAACGCC
TGGCAGGGTA
ACTACAGCTA
GCGCATCTCA
ATGAAAGCAA
ATTCGTTTAT
CCGCCGGCTG
TTAGGAGCAA
CCAACCAATT
ATTTCTCTAT
TATGGCATTA
GTTTCAATAT
CACCCCGATA
AAGCGTGATC
TAAGTGGTTC
TAACCATGCG
AGTGAATAAC
AGAGGGCTAT
CGTTCAAGGA
TTCAGCGAGC
TTTCTCTAA.A
CTATAAAGAA
CTATTTTAAC
GCCTTTAGTG
TCTAATTTAG
AACACTTCAA
AACACCAATA
TTTGGCAA.AA
AATTTGAGCT
TTCACTTATG
A.ACACAGCAG
GAGAGCTACT
ATGA.ACACGA
CACAGCGGGA
AGGGGCGTAG
TACATGATCA
TGTCCATGGC
AAAACCAAAA
TGTTTGGCAA
ATTACGGGCA
CCAAACGCTT
TTGTAGGCAG
GCGTGGGCTT
GGTTGTTCGT
TGAAATCTCA
GCTACTTCCA
TTGAAGTGGG
ATGGATCGGT
ACCTCTAJ\
AGAAAAAAGC
CACCACCCGC
CAATCAAGCG
AATGTATOGGG
TGGCTTTAGG
TAAGCTTGGA
TGATGCGCTC
GGGCTTTGGA
GATGCATT
AATGCCTGTG
CTTTAAATTG
AGATGTGTTC
120 180 240 300 360 420 480 540 600 660 720 768 INFORMATION FOR SEQ ID NO:287: SEQUENCE CHARACTERISTICS.
LENGTH: 1002 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .1002 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:287: ATGCTAATCG CTCGCTTTAA AAAAGCTTTA
ATTTCCTATT
TCATCGTTAT
GGGGAGCAAA
GTGCCTGCCA
TCTGATATTG
CATGTGGCGG
TTTGTGGGTA
TTCCAAGAAA
GAAGTTGACG
CGTTTGAAAA
AGCGGCCATC
TTGAAATACG
TAGGCGTGGC
CCATAAJAGCT
TGTTCAATAC
TTAAAGCCAC
CGTTGAATGT
ACCCTAAAGC
AAACCATTGT
CTTCTAAAAA
ATGTCAAAAA
AAGCCCTTGA
TCAAGTTTGG
TAACGCTTCT
TCCTGTTTCT
TTGGGATAGG
TCTCAAAGAT
GGAATTATTA
GGTAGAGCAT
AGAGGTCATG
ATTGGCTAAA
GAAAAAGGGG
TTCAGATATT
GCGCGCTGAC
AATCAAGAGA
AAAATAATCT
GTCGTGGGCA
CCTGAACGCA
AAAAAGCTTA
GCGAAAAAAT
GAAGATATTG
ATGCAAGAAA
GTGGAGCTTT
TTAGAAAAAG
CTTTAGGGGT
TCCAAGTCAA
ACTTGGGCAG
TTTCGGATTA
TTAAACCCAT
GCCCTGATCT
TTGGGATTTC
ACGCTCAAGC
CTTTGGATTT
TCCATAAAGC
GGOGCATAGA
TCTTCTTGTT
GGATTATTTT
CTTTGCAGAA
CGCTTTTAAA
GAGCAGTGAT
TGTGGTAACC
ATTCCTTTCT
TAAGGCCTTA
TATCAAAGAG
CAATAAAATC
CAATTTTGGC
TAAAGAAAAC
120 180 240 300 360 420 480 540 600 660 720 ATTAGTGTGG AAAAAATCGT WO 97/37044 PCT/US97/05223 308 CCTGAGATTA TTTTTATTTG GTGGATAAGC CCGCTTAGCC CCTAAATTTT CCACTATCAA AGCCATTAAA AACAAGCAAG GATATTGGCG GACCTAGAGC CCCACTCATT AGTCTTTTTA GAAGCGTTTA AGGGCGTGGA TATTAATGCG ATAATCAAAG GATTTGAATG ACGCGGAAGT TGAACCCTTT TTGTGGCACT CTGAAGACGT GTTGAACAAC TCTATAAGCT CCCCACAATG TCGCTTTAAA AGCCCACCCT ATTACTATAA AGTGGTCTTT
GA
780 840 900 960 1002 INFORMATION FOR SEQ ID NO:288: SEQUENCE CHARACTERISTICS: LENGTH: 165 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...165 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:288:
ATGAAAAAGC
GCTCACAAAG
AAAGATAAAA
AAATCTTGAC AGGTGTTTTG TTATCAGTTT TGGCAGTGAG TTCTGCATAC ATAAAAAAGA CGCTAAAAAA CCTGAGTTAA GCTCTCAATT AGTGGCTCAC AAGACGCCAA AAAACCTAAA AACTCAGTGG CCTAA INFORMATION FOR SEQ ID NO:289: SEQUENCE CHARACTERISTICS: LENGTH: 429 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...429 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:289: ATGTATAAAT TAGGAATATT TTTATTAGCC ACCTTACTAT CAGCTAACAC GCAAAAAGTG WO 97/37044 WO 9737044PCTIUS97/05223
AGCGATATTG
AAAA.ACCA.AC
CAAAAGGTTG
AACGAAAGTT
CAAAAACA.AC
TTCTCACAAG
GGCGTTTGA
CTAAAGACAT
TGAACAGTCG
AGATTGA-ACG
TGGTGCAAGA
GATCGTTTTT
CCCT.AAAGGG
CCAACATAAA
TTTGAGTTCT
CCAAATGGTC
AAAAGTCCTA
ACAAAAGAGA
GCAGAATTTA
GAAACCCTTT
TTAGGCGAAG
GCCTTAAAAA
ACCAACTACC
GTGTTTGATA
GCCTCTTCTA
TGAAAAAAAC
CGATCCGCTC
AGAGTCTTGA
GCAAGTCTTT
CGCTTTTAGA
ATGATGTGAT
CCATGAAGJ\
TAAAGAGCTT
AAAAAATCGT
AGATCATTTG
GGATTTTCTT
TTTGCTAAGT
120 180 240 300 360 420 429 INFORMATION FOR SEQ ID NO:290: SEQUENCE CHARACTERISTICS: LENGTH: 594 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .594 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:290:
ATGGATAAA
TTTATCGCTC
AAGCAAGAAA
CAAGATTTTA
GAACATGCCA
AAAAAGTATC
TCCA.AAGAAA
AAGCCTTTAG
TATAGCGCCT
GGCACTCTTA
ACAACAATAA
TTTATAGCTA
CAACCAACAA
GCGTTACCCA
AAATTGAAAT
TAACCCCTAA
ACTCGCAACC
AAGTGCGTTT
CAAAAACCAC
GCATCATTAA
TAATCTCCGC
TTTTTTCCAA
CCACACAGCA
AACCATCCCT
TGATTCTTTA
AGAAAAGGGC
CTCCCTAAAA
TTTAGACCCC
CCTTGGGCCT
AACCCTGACT
TTGATTTTAG
GAACCAAACA
ACAAGCCCCA
CAAGAGAGTT
GGGCGCATCA
TTTTTAGAGC
GAGCTCCCCC
ACGCTCAATA
AACGAACAGC
TTTTATGATG
CGATCGCTCT
GTCTTTCTTG
AAACAACAAC CGAAACCACA CCGCGTCCAA
TACCATCACC
TGTTAAGCAC
GATTTCTTTT
AACAGGTTTA
TCTCAAAGAT
ATGTGAGCCA
TCTTTTTAGC
TTTTAGCAGC
CGATAAACTC
ATAAAGCGTT
CAACACCCCT
TCGTTTTAAC
CCAAGATTTA
ATTTGCATTA TTGA 120 180 240 300 360 420 480 540 594 INFORMATION FOR SEQ ID NO:291: SEQUENCE CHARACTERISTICS: LENGTH: 780 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: WO 97/37044 PCTIUS97/05223 310 ORG-ANISM: Helicobacter PYlori (ix) FEATURE: NAM!E/KEY: misc feature LOCATION .780 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:291: ATGGCGTGTT GGCATAAAAG ATTGGCAGTT GGTTGTTGTA TCGTTPTTmm
ATGAGTGCTA
GCATGGATTT
GGCTATGCGA
AATGGCTTGT
ATTTGCATAG
AAAACCGCCG
AGGATTTATG
CCCACTTCCG
ACGGACATTT
TTTAAAAAGG
ATGGGAATGG
AAGATTGCGC
ATAATGTTAG
ATTCTGTTGC
CTTTGCTAGA
ATCCTAAAAT
CCAGCTTGCA
ATTTTAGAGA
CCAAATACGC
ATCTCTACTA
CTGTGGCTGA.
GGAGCGTGTT
CTAGCAGCAC
GATTTTTAAG
TATCGTTAGA
GTTATTGAAA
AAACGGGCGT
GATTTTAGCC
CCTTGAAGCG
CGATTATTGT
TCAAACTTTC
CCCGGCGTTA
ACTTTTGAAA
GTGGGGAGGG
TTTAGAAAAC
GACGATCCGC
GTGTATTTTA
TATATCGCTT
AAAATGCAAG
ATGGATAGGG
CATAAAAGAG
CATTCAAATC
AATGAGGGGA
TCTAAAAAGT
AGGCCTTATT
CCCTTGATCC
GCGATGGGAC
CTTCTGAAAC
ACAGCAGTGC
ATCAAGGGCT
AAGAGAGCTA
CTTATACCA
ATTCTTTTTC
TTCTTTCGCT
TTAGCGAGGT
TGGTGATTAT
TTTTTCCAAA
GTCATGCGTA
AACGCTCCCT
TTATAAopA
GCTCTATTCT
TAAAGAGCTG
TTCGCTCTTA
TTATCATGCA
TCAAAAAACA
CATACAGACA
TGATGCTTCT
AGGGGAGTTT
CCCTAAAGAA
CATTCCCTAJk 120 180 240 300 360 420 480 540 600 660 720 780 CGCTCTAJAjA
AATCAAAATA
INFORMATION FOR SEQ ID NO:292: SEQUENCE
CHARACTERISTICS:
LENGTH: 2007 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: I-elicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .2007 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:292:
ATGAAAAATC
CATTTTTTAT
TATGCTTTAA
GAAAGCAGGG
AGAAACAATG
TCGTTTCCTT
AAAAAATCCA
GGCGTTAGCA
GACCCTATCT
AATCCTAATA
GOCTATGGGG
GGGAGTCATT
AACACAAAAA
TTTTCTGCTT
GTATTATCAC
TGGTGTTAGG
CGACTAACAG
TGACTAGCAG
CTATCCTTGT
AAGAAGAATA
ATGCTAACAC
GCCC TAGCAA CAAATGGCAjA
CAAATAATAA
TCCCCTAACA
TATTCTAGGA
CACTAAAGAA
CAAGAGGGTG
CATAGATATA
TGCTAAAACC
TTTGAAAGGC
CCAA3AAGCTA
GCCTTTTAGT
TAACGCTATC
TGATGGGGTA
TGCAATAGGC
AAAGCTTTAA
GCGTTTTTAT
ATTOACOCTA
TTTAAAGTAG
GAAATCACCA
AGTTTAAAAA
GAGAACGCTG
GAAAACATCG
AATGGTTCTG
AATGGCAAAG
TGAAAACTTA
TAGGTTTGCT
ATTTGCTTAA~
AAGCTCATGG
GTCTTTTAAG
TACCTCCTAA
AAGAAGTGGC
CTCAAACCAA
ATAGTTCCTT
ATGGCGCAAA
TCCATATAAC
CAGTCCAGCT
TGGAGCGATA
GTTTTATTTT
AGACAATCAA
CGCCAAGATT
TAAGATTTTA
AGCGGCTAAT
TTACGATAAC
TGGGAGTAAC
120 180 240 300 360 420 480 540 600 660 720 AGTGGTATTG ATACAGATOG
CGTGTTAGOG
WO 97/37044 WO 9737044PCT1US97/05223
GTGGATGGGG
AATTTCACTA
C-GCAGCTCAA
GGAGACACAA
TCTAATAACG
GTGCCAGAGT
TGTATTTCTA
GCCGGTAAAG
AATATCGGTG
GACAGCAAAT
ACCTTTCAAA
TGCAGCGATT
CAAACCTTAA
GAGATTTTAA
GGTCAATGGG
TATAACAAGC
AAAAGCGGTG
GATATTACTA
TGCCAAAAAC
TCACCGATTG
GCCCAGCCCA
CCATCAACAG
TGAATGGCTC
ATCATGGCTC
GTGGTGGGAG
ACAATTCCAA
CCACTAATCC
TAAGCCCC;:A
TGAACGCTTT
CCATCAAGCA
GTTGTGTGGA
GCGCCTTACA
CTGAAATCGT
ATGCAAGGGT
CGCCTATAGT
ATCGTGGGAT
AATACAATGA
TTAATGGAGA
CGGTTGTTAG
CAAGCCATTA
TTTATGGAGT
GAGCGCCACT
AGATAATGAA
GACTCAAAGC
TAGTTCTTCA
TACTAACAAT
TTTAGGGAAT
TAATTCCACT
TAGTTCGCA.
CAACACGATG
AAGAGATGAC
AACGCAATAC
TTTACAAGGC
AACCACGAGC
GTTTCGTGGG
GCA.AGACAGG
GGATCAGTAT
TGCCACCCAA
CGCTAAATTA
ATGGGTAGAA
CCCTTATGTG
TTTTAGGATA
CAATCGTTGC
TACAAAACCA
AAACACCCCA
GTTTTGA
AGTGGCGGCT
AACACAGGAG
GGGGGGCTTT
AACACCACTA
GAAAACAATT
AAACTAGATG
ACTAAATGCG
TACTATGTAG
GCTCA-ATACG
GATAAAGGTT
ATGGACAATT
ATTGTTAGGT
TATAATGATC
TTAAGCTCGC
GAAGCCAAAA
GTTACGCCCT
ATGGGCGTGC
GAAAGGAAAA
CAACCGCAAT
CTACCACCCA
CA-ACCTATAA
CTGTAGGGGG
GGTATGACAA
TCCCTATTCC
GCCCA.ACTAA
ACTCCAGCCA
TTATCGCTAA
CTTATAGATA
ATAGGGAAAA
CCATGCAACT
ATGGTATGGG
TAATCCATGT
ATGAAAAAAA
CTAACAACCC
AATATCAAGA
GACCTACAAT
GTAATTTTGA
CTAGTTCTAA
ATTATGGTGA
ATTCCATACT
AACCACTCAA
TCTTATCACC
TTATGAGAAT
TTTTAATAAT
TTTTGGTAAT
TGGCAGTAGT
GTATTGTAAA~
AGATGGCTCT
CGATTTTGAA
TAAAACGCAJ\
TTACAAAGAT
GAAAACGCAA
CGCTGTGCCT
TGATAAAACC
TAACAAGCA.
ATTTGCATGC
GCTAAAAAGC
AGCAGGGATT
AGTCTTAAGC
GAGAGAACAA
GATCCTAGTA
CCTTATTTAC
ACTCAAACCA
780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2007 INFORMATION FOR SEQ ID NO:293: SEQUENCE CHARACTERISTICS: LENGTH: 987 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .987 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:293:
ATGACTTATA
AAAACAGAAC
TTTATGTTGG
CTCATGAAAA
TTTGGGGATT
TTTGCTTTCA
GCTCGTTTCA
GCGATCACTT
TATTATGAAA
CACATGATCA
ATGAA.AGAAC
AAGAACGACT
TGCGCATTTT
CTAAAATGCC
CTTTTGATAG
CTAAAAGCGA
TTATTTTAGA
CGCTCATCAA
TGTGCATGCA
ATTATGGTAG
TGAGTAGGCG
TCATTAAAAT
CATACACGAA
GAGCGTTTTT
TCATTTTTGG
GCGCCGTTAT
AGAAGAAAAA
AACCATTAGA
TGAAGAAAAC
TTTTGGCTAT
GGGGTGTTTG
AGAGGCGTTT
GTATAATCAA
AAAATATTAA
ATCGTGGAAT
TTTTTAAGGT
TTTGACGCTA
AAAAGGATCA
GTGATCTTTA
GTGTGGAAGT
TGGGAAGCGG
GGGCGTTTGA
GGGGTGCGTT
GGCAATGGCC
ACCAAAACGA
TTTTGGTGAA
GCGTTAAAGC
AGGCCAATTT
TTAAAAAGGG
TCCCTAA.AGA
CCTTAAACAA
TAGGCACGAC
CTAAATTCGC
TTGTCAATA3
TGGTGGGGAT
CAAGGGTTTT
TATTCTAGGG
TTTAGCGTGG
GGATTTTGTG
GTATGAAAAT
TGAATACGAC
GGAAGGCCAA
TTTGGCGCAA.
TCCCATCAAT
AATAGGGGCG
TTTAGTGGAT
120 180 240 300 360 420 480 540 600 660 WO 97/37044 PCTfUS97/05223
CAAAATGTCG
ACCACGATCG
GATTTTAACG
ATCACTGATA
GAAGAGGTGA
ACCCACCCTG
TGCCTAAAGA CGGGGTGGTG GTGAAATTTT TCAATAAAGA CGCTACGACC CTTCTATTTT GTCGCGCCGT TACAATATAG ACATTCAGCC GGTATTCATT ATGATTATTC GCATTACACA GCGACTTATT ACCCAAGTAT CCGCTCTCAA ACGCGCAAAA CGATATTTTA GAATGCACGC AAGCCCAAGC GAGTTTGTGC TTAGAAACCA CCCGGAAAGT TATTTTTGGT TCCACAGGCG TTTTAAAAGC AGATTTATCA AAGATAG 720 780 840 900 960 987 INFORMATION FOR SEQ ID NO:294: SEQUENCE CHARACTERISTICS: LENGTH: 417 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...417 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:294: ATGTTATATC TAAGGAAAGA AGCGTTTTGA ATGGCGATGA AGTGGGATTA TTGGGATTGA ACGATGAAAT CGTTGTTTGG TTGATGAGCA AACACATGAA AATTATGAAA AAGCGGTTAA GCTCCTAGCA ACTTCCAATC
AAATGGGGTT
TCTGAGATTG
TATTGACAAA
TGAAAACTTG
AGGGCCTTTA
TGGTTGTCAA
AGGCTCATAC
AGGACACTAA TAAGTTTAGG GATTTTGTTA TATTCAAAAC CTTTGGTCTA TTCGGCTGGA CGGACATTCT ATAAACGAGC GTTCGCTTTC CTTTTGTTTG TTAAATTGAA GCATTCTGCG GAAAACCGCC ACCACCATTC TTTCGCTAAA AAATATGTCC ATATCAAATT GCCTGAAGGC ATGGCGACTA TGGTGATGCG TTTTTAA 120 180 240 300 360 417 INFORMATION FOR SEQ ID NO:295: SEQUENCE CHARACTERISTICS: LENGTH: 987 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 WO 9737044PCT1US97/05223 313 LOCATION 1 .987 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:295:
ATGCGTAAGG
GATGATTTTT
GATTTAAACG
TTGAACATTG
ACTTTATTGT
GATGCGAAAA
GTGGATTCTA
TCTACCACTT
TATTACACTA
GCCCCTAAAG
GAAAAAGAAG
AAAGACATCC
TGGGCTTGTT
CAATTCACTT
AAGGTCGTTG
GCTGAAGACG
TTTGTCAAAA
TTTTATACGC
TAGAAGAAC
CCATTCAAGG
ATTACTTTCA
TTTTTTCAA.A
TTTTAGAAAG
ATATCAGCGT
TCACCAGCTC
ACGCTTTTAT
ATGCCCAAA.A
AAGAGACTAT
AAAAAGGCTA
CTAAAAAATC
ATTTCAAATT
ATGGTTATGA
TTTCTACTAA
GGCGCAAAGG
TCTCATGGGC
GAACGAAACA
GAGCTTTTTT
AGGGCAAACT
ACCCATTAGC
CAATGATCGC
GATTGATAGC
CAAACACCCT
CAAGCCCCAA
AAATAATAAG
AATTATTGGC
TAAGGCTTTA
CAAACTCTCC
TGACAAAAGA
TAATCCGGTG
ATGGACTTTA
TGAATAA
TTTTTGTTGG
GCCCCAGCGA
GACAAAAACC
TATAAAATCC
GATTTTGTTT
ATTTTACTCA
GAGGGTAAGA
AATTTACAGG
AAAGAAAATC
CCC CTAAAAG
GATAACACCA
AAAAGCTCTC
TTAATGCCTA
TTAGCACTCT
AATACAAGGA
AGGTTGGGTA
TTTTTAGTGC
ATTTAAACCA
GCTCCAAAAT
GCTTGCGTTA
TGGGGGATAA
TCAAACCCTT
TTTTTTCTTT
TTTTTATAGA
AAGAAAATAT
AAGAAAAAGA
ACGCGATGALA
AAAGGAAATG
AAGAAATCTT
CTAAATTCCC
TTGTGGGCGA
AGGATTATCT
TTTA.AAAGCC
CCCCATGCAG
GTCCAACACT
TGCGA.TGGCG
GGTGGGTTTT
ACAAATCGGC
CTATGTGTTT
AGATAAAALAC
GTCTGAAAAT
AGAAACTAAA
AATTATTAAA
GTATTGTTTA
TAACGACAAG
GGTGATTTAT
TTACATTATC
GTGT.ATCCGT
INFORMATION FOR SEQ ID NO:296: SEQUENCE CHARACTERISTICS: LENGTH: 1008 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1008 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:296:
GTGATGTTTC
GGGGCTTATG
GGCGTGGCTT
AAACACGCTA
CCAAGCGAGT
GCCACTATAG
GACCGCTACG
TATCAAATGC
CGCTTTTTAA
CATAAGCGTT
A.ACGATGAAA
GTGGTGAGCA
ACAAAGCCCT
ATTTCAAGCA
TAAAGGAATT
AAGTCATCAA
ATGTTTATAC
GGGCTAATCA
CTCAAATTTC
TAGGGCTAGG
ACCAACA.AGA
TGGTGGTAGC
TCCTAGCGAT
ATCTTTCATA
TATTACCTTT
TTGTCAGGCT
GCCTAAGGGC
ATCCGATCCT
ATTAAGGGAT
AGCCACAGAA
ACAAAAAACC
GGTAGGGGGG
GCCTTATTAT
GCAATTTGAT
CAATGATTAC
CCAAAGCCTC
ATCGTTCTAT
TTTTTT.AAAA
GTGTATTTGT
TTTATAGGGC
TTAGACAAAG
GCGCGATTAT
CAAAAAAATG
AACGGCTTTA
GGGGATATTG
CCCTTTTTCC
AAGATCCACT
GCCAAAGTGA
GGTTTTTTTT
AAGCGAGCCT
ATTATTCTA.A
TGTATTTGTT
ACGCCCTTAT
TGGTAGGGCA
GCGTTATCAG
TAGAAACGAA
GGGTGCGTTT
C TAAAAAC CC
CGTTAGCGGA
AAATCAAACG
GAATGGCTTA
TCAAAAAGGG
AACCTACCCC
GCAAAGCGCG
AAGACCCATG
AAAAGGCTAT
CAATATTTGT
ATTCATCAAG
AGAAGAACGT
TTTTTTAAAA
GTTTGAATGG
AAACCATCAA.
WO 97/37044 WO 9737044PCT/US97/05223 ATCAAAGAG, TAACGCTCAA AGTCAATA CGCTATGGGG TTTTTAGAGC GCTATGGCAT CGCTTTAGAC GAGCGTTTTA CATTTGCCCA AAGGCTTGGA TTTTTTAA-AG CTTGGGGATA AGAAGCGTTT CGTTCAACCC GAAAGCTTTA AGAGAAGCGT TTATTAGTCT GGCGTCAAGG CTTTGAATTT TACATTAAAG GGTTTTTGCT CAAAGACACT TCATCACTAA AATAGGCGCT GGATTTTATG GGTGAATCAT TAAGCACGCC TAAAATTGAA
TCCGTTGA
780 840 900 960 1008 INFORMATION FOR SEQ ID NO:297: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 753 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .753 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:297:
ATGCTAGGAA
AGAGGGTTAA
AAGAAACAAG
GGTTTTGGGT
TTGAAAAA.AG
CGCCTACTAG
TCACAATATT
AGCCTACACC
GAAAAAGCTT
GTGGCAAAGA
GGTAGGGGGT
GAAAAGCAAG
GACGCTCTCA
GCGTTAAAAA
TGGCAGAGCC
ATTTCGCTCA
GTGTTTTTTT
CCATTCAATT
GAAATTTATA
ACTCTAAATC
ATTA.TGGCGT
GCGATTTAAA
ATTTTAAGGA
GCTATAATTT
CGGTAGAAumA
AGGAATTAAA
AACCCTTTTT
AGACGCTAAA
AGCTAAAGCG
AGGGGCGTTT
TTACACTAAA
CTATAACGGA
TTGTGAATTG
AGGCACGCCT
AGACAGCCCT
GGCTATCGTT
AGGGGTTATG
CTTTAAAAAA
AATAGAGCTT
GGGGTCTTGT
GAACTTGTTA
CATTTTGAAA
TATGAAGAAG
GGTTGTGAAT
CAAGGCGTTT
AACCATGCTG
AAGGATTTAA
GGGTGTATTA
CGTTATTCTA
CAATACAACG
GGCTGCAA.AT
TAG
GTTTGGGCGC
GTTTAGGTAT
AAGCGTGTGA
GGAAAGGAGT
TAAATGATGG
CTAAAGACGC
AAGGGTGTAC
GAAAGGCTCT
ATGCAGGATA
AAGCATGCGA
CCCAAGGCAC
CAAGCGTTAA
GTTGTGTTTG
AGAGAGCGTG
GTTAAAAGAA
GGGAAAAGAC
TTATGGGTGT
CAAAAAkAGCC
GGTCTTAGGA
TGATTTGTAT
CATGTATGGT
ATTAAAAGAT
AGCAAAGGAC
AGAAGCATGC
INFORMATION FOR SEQ ID NO:298: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 372 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO WO 97/37044 WO 9737044PCTIUS97/05223 315 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 372 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:298: ATGCGTTTAT TGATCGCGCT GCGGATTTCA TCTCTGATTG GCGTGCGCAA AATGCCATGG AAAGGCGAAA AAAAAATCCT AAAGACGCCT TGAGTTTGGG ATCCAAGCGA TTTACCTTTA TCCAAGCCTT AA
AGTTTTGTTC
GGAATACGGA
CATTAAAGGC
TTACGCCCCT
CAAkAGGCATG
CATCACCTCT
TTATGGTGGT
CTGGCCCTTT
GAACAACAAG
AAAATCAACC
ATGCCTAAAT
TTAGGGCATA
TAAATTTAGG
ATAAAAACCC
AAATCACCTT
ATTTGGATTT
ACAACCTGAA
AAGACGAGCG
CGCTAAAGkA
TAGGGGTGTT
TTATTATGAA
TAAAACCTTT
TTTAGAAGAA
TAAGGATCCT
120 180 240 300 360 372 INFORIMATION FOR SEQ ID NO:299: SEQUENCE CHARACTERISTICS: LENGTH: 543 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NME/KEY: misc feature LOCATION .543 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:299:
ATGTTAAACA
TTGCAAGCTA
TTAATTTCTG
CACAGAGATA
ATTTATGATT
TACCCTAACA
GGCATCATGC
AATTGGAGCA
GAGACGATCC
TA.A
AGTTTAAAAA
AAA.ACAGCTT
GGATTAGCAG
TTGCAAGAGC
ATGAAAGCAA
CGAAAGTGTG
ACCAAAAAGT
AAAACGCTTT
TCAAAGCCAA
AATCGTTGGC
ATTTGTCTTA
COCTAGAGAG
GATTAAAAGC
TCACAATAAC
CTTATTAAAG
AGCGATCATT
TGAAAACAAT
GAGTTATTAT
GTGGGTGTGT
CCTTATGAGC
AACGTGAAAA
GTAGCGAGTA
AAGCAATCCA
GGGCTTAAGG
GATGATAAAA
TATGAAGTGC
CAAAAGATGC
TAGTGGGCTG
AAAGAGACGC
TCGCTATTTA
GGGGGATTAA
CTATTGGCTA
CTAAAAACGG
TCGTGTTTTT
TTCTAAAAAC
TAGAAGGTTG
TTTAGGGGTT
TCTCAATTCT
TAGTTTCACG
GGTGCAAATC
TTTAGACAAA
GAATTATTAC
AGGCTCAGCG
CGATGACACA
CGTTGGGTTT
INFORMATION FOR SEQ ID NO:300: SEQUENCE CHARACTERISTICS: LENGTH: 423 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 PCT/US97/05223 316 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .423 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:300: ATGAGAAGGA GTTTGGCTTT AGAGACTTTT CGCAACTCAA AATGAAGCGA TTGATTATCG GACGCTAAGA AATTCCGCGC AGCGAAGAGG ATTTCAAAAA AAAGGTCTAA GCGCTGAAGA GATACGAGAA AAGTTTGGTG
TGA
TTGCCTTTTA
AAACGAAGAA
CATGGAAGTG
GAATTTCAGC
AATGCGTGAA
AATCAAGGCA
TAGGGCTGTT
GCTTTGCTTG
CTTTTAAAAT
TCTAAACGCC
CGGATCGCTA
GAAGTGCGTA
AAAGGACTTA
AAGAAAAAAG
GATTACAGGT TTTAGGTGCT TAGCAGGAAC TCTGCCTTCT TTAAAGCCTT AAGTGCTGAA GGAAGAATCT TTCTAAAATG AAGAATTAGA AGAAAAAACC ATGTGAGCGT TTGCAGCGGC ACGAACATTG CTCTCCTAAG INFORMATION FOR SEQ ID NO:301: SEQUENCE CHARACTERISTICS: LENGTH: 459 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .459 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:301:
ATGAAAAAAG
AAAGATTTCA
CAGGATATTO
AAGAGAAAGG
ACCGTGGCGG
ATGGAAGACA
CACGATAAGC
CATGACCACC
CGTTGAAAAT ACTTTCTGTT GGCGCGTTGC GCAAAACAAG CGATGAAGAT TTGGCTAAAA TGGATTACAC AAAAGAGTTG AAAAAGCGCA CGTTCCATAA ACAATTGCAT GAATACGCGA ATTTTGAAGC CCGCCAAAAA GCCATTAAAG TGGATGATGA TTTTGGGTTG AGATCATGCA ATGGCAAGAA GCATGGCAAA AAACATGACA ATGATGAAGA TCACAGCGAT AAGCACTAA
TATTTGTGGC
TGGCTGGCGT
TGGAAAAGAT
CTAAAAACAC
AAGCGCTTAA
AGCATGGGAA
AAGATCATGA
TTTAAACGCC
TGTCGCTCCG
GCCTGAAGAC
AGACAAAATG
AAAAGGCAAT
AAAGCACAAA
CGATAAAGAC
120 180 240 300 360 420 459 WO 97/37044 PCTIUS97/05223 317 INFORMATION FOR SEQ ID NO:302: SEQUENCE CHARACTERISTICS: LENGTH: 423 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...423 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:302:
ATGCTAAAAA
CATTATGAAA
GCGATCAATA
GATTTTTTAG
TTTAAAAGCG
TCCAACAAGC
GCGATAACGC
TAA
AGTTATTGCT
TCATCGCTGA
AAACCTGTAT
AGAATTTGTC
TAGGGGTTTT
TCGTATGCGT
AAATCATAGG
CATTTCACTA TTTTTAGGGT ACTTTCCAAG GCTTTTTTGA TGAAACCGGG CATGATCGCA TCAAACAGAG CAACAATTTG AAAAACCTTG CTTAAAGACA TGCCCCAAAA AACGCTAAAA ATTAGAAGAG CAGATGAATC
TTTTAAGAGC
AAGCCAAAGA
CTCAAATACG
ATGATTATTT
TCCAATCCTT
ATTTTGAAAT
AATTTATCAA
AGAGGGCGAA
GGTTTTGACT
CCTTCAAAAC
TGAAAAGGAT
AGAAAAAACT
ATTAGAGGGA
TGGCGCGAAA
INFORMATION FOR SEQ ID NO:303: SEQUENCE CHARACTERISTICS: LENGTH: 201 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...201 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:303: TTGAGTGGGG AACTGAGATT GTCTTGGTGG TTAGTGGCTC GTTGCAAAAC ATTGTTCGTT WO 97/37044 PCT/US97/05223 318 CTTTTTTGCG AAGGATCAGC GTTTTCTTCA TTATTGATTT CATCAATTTT ATTTTCGTTA 120 TTGATCAGAT CATTCACTTC TTTGGTGACT TTTTCATCTT CTTCTTCATG CTCAATCTTT 180 TTTATATCCT TATTAGGTTG A 201 INFORMATION FOR SEQ ID NO:304: SEQUENCE CHARACTERISTICS: LENGTH: 270 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...270 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:304: TTGGTGGTTA GTGGCTCGTT GCAAAACATT GTTCGTTCTT TTTTGCGAAG GATCAGCGTT TTCTTCATTA TTGATTTCAT CAATTTTATT TTCGTTATTG ATCAGATCAT TCACTTCTTT 120 GGTGACTTTT TCATCTTCTT CTTCATGCTC AATCTTTTTT ATATCCTTAT TAGGTTGATT 180 GATTTGATTG TTTTGAGTGG TTTTAGTGTT GTTTTGGTGC GATTGGGAYG ATTTTTTAGG 240 CTCATCTTTG CATGCGCTCA ACACGCTTAA 270 INFORMATION FOR SEQ ID NO:305: SEQUENCE CHARACTERISTICS: LENGTH: 357 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...357 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:305: ATGAAAAAAT CCCTTTGTTT GTCTTTCTTT CTGACCTTCT CTAACCCTCT TCAAGCCCTT WO 97137044 WO 9737044PCTIUS97/05223
GTCATTGAGC
GTCCTTGATT
AAACTCACGC
AAACTTTCTT
ATCGTTTTTT
TTTTAGAAGA GATTAAAACT TCGCCGCATA CTAAAGAACC AAGACAAGTT TTAGGCGTTT TCACTATCAC TCACATATCC ACGGCAATCG TAGAAACGAC CTTAAGCCCT AACCGCCCTA CTTCAAAAGA ATTGAAAGAA CCGCACTCAA
AAGGCACTTT
ATAATATCTC
TCTATCAACC
CTATCCCTAG
ACCCAATACC
TAAGGCTAAA
CCCACACAAA
CCTTGATGAA
AAACACCCAA
TTCTTAA
120 180 240 300 357 INFORMATION FOR SEQ ID NO:306: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 738 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. .738 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:306:
TTGAAAGCGT
CAAAACACAC
ATCCTAGCTA
GCTCAAATCA
TTAGATCCGG
AGCTTAAAAA
TATGCGCTCA
TTAAGGGATT
GGGGAATTTA
CGATTCAACA
TTACTCGCTC
ATCATCATTC
ATCAAAAAGC
TTCAAACTTT
AGGGCGTGCA
TTAGAGATGA
TTGAATGCGT
TGATGGTGGC
AACGCCTTTT
CAGGCGTTCA
TAGGCGTTAA
GCAACGATTG
CCAAAAACAC
AAGGGCTGGA
AAAACCCCTT
ATGTTTGA
TGGCGTGTTT
TGGGGTGTAT
TTTTTCGATC
AGCGAACGCT
AAAGAATGGG
ACCTAAAACC
AGCGCGAGAC
AAACGCTGTG
GGTGTTTTTA
GCATGGCAC!G
TTTAAAAAAC
AAACATTGGG
GGGACAAGCG
CCATTGAGTG
AAGGCGTTCA
TTAGAA.ACTT
GCTTTGCTTT
AATTTACTAA
GATAAAAGCG
ATTAAAGGGG
GAAGACGCTG
GGTTGTACTT
GCTATCACA.A
CATGGGCATG
TGATCACTTG
TAGAGAGCGT
AAATGGGGGC
GTGATTTTGG
TAGAAGAGGA
CCCCTAACCT
CTTCAA.AAGC
GGCATACAGA
AATTTACTTT
TGTCTAGCTT
AGGCTAAAGA
GGCCTTTGAA
CATCACCGCT
GAAAGCGCAA
GTTATGCAAC
GTTGTGCGTT
GGCGATTTTA
CCCTGAAGTC
GATGGGTGTT
ACATTTTCAA
AAGCGCCAAG
GATTGTGGGC
GCTTTTAACT
TTTGTGGAGC
120 180 240 300 360 420 480 540 600 660 720 738 INFORMATION FOR SEQ ID NO:307: SEQUENCE CHARACTERISTICS: LENGTH: 642 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 320 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .642 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:307 GTGCTAGAAA AACAAGATAA ACCAAAAGCG AACAGGAATT CAGCTGGGCG ACATTGTGCT GACGATTTGC CCGTTATTGA ACCCCCCTAA AAGAATACAA CAAGAGTTTT TGAATACAGC GTGGTTGAAA ACGAACGCAA GATTTTTACG CTCTGTACCC AAAGAGAAAC AGCTTTTTTT GCTCAAAACA CCCTTTTAGG CTTTTTGCCT TTGAAGATCG
AGACCTTTTT
AAAAATCAAT
TAAAAATTTA
AATGAAGAAA
AGAACTTTTG
CAACATGCTT
AAAAGCTTTT
TAACTTGGGC
AAAAACCACC
CTTTTCTTTT
CTTATTGGAC
CTTTTGCAAG
CAGCCTTATT
GTGCCTGCCC
TTAAAAGAAA
AGCGAAAAAT
TTAAGCTTGC
TTACAGATGA
GAAATTGGCG
CTCCAAAGGA
GTGGAAATTG
ACGCTAGGGT
TGGGTAACAG
TTGCGACTAT
CAAAAATCTT
TATTGAGCGC
TAATCCATGC
AATCGCAGGT
AAGCCAAAAA
GCGTTATTTA
CCAAAGAGGT
TGTGCGAAAA
AA
GATTATCCCC
GCAAAGAAAT
AGACGCATTG
CAAAGACAAC
TAAAAGCTCG
TTTAAGCTTT
ACAAAGCGTT
TTTAAAAGAA
TTTAAAAGAG
AACCCCCATG
INFORMATION FOR SEQ ID NO:308: SEQUENCE CHARACTERISTICS: LENGTH: 276 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .276 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:308: ATGTCAGATG AAATCACGCA AGAAAACGAG TTAGAAATTA ACTCCAATAA TCAAAACCAA GAGCCAAAGG AAGTAGAAAA AATGCCATTG AATAATATTC AAAAAGCTAA GAAATTAAAA AACCACGCCA ATTTAATTGT TCGACGCACT GATGAGTTAG ATAAGGTTAT CAATAAGCGT GAAAGCTTGC AAAGAGAGTT TAAAAGACGC ATTAAGCACT TGGATAACAA GATTGAAACG CTTAGTAACA ATATTGAAGA ATTAAAAAGA AAGTAA INFORMATION FOR SEQ ID NO:309: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 519 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular 120 180 240 276 WO 97/37044 PCTIS97/05223 321 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .519 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:309:
ATGAAAAAAT
GAATGTAAGG
ATAGAAAGCC
TATGAAAGTT
AATGGTAATA
CCCTTTTATT
AATAAAAATG
GAACAAAAAA
AAACAATCTA
ATCAAAACAA ACGAAAGCTG ACATTTTAGA AAATTATTTA TTTTATTAGC CCTAGACCCT TGTTTAAAGT CATCAGTCTT ACATTAACCA AATAGCCAAG TGCAGAATTT AGCATTCTTA TAGAAGAATG TAAAAAAATC GCCAAACAGA ATTTAACGCC CTACTGAGCC AAAAAAGGAC GGTAAGCGTA TCAATTGCTT AAAGTCAAAA AAATTAGCGT AGCAAAGAAA CACTAAGAGC AACCAACATG TCTATAAAGA AACACCAACA TTGCCATTAA GAAGAGATTT TTAAGGTTAT TCTTTGGAAT TATTACTTCA ATTTCAAAAA TTCTCAACGC
AACGCATGA
TATTTCAAAT
TACTGAGTGT
AAGGGATTTT
ACTCATGGCT
GTATAACAAG
GGATACACTT
AATTTACAAA
TCTTAATGAA
120 180 240 300 360 420 480 519 INFORMATION FOR SEQ ID NO:310: SEQUENCE CHARACTERISTICS: LENGTH: 963 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1...963 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:310:
ATGAACACTA
GAGGGCTTTT
AAATTAGAAC
TCAGCTCCTT
CAAAAAAAGT
TCAGAAATTG
ACAGATGATT
TTAATAGGTT
ATTTAAAGAA AGCCTTGTTT TTAACTACTC TTGGTTATTT TAGCCATTAT GCCTGATGAA AATAAACAAG AAGACATTTC ACAAAATGGA GCAAAAAATT GCTCCACAAA ATACCCTTAA TACAAAGTGC TGAGTTGTTA CAAAGTCAAG TTAAAAGAGA CTCAAAAACA CCTAGTAGCT ATGCAAACTA AAAATGGCTT CTATTACAGG CACAAGTTTA AAACAAAGTT TAAGCCTTAT TACTCACTTA TAGCAAGACT TCTAAAAACG CTAATTTTAA ATCAGCATTA TTTTGGAGCT AGTAAAAGGC ATGGGATTAA
GCAAGCTAAT
TCAAGAACAG
AAAGCAAC.A
AGTTAAAGAG
TTTCTTAGGC
TTCTAATGCT
TATGGGACTA
AATCTCAAGC
120 180 240 300 360 420 480 WO 97/37044 WO 9737044PCTIUS97/05223
CATTTGTATG
GTTAGCACCA
TTAAATGTCA
ATGCAATATT
AATAAACCTA
TTAAACCATC
GTTGTGCCTA
AATTTTCTTA
TALA
GAGGCACACC
ATTACACGCC
TTAAAAAGALA
ATTTTGCAAA
AAGATATTTT
ATCAATTTGA
GCAAAATTGT
CCAAAGCGAC
TAGTGATGTA
TTT.AAATTTT
CCGCCATACA
AATAGACCCC
TAATCATGGT
AGTGAATTAT
CGTTCAAAAC
TAATTCTAGC
AGCACAGACA
GGCATAGATG
CTAGGATTAA
ATAAAAACCA
TTTTATCCCA
CGCTTTGGAG
GGGAATTTTA
TACATAGCGT
TAAAAATTTA
TGAAGTATTT
GTGTGGGCTT
GCCATAGTGA
CTATTGGACT
GAGCALATTAA
AGGGCGAGGT
TCAACTACGC
CAAATCTAAA
GTTTGATTTT
TGGCTGGCGT
GTTTA'TCACT
ACATTATTAC
CTATCAGTCA
CTTGTTTTCT
TTATTTATTT
540 600 660 720 780 840 900 960 INFORMATION FOR SEQ ID NO:311: SEQUENCE CHARACTERISTICS: LENGTH: 1461 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: No (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1461 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:311:
TTGGCTATTA
GGGTTATACG
TTGGCAGAGT
ATTTATCCTA
TTATCCAA~A
GGTGGGATTG
ATCGGTTATT
CATGAGTGCG
AGAGCCAATA
AAGGATATAT
TATACGCAAG
TTATGGTGGT
TACTCTCCAA
GATTACACTT
ACTTATTATA
GGCTTTAGAT
GATACTTATC
CGATTTGATT
AACGCTTACA
TATGATATAG
TTTGGTGGAG
GCTTTAGCCA
ACAGCGAGCG
AGTTACAGAC
AAAGGGAAAA
TTTTAAGTTT
CCTTTTCTAA
CAGAGACTTT
GCCTTAAAGA
CTTATGACAG
GGGATGGCTA
CGCTTGGCTC
AAATCCGCCG
TTGCGGCTAA
GCTTTGAAAT
TTAGCTCATG
GAACCGTGAT
ACGAAAGGAA
GCCCTGGGGT
CCGAAACGAA
GTTACGCTGT
ACAATGAATT
TAGGCACGAC
GGCAAGCCTT
GCGTGCATA.A
ATGAAGCGAG
TGAAATTAGA
CCACGCCCGG
AATGAAAAAT
AAGCGCTTCT
AGTTGGCTTT
TGTAACCGCT
TCAAGGGCAT
CACGAAATTC
TTTAGGGGGT
TGATGGCAAG
TAATTACTTG
AGGGGGGCGT
CAGCGCTAAA
GGGTAGGGCG
TAAAAACGGG
AGGGGTTAGC
GGCTGTAGGC
GGCTTATATT
GAAAGCTGGC
TAACTTTGGG
AGGAAACCCT
AAGCCATGTG
AAAATGGCTG
CGCGGCTGTT
ATATTTGGGC
CTCTAAAGCA
AGCGCGCCTT
GTGCAAGCGT
AATAAAAAAA
GTGGGTCAGG
GTTTTAGAGG
AATCAAGGCG
AAAAGGGCCT
GTGATTGATT
ATGAATAACG
TATCAATCCA
ATCAAGGATA
TTCGCTTATG
CGCACTCTAA
GTCAGCCCCT
TATGATAGTA
TTGCTCCCTG
ACCGCTGGGC
GGAGCGTTTT
CTAGGCATTG
GTAACCGCTG
TGGGGGACTT
AATGTGGGCT
GTGATGACGC
CTTTATTCAG
TAAAGAATAA
TTGATTATAA
AGATTGACAT
GCAATATTTA
GAAAAGTCGG
GATCGGTTAT
TGCTTGATGG
CTATAGCGTG
CTTTTTTAGA
ATGCTCCTTA
AAAATGAGGG
GGGAGTGGAT
ATTATGGTAT
TTTTCCAGTT
ACCCTAATTT
TCCATGCCCC
AAAGCCTGCT
ATAAAGTATG
ATTTTTGGAC
ATGCCGTCTC
TGTGGCGTTG
ATA.AGATCAG
ATTCAGGCTT
ACAGGAGCCA
AGTTTTTTGT
AATTGAAGTT
AGCTAGGGGG
TGCGGATTTT
CGGCACAATC
TTATAATTAT
CACGAGTATC
CGGGAACGCT
ATACCGCTAT
TATGAGCTCT
GAGCCATAAA
TTATGATTTT
CCATTTAGTG
TTCGCCCGGG
TAATGGCGTG
CCTTAAAAGG
CATTAGGCAA
GAAAAACGCA
CAATAGCGTT
TGGTTGGGTT
GACTAGCGGC
TAAGAGTTTG
TACGGTAGGG
TCTAATGACA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 WO 97/37044 PCT/US97/05223 ACCCTTAGCG CCAAATTCTA A 1461 INFORMATION FOR SEQ ID NO:312: SEQUENCE CHARACTERISTICS: LENGTH: 504 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...504 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:312:
GTGTGCATTA
GATAACAAGC
GAACAAATCT
GAAATTGCTA
AGTAAAGAAC
GAAGCGCTTG
AAAATAAAGA
4AGCTCGAATT
AAAACAAATC
TTTTTGATGC AGACATAAAA AAAGAGAACC AAGAAAGCGA TTAAGCATAT TCGTGAAAAA TTCAAAGAAA AAGGGACTGA TTTTATTCCC TAACAATCAA GATGATGGCG ATTTAGAAAC AACACGATGA ATTTCTTAAA TGCTTTGAAG GATACTTAGA ATTATAAACC GATTAAAAAC ATAAGAAAAA ATATGCTGTA GATTAGAGAA TTTGACCAAG ACCAATATAG ATGTTTTTGA GTAGATATGA AGAAAACTAC AAAAAATTAA CAGAAGAAGT CTCTCATTCC CCTTAAAAAT TTTCTAGGGC AATTCGCAGA CTAAAATTTT CTAA
TGCAGGTTTT
TTTTCCCAAA
CCTATTATTA
ATGTATTAAA
TGCCTATTTA
TAGTAAAGGT
TATTGATTTC
AAATAAACAA
120 180 240 300 360 420 480 504 INFORMATION FOR SEQ ID NO:313: SEQUENCE CHARACTERISTICS: LENGTH: 633 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...633 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:313: WO 97/37044 PTU9/52 PCT/US97/05223
ATGGTATTGA
GTGGGTTGCT
ACTGCAAGTA
TTTAGCGACC
ATTAACGACA
CAGTTGCGAT
ATTGAAGCAG
AATCAAGACA
AAAGTATCCA
ACCCTAAGCC
GTGAAAAATG
AAACAAAATT
CAAGCGAAAT
TTAATAGCAC
CTAATTTTCA
CCACTCACC
TGCGATCTAA
ACAGCAGAAT
CCACTGTAGA
GTATCGCAGC
TTACCAATAG
CTAGCAATAA
AAAAATTATA
GGCGACTTAT
GGATTTATTG
GCAACTCAAG
CAATTTGGAC
TGGGAGGTTC
GGTGAAACAG
AAAAGGCACT
CTCTATTAGT
GAAAACGGGT
GCGTATGTTT
AGCTCGGTGA TTTTGAGTAc CAAALATGTGA ATGACGCCAC CTAACCGCTA ACGCGATGTT GGCAAGCATT TGATTGAAGT ATGAATCTTT TGACGACCGA AATATCACAA GGGCGAGTGG CGCGAAAAAG AACGAGAGAG TTAAAAGCCG CTGATTTATC AGTTCTAGGC AACGCTTAGA GAAGAGGTAT GGAGCGATGT
TAA
TTTATTGTGG
TAAAAATACG
AGATTCCATG
TTCAGATGTG
ALATCGCGCGG
AGGGAGTGGT
CGAAGAGTAT
TTTA.AGTGGT-
CTACGACTTC
TAAGCCTATT
INFORMATION FOR SEQ ID NO:314: SEQUENCE CHARACTERISTICS: LENGTH: 555 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: I-elicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 555 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:314:
ATGAAAGCTT
AATCCTTTAA
CATTTCAAGC
AAAGCCCCTA
AACGATAAGG
AAAGACAAGA
TTTAAAA~CGA
TCGTTGGAAT
ATCAACCCTA
ATTGTACGCC
TTTTAAAGAT TTGCATGGTT TTGATTTTTG CCCTTTCTAA AGAAGAAGAA GTTTTACAAA AAGTGTTAAA AAATGAAAAG CCCTTAGTTT ATTGGGCTTT ATGGGTTTAT GAAAAACCCT AAGTGGTGGT TTATGAGCCT AATTTATTCC CGGATTTTTT CACCATTCTC AAGCAATTGA CTATCAACAA GACCACTTAT CGTTTGGTTT TTAAAGATGA CATGAATAAT CTTGTTACGA AAATTCCTAA TGAAATCTTT GTTTTTAACC
AGTGA
TGGGTGTCGC
ATTTGCAAAG
ATTATGGGGT
TAAAAAAAGA
AAGCGAC TAT
AAAAGCAGAC
TTAAAGACGG
TCACTTTTTC
CTAAAGATGA
TCATGCTAAA
TTTTAGCGCG
TTTAAAGGCT
AATTTACATG
CACGCCCTTA
TGACGGCTCT
CAAGCCTTTT
TCAAGCAGAA
AAATATTGAT
120 180 240 300 360 420 480 540 555 INFORMATION FOR SEQ ID NO:315: SEQUENCE CHARACTERISTICS: LENGTH: 558 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 WO 9737044PCTIUS97/05223 325 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .558 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:315: ATGAGAATTA AGGCTTATTT TTTAGCGCTT GTAAAAACTC GATAGCCCTA AAACCTACAC GATTTAGATT CTCTAAACAT GCTTTAGATA ATTCTTTAAA AAAGATCGTT TGAGGGTGCT GGCTTTATTG CGCCTTCTCA TTTGATCATT TAAACCATGA CACCAATTGA TTAAAATGTA TCCAATTTAA AGGACTGA
TTTGCGTTTT
TCAAAAATCT
CGCTATGGAT
CAGCCCGGAT
AGATTACGCC
TATTTTACTC
AACTGATTTG
CGCTTTA.AAC
TCAAGGGATC
ATCGCACTGG
CAAGATTCTC
CTGAATAACC
CCTAACACCC
CCCACCTTTA
AATCAACCCT
ATGATTTTAA
CATTCTTTTA
GTGCCAGCAG
TTTTTATCGT
AAAACAATAC
AAGAATACAC
CTACCCTATT
ACGTCTTAA.A
ATTCAAGCGA
ACCCTAAAGA
ACATGCTCTT
AAATGCTCCA
TTTGTTGGGT
CACCCAACAA
GATTATGGGC
GATTTTA.AGC
AAAAACTTTT
TGCAATCAAA
TACCGCTCTT
ATATGACAAA
ATTTGATATT
120 180 240 300 360 420 480 540 INFORMA~TION FOR SEQ ID NO:316: SEQUENCE CHARACTERISTICS: LENGTH: 1092 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (geriomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1092 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:316:
ATGGTATCAA
CAAGGGGGCA
GGGGCTTTAG
GAAAGGATTG
AATGAGATTT
TTATACGCTA
ATTATCATTA
GATGTGGCGC
TGGAGCGATC
GGGCATCAGG
CACTCAAACC
TGGGTGTGGG
GAGTGATTTC
TGGCTAAAAA
TTGCAAACGC
TCAATGACTA
CAGGGGCTGG
TCATCCCTAT
GCTATAAAAG
GCTTTAAATA
GCTAAAAATC
GATTAGCTGG
AGCCGTAGGG
ACCCTTTGA.A
TAGGAAAATT
TGGCCGTGTT
TTTGCCCACT
CATTTCCTCA
AATCCCGGAC
CGAAGATTGT
GGTAAACACA
GATGAACTAG
ACTGGTTATT
GCCTTGAATT
TGCGGGAACA
TTAAGGGACT
AACATGCCTG
GCGAAGGCTT
GCATTCATTG
TTCAAAGAAG
CCATXAAATT
CTGGAAATGT
ATAAAAACAT
TTTACTCCAA
AGCCTTTGGG
CTTGTGAGGC
AATTCGCTAA
TAAAAATCCT
TGGAAGGGCC
AATTCCAATT
CCCTATTTTT
TGCCAAAGAA
GCGTTTTGTA
AAAAGCGTTG
GGCGAATATT
GGGGGCGAAT
GGATTTTAGC
TTGTAAAAGA
TTTGAGTGGG
AGAAAACTTA
120 180 240 300 360 420 480 540 600 WO 97/37044 WO 9737044PCT[US97/05223 GTGCCTAAAG TCGTGGAAGC GGGATTTGGG ATAAGAAAGA ATGGCGACTC
GTTTTTTAGG
CCCACGCTCA AAAAGAAGA GCTATCAAT-A CGGGGGTGAT GTGAGCAATc GTGTAGCGCC ATCGCTGATG GTTTGGGGCG GGGGCTAATIG GCTATAGAGT ACAGAGGGTT AA
TTCTAAAGAA
TATAGACACC
CACGAAAGAA
TATTTTACTC
CAAACGCATT
TTGTAACAGG
CAGTTATTTA
GGATAAGATT
TGGGGGAATA
ATGTTAAGCC
TGCGACGCTA
ATCAAA.TCGC
GAAGAGGGTA
GGTGAAGAAG
GGAAACAGAG
ATCAGCGTGC
TCCCTATCAT
TTGGAGCGAG
AAGCGTATGC
CTGTAGGCTA
ACGCGCCTAA
CTAAAAAGGT
AAGAGGGGCT
ATGA-ATTGAT
CGCCGCGGGG
TGGGGTGCAA
CGATCTTTTG
TCCGGCTAGG
AATCGCATGC
GGGCTATTGT
TTATTTTACC
TAAAGAGCTT
660 720 780 840 900 960 1020 1080 1092 INFORMATION FOR SEQ ID NO:317: SEQUENCE CHARACTERISTICS: LENGTH: 1092 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: inisc-feature LOCATION 1 .1092 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:317:
ATGGTATCAA
CAAGGGGGCA
GGGGCTTTAG
GAAAGGATTG
AATGAGATTT
TTATACGCTA
ATTATCATTA
GATGTGGCGC
TGGAGCGATC
GGGCATCAGG
GTGCCTAAAG
GGGATTTGGG
ATGGCGACTC
CCCACGCTCA
GCTATCAATA
GTGAGCAATT
ATCGCTGATG
GGGGCTAATG
ACAGAGGGTT
CACTCAAACC
TGGGTGTGGG
GAGTGATTTC
TGGCTAAAAA
TTGCA.AACGC
TCAATGACTA
CAGGGGCTGG
TCATCCCTAT
GCTATAAAAG
GCTTTAAATA
TCGTGGAAGC
ATAAGAAAGA
GTTTTTTAGG
AAAAAGAAGA
CGGGGGTGAT
GTGTAGCGCC
GTTTGGGGCG
GCTATAGAGT
GCTAAAAATC
GATTAGCTGG
AGCCGTAGGG
ACCCTTTGAA
TAGGAAAATT
TGGCCGTGTT
TTTGCCCACT
CATTTCCTCA
AATCCCGGAC
CGAAGATTGT
TTCTAAAGAA
TATAGACACC
CACGAAAGAA
TATTTTACTC
CAAACGCATT
TTGTAACAGG
CAGTTATTTA
GGATAAGATT
GGTAAACACA
GATGAACTAG
ACTGGTTATT
GCCTTGAATT
TGCGGGAACA
TTAAGGGACT
AACATGCCTG
GCGAAGGCTT
GCATTCATTG
TTCAAAGAAG
TGGGGGAATA
ATGTTAAGCC
TGCGACGCTA
ATCAAATCGC
GAAGAGGGTA
GGTGAAGAAG
GGAAACAGAG
ATCAGCGTGC
CCATAAAATT
CTGGA.AATGT
ATAAAAACAT
TTTACTCCAA
AGCCTTTGGG
CTTGTGAGGC
AATTCGCTA.A
TAAAAATCCT
TGGAAGGGCC
AATTCCA.ATT
TCCCTATCAT
TTGGAGCGAG
AAGCGTATGC
CTGTAGGCTA
ACGCGCCTAA
CTAAAAAGGT
AAGAGGGGCT
ATGAATTGAT
CCCTATTTTT
TGCCAAAGAA
GCGTTTTGTA
AAAAGCGTTG
GGCGAATATT
GGGGGCGA.AT
GGATTTTAGC
TTGTAAAAGA
TTTGAGTGGG
AGAAAACTTA
CGCCGCGGGG
TGGGGTGCA.A
CGATCTTTTG
TCCGGCTAGG
AATCGCATGC
GGGCTATTGT
TTATTTTACC
TAAAGAGCTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1092 INFORMATION FOR SEQ ID NO:318: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 579 base pairs WO 97/37044 PCTIUS97/05223 327 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1 .579 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:318:
TTGTTAGGGA
AACTGGTCCA
ACCATGCCAA
CATGCCGGAA
GCGATCACTA
TTTGCATGGC
GGGAGGGCTA
CTCCTTAACT
TATTTTGGAG
AGAAGTTACA
GGAATGGCGT
TTGGCTTTTA
GGGGGAATAJ\
TGATCGGCTA
ACGCGAACAC
ATGTTTTTGG
ATGAATATTC
TTAGGATCAC
CGCCCAAATT
TGATGACCA
TACTTTAAAT
TAACACCTTT
CACTTCCTAT
TGATTTTTGG
TTTCACTTTT
GCGCGTCTCT
CTTGCAATTC
TTATTATGGG
CAATAACCCG
CCTCACGCTG
ATCCGCCAGG
GGCAATTCAG
ATCAGTAATG
GATAATACGG
TACACTTCGG
CATGCGAATA
AATGCGAGCT
GCTAGGATCA
GATGGCGATT
AAGTTTTGA
TTTTTTGGTG
ACGCTTTTTT
AAATCTCAGT
CTTATGATGG
TTGGAGGGAT
AAAATGCGTT
ATGCGTTCAC
ATAAA~GGGTA
TTAGCGCTAA
GGATAATTTC
AGGCTCTCAC
AACGACTAGG
GCTGGCTGAT
CCATAAGCGT
AGGGCAAGTG
TGAATCGGTT
TCAAGCAGGG
TTACCAAGAC
120 180 240 300 360 420 480 540 579 INFORM~ATION FOR SEQ ID NO:319: SEQUENCE
CHARACTERISTICS:
LENGTH: 1065 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION .1065 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:319:
ATGCAACAAC
GCTTTAGAGA
GGTTCAGATG
ATCTTAAATA
AGCTCACCTA CTTGAATGCG GGGAATGTGT TTTTTAATGC
GATGAATAAG
AGAATGGGAC TGCTACTGCT AATAGCACTA GTAGCACTAG
CGGTGCGACT
GTCAAACTTA CTCTCAACAA GCTATTCAAT ACCTTCAAGG
CCAACAJAT
ACGCAGCGAA CTTGCTCAAG CAAGATGAAT TGCTCCTAGA
AGCTTTCAAC
WO 97/37044 WO 9737044PCTIUS97/05223
TCTGCCGTAG
C-TGCAAGGCA
AGCGGGAGCG
CGTGCTAGTC
GCGCTCAACA
CGTGCA.ACGA
AAGAAAAGGA
GGCTTTAGAT
TTGTATA.ACA
GGTATCCAAT
TTGCATGGGA
AACTTCGGTA
GTAGTAGTGC
TTCCGTCCTT
CTGCTAACAT
TTATTGATCA
CGGTTAATAA
AGCTCCCTA-A
ATCAGGTGAG
ATATTTTAAA
ATATCGGTTT
CCACTCAAAA
TCTTTAGCCG
TAGCCGGTGA
AAATCAATAA
AGTTGGACGG
CTACGATTTA
ATAGCGTTTA
TGGGAATAAG
ATCTCAATTG
CGCTGGGATA
CGCTCTTTAT
AAGCATGCCT
CGGGTTTTAC
GCGCTATTAT
TAATGTAGGG
CTCCTATCAA
GACCTTCCAA
CACGCACTTC
GAAATCCAAC
TAACACTTAT
TTGGTCTTAT
GAATTCAATT
GTTTATAACG
AACTCCAACC
AACGTGCAAG
TACTTGCCCC
ACTAAAGTGG
GGTTTCTTTT
TTATACACTT
AACCGCTCTG
TCCACGCTCA
CAGTTCCTCT
CGCCACAACC
TACAAATCAG
GGGTATTCAT
CAGCCGCTTT
AGCTCACTAA
AAGCTAACGC
TAACTTTGGA
AATTCAGAGC
GCTATAAGCA
CTTATAACGG
ATGGGGTGGG
TGGATATGGG
GAGATGACCC
TTGACTTCGG
AGCACACGGT
CAGGGACTAC
TCTAA
TACAGGTTTG
AAACACCATT
TGTGCAAGGG
TAAAATCAAC
CGGGAACAGC
ATTCTTCGGG
AGCGAGCGTG
GACTGATGTG
CTTTTTTAGC
CAATGTGAAA
TATGAGAATG
GGAATTTGGC
CGTGAAGTAT
300 360 420 480 540 600 660 720 780 840 900 960 1020 1065 INFORMATION FOR SEQ ID NO:320: SEQUENCE CHARACTERISTICS: LENGTH: 2247 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .2247 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:320:
ATGGA.AGACT
TTTAGTTTCA
AAGGTTTTTA
ATAGAT'GGGC
CTATCCTATT
GATGATAATA
CAAGATTTTC
TTCAATATAT
TTTGTGTGGT
GATGATCTCT
ATCACACCCA
GGCTTAGGGG
AGGAGCGGTA
GTTGTGTTTG
TTCAACCAAA
TTCGCTTATG
GACACACGCC
TTTGGATTGG
AATCAGGCGC
TTTTGTATAA
TAGGGTTAAT
AAGATAAAGC
TGAAAGAAAG
CTTTTATCAC
GATTGCCTGA
CTGCATACTA
CCTTAGTCGT
TCTTAAAATA
TTGGTAGCGC
ACAATAAAAA
ATTTTATCGC
AGGGTGTGGG
ATCCTAAGGC
AAGTGTTCAT
TGGATTTTGG
TAAAAGGGCA
CTA.AGTTGGT
GAAACCTTTT
CACCTTATAT
AGCGTTATTT
TAACCAGCCT
GGTTAAAACC
ATCAGGGTTA
GAGCGATGAT
TTTTAAGGCA
ATATAGTTCT
CTTAACTCGG
GAGTTGGGAA
ACGCGCCTTT
TTACGCAGGA
TTTCATCATG
TGACACTATG
CTATGAACCT
TAATGATGTG
TGGCATGGTG
GTTCCCTGA.A
TGTCATCAAT
TTCATAGAGG
TTCCTCTACA
CAAAAGAAAA
TTTGGCTTTT
TTTTTCTTGA
GATCTTTTTG
CTCACTTTTA
ATTTTATGCT
AC TAGGGAC A
ACTGAAGAGA
GACAAACGAG
CAGGCGTTTA
CCCAATATGA
GAAACTTGCG
TTCTCCTTAA
GTTTTGACTG,
GCTAGTGGAG
AGACCTAATG
TGCAATATTT
ATTATAAGTT
AATTCATAAA
AAAGCTTTAA
GGTTGCAAGC
TTCTCTTAGG
ATATATGGGT
GCTCGCTCAA
CTTATATATT
TAGGAGCGAA
AAATGATCAA
AGGTTATTGT
TTGGCTTGAT
TCAATTACCC
GGAAAATCAG
AAACACACCG
AAGACATACT
GGGATTTTTC
AAAAAGATCC
ATAGGGATCT
GGTTGTTATT
AACTCAAAAA
AGAAATCATT
TATACTATTA
TAATTTTTAT
CTATGCGATA
GATTTATGGG
CATTACCTTT
TAAAAAAGTT
AGCCAAGCTT
AGGCAGGCGT
TGCTCCTACT
TCAAAATATC
AGAAAXACGC
ATTTAATCCT
CTCTCAAATT
CACTCAAATC
TTTTTTTAGC
CATGTGGACT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 WO 97/37044 WO 9737O~PCTIUS97/05223 329 AAAAAGGGGC TTGAGTTTGT CAAAAGAAAA AAAATCATCA TGCCTGAAAC CCCCACGATG 1200 TTTTTCATAG GTTCTATGGC AAGCGGTATC AACTTGATTG ATGAAGACAC AAACATGGAA 1260 AAAGTCGTGT CTTTAATGGA ATTTTTTGGA. GGTGALAGAAG ATAAGAGTGG CGACAATCTA 1320 AGAGCCOTTA GTCCTGCCAC TAGAAACATG TGGAATAACT TCALAGACAAT GGGTGGCGCT 1380 AAAGAAACTT ACAGCTCGGT TCAAGGGGTC TATACATCAG CGTTTGCGCC TTACAATAAC 1440 GCCATGATCA GGAATTTCAC GAGCGCTAAT GATTTTGATT TCAGGCGTTT AAGGATCGAT 1500 GCAGTGAGTA TTGGTGTGAT CGCTAATCCT AAAGAAAGCA CTATTGTTGG ACOGATATTA 1560 GAGCTGTTTT TCAACGTGAT GATTTATAGC AATTTGATTC TGCCAATCCA TGATCCACAA 1620 TGCAAAAGAA GCTGCTTGAT GCTCATGGAT GAATTTACGC TCTGTGGCTA TTTAGAGACC 1680 TTTGTTAA.AG CGGTAGGGAT CATGGCAGAA TACAACATGC GCCCCGCTTT TGTGTTTCA.A 1740 AGTAAGGCGC AATTAGAAAA CGACCCCCCA CTTGGTTATG GTAGGAATGG CGCTAAGACT 1800 ATTTTAGACA ACCTTTCTTT GAATATGTAT TATGGGATCA ACAACGATAA CTACTATGAA 1860 CATTTTGAAA AGCTTTCTAA. AGTGTTAGGG AAATACACAA GGCAAGATGT GAGCCGGAGC 1920 ATTGATGATA ATACGGGTAA GACCAACACT TCTATCAGCA ACAAGGAGCG GTTTTTGATG 1980 ACCCCTGATG AATTGATGAC TATGGGCGAT GAGCTTATCA TTTTAGAAAA CACGCTCAAA 2040 CCCATCALAGT GCCACAAGGC GCTTTACTAT GATGATCCCT TCTTCACCGA TGAGCTCATT 2100 AAGGTGAGTC CA.AGCTTGAG CAAGAAATAC AAATTGGGGA AAGTGCCTAA TCA.AGCAACA 2160 TTTTATGATG ACTTGCA.AGC CGCTAAAACC AGAGGCGAAT TGAGCTATGA TAAATCTTTA 2220 GTGCCTGTGG GTTCAAGCGA ACTGTGA 2247 INFORMATION FOR SEQ ID NO:321: SEQUENCE CHARACTERISTICS: LENGTH: 105 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .105 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:321: TTGAGCCATT TATTTCTCGT AAAACGCCCT TACAGTCGCC TTTATTTACA TAATCAGAGC AACCATAAAA TGAGACACCA TTCAATCCTC TTTGTATCAT TTTAG 105 INFORMATION FOR SEQ ID NO:322: SEQUENCE CHARACTERISTICS: LENGTH: 333 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DN4A (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCTIUS97/05223 330 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...333 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:322:
ATGGAACACC
CTAACCGAAT
TTTAACGTTT
AAAATGAGCA
AAAGTGGATG
GCGTTGGAGT
ATAAAGCGCA CACAACCATT CAGGCTTTAC TAGCCGAGTT AGAAGCAGAA ATAAAGGTGA CGCTCTCGCC GAGTTTGTTA GCCGAAATAG AAGAGCGAAG AATCCATCAC AATCTTTTGC AATACATGAA AGAGAAAGGT TTTCCTAATC TTTATATGCT AAAACACCCA TAA AAGCCAAACG CAAAAGGTTG GCAGCGAACG AAGGAGCAGT AAGAGATAGA ATACGAAGAA TTTCGCCCAG TTTCATGGCT GATCGCTTCT CTTTGAAAAA 120 180 240 300 333 INFORMATION FOR SEQ ID NO:323: SEQUENCE CHARACTERISTICS: LENGTH: 477 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 477 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:323:
GTGATGCGTT
GCCCTTAGCC
TCTAGAGTGA
GCCCCCTATG
AACCCCTTGC
AAAGAGCGAG
AGCTATAAAA
GCGAGTTTAG
TCTTTTCATT
CCCTAGAAGA
TGGGTGAAGA
TTTTAGACTA
TCCAAATAAA
TCATTGACTG
ACGGCACCAC
AGGGAGATAT
CTGTTATTTT
ACAAGAATTT
GTATTCTGTT
TCATTGCTCT
GCTAGAACGC
CCTTTTAAAA
TACCACAAGC
GCTGTCTTTA
TTATTTTATT TTTTAGGGGT TTAATTTCGT ACCGCTTGAA TCTAAACCTA TCGTTAGTCG ATCATCACTC GCAACTTGCC TTCCTTTTAG AAATAGCGTT AGCCAAGTCG CTATCACGCA ATTCTTAACC TCAAAGCCTT GATATTTTTA GAAAGGAAGA
TTCTTTGCAT
AATCGTTGAT
CATTAAAACA
CGATTTGAAA
AAAAAAAGAA
TTATGATCAC
AAGCGTTAAA
AGAATGA
INFORMATION FOR SEQ ID NO:324: SEQUENCE CHARACTERISTICS: LENGTH: 351 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 WO 9737044PCT1US97105223 331 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1 .351 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:324: ATGGAGTTA TTAAGAAJATT GGAAAGAp.
AGCGAAGTTT
CATTCAAACG AGCTTTTTAA. AATGTTGATT
ATTGATAATG
TTTGAGATTA TGTTTAAAGC ATGGGTTGAI\
ATCGTAAAA
AAAACCAAAT TTGATGGCGA AATGATTGGC TACACAGAAG AGAGATTTTT TTAATGGGAT TTTTAAATCC
AAAGTAATAC
GGTGATGTTA AATGCGAAGA TTTTAATGCC
CTAAGAAGTT
TAAAGAAAGA
AAGATTTGTT
TGATGTTTGA
AACTTTTAAC
CTAAAATGCC
TAGTTTATCT
TTTACAACAA
TAAAGAGCAA~
ATTAACCAAA
CTTTTTAGTT
TATTTTTTGC
T
120 180 240 300 351 INFORMATuION FOR SEQ ID NO:325: SEQUENCE CHARACTERISTICS: LENGTH: 834 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .834 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:325:
TTGAAACGAG
AAAGATGTTT
AAACGCTATG
CAGCACAACT
AGCGATAGTC
GTGGGCTATA
CTGTTTTATG
TCCACTTACG
GCCGGGTTCA
TTGTATCAAA
CGTTATGGTT
TGACTAAAGG
CGGATAGATC
CTAGCAACTT
TAAGCCCTGT
AGTGGGTGGG
ATTTGAGCGC
GCACTTATAT
ATCTGGGGAT
CTCTTCTTCA
AATTTTAGGG
GGATTACACT
GGCGTTTTAT
GAATTTATCC
TTTTAAAAAT
TAAGCATGA-A
TTCTCTTTAT
GGATTTATTG
CGCTTTTGCT
AAACACTTTT
CTTTTTTACG
TTTTTTA-ATA
CTGGGGCTTG
CAACGATTTA
TCGTATGTGT
GAGACGAAAT
GGCTCTCAAG
CTTAACGCTT
GGAGTGTATG
GGCGGGAAAG
CGCTTAATGC
AGAGAGCTTT
AAAAGG'TGGT TTCGCCTATC GGTATCAATT
AGGGAGCATT
ATAAGAGTCA
GATTATTTTC
CTAATGGTCT
TGGCGTTCAA
GGTTTGGCTT CAGGTGGGGG AATCGCAGTC TATCATCATT ATAATGGGGA
TAAGTTTTTT
ACAGATTGAG
CGATGCGTTA
TGAATTTAAA CGGCTTCCAG WO 97/37044 WO 9737044PCTIUS97/05223 332 TTT7TTGGTGG ATTTAGGGGT TCGTTTAGGG AATGAACACA ACCAATTTGG CTTTGGGATT AAAATCCCTA CTTATTATTT TA.ACCATTAT TATTCCATGA ATAACATTAG CAATAATAGT GAAAATGTCT TAAAAGTTTT ACGATTTTTA GAATACGGGA TCAACAGCTT GTTGTATCAA GTTGATTTCA GGCGCAATTA CTCGGTTTAT TTCAACTACA CTTATAGTTT TTALA INFORMATION FOR SEQ ID NO:326: SEQUENCE CHARACTERISTICS: LENGTH: 1482 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1482 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:326:
ATGCCTGCCT
AACCCTACCA
TCAATCCCAA
GCCGCTACTA
GCATGGGGGT
GCTATTAACG
GCTAATGAAA
ACGCAGTTCG
GCCCAACAAG
GAATGCACCG
TTTGTGAAAG
AATCAGGATA
GGGAACGACT
CAAAACATGA
TTGGATACCA
TTTAGGCGCA
GTGCAAGCGG
GGTTTCTTTG
GTGTGGACTT
AACTTTTTAG
GGGACTTCGT
GCTAATGTCA
GCTAGGCCCA
AAAATCCCCA
AGGCTCTATA
TGAATAGCTC
CAGAATACAC
TCCAGCTAAA
TCATCAATGT
TTGGCGGTAA.
AAATGATCAA
ACACCCAAAT
CTCAAGAAAT
TAGCGGACAA
CAGGATCAGC
AGACTCTCAA
GGGCTTTGTC
CAAAAGCGAT
CGCATGCCAC
GCAAATACAA
TCGGCGTGAT
GCTATAAGCA
ATTATAACCA
ATGGGGTGGG
GCAAAAATAA
GGCTTAATTC
GCGCTTCTA.A
AGAAAAAAGA
CCATTAACAC
GCGTGTATCT
CAAAAATATG
TTACCCCGAT
GATTAGTAGC
CCTTACCACC
GACCGGGAAT
AAACGCTCAA
CACGCAACCA
GCTCAATAGA
TTTCCACAGC
TGGTGTGATT
TTCCTTAGAG
TCAAACCATT
CAATAGCGGT
TCAAAACCCT
CCAGCTCCAA
TAACTATCAA
ATTCTTTGGC
TGCTTATATC
TATGGACGCG
CAAGCTTTCT
CCAACAAGTG
CTTCCAATTT
CAGCGATCAT
GGATTATTAT
CAATTATGTG
GTAGTCAATA
GGGA.ATGGCA
GTCAATGACG
CAAAACCCGC
GTGATGGATA
GCCGTTTTAG
GACAATTTCA
GCTAACGCTC
ATTCAAGGGC
AACGACAACA
CAACACACCG
TTGAATTTTA
ATCTCTAACT
A.ATTCCCCAG
ACTGTTGCGC
AACAATAACG
AAAAAAAGGA
AAATCTAATT
CTTTATAACT
GTGGGGCTTT
AATTTGACCA
TTGTTTGATT
GCCGCTCAGC
TCTTTCATGG
TTTGCTTACT
TCAATCAAAC
ATTATTATTC
CTGAAAACCT
ATGTGAATGG
TTTTTGGCGA
AAAAAACCCA.
ACCCCTACAC
AAGCAGAGAT
CTATCCAACA
CTTATGGTTC
CTTATTATGG
AAGAAGCCCT
TGCCTAACGC
AAGGTTTGCT
AAGAATTAGG
GGGCGATGAA
ATTGGGGGTT
TTTTTAACTC
TCATCAACGA
TTGGTGGCTT
TGATGAATGG
TAGGCTTGAG
ATGGCATGGA
GGGCTGA.ACT
AG
TTTCACAAAA
AGGCGGTTCA
TTTGCAACA\
TGGCGGTGGG
TAGTTTTAAC
ACAGCTTAAC
TTCTAAAGAC
TTTGAGCTTA
AGATCTAGAA
AGGTTGCGCG
CAACCAGGTC
TAGCACTTTA
TAAGTCCCTT
CACTTATTCT
CAAAAACCCC
CGGCATCGGC
AAGGTATTAT
GGCTTCTGAT
TAAAAACACC
TGCGTTAGCC
CATTTATAAC
AATGAACCTC
ATTGGGCGTG
CAAATACAGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1482 INFORMATION FOR SEQ ID NO:327: SEQUENCE CHARACTERISTICS: WO 97/37044 WO 9737044PCT/US97/05223 333 LENGTH: 810 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .810 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:327:
GTGCTAAACA
GGCGTGGATC
ATAGATATTG
GGGGGTTTGA
CAAAAGGAGC
TCTAATGTTT
GCTATTAA
TTATACGTTT
GCAGAGTCCT
TATCCTACAG
TCCAAAAGCC
GGGATTGCTT
GGTTATTGGG
ACTGCGCGCT
CCACCCAAAA
AAGCGATTAA
TTTTAGTGTT
AATACTTGCA
TTTTTAAAAG,
TAAATCGGTT
GGGAAAAAT
TAAGTTTAAG
TTTCTAAAGT
AGACTTTTGT
TTAAAGATCA
ATGACAGCAC
ATGGCTATTT
TGGCTCTGAT
AAGCCTATTG
ACACGCTATT
CAATAAAGGC
AATCCTTTTA
CCATGCGATA
AGTATCATTG
GAAAAATAGC
CGCTTCTGTG
TGGCTTTAAT
AACCGCTGTG
AGGGCATGTT
GAAATTCAAT
AGGGGGTACA
GGCAAGGTGA.
GTTTTCATGG
TTAGAGGGGT
GTGGCGTTTT
ATTTTAGGGC
GAGTTTGGCA
GGGCAATTGT
GCGCCTTTAA
CAAGCGTTTG
AAAAAAAAGA
GGTCAGGGCA
TTAGAGGGAA
CAAGGCGGAT
AAGGCCTGCT
GGGTTTTTTT
TTCACTATGA
CCTTACTCAG
TTTTTATCTT
TGGTGTTTGG
ATATTCTTTA
AGAATAAAGT
ATTATAAAAT
TTGACATAGC
ATATTTATGC
AAGTCGGCGG
CGOTTATATA
TGATGGCACA
CCTTATCTTT
AAGTTTGGTT
TTTTTTAGAG
TTTAATGCGC
CGCTGGGGTT
TGATGTATTG
TTTTTGTGGG
TGAAGTTTTG
TAGGGGGATT
GGATTTTTTA
CACAATCGGT
TAATTATATC
AGTATCCATG
120 180 240 300 360 420 480 540 600 660 720 780 810 INFORMATION FOR SEQ ID NO:328: SEQUENCE CHARACTERISTICS: LENGTH: 597 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .597 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:328: WO 97/37044 PTU9152 PCT/US97/05223
ATGTTAGTTA
GTTGATGAAC
TGGCCAAjApG
AAAGACTTCC
CATTTCGCAT
ATGGTGGCTG
ATCGCTTTGA
AATGACTTGC
CACTTTGAAG
AAAGCAACTC
CAAAACTTC
ACTTTGAGCT
ATTTTACTTT
ACGAAA2\AGG
GGAAAAJ\CAC
ATATCACTAA
GAGGTGCTTT
CATTAGGTAG
AACATGGTGA
ACCAAGGCGT
CCCCGATTTT
TTCTAxAJAJj TG TATC CCC T
CTTTAATGTG
CCCTGTGGAA.
GAGCATTTCT
TTTGATTGAT
GAATGCAGAT
AGTATGCCCA
TGCAGAGTAT
AAAGCGCCTG
TTAGGCAAAA
ACAGAAATCA
ATTGGCGTGT
AAAGGCGGTA
AGAGACTATG
AAAAACATGA
GAALATGCTTC
GCAGGTTGGA
CTTAAAGAAA
CCGTTTTAGG
ATGGTGTGAT
TTGCGTTTGA
CTATTGACAG
TCGGTCAAGT
ATGTGCTGTT
AAGTAAGACA
GCATGGTAGA
GAAAAGGCGA
ATTCCATTAA
AAACAATGAG
CCTTTTCTTT
CAAAAGAGTG
CGAACAAGTG
GTCTTTCCCT
TGAAGAAGCG
CGCAGTGATC
CGCTCTCTTA
TAAAGGCATG
GCTTTAA
120 180 240 300 360 420 480 540 597 INFORMATION FOR SEQ ID NO:329: SEQUENCE
CHARACTERISTICS:
LENGTH: 648 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .648 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:329:
ATGTTAAAAC
GAAGCCTTTC
CTATTACA\G
CCCAAACCCA
CAAAGCACCA
ACCTTTTACT
GCTGGCTATG
AACATTGAGC
ACTATCCCTA
A.ACGCCGATG
AGAATCTGTT
TCGCCAGTA
AAAAACACCA
GCACACAAAC
AACCAAAACC
TTTTAAAGAA
CTCAAAACCC
GTAATAACAG
TGAGTTACAC
AACAATCTCA
GCTCCA1C
TAACAAGCTC
AACGATTTGT TTGTCCCT.A
TCAGCTCATT
AAAAGACGGC
CCAAGA.ACAA
CATTACCCCT
TGCGACTGAG
TGTGTATGTA
CTTGATTATG
GGACGATCAA
AATCATTCTG
CTCCAAACCA
TTTTTCATAG
ACCATAGCCA
CAAAGCACCT
TTGTTTGCAG
ACCGCTTATA
ATACAAAACT
GGCA.ATGTGG
CCCGCA.AGCT
AAGCCGGCTT
CCACTCAAGA
ATGGGAAATA
AGGATA.ATAT
ACCAAGAAAG
TCTTGCCTTA
TCAGTTTGGG
TGTTTAACGA
ATTTTCTGAT
TCAAATGA
CACGGCTGTA
TGAAACCGGG
AAAACCCAAA
CTACATCTCC
CACCAACTTA
CGCTGAAGAA
TAACTTGAAC
CGTGATAGAG
CCCACAGCTT
GCCAGCACGC
120 180 240 300 360 420 480 540 600 648 AGCAAGGTTA
CAACCAATCT
INFORMATION FOR SEQ ID NO:330: SEQUENCE
CHARACTERISTICS:
LENGTH: 507 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCTIUS97/05223 335 (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .507 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:330: ATGTTATCTT CTAATGATTT GTTTATGGTC
GTTTTAGGGG
TGCTTGGTGG
GAAAA.JCTT
TTAGAGGGGC
AAAACGACTC
GAGCGCGATT
CATTATGCCG
TATAGCGTGG
TTGAACATGG
GGTATTTGTA
TAGATGAATC
GTTTGGAAGG
TTTCGCACCT
ACTTAGAAGA
CTAGCGATGA
ATTCTATTTC
CAGGGTTAAA~
TCTTAAAGAA
CTATCAAGAA
CCTTTCTTTA.
TTATAACCAG
AAAAATCATT
AGTCAACGAA
TAAAGAATTT
ATGGTAG
AAAGAGTTTT
AATTATCTCT
GAAAAAZAGCG
TTGCAAGAAA
ACTTTAGAJAA
AAACAGGTTT
AAAGTGAGTA
CGATTTTATT
ACCATAAAT
ATTCTAAGCG
CTAAAGAGGA
TCCAAAAATC
ACAAATTTAJA
TGAAAATGTA
AGGGCGAGGT
GGTGTTGGTG
GAGGCGTTTA
TTTGAGAGAA
CAGCTCATTA
CATGGATAA
AGACATGGGG
TCAAGAAGGC
GGAATTTATA
120 180 240 300 360 420 480 507 INFORMATION FOR SEQ ID NO:331: SEQUENCE
CHARACTERISTICS:
LENGTH: 1017 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .1017 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:331:
GTGGGTTTTG
GAACGCATCC
AATGAAATCA
GCTAAATTAA
AACATTGAM\
GCGTTAGAAA
AATTTAGAAT
ATCGCTCA
TCAAGCATGT
AATCAAGGGT
GGTAATGGCT
AATATTCCAT
AAGCGATTTC
CAGACATGCC
CCCCTACTGA
AAATAGTCAT
AAATGCAAGA
TACAATTCAG
TTTCAAATTC
ATGGGGTAGG
TTCGTTATTA
TTGATGGTTT
CAGTCATGCC GTTGAACACA
TAACGCTCGA
CAACACCTTT
AAAGCAAGCC
GCTTAGCGGG
ACCCATTACT
CCAATCTCAA
TTTGAACGCG
TTTGAGCGTA
CTTATTCTAT
AGGCAAAATG
GGCGAAGTCT
AACTACATCA
GAAACCTACT
GGCGTTGCGT
AACCCTTTAG
AACAGCATGC
CTTGATCCCA
GGGTATAAGC
ATAACCCTTT
GTGGACTCGA
ACAACGCTTT
ACCTGCAATC
CTAACCCTAA
AATTGGTAGA
TTTCTTCTTT
GCTCTTATTC
ATTTCTTTAC
TTTGGATCA
TCGAGTCAA
AAAAAACAAT
TACCCTTCAA
ATTAGCCCA
AAACTTAAA
GTCTTCTCAG
TAAAAACGTT
CAAGAAAAA
TGGTTTTGTG
TGGCATAGAT
120 180 240 300 360 420 480 540 600 660 AATAACCACC TCTATGGGCT WO 97/37044 PTU9152 PCT/US97/05223 336 TATCTTTTCA ATTTCATTGA TAATGCGCAA AAACATTCGA GCGTGGGGTT TTATGTAGGC 720 TTTGCTTTAG CGGGGAGTTC GTGGGTAGGG AGTGGTTTAG GCATGTGGGT GAGTCAAATG 780 GATTTCATCA ACAACTATTT GACGGATTAT CGGGCTAAAA TGCACACGAG TTTTTTCCA_ 840 ATCCCTTTGA ATTTTGGGGT TCGTGTGAAT GTGGATAGGC ACAATGGTTT TGAAATGGGC 900 TT.A-AGATCC CTTTAGCGGT CAATTCCTTT TATGAAACGC ATGGCAAGGG GTTAAACGCT 960 TCCCTCTTTT TCAAACGCCT TGTCATGTTT AACGTGAGTT ATGTTTATAG TTTTTAG 1017 INFORMATION FOR SEQ ID NO:332: Wi SEQUENCE CHARACTERISTICS: LENGTH: 243 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .243 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:332: ATGATGTTTG AATTAACCAA AAAAACCAAA TTTGATGGCG AAATGATTGG CTACACAGAA GAACTTTTA CCTTTTTAGT TAGAGATTTT TTTAATGGGA. TTTTTAAATC CAAAGTAATA 120 CCTAAAATGC CTATTTTTTG CGGTGATGTT AAATGCGAAG ATTTTAATGC CCTAAGAAGT 180 TTAGTTTATC TTTCTGTGCT TGAGTTGGAA GAAACGATAA ATCCTAATAA AATCCCATTT 240 TAA 243 INFORMATION FOR SEQ ID NO:333: SEQUENCE CHARACTERISTICS: LENGTH: 603 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .603 WO 97/37044 PCTIUS97/05223 337 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:333: GTGCAATTAC TCkAAGACAA TAAAGAAGTG GTTGTTTTAG ATACAGATAG CCAAAAGTCT ATGGAAACTT TTGCTACAAT TCGTGCTGAJ\ AAAGAACGCC CCACTTTTAG CTTATTTAT 120 CGTAGTAGTG GCTTTAGCGA TACTTTAAAG CAAATGGTAT CTAAGTATGA A.AATATCCTT 180 ATTGATACTA AGGGGGAATA CAGCAAAGAA ACCCAAJG CTATGCTTTT AAGTAATATT 240 CTGCTAGTGC CAACAACTCC TAGCCAJJTA GACACTGAJAo TCTTAGCTAA TATGCTAGAA 300 AGAATTGAGC AACTCCAAGA GCTTAATGAA AATCTAAGAG CCTTAATTGT CATCAATAGA 360 ATGCCTACTA TTCCTACCCT TAAAGAAAGA CAAGCCTTAA~ TAGAGTTTAT TAA-AGAAAAT 420 AACCCTAGCG ATAGGATTAC ACTTTTAGAA AGCTCTTTGA GTGAGCGCAT TGTTTATAAG 480 CGCAGTOTAA GCGAAGGCTT AGGGGTCATA GAATACAGCG ATAAAAAGGC TATCAATGAG 540 TGGGTTAATT TTTATAACGA ATTAAAAAGC CATTTAGAAA AAGAGAAAAT ACATACGTTT 600 INFORMATION FOR SEQ ID NO:334: SEQUENCE CHARACTERISTICS: LENGTH: 213 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1. 213 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:334: ATGCGGTCGG ATGTTGAAGT TTTATCGCCT TTGCATAAAA TAGATGAAAA~ ATACCTTTTC CATTTAGA TTGCGGGGGA ATTGGCGAGC ATGGGTAAGA TTTTAAGTGT ATATTTAGCC 120 CACAAGCACA GCGCGTATTT CATTTTAAAC GCTTTGAGTT ACGGCTTTAG CCACCAGGAT 180 AGGGCGATCA TTTGCTTATT GGGCGCAATT TAG 213 INFORMATION FOR SEQ ID NO:335: SEQUENCE
CHARACTERISTICS:
LENGTH: 213 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: WO 97/37044 PCTIUS97/05223 338 ORGANISM: Helicobacter PYloni (ix) FEATURE: NAkME/KEY: misc-feature LOCATION 1 .213 (xi) SEQUENCE DESCRIPTION. SEQ ID NO:335:
ATGAGTGCGA
GCCGAAAATT
CTTGTGATCC
AAGCCCATTC
TGATGCCAAG CCTTTTGACC TTGCAGTGGC TGAGTTTTAT
CCTTTCTTTA
TGTGCCTGAc AGACAGCCAT CATTTAAAAT ACACGCTAGA
AAAAAACAAG
ATTCTAATGA TGCGCTTTAC TTGGCTAAAG AAATGCTCCC
CAAACTCATT
CTTTGACGAT AGAGTTTGCT
TGA
120 180 213 INFORMATION FOR SEQ ID NO:336: Ci) SEQUENCE
CHARACTERISTICS:
LENGTH: 1437 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1437 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:336: TTGAAGAAAAJ TCGCCCTTA.T TTTAGATGGC
ATT(GTAGCAA
CTAAGGCATT
CCTAAAAATT
CTTTTGCAGG
GAGCAGCGCA
AGCGTGAAAA
CTCATTTTGA
ATCCCTAGCA
CCTTTTGGGA
ATTGTAGGGC
CCACGAGACA
GTCAAAAGCA
GATATGCGAT
CACAAGCGTT
TTTTATCATA
GGGAAGAGTT
GTAGGCAGAG
CCGGTTTATA
AATGAAAGTT
ATGGATTTGG
GTCAATCATT
ATTCTAATCA
ACCCGAGCAC
TGTTAAATGA
TCATCCATA
GAGATAGCGA
TTGATGAATT
CCCCTAGAGA
GCATTTTTGC
TTTATCGCA.A
TTCTTTTAGT
ATGTGGGGCA
TGCAGAGCCG
TAAAGAGCTA
AATTTTTATC
TTATCCAAAA
AGCTTTTTTT
AAACCAACAC
TAAGCATTAAz
GCTTGTTGCT
ACGAAAATTT
TAATTTTTAT
TTTCGCTTTT
TGAGGTGAGC
AATCATTCAA
AAAAACTTTA
TGAAGTTTTA
GTTTGGGTTA
TTATAGGCAT
CGATGTTTTG
GGCAGGCAAT
GTTCCCAGCC
AAAAGCGATG
CAAGCTCTAC
GCTAGAAJACC
ACTCCATGA
CTTGAAA.AjA
CTCTGGCCTG
TGAAGACATG
TTATGATTTT
AGCCAACACC
ATAGTGGTTG
CATTGTTTTG
GATGCGTTTT
ACCCATTTCA
GAAAACAATG
GCCAATAAAT
GGCAAGGGCG
ATTGGCTCTA
TTGCTCTCCA
CCGGAAATTT
CCCTTTGGTA
ATGCGCGATG
ATTCAGGTTT
GAAAGCATTG
GACCACCAAA
CACCGAAGAG
TCTAAAACCA
TCTTCAGTGA
GACCCTAACA
TTTA.ACCGCA
AAAATTTTTT
AGACTTGGTG
TCAAAGATGA
GAGCCTTATC
ATGCGACTTC
CAGTTTTAGG
TAATCATACA
AGATTTTAA
AACGCATGCG
TGTGGTTTTG
AAGAJAAT
AGATGAAAG
TCATTTCTCG
TTTGCCTA.AT
AGATCATGGA
GATTGATGTG
TCAGGCAAjA.
AGA.ATACAGG
CTAAATCTTT
AGTCATCCAA
TAAACGCGGT
GTACCATCAA
AGAGCATTTA
TTTATACATT
TGTATCAAGC
CTTGTTTTTG
TACACCCCAC
TAGCCCTAAG
AAGTGAATTT
TGATTTTTAT
AAAAAATGGG
TTTGATTGTG
CCTTGCACAA~
AACAGCCACA
CTCAAAGCAT
TGTGGTTTTG
TTTTTGATGT GTCTATGCAjA AGCACTATAA
AAACGAGATT
AGATTGAGAT TTTTCAAACC 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 WO 97/37044 PCTJUS97/05223 C-ATATTAAAA ACCCTATCAT GTATCTCAAT TCTTTAAGAA ATCCCATTTT
GCATTTCATG
CCTTTTGAAG AGTGCATCAC GCA-AkCGCGC TATTTGTGGT TTTTATCCAC
TAAAGTGGA
AAATTAGCGT TTTTGAACGA TGATCACCCT CAAATTTTTA TCCCTGTAGC
GGAGTGA
INFORMATION FOR SEQ ID NO:337: SEQUENCE
CHARACTERISTICS:
LENGTH: 1257 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NME/KEY: misc -feature LOCATION 1. 1257 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:337: 1320 1380 1437
ATGAAAAAAG
ATGGGCGAAA
ATTATCAATT
AAA.ATGGCAC
AAAGAGCTTT
ATTAACGCGC
CATTTAGACA
CAAGAGGGCT
AGGAGTTTTG
CAAGAAGTGG
ATCGCGCGAT
AGTTTAGAGC
GAAAAACATT
CGAAGAAACC
GCTATCGGCA
GCGTTTAAAA
CGAAAAGACA
CGTTTGAATG
CTCAAACTCA
GCCTTAGATC
TTTTTAGAAA
TCTATTTTAA
ATTTAAAGGA
CTTGCACCGT
GATTGGATAA
TTGAAAAAGG
TTTTACAAGA
CCACGATGGT
GTGATTTTGA
AAGAGAGGAA
TTTTAACCGG
TGATTAAAAA
CTAATCAAAT
TGCATATCGC
GCACTAAAAG
CGGATTTTAT
ATTTAGAAAG
CCCCCTCTAG
CGATTAAAGA
ACACGCCCCT
AATTTTTCAA
TCAAAGAGTA
AACTTTTGGG
TTTTAGCGCG
AACCAATGGG
AGAAGTGCTA
GCTTTTAAAG
AAAAAAGCGT
GAGCGAGTTT
TTGCAATTAT
AATTTTAGAG
CACTAACGTG
ATTAAGCCAG
CAACGATGAA
TTTACAGCAC
CGATAGGGAA
TGTGGGGCAT
CTTGCCTTTA
TTTGATGCGT
TTTGATTTTT
AAAAGCCCTA
CCCCATTAAA
TGAAATTAAG
TGTAGGACGA
ACTTTAGAAG
ACCGATAGCG
TTTACTGGTT
GGCGTTTTTG
TTTTTCATAG
GTGGGAAAAA
TGCATTATCC
CAAGTGGGCC
GGGAGCTATG
ATTACTGGAT
TTTTTAGAGC
AGCCATGATT
TTATTAGAGA
CCGGGCGAGA
ACGCATATCC
GATAGCGTGA
CATAAAAATA
GTGGAAGCGC
ATCAAAAGCG
*ATCTTTTTGA
AACAAGAAGC
CGGTAAGGAG
GCGGGGTGAA
GGCATGAC.A
ATGACAATTT
CCAGGGCGTT
CAAGCGTGAG
TTTTATGCTC
GGAAAGATAG
TGAAACGCAT
TTTTAGAAGA
TCATGCTAGA
TAATCGCTTC
GTGAAAGCGT
ACCCTTTTAT
GTTTGGAAGA
AGGCGTTCAG
AAAAAGACGG
CACGCAAGTG
CGATATTATT
TTACGCTAGA
AACCCAAGGC
TAAAGAAAAG
AGAAAACAAA
TATCAAGATC
AGGGAGGGCT
TAAAGGGGTT
AGGAAGCAT
TAGGATTGGG
GGATTTTTTA
GAGAATGAAT
TAAGAATTTT
TTTTGAAAAG
TTACAGCAAG
TTCTAAAAAG
GCAATTGCAA
CGAATTTAAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1257 GAGAGGGAAA ATCATGCCGT TTTCTA.A INFORMATION FOR SEQ ID NO:338: SEQUENCE CHARACTERISTICS: LENGTH: 528 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCTIUS97/05223 340 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(Vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .528 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:338:
TTGAAAATCG
GTTTTAATCA
CAGGACAAGC
TTGGCTCGTT
GATAAATCCC
GAAGATTTGA
CATATCGTCA
GCGGCTAAAC
AGCTATTTGT
CTTACAGGCT
GTATCCTGCC
ATTACGCCAT
CGCTCATTGG
GCTATGAACT
TTAAAGCCCA
ATATTGCGAT
TTTTGA.ATGA
TTGACACCCC
TTTAGGTTTG
CTTACAAAAJ\
TATCCAAAGA
GGCGTATGTG
GGTGCGCTTG
AAACAGCATC
CTATCAGCAA
AAACAAGCTC
GGATTTTGAT
ATGAGCTTTA
ACCGAACACC
GCGGATAXAjA
TTAAACCGAG
CAAAGCAGCT
TATGTGCAAA
GACAATAACC
GTGTATGAAA
TACGCTTCCA
TCGCGCTTGT
ATTTCGTGGA
GCATTTCCAG
AGAGCATTA
CTAAAGTGTG
GCCATTTAGA
CCATTGCGAG
AGCGTTATAA
TGCCCTAA.
CTTAGCGATC
TTTTTTAAAC
TAATGAAGCG
TCGCATTGAC
GCAACGCTTT
AAGAGAAGTC
CGTGTCCATT
AATCGTATTG
120 180 240 300 360 420 480 528 INFORMATION FOR SEQ ID NO:339: SEQUENCE
CHARACTERISTICS:
LENGTH: 561 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .561 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:339:
TTGTTGCTTG
AGCCGTTTCC
TTTACCCCTG
AAGCATCAAA
TTCATTTTAG
CTGATTATAG
AACTCCCAC
TTTAGCGATG
GCCGTGCCTC
TTTCTCGCTT
AAATTAAAAA
GATATGAAGA
TTTGGCAATC
AAAATGACCT
ACAATGACTT
CTGAATTGCA
ATTATATCTT
CTCTTCTAGC
TTTAAACGCC ATTGATCCTT
TTAATTTAGG
TGGTTGTATT
AAGCAAAGCG
CAATCAAGAA
GCATTTAGAC
TTTTAACCGC
CCGCTCTATC
TTATGGCAAT
CCTAGCCTTA
TATGGGGTGT
CGAGTTTTAA
AGCGTTACAA
GAAAACTCTT
TCTCATTTAG
ACGCCCAGAG
AAAACGAGCG
GGCAAAGCGA
GTTCTTACAA.
ACGCTCTCAA
AAGTCAAAGG
TTTATAAGAA
TAACTCCGAG
AAGCCGCTAG
TTTGCA.AGCA
TCTTAAAAJkG
GGTGTTGTTG
GGTTTCAAA
TATTTTAAGC
AACTTTTGTT
ACTTTTAAAT
CAATGGCACC
GATACAAAGT
AATCGGTAAC
CGCAAGAAAT
120 180 240 300 360 420 480 540 WO 97/37044 PCTUS97/0523 GATACAAATC CATCACGCTA
A
INFORMATION FOR SEQ ID NO:340: SEQUENCE
CHARACTERISTICS:
LENGTH: 522 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 522 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:340:
ATGGGGTTAA
TGCGTTCGTG,
TATTGTGGAA
CAAGAAAAJAG
TTAAATA-ATG
TATAATTATG
CCCACGCATT
AACCCATGGG
TCTATGCTTC
AGAATAAAAT
GTTTAAAAGG
TCAATCGTGA
CTTTATTCAA
AAAGTTTCAT
GGGGGCATTT
CGGGTTATTT
CTTTTATTAG
CTGCCGTTC.A
CAAGGGTTTT
GGCCAAAGAC
AATTAAGGAA
CCACGACCAT
TGAATACAAC
AGAAGATAGG
TGACTATGAT
AGTCAAAAAC
AAGGGGGGGT
GTTAAAGAGA
ACCCATGAAA
ATGCTAGAGG
CAAGAAAGCG
AAGAGTATTT
GTCATCCATC
AAAAA.AAGTC
GAAATCGTTA
CATTGGTTTT
GAATGCCTTT
ACGCCCATGA
CTAAAAAACT
TGTTTTTAGC
ATAAAAATAG
CCACTTTAAC
AAAACCCTAA
CTTTAGAAGA
AA
TGTAATGAGA
TAGGGACGCT
TCATTTTCTT
TATCGCCTCT
TTCTCTTAAT
TTTACCCAAT
AAGCCCTCTA
GAGTTTGTTT
120 180 240 300 360 420 480 522 INFORMATION FOR SEQ ID NO:341: SEQUENCE:
CHARACTERISTICS:
LENGTH: 528 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION .528 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:341: WO 97/37044 WO 9737044PCTIUS97/05223
TTGTGGCACC
TGGTAG TA.A
TATATTCCTA
GATTTTGAAG
CATTGGTTAT
TATGAACTcC
CACTTCCCTT
GAAGAATTCA
GATGAAAAGA
AACTCTATCA TTACTGTAT
TATACGCTTT
;AATTGATTG
AGAATATTA.A
TCTTTGTTCG
TGTATAACGA
TAAACCTTAA
TGGCTAAAAA
AAAAACGCCA
GGATTTTAGA
CGATCATATC
AGAGGTTGTG
TAACGATGGG
TTCCGAGCCT
AGATAAACAC
GAGGCGAAAC
TGCCGATTTG
TATGACGCTA
ATGTATTCOC
GATTTTGGGT
TTTGAAATTT
CATATTAAAG
GCTATTGTTT
ATAGGCACAA
CTTTTATCCC
AAAAGCTTTA
GCATTAATTT
TTTTAGACGC
GGCGCTACA
ATAAGGAAT
ATGACGATTT
GGATTGAAGA
TGCCTTA.A
TAAAAATGAG
TAAAAGTTTT
TGTGGTGCGA
ATGGGGGAT
TGATGA.AACT
GGTGCAATGC
GATCCCTTTA
AAGCATGCTA
120 180 240 300 360 420 480 528 GGTATATCAA AAATTCCGCT INFORMATION FOR SEQ ID NO:342: SEQUENCE CHARACTERISTICS: LENGTH: 1074 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 1074 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:342:
ATGCAAAAAC
CCAGAGTCCA
CCTAGCAAAG
TTAAAGCATT
ACGCTCCCTA
AGGGATAATG
AAGTGGGTTT
OATAAAGACA
AAAAGCGGTA
GAAGACCCTC
ACAGAAAAGT
ATAGAAGAOC
AAAGCCATAT
AGCGAATTG
A.ATGAAAAAT
AAAGAGAGCA
TGCATGGCCT
AACTCTCGTG
CACAAAATAA
AACTAGGCTC
TAAGCCCCAC
CTTCACA.AGA.
ACAACACCTC
TGGAAAAACA
ATGATGATGA
AAGAGATTAC
AAATCATTAC
AAACTTTTGA
GTAAAAGGGC
CTTTAAAACA
ATGAACGCCC
CTTATTCTTC
TCATGGAATT
GCGAATACAA
TAGAAATAGA
TTGTGTGTGT
ACCCAGCTCA
TAAAAACTCT
TAACGAAGTT
TCAAGAAA.AC
TAGTGCTGAT
AGCGATTAGA
AAATTTACAA
AACAGACATT
CCCCTATACT
AGCTAAAAAT
TAGACAGA
AGCGTGGGAG
CAA.ACAAGAC
CACACGAAkA
TGTGGAA.GTG
AGAATGGGTT
AGAGCAACCA
CAAAAAGGGG
TCGCAACAAT
AAAAACAGCC
AAAACGCCAA
AACCTCTTTG
GCGAGTGAAA
GATCCTAATA
GCCTATCGCC
ACCCCTTGCG
AA.AATCTCTG
AATTTCGCCA
AAAGACGGCA
AGCOAGTATG
GATCAAGTAG
AGCGAAATA.A
TATGAGGGGC
AAAAACCATG
CGAGCCAAAA
AATTATTTAT
CTCCTCA.AAA
TTTTACAGCC
CAAACGACGC
TAGCGCCACC
ACAATGAAAG
TCAAAGAATT
CAAGCATTTT
ATTACAGCAC
TTCATAAAAC
TTCTCCAAGC
CGACCAGOCA
AAATCACCAC
AGCCGACTTT
CGCGTAATGA
ATTATTTAA.A
TGCGCTTTA.A
GCACGCCTTT
TCAATGAAGT
CTTTTCTTAC
TTTAGTAACT
TAATCCCCCT
CACTGAAAAA~
CAACGAAAAT
CGCATGCGGG
AAAACGCGTT
COCTGAAAAT
AGAGCCTTTA
CAGAAGCTCT
ATGCTATCTG
GCAATTAGTG
TTATGAAACC
ATTGAATTTG
CGATATAATC
AGAAGGGGTG
GAGTATTGAA
TTAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1074 INFORMATION FOR SEQ ID NO:343: SEQUENCE CHARACTERISTICS: LENGTH: 432 base pairs WO 97/37044 PCT/US97/05223 343 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...432 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:343: ATACAATACC CCAAGTCATC TTTTTTTCAA AAAAGGGGTA ATTCAACCAT TCAACCATTC AACCATTCAA CCATTCAACC AATCATTCAA TCATTCAATC ATTCAATCAT TCAATCATTC ATTCAAGCAA CGCTACCTTA TTTTTATAAC TATTTATCTT AATCCCCTAT TTTTTATTAT TCCCCCTTTT ATTAACCCTT CCTTTTATTA ACCCTTTTAT TAACCCTTTT ATTAACCCTT CCTTTTATTA ACCCTTTTAT TAACCCTTTT ATTAACCCTT CCTTTCATAT AA
GGAATGGCAA
ATTCAATCAT
AATCATTCAA
TTTACAAAAA
TTATTAACCC
TTATTAACCC
TTATTAACCC
CCATTCAACC
TCAATCATTC
TCATTCAACC
CCTATTTAAA
TTTTATTAAC
TTTTATTAAC
TTTTATTAAC
120 180 240 300 360 420 432 INFORMATION FOR SEQ ID NO:344: SEQUENCE CHARACTERISTICS: LENGTH: 729 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...729 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:344: TTGAAAGTCA ATTTCTTTGC CACTTGTCTA GGAGCAGCTA TTTATAGCAA AACGCTATCA AATTACTCCG CAAGGAAAAT TTGGAAGTGG TTTTTAAAAA TGTTGTGGCC AGCCAAGCTA CAACTCAGGA TACTATGAAG AGACTAAAAA TACAATATCA AACTTTATTC CAATAACGAC TACCCTATTA TTTTGCCTAG ACAGGGATGA TGCGGCATGA TTATTTGGAA TTGTTTGAAG GGCATGCGGA GTTAAAGATT TTTGCTCTAG GGTGTATGAA TTGAGCGAGT TTTTGGATAA
CGCATCGCTT
AGACCAGACA
AGTCGTTTTA
CGGTTCATGC
ATTTAACATG
AAAATTGCAA
120 180 240 300 360 WO 97/37044 PCT/US97/05223 344
GTCAAATACG
TTAAGGGTGG
GAACTCATTG
AAAGAGCCTG
CATGTGGATG
CAAAAAATGG
GGACTTTAA
AAGATAAGGG
CTAAAGTGAT
AATTGGAAAA
.AAATTTCAGC
TGATCGTTTC
GCTCTTTGAC
CGAACCTCTC
TGACTCGGCG
AGAAGAAcpJA
GGTTATGGTC
AGCGGATGCT
AAAACCCATG
AAA-ATCACAT
AAAAATCTCA
TGCTGCGGGT
AAAGAAAAGA
GGGTGCTTGA
CATTTTTATG
GGCATTCTAA
TCAGACAGCT
TTGGGGGGAC
TTAAAGACAT
TGAATATCAG
ACTTTTTAGC
TTGCCATGCC
TAAAAATGTG
TTTTTCAOTT
AGAGAGCCGT
CACCGCTATG
CTCAAGGCTT
420 480 540 600 660 720 729 INFORMATION FOR SEQ ID NO:345: SEQUENCE
CHARACTERISTICS:
LENGTH: 1206 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1 .1206 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:345:
ATGATTTTTG
AACGAGCTTA
GGGTATCTTT
CCTTTTTTGT
GAGCATGCGT
TTTAAGGCCG
GATTTATTGT
CAAAACACGC
CCGGAATTGT
ATGAAAGGCA
TTGCAAAATG
GATTTGAGCC
AGTTTGCCGA
AGCTTGTTTG
ATCAAAACCA
GCGATAGGCA
AAAAGAGCGC
AAAGCTTCAA
GAATTTGAGA
AATAAAAACG
AGATGA
GGGATTTTAA
AAAACGCCCT
TGTATGAAGC
ATTTTGAACA
TTTACCCTAA
TTAAAGAGCA
TAAACACTAA
CTTTTAAGGC
TTTTTGAATT
CGATCGCTCG
ATGACAAAAA
GCTTGGCCTT
GCGTGTATCA
AGATTTTTAA
TGCAAATCAT
TGGTTGGAGG
ATGAAGATTT
AGGAATATGA
TCGTGGAA~c
CCCATAAAGA
ATATCAAAAA
GGATTTTATC
GCGCTTAGCG
ATTTTTAGAA
AATCCACAGC
TCTCAAAAAC
AGCCAAACCA
TTTTATAGAA
AGAGTTTTTA
CTCAAACAAC
TAGAAGCGAA
AAAA.AATAGC
AATGATAAGC
GGCGTTGTTC
TGAAAGTTTA
AAAAAAAGCC
TTTGCATTTA
AGAGAGCTTT
GATGAGAGTT
1CGCTTAATG
AGCGTTAAAA
TCTCAAAATA
TTTTTAGATG
AGAAAAAAAT
TCTTTAGATC
GGCGATACCT
AAGCGCGTTT
AATGAGTTTG
GACACAGCGA
CCCTTGATAG
AATGTGATGA
GTGAAAGTCA
GAGATTGAAG
CCTTGCGGCT
GAAAAACGCC
CTTTTTAGCG
GGGGTAGGGA
TTAAAATCCT
ATCAAAAGGG
CATAGCGCCC
AACTCACAGC
GGGGGAAGGG
AAAATTTTCA
ACCCTTTAGA
AAAAAACTTA
ATCAAGTGAA
TTAAGGAAGT
GGAGCGTTTT
TTAAGATTAT
ATGAAAAAAA
TCGTGGATTT
ATCAATTGTT
CTCAATTGCC
CTGTGACCGG
CTAGGGGGGT
TGCCTATCCG
GTGGGGTAAC
rTTTTGTGAT
ATCAAAAATT
AATATTTTA
CACTAACCTT
GTATTTTGTG
AAGCCAAACC
GCCTTTAAAAJ
TTTCAAGCAG
TTTGACAATG
GATACACAAC
AAGCTTTTCG
CACAAAACCC
CCGATTGTTT
ATTGCGTAAC
TGAAATCATC
CCTAAAAACA
ATGCCCTAAA
GTATTGTGGG
CACTTTAGAA
rTATAAAAGT
GCCCAAAATA
ASGAGATTAAC
CTTTAAATAC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1206 INFORMATION FOR SEQ ID NO:346: SEQUENCE
CHARACTERISTICS:
LENGTH: 795 base pairs WO 97/37044 PCT/US97/05223 345 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION I1.. .795 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:346: ATGAAAATCA GTGT1 TTGGATAAAA
AGGAC
AAGCTCTTTT
TAAAA
TCTACCGATA
AAGAG
TGTTTGAAAG
ACTCT
AATAAAAGCT
CTTTC
ATTGATCCAA
A.AGTG
ATCGCTCCTG
TGATT
TTCAATCA
AACAC
ACGCAGTTAG
AAAAA
AAAAGGGCTT
TATTG
GGCATGTTAG
CGGTG(
AATTACCCTG
ATTATC
AAGGAAGAAT
TTTAA
'AGTAA
GCTTC
~GCCAG
GGCGT
AATAT
AAACT
AGTTT
GAGCA
CAAAC
~TCTC
GAAAT
AAACGATTTA GAAAACACTT
TGCGCTACTT
CTCTATCGCT
CGATTCAGAT
AGGCACGATC
TGTTTTAGAG
CCCCATGTTT
AGAAATCAAT
AACTAGCCAT
CCTTTCAGTG
TATCCATTCC
CCTTAAACTT
TCACACATCC
ATTGGACT.
AACGGGAAGA
ACTAAAGATG
GACGCTGATG
GCCCCCTTTT
AAAAGGGAAT
GTAGGCACGG
ACCGAAGAAG
TTTTATGA
CACGCTTTTT
AAAGAATACA
ATTTAGAAGT
AAAGCTATAT
AGTTTTTAGA
ACAGCTTGGT
AGTTCCCTGA
TGGTGGATGC
TAGCCGGTGT
ATACCAAGCG
ACATCTCTTG
ATTTCAGTTT
TCACCAAGCT
CTTCTTCTTT
GCAAGCTTTT
CATTAAAGA
TTCTACGCAJA
TATTATCTCA
GATCAAACA
ATTCCCTGTT
GTTTAAAAG
TTTAATGCA
GCTCTCTTAC
CATTTTGCCT
TAAAAGCGAC
CATTGATGGG
CACTTTAGGC
120 180 240 300 360 420 480 540 600 660 720 780 795 CAAAA AATCCTCCCT INFORMATION FOR SEQ ID NO:347: Ci) SEQUENCE
CHARACTERISTICS:
LENGTH: 939 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
ANTI-SENSE: No ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .939 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:347: WO 97/37044 PCTIUS97/05223 346
TTGAATAATT
ATGATAGAGG
AAAGCGAGCT
TTAGTGAGCG
TTTGGTTTGT
GCGAGTGGGG
AACGCGCTCA
ATGGTCAATA
ATTGACGCGC
TCTGTCATTG
CCTAAATTAG
ATTAAATTCG
TTGGAAAACC
TTCTTGA-ATC
TTGTATGCTA
GAAGTTAGGG
TAGAGCCTTT
GGGCGCGGAA
AGTGCGTTTC
TTAAAAGCTP
CGGGGCATTT
GGGCGTTGTT
GGGCTTTTAG
AGCACCCTAA
AACTTTTTGA
TGATTGAAAG
CGCATTTAGC
CTTCTTTGAG
AACCCATTAG
CAAAGCTTGG
AAGAAATCGC
AAAAATCTTT
AGCGTTTTGG
CAATTTAGTC
ACAAAGGAAC
ATTAGACA.TT
GCAA-AACAAG
GGACACGCCC
TTGGAGCATG
AATTTTAGA.A
AAGTTCTATC
GAGTGCGGAC
TGCTTCCATT
CGTGGAGCGC
CGTGGTGCTG
TTTTGGTGGC
CAAGCTCTCT
AAGTGTAGCG
GTGTTTGTGG
ATTAACGCCA
AAAAAACTAG
TCACAAATCA
ACTTTAAAGC
TTAGATCTTA
GGGGATAAAA
ACCTATTGGC
GTGCATGCTT
ATGCAATTAC
AAACCCTTAG
TACACTTTGT
AATGCGAGCA
TTTATCCAAA
AGTTTAGATG
AGAGTGTAG
GGTTAGACGG
TTGTGGGCGT
CCCTAGCGAJA
CGCCCGTTGA
CTAAAkTCTTT
TCGCTATTCA
TCACCATTGA.
TTTTTGGTGC
TGGTGGAGTT
CCATAAGCTA
ATTTATACGC
GGCGTTATAA
ATGAAGTGGC
TCATCTCTCA
AAGTGCTAGC
CATTGATGCG
AGCCGGATTG
TAAAGAAAGT
TAGCGAGCAT
AATCATTAGC
AAATGCGCAA~
TTCAGCGAGC
GTCTTTAAAG
TGAAGACAAC
TGCCATTAAC
TTTAAGCGCG
AGATTTGTTG
GATGAAGAAG
AGCTTTAGAA
GTTAGATAAG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 939 INFORMATION FOR SEQ ID NO:348: SEQUENCE
CHARACTERISTICS:
LENGTH: 873 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...873 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:348:
ATGTTAGAAA
GGGGGGCTAA
AAAGAGCAGG
AATAGCGGGT
TTGAAAAAA.G
CATTTGCTAG
TTGCAATACT
GGGATCTATC
ACTAAAGCTT
GCAGGCAGAG
GACTTAAAAG
GCGGCGAAGA
GGCGGAGGGT
GAAAAGCAAG
GATATTCTCA
ATGTCAAAAA ATCCCTTTTT
AGGGTTTTGT
TGGCAGAGCA
ATTTCACTCA
GTTTTAATTT
CCGCTTCATT
GGAATTTATA
ACTCTAAAGC
ATGATGGCAA
GCGATTTGAA
GCACGCCTAA
ACAGCCCAGG
ATTTTAAAGA
GTTTCAATTT
CCATAGAAAA
AGCAGCTCAA
AGACCCTA
GGCTAAGAAA
AGGGGTGCTT
TTACTCTAAA
TTATAGCGGG
GTGCGATTTG
AGTGGTTACT
CGATGGCGAT
AGATTTGAAA
GTGCTTTAAC
GGCCCTCGCT
AGGCTATG
CTTTAAAAA
AATCAAAGTT
GAGCTTGTCG
TATTTTGAAA
TATTATCAAG
GCTTGCGATT
CAAGGCGTGT
AAATACGCTG
AGGGATTTTA
GGTTGCACGA
AAAGCGCTCG
GCAGGGAACA
CGTTATTCTA
CAATACAATG
GGCTGTAAAT
TAG
GCTTGGGCGC
GTTTGGGGGC
AAGCTTGCGA
GGCATGGGGT
TAAATTACAG
CCCAAAACAC
AAGGGTGTGC
AAAAkAGCGGT
TATTAGGGAG
CTTCGTATGA
TGTATCATCA
AGGCATGCGA
GCGAAGGCGC
TAGGCGCTAA
GTTGTGTTTA
AAAGAGCTAC
TTTGAAAGAA
GGAAAAGAAT
CAATGGGTGT
CAACAAAGCC
GAGCTTAGGG
GGAATATTTC
CTTGTATGAT
TAAAGCTTGC
TGGCGAAGGT
GTTGGAAAAT
AACAAGGAAT
AGGGGCATGC
120 180 240 300 360 420 480 540 600 660 720 780 840 873 INFORMATION FOR SEQ ID NO:349: WO 97/37044 PCTIUS97/05223 347 SEQUENCE CHARACTERISTICS: LENGTH: 714 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .714 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:349:
ATGAAGTCAA
ATAGGCTTAG
TCCAGCCGTA
GGTTCCTTTG
ATTTATTTAG
GAAATCCCTT
GATAAAAAGG
CCTTTAGTGA
AAGCAAGGGG
ATCAATTCCA
TTGGTCATGC
CAAGAACGCA
ATAAM.AGTC CAATCATTTA AGAGCGATTT
ATAGGGCTTT
CCGTTATCAT
GGGCTTGTTC
ACACGGGCGC
AAGCCTACCA
TTTATGGGCA
GGATAGTGAG
TTTTTCCTGA
CTAAAATCAT
TTAAAATCTT
TAGAAAGCTA
TGCAAAAAGA
TGTTTTCAAT
GTGTTTTTTT
TAAACTCATT
CCCTAGTAAT
TGCTCTAACG
CCTTTTGAAA
AGGCACTAGA
CGCTGAAAAA
TAATTCCAAG
TACGCCTGAT
ATATTTAAAA
TATTTTAACC
TCTCTCACCG
GTCTTAAACC
ATTTGCTGGA
GATACCGGAA
GCATGTAXAG
GGCAAAGGAG
TTCCAGCTCA
CCCCTAGAAG
TTTAGCTCGC
CACTACCACG
GCAAAAACAA
GGGTTAATTT
ACCAAAGCTT
TCGCTAAAA
TGATCTTAAT
AAAAGCTAGA
GAGAAAAATT
AAATCCAGCC
CTTATAAAGC
CCACCTGGTA
AACTAAACGC
AGTGATCGCT
TAACGCCCGC
AGAAAAAATA
ACTAGACATT
AGAGCTGGGC
TGACAGAGAG
TCAAAACCGC
CCTCCCCTTT
CATGGTGTTA
GCGCACCCGT
TGAAGAATTA
ATGA
120 180 240 300 360 420 480 540 600 660 714 INFORMATION FOR SEQ ID NO:350: SEQUENCE
CHARACTERISTICS:
LENGTH: 2046 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) AINTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION .2046 (xi) SEQUENCE DESCRIPTION: SEQ ID WO 97/37044 PCTIUS97/05223
ATGGGTTTTG
CACATTCAAG
ACGAGCCGTT
CTCACTTTCA
AACCAAGCCC
AGGCA-ACACA
GTGAAAACGC
AAAAACGGCA
TATCAAAACG
CTTA-AGATTT
ATTATGGTAG
AGTTTCACGC
AGGGGGGCTG
CTGAAATTAG
ATCAGCCATA
AGCGTGATTT
ATCAAAGCCC
AATGGGCTTA
ATTGGTGCGG
GTGGTGGCGA
GGCCTTGGCA
TTAGAAGAAG
GCTTTAAAGA
GAGAAGTTTT
GACAATTACG
TTTAAAACTA
CATAATACAG
GAGTTTAAGC2 AATCAAGAAA C AAAGAAGAAT 9] TGCTCGCCCT
C
CAAAATCACC
GGCACCGGGA
C
TGCGGAGGGA
A~
TTTTGA
AAAAAAGCAT TTTAGACAAT
TTGAACGGAC
GGCCTTTAT
TAGCGTATT2
CCAATAAAGC
TTATCCCCCC
TGAATCTTT'I
TCTGCAAACA
TGATGGATTT
CGCTCAAAAA
TACAAGATAJ\
ATGAGTATCA
ACCATAATTT
ATATTTCTAA
AGACCAACTA
ACCAACACCG
GTAAAGAATA
TTTTAAAGAJA
GCCGCAGCAT
TGAGTTTCTA
M.AAA GACGA
AGATCACTCA
CGCTAAAAAT
AATTCACCGC
GCGAGCGGTT
kAGAAAGAGA
%.CCCCACGCA
V.AACGCGCA
.TGTGTTTGT
CGATTTGGA
?ACAACTTTC
TGTGTTTTT
LAAAAGATAC
GGTTTTAGG
~TGTCTATGA:
GATTTTAGCC
GATTGGCGCI
GAGTAAAGKz
CTTGCTTTGC
AAAAAGGGCC
GCTCAAAAT'I
GAGCGTGCAA
AGACAATTTA
TGAAAAACTC
AGACACGAJAC
GTGCGTGGTG
CATTTTAAAT
CCGCTCTAGC
CCACATTAAA
CCCCACGCAA
GGGCGAGAAT
TGAAGAGAGC
TGAAAGAGCC
TCGCTTTTTT
AGAATGGATT
TGGGGCGTTT
TATGATAGGG
TTTAGAAGAG
GGGCTTTGTT
TTCTTTACTG
AAAAGTGAGT
GATCGGGTTA
AGAAGAAAGG
TTATGTCAAA
AGAAGAAGCC
GCCCATTAAA
CGTAGAAAA
TAAAATTTCA
3GGAGCTGGGi
TGTGGCGTGC
ATGCAAGAAP
ACTTTCCATC
TGCGATTTT'I
TCAAATTTCA
*GATAGCGAAT
*GTGGATTTTG
GCCAAAGAGA
GCCCTGCAAC
GGCGATGACG
TTTTCCAAGC
GCTGAAATCT
ACGCTTCAAA
AAAGAAGAGA
TTAGAAAATA
TTGAACGCTT
GAGGTTAAAG
ATTAAACGCG
TTTTCTCTTT
AAAGACAAAC
CGTTTGAGGG
ACTAACCTTT
AAAGAGCTTT
GATTTTTTGA.
TGCATGAGCG
GAAGAGGGGT
CGCTTGGCTT
GAGCGTTCGT
CAGTTGCTCC
GTGGGGGATT
GGCTTGAGCG
GAAAAATTTG
CGCAAAAAA',
GCGGTAAGA(
-CTAGCGAAA)
GGGCTTTGA-Z
GTTTTGGTT]
CGGTGCTAG~z
*GGGCGAGCA'I
CTTACAAAGC
ACGATTTGC'I
CCAGCGA-ACC
TGGAATTTTT
ATCAGAGCAT
ATTTTAAAGG
TAGCGTGCGC
GTTTCAAAGG
GCCTGGATGT
TCGCTATTTT
TGAATATCCC
ACGCTTTGGC
TTTTAACA
TAGATGAAGA
TCAACCCTAA
AGGCTTTTGA
TAAAAAGCTA
TAAGTTTAGT
ATGAA.AGCGC
TGCATATGAG
TTTTCCCGCA
ACGTGGCGAT
ATTTTGGGAG
A.ACAAGACAA
TGATCAAACA
GTTTGTGCTT
TAGAAAAAGT
P TGCCGCATGC TAAGACTTTAp
SCACTTTAACG
kATTGTTGAAJA
GCTGTTTTTA
TAGCGATGA
TTCTCAAATC
GTATGAGCTT
TTGTTTGAGC
CTACCATTAC
AAAACAATTG
TTATGGGTTT
GCCTAAAATA
TAATTCCCTG
TTCGCACAAA
GGCTTATCAG
GTATCGTTTA
TTATAGGCTC
ACTCATGCAT
GCCCCCAAGA
GGGTTTGAAT
AAACGAATAC
AATTTCAGTA
TGAAAAAGAA
GAAAGAGCAT
ACTGGATGTC
TAAGGGCTTA
TAGGGGGTTC
CACTAGGGCT
GAAAATTTCT
GCCTCCTAAA
TAAGATTTTT
GAAAATCAAT
GGATAACGAG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2046 INFORMATION FOR SEQ ID NO:351: SEQUENCE
CHARACTERISTICS:
LENGTH: 831 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .831 WO 97/37044 PCT/US97/05223 349 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:351: ATGAAkCTT CTAACACAAA AACCCCTAAA CCCGTTTTmA
GAGAGTTTAG
CGGTTGGAT:
TACAGAGGAC
GGTTATAAAA
GCGGATATTT
AGCCAAACTA
CAATATTCCG
GAAACAGCGT
AATTTAGTGG
GACGCTACCC
TCTTTTCCCC
GAAACGCATA
GAGCTAGAAC
AAAATCTAAG
TTTATTTTA
CTGGTTTAGA
TTTTAACCGA
TACAAATCCC
ACGCCATTGT
TTCTAAAAGC
TAAAAAATGG
TGGATATGCG
ATAGCGTGCA
CTATTTTACC
TTGATCCTAA
ACTTAGTAAC
AAGTATCGCT
AGCGAGTTTT
AAAAGGCTTA
TGTGCATGAG
GGCGTTTTTG
CAATATCAAA
CCTTAAAACA
CGTGTGGCTG
CTCTTTAAAA
AATGCCAGGG
CAGAGCGGCG
A.AACGCCCTA
CGACATGTTA
ATTAAATTAC
GATAAGGCTA
GAAATGTTAC
AGCTATCAAG
TGCCGCCAAA
AAAGGGCAAT
AGGGATAGTA
TGTGMAAGGG
ATCATGCGAG
GGAGCGAACG
GCGGCGGTGG
AGCGATGGAG
AAAATCCAAA
TCGCCGGGCC
AACCCCTAGC
ACCGCACGAG
AAACTATCAA
CAAGCGTGGC
CGGATCTGAT
TCATGAACCC
GCATTCAAAG
GGAGCAGCTT
AATTTGCCCC
GGAAAAGTTC
GGATTGATGG
CAALACATGCT
ATTTATTTTA
ATGCGTCATT
CAACAACGAG
TTTGGAGAGT
AGATGAATTT
AGCCAAkGGTA
TGTAGAAGTG
AAAAGACATG
CCCCACTTAT
TGGGTATGGG
TGTGATTTTT
AGGAGACAGC
GCTGTTCGCT
AAAACCTGAC
120 180 240 300 360 420 480 540 600 660 720 780 INFORMATION FOR SEQ ID NO:352: SEQUENCE
CHARACTERISTICS:
LENGTH: 1311 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .1311 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:352:
ATGATOAAAT
AACCTCAAAC
GCCTTTGTTG
GTTTTAAAAA
ATTTTACAAG
TTAGCTTATA
CTCATGATTT
GCTAAAAACG
TTGGATTTTA
GAATTGGAAC
AAAGAAAAAT
CAAACTCA.AG
GTGATTTTAA
TTAAACGCTT
TTCTTGTATT
TTTTTCTTTT
GCTTGAACGC
TGGATTTGAG
ACACTTTAGC
CAAACGTCAT
AGAGTGAA.AC
TAGATCAAGA
ATATTTTAGG
AGGGATTGTT
ACAAAAAAAA
TAGAAAAACT
CCTCATTGTT
AGGATTTTGA
TTATCAATAA
TAGAAGAAGA
AAAGAAATTC
GTCTAGTTTT
CGCGCCTTAT
GTTAGATTTT
TGATAACGAT
TTTTATTTTG
AAAATGCGTG
GGCATTGCCT
GGACATTTTA
TCAAATCATC
AGAAGATCCT
GCTCACTTAC
AGATAAAGAA
AAAATTCACT
GAATCTGAAA
AGAAGAAAGC
AGCGAATTTT
TTATTAGAGA
ATTGGTTTGT
TGTTTGA.ATA
CGGATTTTAG
CGTTTAGAAA
ATAGAGGCTT
CCTAATATTT
GAAAAAGATT
AAGCGATTAA
AAAACTTTAC
CAGCATTTAA
TGCATGATTG
CTCAGCAAGA
GAAAAAATCG
GTTTTAGAAA
TAAACACTCA
CTTTTTCTAA
CCAA.AAAACC
AATTCACCAA
AAATCAAGGG
TGATCCCTAA
TTCGTTTTA.A
ACGAGCATCA
TTTTATCCTA
ACGCCCAAAA
AGCTGGAAGC
TCAACAGGCG
AAATTGATAA
AAAAGAAACA
CTTTTAAGGA
TGTTTATGCC
AACGCATTTT
AGAAAA.ACAC
CCCAGAGAGC
AAkACGCCAAA
CGCTAAA.GAT
AXAAAGCCAAT
TGACAGGGTG
AGAAGAGGAT
TCAGCACAAA
AGAACGCTTG
GAAAGAATTG
TGAAAATCGC
GAGCATGCCC
AAAATCGCAA
AA.ATCAAATC
GGTAAAAAAC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 AACTATGTTA
GAGACGCTGC
WO 97/37044 PCT/US97/05223
TCTAAAATCA
GGTTTAGGGA
TTGTGGATGC
ACGCCTAAAG
GCGTTTAATG
GCTCATGTCA
AACGCCCGAT
GAACGGGTAT
AAAACCAAAA AGAGAATATC ATGTGAGAGA
TATTCCTGGA
ATGAGGTCAT
TATGGAATTA
GTTACGAGAT
TGACTACACG
TTTACTCAAA
ATACCGAACT
GAAGTGCTGT
AAGCTTTTAC
TCGCATTTCA
GCCAAAATGT
CAACGAAAAT
ATTAGTCTAA
ATTATAAGGA
AAGACGCAAG
TCGTTTTTTG
TGATTAAAAkT
TTGTCAAXAT
AGGACACTTA
TTTTAAAATC
AGCGAATGAT
CCAAAAGAAT
GCAAAAAGAT
CATCAAAGGA
A
1020 1080 1140 1200 1260 1311 INFORMATION FOR SEQ ID NO:353: SEQUENCE CHARACTERISTICS: LENGTH: 696 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-~SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1 696 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:353:
ATGTTTAAA
CTTGAAGCCA
GAAAAGCTCT
TATTTTTTAG
AAAGATTGGA
ATCGGGATTA
TATAAAGATC
GAATTTCCTC
TTAGTTGTTT
GGATTAGCGA
GGGTTTA3AGA
CCCCTATTTT
AAATCATTTT
TGCCTATTTT
ATAGAAGCAT
GGGGGGTTGG
TCGCCACGCT
GGGGGGTTTT
CTACCAACTC
TAGGGAGTTA
ATACAGACAA
TTAGTGGAGG
TCTTGCCCAC
ATGCGGCATA
TTTGTGCGTT
GCGCAATAA-A
CATTAACCGC
CGCTGTAGAAI
CAATTTAAAA
TGCATGGGAT
TTTTTTTACC
TAAGCATTAT
GCAAAATTTC
GGTCATGCTC
CGCCAGATTG
CAGCTATAAA
TTTTTGATAG
ACCCCCAAAA
CAAAAGCTCA
GCCATTAAGG
ACCGGCGTGC
CTTGGGTCAG,
ATGCTTGCGG
TTAGGGGCGT
AAGTTTTTTA
ACGCTTTTTT
CTTTCTAGCT
TTTTAA
GGGGATTTGT
AAAATTACCA
CGCGTAAAAA
ACTATCAAGG
AAAGTTTTTT
GAAAAGTGAA
TGGGTTTGGA
TTGGGGGAGC
AACATTCTGT
TAAGGCACCG
CTAGTCGCTT
CATTCCACCC
AGAAGCCCAT
AAGCGGGTGG
CAAGGAAATG
TAAAAAATAC
TTATCAAAGC
TGTGATCATG
TAGGGGGGCT
GGTTTCAGGG
CATTGAATTA
TGAAACTTCG
120 180 240 300 360 420 480 540 600 660 696 INFORMATION FOR SEQ ID NO:354: SEQUENCE
CHARACTERISTICS:
LENGTH: 1983 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
WO 97/37044 PCT/tJS97/05223 351 (vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 1983 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:354:
ATGCTAAA
GCCGTTCTT
TATCGCCCC.
TATGATAAG,
AGCCTTCTA1
GTCATGCGC(
CTAACCCAA(
CTCAAAGAAC
GAGCGTTAT'
TTAGGGTAT'
GCCTTACCTI
AGGGCTAATC
AAATCCGCTC
CCCTATGTCC
GGCTATACCA
CGTTTTGGGC
TCTAATGATA
GGTAAGATTT
ACGCAAGCCA
GATAATGGCT
AATTATAGTA
TTTTTAGGGC
AATTTAAGCG
AAAAACCTCC
GCGGCTGAAA
GAAAGCATCA
ATCACTTCCA
GGCACAGGGA
AACAACAATA
TTTGGGAGAG
CCTGTGTATT
TTTGATGTCC
CCTAATrCCA
TAA
A AGATTTTTT G TCGCTCAAG A GTGTCGCTT,
SAATTTCGTT'
3 CGGTAGAAG, 3 CTATGATTA AACTCGTTAj 3 CTATCAT'CT P7 TGAACCAAAC P7 TTAAXAAACC SGGGCTCCA.Ac 3ATATTTTAAC
TCAATGAG
STGGATGAG'
TAAAACTCAC
ATCAAAAAAT
AAGATGAAGA
TAGCCTTAGT
AACGGCAGTT
ATTCCACCAC
AAAACAGCGT
TTGTAACCTT
ATCAGCTTGG
CTAAAGATTT
AGTATTCTCT
CCAACCAACA
AAGAACAAGC
GTTTGGCTCG
TTGACGCTTG
ACGATAACAC
CGTATTTCAT
CCAAAGGCTT
TCACCCCCAC
A TGGTTTTATC T TTGGGTAACT 2 ACAGATTTTA I TTATGCGCGT k CACCCTCTTT
SAAACGCTA
~AAACATGGTc3 2CATACGCATT 2TTTTTTTGGG 2CCTTGACAAA
TTTTTATGAC
GCGGTTGTAT
GCCAATCGTC
GTTGAAGCAA
GATAGATTTG
CTTAGAAAAA
CAACTTAAAC
GGGGGGGATT(
TGGGAGCGCG
TTCCAAAATC(
GCAAAACCACC
GCA.AGAAGCC 9 CTTTGAAAAA GTCTATTGTG
I
ATTTTCTAAT
AAACGAAGTC
TTTTTTAACC
C
CATTAAAGGT
I
GTTCATTGGC
T
GCCTATTGGCA
GCGCAATATC
T
GCGTAAAGAA
A
CCCCAAAAAA A
GTTTTATTTT
ACGGATAAGG
GACAGAAAG
TTTGAAGAAA~
TTTGAGCATG
AGTGGTCGTT
CTCACACGGG
GAAAAAGTCT
CATGGGTATT
CTCACGCTTA
CCTACCAAAA
TCTTTAGGCT
TATAACCAA
TTGGATCAAT
GATTACCAAC
ATCGCTAAAG
GCCAGCATGA
GATTATAAAA
k.TCAAGCCTT
CTGATACCGC
"CATGGCACC
C
ETGAGCCATT
C
~TTTATCAAT
C
7TAGGGAGCT qI 7ACGGCACCA
T
LAAACTTTCA
C
TTTCAGCGC
T
TAGAA.ATTG
C
TTACCCCCA
C
AAGGAGCGA
C
TAGCGATTG
A
TCGTGGATAA
CAGACGATA G
TGATTATCGT
ATATTGCTAA
GGCGTTTGAT
TCCCCCCACG
GGGGGATCAA
ACACTGAAGG
AAAAAACCCT
TAAGCAAAGA
ATGGCGTGAA
AAGAAATCAC
ATTTAGAATT
GGATTTCTTC
CTTCCACGCA
TAGACGGGTT
GCTTAGCGTT
AGAAGCCAAA
TAGTTACAGA
kIAAGCGCTTT 17TGTGTATCA 2GCGAA.ATTT 2TAGCAATTA 2GTTAAATCT( 2TTTAAGCGAC 'TGCTATCTC
I
~GCTCAAACC C
GCCCATTGA
GATGGATGC
C
CGGTAAAAC C CTTACAAAG
C
AGGAGGCGT
TI
ACCTTCTTTA
AATCCCCTA
C
CGAAGAACG
C
AGGGTTGTTG
AATTAAAGAT
CGCTAATATT
ATTTGTTGAA
TTTAGACGCT
GGGTAGCACT
AACCAGAAAA
AGAAATTTTA
AACCGCAAGT
CATGTTAGTC
TTCACTCTCT
TAACGAGCTC
AAATATCGCT
AAAAACTCAA
GGAGTCTTTG
A.ACTAACGCT
AACGAGCACC
27AATCGCGCC
AATCGCTTTT
TGAAAATGGC
TACTCGCAAA
3GCTACGATT
'ATGGGGTTT
.CCGATTGAT
2ATGCTCATT
ACCAAAAAG
GTAGAAAAC
GGGACTTCT
~GTGATCTGG
'GTGAGCGCG
AAAAGAAAG
TACTCAAGC
TTGTTGTTr 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 1983 INFORM~ATION FOR SEQ ID NO:355: SEQUENCE
CHARACTERISTICS:
LENGTH: 972 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
WO 97/37044 PCTIUS97/05223 352 (vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylorn (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .972 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:355: ATGAAATATT
TATGGCTTTT
CTAGATATTA
TTAAZAACCAT
AACGATGCCA
ATTACCCTTT
CAGCATTTTG
ATGTTTCTCA
GATAAAM.AG
TCCATCTTGT
TCACGATTZA
AACTTTATGA
ATTGTAAGTT
TAGATCTATA
TATTTAAAjAG
CCCCTTCTAT
GGACCAGGAA
TCACAAACAT
ATCAAAAACA
ACCGACTCA
TATTACACGC
AGTATGGCGA
ACTCATGAGA
ATATCGCTAG
GGCTCTAAAA
TTTTALATGTC
ACGCATAAAA
AAACTAAAAC
TTTTTAGAAG
ATGACAAGTC
TATATGAAGA
AATTGGGGTT
AATGAATCCA
TT
TTTALATATAC
TCAAAAACTT
AAAATTGCAT
AAACAAAGAG
AGCGCTTGTG
TGTOGATACA
CCCTTTTGCA
CGCTTGGATG
CGCACTAGCG
TATTTTCCCC
AAAAACGCCC
CTCTCAAGGA
TTTAGCCCCT
CAAAATAACG
TATGGCTTTT
AAAAGAGAGC
GCTATAGGGC
CCTAAGATTG
GAAGTCTTAG
CAAGGTGCTA
AGCGTGGCGG
GGAACGCTCA
GCGCACAACA
AAGCGCCTGA
GATTATACGA
AA.ATGGGCGA
ATGATTTTAA
ATGGCTGTGG,
GATGGCCA-C
CGCTATCCGG
GTTTCAGATA
GCGGAGCAAC
TTTTTGCAAC
A.AGTGCGCTA
CGAACGATTT
TCAATTACGC
TAGAAAACGG
AAAAGACTTT
TGGCCATTGT
TTGTTTTTTC
TGCGTTATCA
ACGCTGAGCA
AATACAACAT
TCTCTAGCGT
CGGATGTTTA
GGATAGATGT
GATCCGGTTA
TCCTTTATGA
AGATAAAACG
CTCCATAGAT
AAAGACTAGC
AGAACTCPJAG
CAATAAAATT
TGACTACCCC
GGTGAATGAT
TAAATACATT
AAAAGAAATC
AACGGAGTTT
TCAAAAAGCC
GAGTTCTGAT
TTTGTATGAC
TTCAGGCGTG
TCCTAACATC
AGGAAGAAGC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 972 INFORMATION FOR SEQ ID NO:356: SEQUENCE
CHARACTERISTICS:
LENGTH: 3867 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION I1.. .3867 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:356:
ATGGAAATAC
GGAGCGTTGA
ATTCCAGCCA
CTTAGTTGGG
GTTTGGCGCA
TACAkATCCC
AACAAACACA
TTAGCGCCAT
TTGTTGGGGG
GACTCAAACA
TTCAAGCAGG
TTTTATCCAG
CCGCAAAATC
ACCGCAAGAG
TATCGCCACA
AGCCGAAGAJ\
AAAAGGCTTT
TAAGATTGAT
AATCGCCCTT
TAGTTTCTCT
AGTCATGCCG
CCTTTTTCAC
OGCACTGCTG
TAGGAACGGT
GCGAATpApA
CCCCAGATAA
AATGAATTTC
CTAACAAGGA
GGAGGTTGGG ACTGGGGGAjA
CGTTTTAGCA
GACCGTGATC
CTCAGGGCTT
ACCCGATAAA
ATACGACTTA
CGCCGCTAGG
120 180 240 300 360 WO 97/37044 PCTIUS97/05223
CATTATTGC
GGGACTTAT
CAAAAACC
AGCGCTGAT
GTAGAAATC
ACTTTGCAJ\
GGCGCCACC
CGTTTGCAA
GTTCAAOGG
GGCATTATC(
TTAAATATC,
CAMAGTGGC,
GTCATTAAC(
GGGCCTTTT(
GATGGCACG;
AATATCGGCI
GAAAATCTA)Z
GGCTATGCT'
AACGGCACA(
GATGCTCATI
GATTTTAGTC
GTGGCCGTT
GGGGAATACI
TTGGAAACTC
CTAGTTATCP
AATGTTGAAZ
AAGCTCATGL
TCAAATTTA
CGAGGCGGGA
GATATAGACA
CTCATTAAAA~J
TCTACAGGTA
GCCCTTTATA
AAAGCATGCG
TATCTTATCG
ATTTCGGTGT
TTACCCACAA
CCTTTCGCTC
ATTGAGAGCG
TCAGGCGCGC
TATGCCAGAC
GCCACTGACG
TTGAGCTTGA
ACCAACCATA
TCTTTAGAGA
AATGTTTGGG
TATGGCACAA
GGTTTTGGAAi
GCCAATAACG
TTTGAAGCTC
TTACAAGATT
TATGGTTATG
AGCTATAACC
AAAAATGGCG
TATTATGGGG
TTTGGATCGA
TTAAGCACCT
G TCAAAGGCG 'A AACTATCAG A CTTTGCGTT C GCACCACGA A ATAATCGTG G CTTCAGAjAG C TCAATTTGG T ACGTGGGAG 3AAGTGGATT' 3 CTAGCAATAK k. TTGCCCCTC( k CTAAAAACG CACCCAATAj 3 CTGGCGGCAj
TTAAAGTGGC
k~ AAGGCGGTG'.
k CCGGGA.ATA'.
PTGGCAGGAT(
3CCACTTTCAJ CAGCTAATT9
GTGTTACAGI
AAAACTTCA.2
CTCATTTTAC
GCACTAGGTC
ATGATTTTTPA
TCACCAGAAA
TTAATAATCT
*CCATTCAGGG
AAGTGGCAAC
*GCGCGACCGG
ATACAGAGCA
CCAATGGCAT
ACAATAATAA~
GTATGGCTAT
GTAAGGCATG
ATTATTTAGG
ACACCACTAA
ACAGCGCCAC
TGTTTGAATT
AAGGCAGGGA
AALATGATTGA
CTTTAA~cAA
GTAATGCGAT
TTAACTCGTT
GCGCGGCAGA
CTAACGCTAT
GCGCCGGCGT
GCTATGGTTA
CTAATTTTGG
AAGGGGCGCT
TGAATCAA.AG
ACTTCGCGTT
ATTTAGGTTC
CGAGCAGTCA
ACACTTCATA
ATGATGTGGC
ATGCAAGAGC
G GCA-ATGGAA G GCTTAGAAA T GGGCCAATT G AGTCAATTT T GGGTTCTGG.
G GATCACTAG, C TTCA-AACAG
CGTATTTAGC(
r TAACCATCT( k GACTCATAT'
AGAAGGTGG(
k CAAGAAAGA(
CACGCAAAAJ
AGACACGGT'
3 AGGGTTTAAI P~ CAATCTGTCC P CACCGTTGAI
ALAGCGCGAA]
TAACGATATq
TAAAGGTAT'I
CAAAGTCALA'
CATTAATGAA
CGAAGATATA
AATCTTTTCT
LCTATAGCCCT
ATTCGCTTCT
AACCTTGGGT
GGATTTTATC
CTTAAATGTA
ATTTTACAAA
TGTTTTATTG
TAGTAATGTT
CCGCATGGAT
CGGCAATCAA
GAGAAATATA
CAATTCTACG
TAATGCGCAT
TCCTAATTTA
GGCTAACCGC
TCTCTTGCAA
TAACACAAGC
CGTAGCCAGT
GATTTTAAAT
CGCTCAACGC
AGTGTTGTAT
TGGGGGAC
AGACGCTTTC
TAGCTCCTTT
CGTGTATAGC
AGGGAGCGAT
CTATALATTAC
TTTTAGGA.AC
AACCAACTTT
GCATTTATTC
CTTTTATTTG
GTCTTTAAAC
GATGATGGGT
C AAGCTTGAA C TTTACTGGT C AATGGCAAT C AACGCTAAA.
~GCCGGGAGk.
2AGTAAAAAT(
GTTAAATTA,
2 CCTTCATACJ
ACTGTGGGG(
r GGCACACTGC -TACAAGGAT2 3 ATCAGTCAA2 k. ACAGAAAC'pc
PGTCAATATT'
G CTTCTCTTI
AATCAAGCG;
GGGCCTTTA2
TTTGAGTTTV
AGTTTGGGA.P
GATACGGGTP
ATCAACAAGC
TTGATTGTTA
GGCAGTCAAJI
GGGGGTGTCA
TGGAATTATT
TCAACCCCAG
CAAAATGCGG
AACAATCAAG
GGCAATGCAG
CCGCTCATCA
AAAGCGAAAA
AATCTAGAAG
ACTTGTGTGG
AGCATGGTGA
GGCATCAGTA
CCTACTGAGA
TCTGCTAACT
GTCGCTATCA
TCTAAAGATA
ACTTTATTGA
ACCGGTGAAA
TTAGAGCATA
TCTCGTTTAG
TTACAAGCTT
CAATTTGCCC
AGCTTGAATA
CTTAACGGGA
AGCAATCAAG
CGTTTTTTTG
CAATCAAGCT
TTAGCCTATA
GCTTTAGTGT
AAAAGCAATA
AACGCTAACG
CATGCGGGAG
ACCTTTAAAA
G TGGATATGAA G GGGATTTAGA T CTTTCACALAG A ATATTTCAAT k AAGCCAGCTC 3 CCGAAAT'TC k ATGGTA.ATGT k GCACGATCAAi 3 ATCAAAACGC 3ATTTGTGGCA.
AACCTAATAG
k ATAACAATAG 3AACCCACGCA
TCCACTTAAA
SCCACGAATGC
GCGGGCGCAC
GAGTGAATAA
AGGCTGGTGT
GATTTGTGAA
ATGGTGGTTT
*TCATCACAGC
*AAACCAATGG
CGCGTATCA
*AATTTAAAAG
TTGACGCTAG
AAAACCCTTG
TCATGGACTA
GCACTATCAA
CAGCTATGAT
AGATTAACAG
TCATTGGTTA
AGCAATTCAA I
TGCGAAATACI
ACAACCCTGA C AAACGGCTAkA C ATGGTGGCAA
I
ACGCTCTCGT C ATCAGCATGA TI TTGACACGCT C TTGATAGCCA TI TCACCALAGCA A AACAAAGCGG C TCAATCTCTC
T
TAAAAGGCCA
A
CTAAATATGAA
GCGGCTCTAA C ATGTCGAAGC C CGAACTCTCT T CCAACCAGCA T TGAATTTCAA
A
GCGCCACAGC A TAAAACCAAG
C
GCCAATCACA A, CTAACGTGGA
A,
TTTTACAAGA
G'
TCAATGCCGC T~
AGACGCTCTA
CGTGALATATG
CTATA-AGGAT
TGATAATTTT
TACGGTTTTC
TCTTTATGAT
GTGGATGGGC
CACTTCAAAA
CGCTCAAGCG
AAGCGCCGGG
TACCACTTCT
CAACACAGAG
AGTCATTGAT
CACTAAAGCC
GGCTCATTTG
CCTTTTAGTG
TCA.AGTGGGT
GGATACTAL
TTTAAAGGTG
CAACACCTTA
TTCCACTAAT
GATAAGTGTG
rACCGTGCGT
CGGTGAAAAA~
GAATGTTAAA
GGGCACATCA
rAGTCAATTT
CTATCTGGTC
-TTTAATAAT
2GCTCAAGAT
PGGTAATGTT
.GAGCGCCTA
2GATGACATT
A.ATTACAAG
GGCTCTAAA
ACCACCAAC
;AAGAACGCT
'TTTGGCACT
TATACTCAT
'GATGCGCGT
*TTGAATCCG
TTACAAACC
'AGGAAGCAC
.GAATTCGCT
AAACCTACC
GCTTCATTG
ATTGTGGGC
AACTCTGGG
GAATTTGAC
AGCACTCTA
AGAGCGAGT
GTGGGCGTG
GTGGCTTTA
GCGCGTTAT
TTCGCTCAC
CCAGTCCT
GAAGTGTTT
420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3780 GGGGAATTGC AATTGGCTAA WO 97/37044 PCTfUS97/052 2 3 TTGAATTTGG GCGTGGTTTA TTTGCACAAT TTGATTTCCA ACGCAAGCCA
TTTCGCTTCC
AATTTAGGA TGAGGTATAG
TTTCTAA
INFORMATION FOR SEQ ID NO:357: SEQUENCE
CHARACTERISTICS:
LENGTH: 1158 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .1158 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:357: 3840 3867 ATGAAAGAT~p
GCCGATCAGA
GTCGCATGCG
TCTGTTTATG
GACGCTCTTT
AGCCCTGATA
GGGCTTATGT
TTAGCGCACC
TTAAGGCCTG
GTTGATACGA
GAAGCGGTGA
ATCAAGTTTT
GGTTTGACAG
GGAGCGTTTA
TATGTGGCTA
TATGCGATTG
CATTCAAGCG
ATCATTGA3A
TTTGGGCGCG
GCGTTCTTTA
GTTTTCTTTT CACTTCCGAA TCAGTAACrm
TCAGCGATGC
AGACTTTAGT
CCCCGATGCA
ATGGCTTTGA
TTAATCAAGG
TTGGTTATGC
AGCTCGCTTT
ATGGCAAGTC
TTGTCATTTC
TTGAAGAGAT
TTATAAACCC
GTAGAAAAAT
GCGGGAAAGA
AAAATTTGGT
GGGTGATAGA
CGGAGTTGGA
GCTTGGATTT
AGTTAGAAGA
AGCGTTAA
GGTTTTAGAT
TTCTAATGGT
AGAGATCGCA
TTACAGAAGC
CGTGGATAGA
ATGCAAAGAG
CGCTCTAGCT
TCAAGTGAGC
CACCCAACAT
CGTGTATAAG
TACAGGAAAA
CATCGTGGAT
CCCTAGCAAG
AGCGAGCGGG
GCCTGTGTCT
AAAATGCGTG
GTTAAGACCC2
ATTCACTTGGC
TACATTATTG
TTTTGCATGA
AGAGAAGTGG
GCGGCGGTTT
GAAGATGGCG
ACTGAAACGC
CAAAAAAGAAz
GTGCGTTATG
TCCCCAGAAG
GTTTTACCCA
TTCGTCATCG
ACTTATGGAG
GTGGATAGGA
GTTTGCGATA
kTTTATGTGA U ATCGGTTT
~TTTATTCGC
~AAAAGACTA
AGCGGGATAA
TCACTGGCGA
TTAAAAAAAT
TAAATGGCAT
AGATTGGGGC
TCATGCCTTT
AAGACAACAC
AAAACAACAA
TTTCACAAzA
AAGAATATTT
GTGGGCCTCA
GGTTTTGCCC
GCGCGGCTTA
AAGCGACCGT
ACACGCATAA
rCAAACTCAC
TCACTTCAGC
%CAAGGTTGA2
TGATAAALATG
AAAAGCCAAA
GTTAAAAACT
TGGCTATACA
TGGCGAGCAA
GGGGGATCAA
ACCCATCCAT
CTTGCCTTTT
GCCTGTAAGC
GCATTTAAAA
GCATGACAAT
AGGCGATGCG
GCATGGAGGG
TGCGGCCCGC
GCAGCTTGCT
CACGAGCAAG
GCCAAAAGGC
PTATGGGCAT
kGAGATTAAA 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1158 INFORMATION FOR SEQ ID NO:358: SEQUENCE
CHARACTERISTICS:
LENGTH: 207 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/IJS97/05223 (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: H-elicobacter pylori (ix) FEATURE: NAMIE/KEY: misc -feature LOCATION 1 .207 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:358:
ATGAAATTTT
GTCGTTTTTT
CGCTATAAGG
TTAGTCAATC
TAAACGGATT AGCAGGGAAT TTACTGATTG TGGTTATCTT
ATTGTGTGTG
TCGCGCTCA AGCGATCCAT ATCCAAAAG AGCAAGCCAC
CAATTATTAC
ATATTAACGC TTTAGAGGCA AAAAACACCC AAAACCACGC
TAATTATGAA
AAGGGAGTAA
AAAATGA
120 180 207 INFORMATION FOR SEQ ID NO:359: SEQUENCE
CHARACTERISTICS:
LENGTH: 1509 base pairs TYPE: nucleic acid STRANDEDNESS. double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: No (vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .1509 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:359:
TTGAAAATCT
TATAAGACTC
AGCTTGTATG
GACGGCTTTG
GACATGCAAA
TTAAACGCGG
TCAGATTTTT
AACCCCTACT
CGCATTAAAA
GGGGGGCGCA
TTAGACGCTT
TGGCGATTCC
AACGTTAAAG
GAATTTGAA
TATTATOGGA
TCGCCCATCA
TTTTAGTTAT
CCATTTCAAA
CTAAAA.AATTT
ATGCCTTGTT
CTTATATCTA
CCAATCGTGG
CAGACATCAT
ATATCCGCAA
AACGCATGCA
ACATTOGGGA
TGTTTTTTGG
ACCGCTCTAT
AAATCGCTAA
AAAA.AGTCAA
ATGCCATTTT
AA.ATCGCTTT
TCTAAGCGTC TTTTTTTTTA
ATGGGTGTTT
CCCCCCTATC
AAAAGAAAAC
GCACAGAGTG
TAAAAACGAC
GGTAAAAGTG
GCTTTTAAAT
TAAAGGCTTG
CAACAAGCTT
CAATTATTTT
GGGGGTTGCT
CCCTGTTTCA
GCTCCATAA
TGATTTTATA
TCTAGCCGAT
TGAAAAAGCC
TCTTATGACC
CCTAAACATA
GGTCTTATTA
CTTTCTTCTC
CGCATCCTTT
TTCCACAAA
CGTTATTTTG
TTCATCGTGG
GATAACGATT
TCAAAGGCCA
TTATTAAGAA
AAAATCCCTA
GAACGCTTCC
TTGCCCGCCA
CTTAAAAAcG
CCTACACTAC
GTGCGGCCAT
GAATGAGCCA
AAGTGATCGC
TAGATGAC;JA
ACATTGAAGT
AAATGCTTGC
ATAATTTCC
TAGACACGAA
AAGAAAGCTT
CCCATAAAAG
TCAGCGCTGA
AAAAATACCA
AAATTGACAC
CTAAAGACTC
TGGGCTAGTC
CACCATTGGG
TCTTTTAGAA
AAAAAGCATT
TAAAGAACTT
CGGGTTAGAT
GAAAATCTTT
GGATTATGAA
TGTCATTATA
TTTTTTAGAT
TGAAAATTAT
ACTCAAAAAC
AGACGCAAAC
ATACCCCATT
GCCCTTGTAT
CGTTTTTATC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 WO 97/37044 WO 9737044PCTJUS97/05223
GCTTCATCGT
AAGGGGATTG
TATGGGGCCT
ATACGAAACG
CACGGCAAGA
CCGCGCTCTG
AAAAGGGTGC
CGGCATAGAG
CCTGATACCT
GAACTTTAA
ATTTTATTCC
AATTGAATAT
GGGAAAGGTA
ATTTTTTCAA
CAATTGTTTT
CATATATTAA
GTTTGTCGCT
TGATTTGGGA
CTTTCTTTTT
AGGTAAAAAG
TCTTACCAAJT
TCGCAATAAA
CCGCCAGATT
TGACGATGCC
CACTGAAAGT
TAAAGATCAT
AGCTACAGAA
GCGTTTGATT
ATAATGAAAA
TCCCTTTCAT
TTAGTGCGAA
AAAGGGCGCT
CTAACGCTTC
GCGGTCTTGT
GCCCAACAAT
GAAGGCATCT
AAAGAATGGT
TCTTTAAAAA
CTACGGACGC
TGGGCGCGAA
TTAGCACCAA
TAGGGAGTTT
TTGACAACCC
CATGGCATTT
TAATCCATGA
CTAAAGTCCT
TCAAATTTCT
TATCGTGGTC
TGTCTATGAA
ACATTCCTTA
TAATATTGAT
GTCTTTTGCC
AGTCTTGTAT
AAAAAACTCG
TCCTGAAAGA
1020 1080 1140 1200 1260 1320 1380 1440 1500 1509 INFORMATION FOR SEQ ID NO:360: SEQUENCE CHARACTERISTICS: LENGTH: 534 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .534 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:360:
ATGCGTTTTA
TGGGTTTTTT
AACGCCATTA
CGCCATGAAA
GCCAAAAAGC
ATTGTGCCGG
AGCGGTAAAA
ATGTTTGATT
TCCACCAACT
GTTACATTGA GCCAAGAGCG ACATTTTTTT ATCTTTTGTG AACGCGCTCA AGACAACGCG TCAGCCGCTT GCAGGTAAAG GTTTGAATTA TAACGATGAT ATCTTATCAC CATTAATAGC CCCCCTCTAA AGAAGCCTTT ATTCTAGGGC GGAATTTTTC TTTCTA.ATTC CTTACTGATA AAATACTTAA TCAGCAAGCT CTAATAGGGG GGTTAGTGTG TCTAGTTTGA CGATCCAAGA ACTGATGAAA CTTTAAAACT ATACGAGATG TTTTGCAAGG ATTGAAATAG ACCAACAAAG TACTTTTTGT TTCAAAACAA CCTTTAAGCG ACGGGTGGTT AAAAACCCGG AGTCTATTAA
TTCTAAAATT
GTTTATGCAC
AAGGCTTTAC
CATTAAAGAA
GCTTTTGAAT
CGTGGTGGTT
ACTAA.ACCCC
TAATTTTGTC
ATGA
120 180 240 300 360 420 480 534 INFORMATION FOR SEQ ID NO:361: SEQUENCE CHARACTERISTICS: LENGTH: 669 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 WO 9737044PCTIUS97/0523 357 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: (A2) NAME/KEY: misc feature LOCATION .669 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:361:
ATGCGTAAGA
CAAGAATTGT
TTGCGCCAAT
CTTTTAGATG
AAAAATTTAG
CTTAAAAATT
AGCAAGATTG
AATTTACCCA
AAAATTCTAG
CCCCCAAAAG
ACACCACCCA
GGGGTATGA
TCTTGTTAAT
TGCAATGCTC
TGAGTGAAAA
AAAAAAGCGA
CCGCTAAAGA
TGATAGAAGA
GCGAGACTTA
CTCAAAACGC
CCAAAATGGA
AAAATAAAGA
CGCCTCCTAA
GGGCTTGATT
TGCGATTTTT
AGAGCAGTCT
TCTGTTAAAC
AGAAGCCTTT
AAATGAAGGC
TTCTAAAATG
TTTAGAAATT
TCCTAAAAAA
AAATAAAGAA
AGAGCCAACC
TTACAAGCGC
GAATCTAAAA
TTGAGGATCT
AAGAAAGAA.A
AAAACCTTAC
ATTTTAAGAG
AAAGATTCTA
TTAATGGCGC
GCGGCGGCTT
AGCCAAAAAA
CTAAAAGATC
TCTTTGGCGA
AAGCCGAATT
TGCAAACCGA
AAGAAATAGA
AAACAGAAGA
AAATCAAGCA
AATCGGCTCT
TAAAGCCTCA
TGACAGAGTT
CCACAGATCC
CTAATGTAAA
AGAAGCCGCG
GAAAGAGGAT
AAACGCCCGC
CGAAAAACTG
AAAAAAACGC
GGCTAAAGAC
GATTTTAGAA
AGAATTAGGC
GTGGCAAAAA
CACGCCCCCC
AGAGCCAACA
120 180 240 300 360 420 480 540 600 660 INFORMATION FOR SEQ ID NO:362: SEQUENCE CHARACTERISTICS: LENGTH: 1296 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1296 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:362: ATGAATATTC AAACAAAGAA AAGATTTTTA GCAAATTTAT TGCTTTTTAG
TGCCTTAAGG
CATAGAGGGG
AAAGTGGTGT
GCTATGCATT
AATAAGATTT
TTACACAAGA
TATTTGACTC
GTGCATGATG
AAAGAGGGTT
TGCCAAAAAG
CTGAAACCCT
ATTTTGCCAC
ATGCTAAAGA
TAGCCATGCT
TAGTGGATGG
TTCGTAAGGA
AAAAGCGTTT
AAGACAGCCT
TGGATCTGTT
CGCTCAACAC
TTCAGAAGAC
CGCTCAAAAA
GGCGGCCATT
TTATCAAAAA
CTATGCACAA
AGAAAAAACC
GGATAAGGCT
AGAAAAACTC
GCAATCTCAT
CTTCACGCAA
CATCAAATCC
GGCTATATGA
TCAGCGGCGA
ATCACCAATA
ATGGGGCAGA
ATAGCCACAG
TTCCCGTTGT
ATTACGATCT
ATAGACAGGT
TTTAACGAGC
TGTTGAGTTC
ATCTCTATAA
GTTTAGGGGA
ATCGTAACGA
TTGATAAGGC
ACAATGTGTT
TGAATAAGTT
ATTTTTTACA
ATGGTTGCTC
TTGATTTGGC
CCTGTTTTCT
AGACGCTTTC
GCAAACCAAT
CATTAAAACC
TGTTTCTATC
GATTGAACTA
AGGGACTTTG
TTATAACCA.
AAACCGCAAA
AGAGCAATTG
TAAAACGACT
120 180 240 300 360 420 480 540 600 660 WO 97/37044 PCTIUS97/05223 TTCGCTCGTT TGTATGAAA.A AAACCCTATT GTTCAAAACc
TTAATCTTGT
AGGCGTTTGT
GCTTCTTTA
CATTATGA
ATTCAAAAAT
AAAGAAGACG
GACATGGATG
TCAGTGCTTT
GCTAAAAAIA
GAACACAATA
TAAA-AGAGTT
TGCTAGACTT
TCTATCAAGA
GCTTGAGCGC
TAGAGCAAGC
CGCAAGACGC
TTAAAA~GGGG
ATTTGGATTC
TCTTTTCTAG
AAATCATTCA
TGATAAGGCC
ATACACCGCG
AAGAA-AAGAC
GAATAAGAAA
CACCAAAGAG
TTTCTTTTAT
CATGGATCTT
TTTAGCATGG
CATCGCTAAA~
AGAATGC7AAG
CAGCAAATCG
CAAAAAAAAT
CCTAkAATTrCT
AAGCTCACCA
CGCCAAGCAT
AATTTTTTAG
GTGAGGAAAG
GGTTATTACA
GAGCTTATCC
AAATAG
CCCAATTTTA
CAGAATTAT'r
TTGATCAAGC
TAGGCTTAGA
AAGAAGAGAT
GGCTTGCTAA
GGTATTCCTT
CCTTAGCGTT
AATTAGGGAA
AAAACGAACC
cATAGGGGTA
CCCCTTTGAC
TTCCAAACAG
GGCGATTTAT
GTTGCCCATC
AACTAAAGAT
A-ATAGATTAT
AGATTCTAC
TTGTTTGGAG
CGAATTGAAA
720 780 840 900 960 1020 1080 1140 1200 1260 1296 INFORMATION FOR SEQ ID NO:363: SEQUENCE
CHARACTERISTICS:
LENGTH: 294 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1.....294 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:363: ATGCTTTATG CATCAAAAGC GCGTTTATTT TTACAAATCA AAGGAAAGTT
TATGTTAAGA
ATTTTAATCC CCTTACTCAT TATTOTOTOG GTTTTATGGC GTTTGTTTTT
GAGGCAAAAA~
CCCCACAAAG ATGACCACAG AGACAACCAC TCTTACACGC AACAAACCCC
CAAAGAATTA
GAAGATCACA TGATTGTATG CTCTAAATGC CAAACTTATG TCTCTAGTA
AGACGCCATT
TATAGTGGGG CGGTAGCCTA TTGCAGTGAA ACTTGTTTGA AGGATAAGGG
GTAA
INFORMATION FOR SEQ ID NO:364: SEQUENCE
CHARACTERISTICS:
LENGTH: 1440 base pairs TYPE: nucleic acid CC) STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
CA) ORGANISM: Helicobacter pylori 120 180 240 294 WO 97/37044 PCT1US97/05223 359 (ix) FEATURE: NAME/KEY: misc feature LOCATION .1440 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:364:
ATGAAMA.ATA
AGCCCTTCTG
GTTGGCTTTA
GTAACCGCTG
CAAGGGCATG
ACGAAATTCA
TTAGGGGGTA
GATGGCAAGG
AATTACTTGA
GGGGGGCGTT
AGCGCTAAA
GGTAGGGCGT
AAAAACGGOC
GGGGTTAGCG
GCTGTAGGCT
GCTTATATTT
AAAGCTGGCA
AACTTTGGGG
GGAAACCCTC
AGCCATGTGG
AAATGGCTGT
GCGGCTGTTA
TATTTGGGCG
TCTAAAGCAC
*GCGCGCCTTT
TGCAAGCGTT
ATAAAAAAAA
TGGGTCAGGG
TTTTAGAGGG
ATCAAGGCGG
AAAGGGCCTT
TGATTGATTC
TGAATAACGC
ATCAATCCAJA
TCAAGGATAA
TCGCTTATGG
GCACTCTA
TCAGCCCCTT
ATGATAGTAA
TGCTCCCTGT
CCGCTGGGCA
GAGCGTTTTA
TAGGCATTGA
TAACCGCTGA
GGGGGACTTT
ATGTGGGCTA
TGATGACGCA
TTTATTCAGA
AAAGAATAA.A
TGATTATAA
GATTGACATA
CAATATTTAT
AAAAGTCGGC
ATCGGTTATT
GCTTGATGGC
TATAGCGTGC
TTTTTTAGAA
TGCTCCTTAT
AAATGAGGGG
GGAGTGGATT
TTATGGTATC
TTTCCAGTTT
CCCTAATTTT
CCATGCCCCC
AAGCCTGCTC
TAAAGTATGG
TTTTTGGACC
TGCCGTCTCT
GTGGCGTTGG
rAAGATCAGT rTCAGGCTTT
CAGGAGCCAT
GTTTTTTGTG
ATTGAAGTTT
GCTAGGGGGA
GCGGATTTTT
GGCACAATCG
TATAATTATA
ACGAGTATCC
GGGAACGCTA
TACCGCTATA
ATGAGCTCTT
AGCCATAAAT
TATGATTTTT
CATTTAGTGG
TCGCCCGGGA
AATGGCGTGG
CTTAAAAGGG
ATTAGGCAAC
A.AAAACGCAA
AATAGCGTTT
GGTTGGGTTT
A.CTAGCGGCG
kAGAGTTTGA
%CGGTAGGGA
TAATGAC
C
GGTTATACGT
TGGCAGAGTC
TTTATCCTAC
TATCCAAAAG
GTGGGATTGC
TCGGTTATTG
ATGAGTGCGC
GAGCCAATAA
AOGATATATT
ATACGCAAGG
TATGGTGGTT
ACTCTCCAAG
ATTACACTTA
CTTATTATAG
GCTTTAGATC
ATACTTATCG
GATTTGATTA
ACGCTTACAT
A.TGATATAGG
rTGGTGGAGG
CTTTAGCCAA
CAGCGAGCGT
C
'TTACAGACC
C
TTTAAGTTTA
CTTTTCTAAA
AGAGACTTTT
CCTTAAAGAT
TTATGACAGC
GGATGGCTAT
GCTTGGCTCT
AATCCGCCGT
TGCGGCTA
CTTTGAAATC
TAGCTCATGG
AACCGTGATT
CGAAAGGAAA
CCCTGGGGTG
CGAAACGAAG
rTACGCTGTG
CAATGAATTT
kGGCACGACA
"CAAGCCTTA
'GTGCATAAA
E'GAAGCGAGC
AAATTAGAA
ACGCCCGGC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 INFORMATION FOR SEQ, ID NO:365: SEQUENCE
CHARACTERISTICS:
LENGTH: 1440 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genolnic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .1440 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:365: ATGAAAAATA GCGCGCCTTT AAAGAATAAA GTTTTTTGTG CGTTATACGT
TTTAAGTTTA
AGCGCTTCTG TGCAAGCGTT TGATTATAAA ATTGAAGTTT TGGCAGAGTC
CTTTTCTAAA
WO 97/37044 PCTIUS97/05223 360
GTTGGCTTT,
GTAACCGCTC
CAAGGGCATG
ACGAAATTCA
TTAGGGGGTA
GATGGCAAGG
AATTACTTGA
GGGGGGCGTT
AGCGCTAAAA
GGTAGGGCGT
AAAAACGGGC
GGGGTTAGCG
GCTGTAGGCT
GCTTATATTT
AA.AGCTGGCA
AACTTTGGGG
GGAAACCCTC
AGCCATGTGG
AAATGGCTGT
GCGGCTGTTA
TATTTGGGCG
TCTAAAGCAC
ATAAAAAAAA GATTGACATA
GCTAGGGGGA
TGGGTCAGGC
TTTTAGAGGC
ATCAAGGCGC
AAAGGGCCT'I
TGATTGATTC
TGAATAACGC
ATCAATCCAA
TCAAGGATAA~
TCGCTTATGG
GCACTCTAAA
TCAGCCCCTT
ATGATAGTAA
TGCTCCCTGT
CCGCTGGGCA
GAGCGTTTTA
TAGGCATTGA
TAACCGCTGA
GGGGGACTTT
ATGTGGGCTA
TGATGACGCA
TTTATTCAGA
CAATATTTA'I
AAAAGTCGGC
ATCGGTTATI
GCTTGATGGC
TATAGCGTGC
TTTTTTAGAA
TGCTCCTTAT
AAATGAGGGG
GGAGTGGATT
TTATGGTATC
TTTCCAGTTT
CCCTAATTTT
CCATGCCCCC
AAGCCTGCTC
TAAAGTATGG
TTTTTGGACC
TGCCGTCTCT
GTGGCGTTGG
TAAGATCAGT
TTCAGGCTTT
CAGGAGCCAT
GCGGATTTTT
GGCACAJ\TCG
TATAATTATA
ACGAGTATCC
*GGGAACGCTA
*TACCGCTATA
ATGAGCTCTT
AGCCATAAAT
TATGATTTTT
CATTTAGTGG
TCGCCCGGGA
AATGGCGTGG
CTTAAAAGGG
ATTAGGCAAC
AAAAACGCAA
AATAGCGTTT
GGTTGGGTTT
ACTAGCGGCG
AAGAGTTTGA
ACGGTAGGGA
CTAATGACAA
TTTATCCTAC
TATCCAAAAG
GTGGGATTGC
TCGGTTATTG
ATGAGTGCGC
GAGCCAATAjA
AGGATATATT
ATACGCAAGG
TATGGTGGTT
ACTCTCCAAG
ATTACACTTA
CTTATTATAG
GCTTTAGATC
ATACTTATCG
GATTTGATTA
ACGCTTACAT
ATGATATAGG
TTGGTGGAGG
CTTTAGCCAA
CAGCGAGCGT
GTTACAGACC
CCCTTAGCGC
AGAGACTTTT
CCTTAAAGAT
TTATGACAGC
GGATGGC
TAT
GCTTGGCTCT
AATCCGCCc3T
TGCGGCTAA
CTTTGAAATC
TAGCTCATGG
AACCGTGATT
CGAAAGGAAA
CCCTGGGGTG
CGAAACGAAG
TTACGCTGTG
CAATGAATTT
AGGCACGACA
GCAAGCCTTA
CGTGCATAA.A
TGAAGCGAGC
GAAATTAGA.A
CACGCCCGGC
CAAATTCTAA
180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 INFORMATION FOR SEQ ID NO:366: SEQUENCE
CHARACTERISTICS:
LENGTH: 516 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc Ifeature LOCATION .516 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:366: ATGGGCTTGA AAAATCTCTC AACACTTCTG GTGTTTTTAT TCTTmC-mm
AGCAATTTTA
AGTAGGAAAG
TTAAACGATG
ACGCAAAATA
CCTACAGGAT
TTAGAAACCA
TTAGCGGTGC
AATTTCGCTT
ATGAAGACAC
GCGAAATCAC
TGGATAGCGG
ACGACTGGAT
CAGAACCTTT
CGAACCGGTG
AAGAAGCCAA
ACCAAGTCCC
TTACACGCTA
CCAAGATAAT
CACTTACTAC
AGATGATGGC
ATGGGTGCGA
GAGCCGAGCC
ACTAGAGCTT
CTTACCTCA\
GACTTAGTTT
GTGCCTATCA
GATCATGAGT
TATATTTCTT
GAGATTACAA
TTTTTGATCG
GATGCCTATA
TTTTAA
TAGAAAAA
TCACGGCTAT
ATTTTTTAGT
ATGAACTTTT
GAGATGAATT
CTTTTGACA-A
GTTTAGGCAA
GGGGTGTGTG
GATCCAAGCC
CGCTACGCAT
GGAGATTTTC
TGGCACAAAA
TGATGGCATT
ATTGGATTAT
AATTGTTTTT
120 180 240 300 360 420 480 516 INFORMATION FOR SEQ ID NO:367: WO 97/37044 PCTIUS97/05223 361 SEQUENCE CHARACTERISTICS: LENGTH: 255 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...255 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:367: GTGAATAAAA AGCACCGCTT GGCTTTTTTA GGGCTAATTG TTGGGGTTCT ATTCTTCTTT AGCGCGTGCC AACACCGCCT GCACATGGGG TATTATTCAG AAGTTACAGG GGATTATTTG 120 TTCAATTATA ATTCCACTAT CGTGGTGGCT TATGACAGAA GCGATGCGAT GACTTCTTAT 180 TATATCAATG TGATTGTTTA TGAATTGCAA AAATTAGGCT TTTATAATGT CTTCACGCAA 240 GCGAATTCCC GCTAG 255 INFORMATION FOR SEQ ID NO:368: SEQUENCE CHARACTERISTICS: LENGTH: 756 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...756 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:368: ATGCAATATA AGAAAAATAA GAAAAGATAT TATCATTTAG CGTTAGGGAT CCTTTTTTGT AATGGTCTGT CTTTGAAAGC TTTAGAAATT GCCGTCAAAC CCTTTGGCTA TCTGGGGCTA 120 TTGTATAATC AAGGGACGCA AAAAAACCCT CACAGCTATG TGGGGGCTTT AGCGCGTCTT 180 GGGGTGGATT TTTCTTATAG CAATGGGTGG TCGTTTGGTA TTGGGGCTAT TGGGGCTTGG 240 AATATTTATA ACAAACAGCG TTTGGCTAAC CTTTACATCA GTCTAGGGAA TTTTTTTGGT 300 AACCCTAACA ACGTTAAGCC TTATTTGAGC GCCGGCGATG TTTCTGATGC GTATCTTCAA 360 WO 97/37044 PCTIUS97/05223 362
TACGCTAACC
GATTGOATAG
TATTTTGGGA
GGGAATCGGA
TTGTATGTAG
TTCGTTCCTT
CAAGTGGGGG
AGCGTTTTAA
GGGGCAATAT
TTTTTATGGA
TCGCCACTTC
GGGGGGAAGT
TTATTTTAAC
GTAAGTTGGA
AATCGCTTTA
TCAAGGGGTC
TAGCATGCTT
CCTAAACGCT
GTTTGTTTTG
GGACACCCGC
GTATCGACGC
GGGCGTTTTA
TCTGTGGCTT
TATAATGGGC
CTAGCGTCTT
GGCGCAGAAT
TTGCCTTTGC
TTCTTT
ATACCGATTT
TCAAGC~AA
ATCAAATCA
ATGATCCTGT
ACAAkAAATAA
CCACCCAAAA
TGTGGATTTT
TTCCATGCGT
TAAAGAGCAA
GTCTAAACGC
AAATTTGATA
TGTTTTAGTG
INFORMATION FOR SEQ ID NO:369: SEQUENCE
CHARACTERISTICS:
LENGTH: 696 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION I1.. .696 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:369: ATGGTGCAA AAATTGGCAT
TTTAGGGGCG
ATGAGA
TTGTTTGGCG
TATCATAATA
ACCACAACAA
GGAAGCTTAG
CACGATGTGG
TTTATTGAAA
ATCGCGCTCA
AAAGAATTTT
OCGTTTGTGT
GCCGATGAAA
GCGAAATTCT
TGGATTTTGA
AGGAAATCAT
GCATGATTTT
TTAAAGATTT
ATTTGAGCGC
CGAGTGGAAG
AAGAAGGCGT
TAGTTAGCGA
GCCAAAAATT
AAGCCGGTAT
TAAAAAGCAT
AGAGATCCCT
TTAGGG
TGTCGCTTA.T
AGCAAG
AGCGTTTGGC
GTTCAG
AAAAATCAAT
GATTTG
GTTTGATCAC
CCTTTA
TTTAAACGCT
TTAGCT
CATCGCATCA
GGCGAT
GTTTAAAGCG AGCGCCi TGGCGTGCCA
TGCTGC
GAGTTTTGAT
GAATTT'
GTGATOAG CTTTAG
.GAAG
OGGA
ATTG
AAGG
TTAG
GOT
~AA
CACT
GTGG
GTGC
TTAG
AAATAACCCC
ATGTTTTCCA
GCAAGGTGCA
TGCTTTTTAG
TGGCTACTCA
TTATCCCCGA
AGATCGCTAA
TTGTGCATAG
AAATGGAGGG
TATACTAGAA
TAAAGGCGTT
TTCCACTTTA
CGGGGTGGCT
ATTAGTCCAG
AAGCGCGATT
TGAGCAACAT
CAAAGAAAGG
GGCGAGCGTG
120 180 240 300 360 420 480 540 600 660 696 AAAAAAGCGC TCACACTTCA INFORMATION FOR SEQ ID NO:370: SEQUENCE
CHARACTERISTICS:
LENGTH: 678 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
WO 97/37044 PCTIUS97/05223 363 ORIGINAL
SOURCE:
ORGANISM: I-elicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .678 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:370:
ATGAGAGCTA
GGTTGCTTGA
GCGAGTTCTT
ATTTTGAGCG
ATCACGCATG
TTCATGCAAG
GCGCCCACTT
TCTACTTACA
TCTGGAGTGA
AGTAAAAATG
ACGCAAGCGA
AGCCCGTTAA~
CGGCGATAAA
GCATTAATTT
TTGAAATGAC
CGGATTTATT
GGAAGCACCA
AAGCGCAAAJA
ATGCGGTTCG
GGGCGGAATT
TCATTAAGCA
GCAGTCAAGA
TGCAAGAAGC
AAAAATAA
AATCTTTTCA
AAAACAAATG
GCAATOCCCT
CAACACTAAA
AAAATGGATA
AGCATGCTTA
TTTTACGATT
CGCGCTAGGC
TGAAAATATT
TTTCCA\GAA
GATTTCTTTG
TTCTCATCAG CGCTCGCCCT
ATTGCTTCAG
CTGCCAGAAA
AAACCTTTGA
GAGATCGTGT
GACTTGCCTC
GGCGTGGCTT
TTATCGTTTT
TATGATATTA
TCTAGCTTGG
AGCGCGATAC
ATTAAAAG
TCAGAACTTA
CTGAAGTGAG
TTAAAGCGCA
GCAACATGTT
TGCCCCCTTA
CTCTTTTAGA
GCGTGAAAGG
AAAATAAAAC
AATCCCTCCA
CGGTTGAAGC
TGATTTAAAT
GCTCATTAGC
AGACGGGCAG
AAAA-ACCATG
TGGCGCGGGC
AAAAGAAAAT
CGATTCGCAT
GACCAAAACG
ACATGTCAGC
GCAAAGCGTA
120 180 240 300 360 420 480 540 600 660 678 INFORMATION FOR SEQ ID NO:371: SEQUENCE
CHARACTERISTICS:
LENGTH: 816 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc Ifeature LOCATION 1 .816 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:371: TTGGAAAGGC ATGTGAATTA CACTTTAATC
GGCGGGCTCT
ATGGTAGGCT
TATGTGGTCT
AAAGGGATTC
GTCCGTTTGG
GTTTCTTCTA
GAATTTTATG
GATCGCTTGA
GTGAATAGGA
GATGATCTCA
CCTAACAACC
TTATTTTATG
ATACGGACAA
AAGTGGGTAA
ATTTGATGAT
GAGGGTTTAT
GTAGCGGCGA
GCGGTGATGC
TTTTAGACGA
TCGCTAACTT
TTGTTTCTAA
GCTAGGCCAT
AGACTTGGGA
TGTCATCAAA
CAAATCCAGC
GGGGTTAAAA
TAAAGGGGAG
TAATCAAGTG
TGAA.AACGTG
GGATTCAAGA
TGTCATAT
TTGGGATTAG
GGCATTGCGA
GTGGGTTTTG
GTTAAGATCC
TTTTTAGCCT
CGGATTTTAA
GTGCAAGAAG
GAGAAATTCA
TCTTTTTATG
ATGATGGG.
CCAACTCGCC
CAAAGGATA
GTAAAGATTC
TAGAGCAG
TCTTTAAAGA
TGATGAAAGC
AGCACATTCT
CTTAGTGTGC
GTATTATGAA
CATTAATTAC
AGTGGGGGTG
CAAAGTGGCG
CCACAATGA-A
AGGGCTTATG
GATCAGGAAT
CGCTTCAGTG
120 180 240 300 360 420 480 540 600 660 GTGGCTTTGG ATGTGGATAJ\ ACGCGTCAAp.
WO 97/37044 WO 9737044PCTIUS97/05223 364 CAGGGGCAT ACGACTTTA GGCGATGTTC ACTCCTTTAJ\ TCATGCAAGC
GCAGTTGAGC
TTAAGAAACA TTGATAATTT TGTGGAAAAA GGCTCTGCTT TGATAGATAA
ATTTGACGCT
AACCCCTATA AAACGATTTT TGGAGAA.AGG
AAATAA
INFORMATION FOR SEQ ID NO:372: SEQUENCE CHARACTERISTICS: LENGTH: 1053 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 1053 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:372: 720 780 816
ATGAAAAAAA
GATATGGATT
GCGCTCAAAG
GAAATCATGA
TCCACTTCCT
CTAATCCCAA
ACGGTGTATA
AACGAACAAG
GTGGTAGAAA
TCTAAAGTCA
CTCATTACCC
GCCGAAAAAG
ATCAATCTTG
GTGGGCGATT
ACCGAAACGA
GAAATGGGCT
ACCTTCTTTG
ATGACAGACA
CTATTCTATT
TGATTAAAAA
AATACCAGAT
CCTCCGCTCA
ATCTTGTGTC
GCGCTATAGG
ACTCTGTGTT
CCCAAGGGCC
AAATCAATTC
AAATTGATTT
CAAGCCGTTA
AAGGGCTGGA
GGGGGACGAT
TCAAGGGCGA
TGCCTTATTT
CTATCCAGTT
AAGCCTTAAA
GAGCGCTCTT
AGCGAAGGAA
CAAAAAAACC
AGTGGAATTA
TTGCAACACA
CTCTCAATGG
TAACGATGTG
AATCCAATCT
CATACCAGGC
TAAATTGATC
CGATGATTTT
TTTATTCATT
GCAGCCTTTT
TAAAAACGGG
CCATAACGGG
AGGCATTGAA
GGGTAAAAAA
TTAGCTTTTT
AGCCAATTAG
AGAGATGTGG
GGCAAAATGC
TGCCATAATT
AAGAAAAACC
CAGTTTTGGG
TCTTTTGAAA
TATGTCAAGC
GCTGATAGTA
TTAAGGGGCA
TCTAAAGGCT
GGGGTGGTCA
CTTGTGAAAG
CA.ATTCTGGG
ATCAGCGATG
CCTAA.AATAA
CTTGCGCTCA
AACCCATGCC
GCATTGGCGC
TCTATTTTGA
TGGGCTTAGG
CCCACCTTTT
ATGGTAGGGT
TGGGGGCTGA
TCTTTAGAAA
TCGCTATGTT
ATCCTAAAGC
GTGTGGCGTG
AACCTTATAA
TGCCTACTTT
ATGTTAAGGA
CAGAAGCGAA
TCTATCCAGA
TGCTTTAAGC
TATGGGCAAA
TAAAAACAGC
CCCTAGGATT
CGGGGTGGAT
AAGTTCCCCA
TACGCATTTA
TCCTAAAGTC
AGCTTATGGC
TGAAGCCACG
GCTCAGTAAA
TCATAATGGC
ATTCGCTAAT
AAGGAATATC
TGCGATTAAA
AAAGATTGAA
ACTCCCTGTG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1053 AAACCCCTAA ACCCTCTTTT
TGA
INFORMATION FOR SEQ ID NO:373: SEQUENCE CHARACTERISTICS: LENGTH: 222 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
WO 97/37044 PCTJ[US97/05223 365 (iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .222 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:373:
ATGATGGATT
CTCTTTTTGT
TATGAGCGAT
CGCCATAAAG
TAGAAAGTTT GAGAGGTTTT
GCGTATGCGT
ATGCTTATAT TTTTAGCATG
TATAGAAAGC
ACGGGTATTT AGCGTTAAAT
GATGCTTTAG
AAGTTCATGA TAAAGGCATA
AAGGAAAGTT
TTTTTACCAT
TCTTTTTACG
AAAAAAAGGG
CATCGTGGAT
AAGACGAGTT
GATTGAACCA
GA
120 180 222 INFORMATION FOR SEQ ID NO:374: SEQUENCE CHARACTERISTICS: LENGTH: 828 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .828 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:374: TTGCTAGATG TCAAAGCCAA
TTCCCCCGCC
GCAGTGGGCT
GAGAACGCGA
ATCACTTGTA
GCGATCATCA
ATCCCAGTTT
ACGGGTGGCG
ACCTCGTGGA
GCTCAAGAGC
AACTTCCAAA
GGGATGTTTA
GTCGCGCAAG
AACCATTCAA
CCAAGCGGAG
TGTGGCAJAGT
ATGGAGGTAT
ATTCGTATTA
ACAAGGCTTA
TAAGCAACAC
AACCAAATAA
ATGCAACCAT
TTTTAAAACA
ATGGTGGTAG
AGAATGAAAT
CCAAAATCGT
CCCCTACACA
ATTTTAXACC
TACAAGCTAT
CCAAACCTTT
TGAGCCAGGA
TCAAATCATT
CACTACAAAA
AAAATTAGTA
AACAGTACCA
AGCGAGCATC
CGGTTATTGG
CAGCGCTATC
TAGTGAA-AAC
GACGCTAGTT
AAGCCGAACA
TATCAAGCGG
TGCTTTTAGC
GCCTTTACCG
CTTGTGGTCC
AATAATGTGC
CAGGACAAA
CATGGCGGGC
CAATATCCAC
CAAAAGGCTT
TGACAGCC-
CTTGATTTCA
CTATCAATGG
TACCCATGGA
GTCATGGGAA
ACAACAGAAA
ATATCAATAC
ATTATCACTA
CCCTGAATAG
GCAGGGATAA
GTGGCAATGG
CAAGGCATGA
TCGCTAACGC
ACGCAAAATC
AAAACAGCCT
TTGCTGAAAG
CATGCTCA
AGTGGTGAA AACTTTGA
ATTGAACGCT
TGGTAGCAAT
CACGACGACC
TGAAAATTAT
TGGAGAAGGG
AGACAAAAGA
ACCTATTTCA
AACCAATAGC
TGCATGCCCA
GACAATGTGT
GCAAGAAGCT
AGACGCTGAA
AACGCGAAGC
120 180 240 300 360 420 480 540 600 660 720 780 INFORMVATION FOR SEQ ID NO:375: WO 97/37044 PCT/US97/05223 366 SEQUENCE
CHARACTERISTICS:
LENGTH: 273 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...273 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:375: ATGCTGACGA TTGAAACCAG TAAAAAATTT GATAAGGATC TTAAAATTCT TGTTAAAAAT GGGTTTGATT TAAAGCTTTT GTATAAAGTG GTTGGAAATT TAGCCACAGA GCAACCCCTA 120 GAACCCAAAT ACAAAGACCA CCCACTCAAA GGCGCTTTAA AAGACTTTAG GGAATGCCAC 180 CTAAAACCGG ATTTATTGCT TGTCTATCAA ATTAAAAAAC AAGAAAACAC TCTTTTTTTA 240 GTAAGGCTAG GCAGTCATAG CGAGCTGTTT TGA 273 INFORMATION FOR SEQ ID NO:376: SEQUENCE CHARACTERISTICS: LENGTH: 303 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...303 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:376: ATGCTAGAAA TTGAATTAAA AAAGAAATTC ACTAAGGATT TAAAAAAGCA TATTTTAAAT CAAAAAATTG AGTTAGAAGT TTTTGACTTA GTGGTTGAAA ATTTAAGAAA TCAAATTCCA 120 TTGGACAAAA GATTTAAAGA CCATGCTTTA AGTGGAACAT ACAAAGGCTG TAGAGAGCGC 180 CACATTAAGC CTGATGTTTT GCTTGTGTAT AGAGTGAAAG GCAATGTTTT AACTTTGGTT 240 AGGCTTGGCA GTCATAGCGA GCTGTTTTGT AAACCGCCCA CACCACTCAT AACGCTTAAA 300 TGA 303 WO 97/37044 PCT/US97/05223 367 INFORMATION FOR SEQ ID NO:377: SEQUENCE CHARACTERISTICS: LENGTH: 270 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...270 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:377: TTGGTGGTTA GTGGCTCGTT GCAAAACATT GTTCGTTCTT TTCTTCATTA TTGATTTCAT CAATTTTATT TTCGTTATTG GGTGACTTTT TCATCTTCTT CTTCATGCTC AATCTTTTTT GATTTGATTG TTTTGAGTGG TTTTAGTGTT GTTTTGGTGC CTCATCTTTG CAGGCGCTCA ACACGCTTAA TTTTGCGAAG GATCAGCGTT ATCAGATCAT TCACTTCTTT ATATCCTTAT TAGGTTGATT GATTGGGACG ATTTTTTAGG 120 180 240 270 INFORMATION FOR SEQ ID NO:378: SEQUENCE CHARACTERISTICS: LENGTH: 1440 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...1440 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:378: ATGAAAAAAT CCCTTTGTTT GTCTTTCTTT CTGACCTTCT CTAACCCTCT GTCATTGAGC TTTTAGAAGA GATTAAAACT TCGCCGCATA AAGGCACTTT GTCCTTGATT CTAAAGAACC AAGACAAGTT TTAGGCGTTT ATAATATCTC AAACTCACGC TCACTATCAC TCACATATCC ACGGCAATCG TCTATCAACC AAACTTTCTT TAGAAACGAC CTTAAGCCCT AACCGCCCTA CTATCCCTAG
TCAAGCCCTT
TAAGGCTAAA
CCCACACAAA
CCTTGATGAA
AAACACCCAA
120 180 240 300 WO 97/37044 WO 9737044PCTIUS97/05223
ATCGTTTTTT
GCGCCCATGC
TCTTACCCAG
GTAACTCCTA
CCCCCTTTAA
GAAAAAACGC
GAAAATAGGG
TGCGGGAAGT
CGCGTTGATA
GAAAATAAAA
CCTTTAGAAG
AGCTCTACAG
TATCTGATAG
TTAGTGAAAG
GAAACCAGCG
AATTTGAATG
ATAATCAAAG
GGGGTGTGCA
ATTGAA'AACT
CTTCAAAAGA
AAAAACCACA
AGTCCAAACT
GCAAAGTAAG
AGCATTCTTC
TCCCTAACAA
ATAATGTGGA
GGGTTTATGA
AAGACAALAGA
GCGGTAAAAT
ACCCTCAA.AC
AAAAGTGTAA
AAGAGCCTTT
CCATATATGA
AATTGGCTTA
AAAAATTCAT
AGAGCAGCGA
TGGCCTTAGA
CTCGTGTTGT
ATTGAAAGAA
AAATAAACCC
AGGCTCTAA
CCCCACTAAC
ACAAGATCAA~
CACCTCTAGT
AAAAcAAGCG
TGATGAAAAT
GATTACAACA
CATTACCCCC
TTTTGAAGCT
AAGGGCTAGA
AAAACAAGCG
ACGCCCCAAA
TTCTTCCACA
GGAATTTGTG
ATACAAAGAA
AATAGAAGAG
GTGTGTCAA.A
CCGCACTCAA
AGCTCATCGC
AACTCTAAAA
GAAGTTAAAA
GAAAACAACC
GCTGATGCGA
ATTAGAGATC
TTACAAGCCT
GACATTACCC
TATACTAAAA
AAAAATAATT
GCGAGAAAAG
TGGGAGAGCG
CAAGACGATC
CGAAAAAGCG
GAAGTGTATG
TGGGTTAAAA
CAACCACGAG
AAGGGGAATT
ACCCAATACC
AACAATCTCC
ACAGCCTTTT
CGCCAACAAA
TCTTTGTAGC
GTGAAAACAA
CTAATATCAA
ATCGCCCAAG
CTTGCGATTA
TCTCTGTTCA.
TCGCCATTCT
ACGGCACGAC
AGTATGAAAT
ALAGTAGAGCC
AAATAACGCG
AGGGGCATTA
ACCATGTGCG
CCAAAAGCAC
ATTTATTCAA
TTCTTTAAAC
TCAAAACTTT
ACAGCCTTTA
CGACGCTAAT
GCCACCCACT
TGAAAGCAAC
AGAATTCGCA
CATTTTAA
CAGCACCGCT
TAAA-ACAGAG
CCAAGCCAGA
CAGGCAATGC
CACCACGCAA
GACTTTTTAT
TAATGAATTG
TTTAAACGAT
CTTTAAAGAA
GCCTTTGAGT
TGAAGTTTA.A
360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 INFORMATION FOR SEQ ID NO:379: SEQUENCE CHARACTERISTICS: LENGTH: 1440 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1440 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:379:
ATGAAAAAAT
GTCATTGAGC
GTCCTTGATT
AAACTCACGC
AAACTTTCTT
ATCGTTTTTT
GCGCCCATGC
TCTTACCCAG
GTAACTCCTA
CCCCCTTTAA
GAAAAAACGC
GAAAATAGGG
TGCGGGAAGT
CGCGTTGATA
CCCTTTGTTT
TTTTAGAAGA
CTAAAGAACC
TCACTATCAC
TAGAAACGAC
CTTCAAAAGA
AAAAACCACA
AGTCCAAACT
GCAAAGTAAG
AGCATTCTTC
TCCCTAACA.A
ATAATGTGGA
GGGTTTATGA
AAGACAAAGA
GTCTTTCTTT
GATTAAAACT
AAGACAAGTT
TCACATATCC
CTTAAGCCCT
ATTGAAAGAA
AAATAAACCC
AGGCTCTAAA
CCCCACTAAC
ACAAGATCAA
CACCTCTAGT
AAAACAAGCG
TGATGAAAAT
GATTACAACA
CTGACCTTCT
TCGCCGCATA
TTAGGCGTTT
ACGGCAATCG
AACCGCCCTA
CCGCACTCAA
AGCTCATCGC
AACTCTAAAA.
GAAGTTAAAA.
GAAAACAACC
GCTGATGCGA
ATTAGAGATC
TTACAAGCCT
GACATTACCC
CTAACCCTCT
AAGGCACTTT
ATA.ATATCTC
TCTATCAACC
CTATCCCTAG
ACCCAATACC
AACAATCTCC
ACAGCCTTTT
CGCCAACAAA
TCTTTGTAGC
GTGAAAACAA
CTAATATCAA
ATCGCCCAAG
CTTGCGATTA
TCAAGCCCTT
TAAGGCTAAA
CCCACACAAA
CCTTGATGAA
AAACACCCAA
TTCTTTAAAC
TCAAAACTTT
ACAGCCTTTA
CGACGCTAAT
GCCACCCACT
TGAAAGCAAC
AGAATTCGCA
CATTTTAAAA
CAGCACCGCT
120 180 240 300 360 420 480 540 600 660 720 780 840 WO 97/37044 PCTIUS97/05223 369
GAAAATAAAA
CCTTTAGAJ\G
AGCTCTACAG
TATCTGATAG
TTAGTGAAAG
GAAACCAGCG
AATTTGAATG
ATAATCAAAG
GGGGTGTGCA
ATTGAAAACT
GCGGTAAAAT
ACCCTCAAAC
AAAAGTGTAJA
AAGAGCCTTT
CCATATATGA
AATTGGCTTA
AAAAATTCAT
AGAGCAGCGA
TGGCCTTAGA
CTCGTGTTGT
CATTACCCCO
TTTTGAACCT
AAGGGCTAGA
AAAACAAOCG
ACGCCCCAAA
TTCTTCCACA
GGAATTTGTG
ATACAAAGAJA
AATAGAAGAG
GTGTGTCAAA
TATACTAAAA
AAAAATAATT
GCGAGAAAAG
TGGGAGAGCG
CAAGACGATC
CGAAAAAGCG
GAAGTGTATG
TGGGTTAAAAJ
CAACCACGAG
AAGGGGAATT
TCTCTGTTCA
TCGCCATTCT
ACGGCACGAC
AGTATGAAAT
AAGTAGAGCC
AAATAACGCG
AGGGGCATTA
ACCATGTGCG
CCAAAAGCAC
ATTTATTCAjA
TAAAACAGAG
CCAAGCCAGA
CAGGCAATGC
CACCACGCAA
GACTTTTTAT
TAATGAATTG
TTTAAACGAT
CTTTAAAGAA
GCCTTTGAGT
TGAAGTTTAA
900 960 1020 1080 1140 1200 1260 1320 1380 1440 INFORMATION FOR SEQ ID NO:380: SEQUENCE
CHARACTERISTICS:
LENGTH: 903 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoxnic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 903 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:380:
ATGAGAAAAA
GCGAGTAACG
ATGAAAGGCG
ATCCCCCCGT
CCAAAAGGCT
GTGGTATTGA
TTGGTAGCGC
TTGAAAGGTT
GCGCGCTTGG
GTTGTTGp.A
CCGATTTTGG
CAAGCAGGGA
CAATACGAAG
TTGGTCTATT
CAAAAACATC
TAA
CGATTTCAGC
CTTTGATTTT
TCGCTTTCAG
ATAACATTTG
CGGTATTTGT
AAACTAAAAA
AAACTTTGGG
CTGAAAAATC
CTTCTGGGGC
TTCCTTACCA
ATATTCAATA
TCAAACGAA
GGAAAATGCC
TAAACAGCTT
AAATTAAATC
GTTGTTTTTA TCAGCGTGTA
TAGGOTTATC
ACAAACAGAT
CGTTGATTCT
GGAAGGCGCT
GAGCGTAGTG
CGGCCAGTAT
GATTGATAGC
CTATACTTTC
GATCACATTC
AAAAGCGAAA
TGGCAATGTT
CGATACAGTG
GTATGTCGCG
GTTGAATGTT
TGGTGCTGAC
TTTAGCCTAA
AATCTTAAAA
TACCGCTTGT
GATCCGGGCG
TTCGTCTCTC
GTGCGTGAAA
CATGGCCGTG
GAGCACGTCG
GCCACAAAAG
TGGAGCAATA
TGTGTAACTA
AGCTTTGGCG
TCCGTGGCGT
TGGAATATTG
AAGATGGGGC
TCTTTGATTT
ATCAGACCGC
TAGGCACTAA
CAGATAACGG
TTGATGAAAJA
ATGTGTATGC
GGCCAGAGCT
GGGGAGTGAA
TCAGCGATAA
TTTTTAAAAA
ATGTGCCAGA
TGAATATGGA
ATATAAAGA
GTCTGTTCAT
CGTCTCGGCG
AACGCACGAA
TAGTTATTGG
TCGTAAATCG
CACGCTGACT
AGCCAACCGC
TTACACTGGG
TCCTATAAAA
AGGCAATATC
ACTGCTCAAT
TTCCAAGAAA
AGGCCAGCCG
TAATTTCGCG
GTGCGCTAA-A
120 180 240 300 360 420 480 540 600 660 720 780 840 900 INFORMATION FOR SEQ ID NO:381: SEQUENCE
CHARACTERISTICS:
LENGTH: 471 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 PCTIUS97/05223 370 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genioric) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: No (vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1. .471 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:381:
TTGACTAJ
GGGGTGTTTA
GCTTATTTAG
TGCGAAAGGT
AACTCTCCTA
TCTCTTACCC
CAAAAAATCA
ACGCGCTTGA
AATTCATGTC
TTGGATGGTG
TCTTCTTCAC
TAGCATGTCG
AGCAACGCCC
AAATATCGAG
TTTTTAAAAT
CGCATGCGTT
TTATGGATTT
TAAAAATTTA
TTTCTATCCA
TTCTCAAATC
GCCAAATCCC
CTTAAAAAAC
ATTGCTCTTT
AACATTCAAC
GTTATCGGGG CTTTAATTTG
CGTGCTTTTA
GTTAAAAAAT CTTTAACCGC
TTATCTTAAC
GGCATGGGGA TTATAGGAGT
TCCTTTTAAA
TCTAAAGAAC TTCGTTTTTT
AGATCCTCAJA
AAGATCAAGC TCCATTCTTT
AGATAAAAGC
CAATCCCCTA TTTTAGAACA
ATCCATTCAG
TTGAATGCCT TATTGGAAA
ATTCAAACCC
GCTTTAGATG AAAAAACCTA
A
120 180 240 300 360 420 471 INFORMATION FOR SEQ ID NO:382: SEQUENCE
CHARACTERISTICS-
LENGTH: 858 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .858 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:382:
ATGCAAGATT
GGTAAGGCTC
ATCAGCACTC
GAATTACTGG
GGGGCGAGCA
AATATTTTTG
TTCACCACCA
GCGATGGTGT
TTATTAAGAT
CAAGCGTAGG
CTTATGCAAG
CTCCGGTAGA
AGGAAGAAAT
GTGCGATCGC
CGAACGCTGA
TTTCTTTTA
TTTTATTCAA
ATTAGAAAA
GGTTAAGATT
TTTAGTTACC
GGATAATGAC
TACAAGCTTG
AATCGCTAAA
AATGGAAGCC
GAGGTTGTCT
GAAGTTTCTA
AGCGCGATTG
GCTTTGAGCG
GATTTAGACG
AAGTCTCAAG
GAGCTTCCTA
ATCAAAGAAA
CTACTTTAGA
ACAATGAAGA
AAAAAAATGA
ATTTGATGCT
CTTTTAAAGA
AATTGCTCCC
AAAAAGAAGA
GCCAAATCGT
AGGGTTAGTG
AGCGAGTTTA
AAGTCCTATT
AGGAGGTGAG
AATGGCTTCT
TAAACTCAAT
TTACGCTAAG
TTTATTGATC
120 180 240 300 360 420 480 WO 97/37044 PCTIUS97/05223 ACTTCAGCGT
TTGAGGGCCA
AAAAGCGCTA
CTGAAGAGAC
AATATCAGCA
TGCTTTTAGA
ATGATTTTA
AAGATGTGGT
GTGAATGACC
CTTTAGAAT
ATCGTGGATG
GGAATTTTGG
GAGCAACTGA
AAAATTAA
ATTTGAAAAA
TAAAACCCAC
CGTGAAATTG
CTCTATGGAT
CCTCGTAGAT
CATTCAA-ATC
ACGCATAAAG
GACGCGTCTT
AACGTTAAGG
ATAGGGAC
GACAAGGTGA
ACGGATATTG
AAGAAAAAGA
TAGAAAATAT
TGCGTATCGG
TGGTGGAGTT
TCGCTAXGGG
GCACTAAAAA~
AGAAACGACA
AGAAATCCGC
TCAAAAAAAG
GGATCAATTG
CGAAGTGGTG
GGAACGATTA
540 600 660 720 780 840 858 INFORMATION FOR SEQ ID NO:383: SEQUENCE
CHARACTERISTICS:
LENGTH: 990 base pairs TYPE: nucleic acid STRANDEDNESS. double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .990 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:383:
ATGACGACTA
GATACATGTT
AACAAAGTGG
CA.AGGCGTGG
AAAGGCTATA
ATCAAAGCGA
TTCACGCAAA
GAAAGCATCG
ACTTATCATT
TCCATGAAAA
TGGTTTGAAA
ATGCCCCTGA
GATGGGGCTT
ACCGATAACA
AGCGTGAGCT
TTTGTGAAAA
AACATTAATA
AAAGAGTGAA
TTCTTTTCTT
TGGTTGTCCC
ATATTAACGC
TTGACATGGG
AAACGGCGCA
TTTTGAGCGA
CTCCACGATT
TACCTTTAGG
AACACGA.AGC
AAATCATTCT
TTGCGAGCGT
TGAATTATCA
CCCCCTACAA
TAGAAGCGGT
TGCCGGATAA
TTTCTAATA.
TACTGCCACA
CATCAGTATT
GCAAGGTTCG
TTTAGATTTG
CGATGGGGCT
AAAAAGCGTT
GACTTACCAA
AAACGGCGCT
GGAGGACGCT
CTTAAGCAAA
CGCTTCCATT
GATTTTTAAC
GGA.ATTTTCA
TACCTATAAA
TAGAGCCGTG
AAAACATGCT
TCATTTTTA-A
AACAAGATAA
CTTTTTTACC
CTCAAAAAAG
CTTTTGTTGC
TTAAGGAAGG
ACTTTAATCC
CTAGAAACAA
GTGATAGAAG
TTTAAAATCA
CAATGGCTTG
GTGCAAAAAG
CGCTTGAAAA
CACGCTAAAG
TTTA.AGGGCT
GTTTTCCCTA.
TTCAGCGCAA
TGACATTAAA
TAAGTATACC
TGTTTTTTTC
GCCTAATGGG
GGGATTTTTT
CTGGAGAAAC
GCGATCTCAA
ATGGGGTGAT
TGCAAACTTT
GATACTACCA
AAGCCGCTAA
AAGGCATGCC
TAACCAAAGA
TGCCTAAAAA
AAAAAACGGA
CTTATAAGGA
CACTTTCTTG
AATTTATCCT
TTTGAAAGAG
CATGCCTAAA
AGTCCGTTTG
CCGCTATTTT
TGAAGCTTAT
ATGGCCAGAC
GATTGGTCAA
TAAAGAAGAG
TGTTGAAGAjA
TTTACAAATG
GCGCATTAAA
TCCTGTAGGG
TTTCTTGTAT
GCATTTAAAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 990 INFORMATION FOR SEQ ID NO:384: SEQUENCE
CHARACTERISTICS:
LENGTH: 441 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/US97/05223 372 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc eature LOCATION .441 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:384:
ATGAGATTGT
CCTTTAAGCG
CAACCCGATT
TTTGAAACGC
GTGTTAAAAA
AATGCCTTAG
GCGGGATTAG
TACCCAAAGG
TGTTCTTGTT
ATGACGCCCC
CAAACGCTCC
CTTTTAATA.A
ACGCTCAATT
AAATGTTTTC
AAATCCAAGC
TTTTCATTTA
ACTGAGCGCT
CATTAAATTG
GGCA.ACGCCA
AACGCCTAAA
GGATTCTAAA
CTACCAAAAT
TTCAAGCAGC
ACTTTGATGT
GTTCATTGGC
CCTATAAAGG
ATCATGGAGG
AAAACGATGG
GACATCTACC
AAGGATAAA
TACTGGCTGA
AAAATGCGCT
CCGTGCAAAC
TTGAAGGGCA
ATTTTAA3AGA
TCTTGTCTAA
AGCAACTCGC
AGAAAAAATA
AAAAGAAGTC
CACGCTCACT
AAAAGTCATC
AGCCTCTTTG
AAAAGCTAAA
TTTCTTTTTT
120 180 240 300 360 420 441 INFORMATION FOR SEQ ID NO:385: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1983 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1983 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:385:
ATGCTAAAAA
GCCGTTCTTG
TATCGCCCCA
TATGATA.AGG
AGCCTTCTAG
GTCATGCGCG
CTAACCCAAC
CTCAAAGAAG
GAGCGTTATT
TTAGGGTATT
AGATTTTTTA.
TCGCTCAAGT
GTGTCGCTTC
AATTTCGTTT
CGGTAGAAGA
CTATGATTAA
AACTCGTTA.A
CTATCATCTC
TGAACCAAAC
TTAAAAAACC
TGGTTTTATC
TTGGGTAACT
ACAGATTTTA
TTA.TGCGCGT
CACCCTCTTT
AAACGCTAAA
AAACATGGTG
CATACGCATT
TTTTTTTGGG
CCTTGACAAA
GTTTTATTTT
ACGGATAAGG
GACAGAAAAG
TTTGAAGAAA
TTTGAGCATG
AGTGGTCGTT
CTCACACGGG
GAAAAAGTCT
CATGGGTATT
CTCACGCTTA
TGATTATCGT AGGGTTGTTG
ATATTGCTAA
GGCGTTTGAT
TCCCCCCACG
GGGGGATCAA
ACACTGAAGG
AAAA.AACCCT
TAAGCAAAGA
ATGGCGTGA
AAGAAATCAC
AATTAAAGAT
CGCTAATATT
ATTTGTTGAA
TTTAGACGCT
GGGTAGCACT
AACCAGAAAA
AGAAATTTTA
AACCGCAAGT
CATGTTAGTC
120 180 240 300 360 420 480 540 600 WO 97/37044 WO 9737044PCTIUS97105223
GCCTTACCTA
AGGGCTAJ\TG
AAATCCGCTC
CCCTATGTCG
GGCTATACCA
CGTTTTGGGC
TCTAATGATA
GGTAAGATTT
ACGCAAGCCA
GATAATGGCT
AATTATAGTA
TTTTTAGGGC
AATTTAAGCG
AAAAACCTCC
GCGGCTGAAA
GAA.AGCATCA
ATCACTTCCA
GGCACAGGGA
AACAACAATA
TTTGGGAGAG
CCTGTGTATT
TTTGATGTCC
CCTAATTCCA
TAA
GGGCTCCAAG
ATATTTTAAG
TCAATGAAGT
TGGATGAAGT
TAAAACTCAC
ATCA.AAAAAT
AAGATGAAGA
TAGCCTTAGT
AACGGCAGTT
ATTCCACCAC
AAAACAGCGT
TTGTAACCTT
ATCAGCTTGG
CTAAAGATTT
AGTATTCTCT
CCAACCAACA
AAGAACAAGC
GTTTGGCTCG
TTGACGCTTG
ACGATAACAC
CGTATTTCAT
CCAAAGGCTT
TCACCCCCAC
TTTTTATGAC
GCGGTTGTAT
GCCAATCGTC
GTTGAAGCAA
GATAGATTTG
CTTAGAAJApA CAACTTAAAjC
GGGGGGGATT
TGGGAGCGCG
TTCCAJAj\TC
GCAAAACCAC
GCAAGAAGCC
CTTTGAAAA
GTCTATTGTG
ATTTTCTAAT
AAACGAAGTC
TTTTTTAACC
CATTAAAGGT
GTTCATTGGC
GCCTATTGGC
GCGCAATATC
GCGTAAAGAA
CCCCAAAAAA
CC TAOCCAAAA
TCTTTACGCT
TATAACCAAA
TTGGATCAAT
GATTACCAJAC
ATCGCTAAAG
GCCAGCATGA
GATTATAAAA
ATCAAGCCTT
CCTGATACCG
GCATGGCACC
TTGAGCCATT
ATTTATCAAT
TTAGGGAGCT
TACGGCACCA
AAAACTTTCA
CTTTCAGCGC
TTAGAAATTG
TTTACCCCCA
AAAGGAGCGA
TTAGCGATTG
ATCGTGGATA
ACAGACGATA
ATTTAGAATT
GGATTTCTTC
CTTCCACGCA
TAGACGGGTT
GCTTAGCGTT
AGAAGCCAAA
TAGTTACAGA
AAAGCGCTTT
TTGTGTATCA
CGCGAAATTT
CTAGCAATTA
CGTTAXATCT
CTTTAAGCGA
TTGCTATCTC
TGCTCAA.ACc
CGCCCATTGA
TGATGGATGC
CCGGTAAAAC
CCTTACAAAG
CAGGAGGCGT
AACCTTCTTT
AAATCCCCTA
GCGAAGAACG
TTCACTCTCT
TAACGAGCTC
AAATATCGCT
AAAAACTCAA
GGAGTCTTTG
AACTAACGCT
AACGAGCACC
CAATCGCGCC
AATCGCTTTT
TGAAAATGGC
TACTCGCAAA
GGCTACGATT
CATGGGGTTT
ACCGATTGAT
CATGCTCATT
AACCAAAAAG
GGTAGAAAAC
CGGGACTTCT
CGTGATCTGG
TGTGAGCGCG
AAAAAGAAAG
CTACTCAAGC
CTTGTTGTTC
660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 1983 INFORMATION FOR SEQ ID NO:386: SEQUENCE
CHARACTERISTICS:
LENGTH: 342 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION .342 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:386:
ATGGCTAAAA
GGTTGTGATA
GAAAGGGTGC
CAATGCGAGT
TTCGCTAAAG
ATGCTCTTAG
TGAACGCTCC
TTTGCGTGTC
TTGGAAAAGT
TGCACTGCCC
TTTCTAAAGA
AAGAGACTAT
AGATGGGGTT
GGTATGCCCG
GGCCAAAGTA
GGATTTTGCG
AGCCCAAGAA
TTTAGAAGGG
GCCGTTTGGG
TGAATGAAGA
GCTGGAGTTC
TTGGCATGGG
GCCTATCCAG
AGAGCTGTAT
ATTTACGTGG
CTGACAGGAA
AGAAGCGAAA
AGGTTAAGGC
AGAGGCAAAT AA
CAGGTGTAAG
GATTGAAAA.A
TGGTTGCGTG
GGATTTCAAA
CAATAA.ATAC
120 180 240 300 342 INFORMATION FOR SEQ ID NO:387: WO 97/37044 PCTIUS97/05223 374 SEQUENCE
CHARACTERISTICS:
LENGTH: 1020 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .1020 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:387:
ATGATTTTCA
AGGCAAGCGG
TTGGAATTGT
TTAGGCGTGG
TTGAATTTGG
AAAAGCGTGG
ATTTCTCAAA
CCTTGGACTT
AAGAAAATGC
GAATTGATAG
GACTCATGGG
AAAAAAATCT
AAAGGGATTG
TGGGGCACGC
AATTTAGAGC
CTAAAAGTCA
TTGCCCAGAG
TTGACGCATG
GGCGTTACCT
GTAAAAATAG
ATGCGGCTAT
AGTTTATCCC
AAAGCCTAAA
CGCGCCAAAA.
TAGCGACTTA
TTTATAGCGA
AGTATTTGAG
CTAGCGCTTT
CTAAAGAGCT
GCGCTTATTT
CTTTAACTGC
CCACCCGCCT
TGGGCAATCA
AAAACGCCAA
TTTTAGAAAG
TAGCGAATAC
CGATTTAGCC
TTTGTTTAGC
CAAAAAGGGG
AGTAGGGGTT
GCTTTCTAA
CATGATAGAA
GCCTGAAGTT
CCTTCAAATC
AGAAAAAGAA
TAAAAAACGC
GGATAGCATA
GGCAAAJAG
TTATGATAAA
AGGGCATATT
GTATTTAGTG
GAAACGCCTT ACACACCCAT
CAAGAGAGCC
ACAGAAGTTA
GATATTTTAG
CCGCATTTTT
TATAAACAAC
GAGAAAGCGC
GGCGAGGGGA
TTAAAAGCGC
CAAGCAGGGG
GCGTATTTGG
TATGCGCACA
GATGGGGAT
ATTTTAGGCG
AACGCTTTAG
TTTAATTTAG
CAATTAGTGC
GTAAAAAAGC
CCCTACAGCC
TAGTGCCTTT
TAGAGACGAT
TAAACTATGT
TAATCGGTTT
GCAAATCGTA
TTTTAGAA-
TCAATGCAGT
AATTCAGTTG
TCCCAGTTAT
TTGATGTGTT
GTAAGTATGT
AAGAAGGGGT
GGCATGGGAT
ATGCTAAAAC
TTGGATGATG
GGGGAGCTTC
GGTAGAGATT
GGAAATGGGC
TACGGATTTA
CTATGATACG
TTGCGGATCG
TGCCAAAAGC
GTTAAGCCTT
GATGATCTTT
GGATTATTTG
CCTTTTCCCT
TGGCGTGGAT
TTTGCAAGGG,
TGAAAAGATT
GTTGCCGGAT
CAGGCGATAG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 INFORMATION FOR SEQ ID NO:388: SEQUENCE CHARACTERISTICS: LENGTH: 579 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PTU9/52 PCTIUS97/05223 375 NAME/KEY: misc-feature LOCATION 1 .579 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:388:
GTGTTAGAAA
ATGTTGCAGG
CCTTATTGGA
ATTGTAGATG
AGAGTGGCGC
ATCAAGCAAA
TTGATGAATO
TTOGTGCOCG
TTAGACGATC
ATCAATTACG
AATCTTTTTT
CTTGCACTTG
TGTTGCAAAA
GTAAGGCAAC
AAAATATCGT
AGATCACTAA
TGGATCGTTT
CGCGTTCCTT
AAGCGGTGAG
GAGACGTTAA
AAAAAGCAAG
CCCAAACACT
TCGCAGCCAG
TGAAGAGATA
GCATAAACTT
TGAAATGTTT
AGGGATTTAT
TGATAAGGAC
TATCCTTGTT
AGTCCCTATA
CAATTGGTTT
TCACAAAGAA
TATCTCACGC
GAGAAAATCG
AAAGAAGCTT
ATCCAAATGA
ATCAATCCTA
GCTTTGAOCG
TCTAAAGTGG
GCCATGTAG
TATOCGGGTT
ATTCTTTTTT
AAGGGGTGGA
CTACCAAAAG
ATCTTTCTA
CAAAGCCCAT
ACAATGAAGA
AAGGGTTGCA
AAGAAATCTT
GGGTGTTTTA
ACAAGATGTG
TAOCTCGCAC
AGCGACAJATA
ATCCAACCGC
TTTTGACAGC
AOTGTTTGCG
TAAAATGTCT
TAAAGATTCT
120 180 240 300 360 420 480 540 579 INFORMATION FOR SEQ ID NO:389: SEQUENCE CHARACTERISTICS: LENGTH: 330 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 330 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:389:
ATOCTTAGAA
GCTCAAGAAA
TACAACAATA
CGTGATATTT
AAAATCTATG
CAAAATGGTA
ATCAATTTCG
ATACCCACAC
AAATGTATAT
TTAGGACTAG
TTAGGGGGAT
ACATTTTCCA
TATCGTGTTT
TTTGGGTAAG
TGACAGGAAA
GGCGGATGTG
TGAGAGCCGT
CCATGACGCT
GTCTCTTGTA
GTAACCACTA
GAGCTCCAAC
AATGTGGCCA
CTCTTAAGGG
TTGTCGCTAG
AGGGTGAAAG
AACGCCAAAG
GTGGGGGCTT
TA.ACGATAGA
CAGTTTGCAA
GACTTTTGAA
CAACCAAATC
AATGGCGCAA
TGGCGTCGCT
120 180 240 300 330 INFORMATION FOR SEQ ID NO:390: SEQUENCE CHARACTERISTICS: LENGTH: 270 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCT/US97/05223 376 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...270 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:390:
GTGATGAAAA
GCGCAGGCCA
AAAATCATCA
TCTGGGCAAT
TTAAGGCCTT
AAATCGTTGT GAGTTTATGC GTGGCGTTAG ATAAAGCGAT CAGTGATGCG GATTTGATTG GCGCACAAAA CACGGAGATT AACCAATTAA TAGGGGACAT GCGTAAGGAT ATATTAAGCA ATATTTATAA TTGGCGCTAG GTTTTTTAAG CGCGGATCCA AAGAGATAAG GGACTTGAAA GAAAAGTGCA AGAAGTCTTA CTAGAGATTA CTGTATCAGC 120 180 240 270 INFORMATION FOR SEQ ID NO:391: SEQUENCE CHARACTERISTICS: LENGTH: 477 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...477 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:391:
TTGAATAAAT
AAAGCTCCTG
TTTTCGGCTA
ATAGATAAGG
TTAGTGGGCA
AGTTTTATCT
AAAGAAGTTA
GGATTGAGCA
TTGGTTGGCG
GAACGATAGG
ACACTTTGTT
AAGAAGAAGA
TGTGGCTTGC
TTTTTAGGAT
AAGGGGGCTT
CGTTATTAGC
TGCGTGTTTT
GAGTTTAGTG
TTTAGGGGCG
GACTAAGAGG
GATGGCGATT
CTATGATATT
AGGGGTTGTG
CATCAATATT
CTAACCCTTT
GCGTTGTTAC
ATTTTTGTTG
CATGACAGCT
AGCGGATTAT
ACTAAACCCT
GCTGATGACG
TTAGGATTTT
TTTTTAGCGG
TGGGCTTACC
GGCTTATCGC
CTTACATTGT
CGTTAGTGGG
CACTCATTGG
CTTTAGCCGG
TTAACATTAA
GTATTCTAAA
TATTTTAATT
TATTACTCAA
GATAGACGAA
TGTGATCTTG
CAAAATAGAC
GGTTTTAGCC
GTTTTAG
120 180 240 300 360 420 477 INFORMATION FOR SEQ ID N0:392: SEQUENCE CHARACTERISTICS: LENGTH: 402 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/US97/05223 377 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(Vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1 .402 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:392:
ATGGAGTTAA
CATTCAAACG
TTTGAGATTA
AAAACCAAAT
AGAGATTTTT
GGTGATGTTA
GAGTTGGAAG
TTAAGAAATT
AGCTTTTTA.A
TGTTTAAAGC
TTGATGGCGA
TTA.ATGGGAT
AATGCGAAGA
AAACGATAAA
GGAAAAAGAA
AATGTTGATT
ATGGGTTGAA
AATGATTGGC
TTTTAAATCC
TTTTAATGCC
TCCTAATAAA
AGCGAAGTTT
ATTGATAATG
ATCGTAAAAA
TACACAGAAG
AAAGTAATAC
CTAAGAAGTT
ATCCCATTTT
TAAAGAAAGA
AAGATTTGTT
TGATGTTTGA
AACTTTTAAC
CTAAAATGCC
TAGTTTATCT
AA
TTTACA.ACAA
TAAAGAGCAA
ATTAACCAAA
CTTTTTAGTT
TATTTTTTGC
TTCTGTGCTT
120 180 240 300 360 402 INFORMATION FOR SEQ ID NO:393: SEQUENCE
CHARACTERISTICS:
LENGTH: 402 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION .402 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:393: ATGGAGTTAA
TTAAGAAATT
CATTCAAACG
AGCTTTTTAA
TTTGAGATTA
TGTTTAA.AGC
AAAACCAAAT
TTGATGGCGA
AGAGATTTTT
TTAATGGGAT
GGTGATGTTA
AATGCGAAGA
GAGTTGGAG
AAACGATAAA
GGAAAAAGAA
AATGTTGATT
ATGGGTTGAA
AATGATTGGC
TTTTAAATCC
TTTTAATGCC
TCCTAATAAA
AGCGAAGTTT
TAAAGAAAGA
ATTGATAATG
AAGATTTGTT
ATCGTAAAA
TGATGTTTGA
TACACAGAAG
AACTTTTAAC
AAAGTAATAC
CTAAAATGCC
CTAAGAAGTT
TAGTTTATCT
ATCCCATTTT AA
TTTACAACAA
TAAAGAGCAA
ATTAACCAAA
CTTTTTAGTT
TATTTTTTGC
TTCTGTGCTT
120 180 240 300 360 402 INFORMATION FOR SEQ ID NO:394: WO 97/37044 PCT/US97/05223 378 SEQUENCE CHARACTERISTICS: LENGTH: 378 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...378 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:394: ATGAAAAAGT TAGCCGCTTT ATTTTTAGTA AGCGCGTTGG GGGTTATGAG TTTGAACGCA TGGGAGCAAA CCCTAAAAGC TAATGATTTG GAAGTGAAAA TCAAATCCGT GGGCAACCCC 120 ATTAAAGGCG ATAACACTTT TGTGCTTAGC CCCACTTTAA AAGGTAAGGC TTTAGAAAAA 180 GCTATCGTTA GGGTGCAGTT TATGATGCCT GAAATGCCCG GCATGCCAGC GATGAAAGAA 240 ATGGCGCAAG TGAGTGAAAA AAACGGCCTT TATGAAGCTA AAACCAATCT TTCCATGAAC 300 GGGACATGGC AGGTTAGGGT GGATATTAAA TCCAAAGAAG GCCAAGTTTA TCGCACTAAA 360 ACAAGCCTGG ATTTATGA 378 INFORMATION FOR SEQ ID NO:395: SEQUENCE CHARACTERISTICS: LENGTH: 1026 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...1026 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:395: ATGAAACGGC TTTTATTGTT AGCCTTGGCC CTCTTTTTTA GCCTCTCATG CACTAACGCT CAAGAAATTA AAGAAACTCA AGAGACTAAA AAAACTAAAG AAACTAAAAG CCAAACCCGT 120 TTTAACATTT CCACCACTAA GGTCATAGAA AAAGAATTTT CTCAAAGCCG GCGCTATTAC 180 GCGCTTTTAG AGCCTAATGA AGCGCTGATT TTTTCTCAAA CCCTGCGTTT TGATGGCTAT 240 GTGGAAAAGC TTTATGCGAA TAAAACCTAT ACCCCCATTA AAAAGGGCGA TAGGCTATTG 300 WO 97/37044 WO 9737044PCTIUS97/05223
AGCGTGTATIT
AACCAGCAAG
ATTGAAAAAA
AAC-GGCGTTA
CAAGAGCTTT
GAGGATTTAG
GGCAAGCAAG
CTAGAAGCGC
CAAGTAGAAA
ATTA.AGGGGG
ATTAAAGCCG
GAAGAAGTCG
TATTGA
CCCCTGAATT
TGGGAGCGAT
TCATCAGCAG
T'TTTTAAAAA
TCAAA.ATCAT
AATTTTTAAA
CAATCACGCT
CCTTCAATGT
TCTTTCACAA.
GGAAAGCTAT
TCCGCTTGAG
CTAATAACGC
AGCGGGCGTT
TAAAGAAAAA
CCATAAAGTC
AAGCCCGGAT
AGATTTAAGC
AAACACGCAT
TGAAAACATC
GCCTAATCTT
ACCCCAAAA-A
CGTGTTTAAA
CGATGGGAGT
TTTATTCGTG
CAAAGCGAC
TTAAAACTAC
CAAAATGAAA
CTCAATGAGG
CGATTGTGGG
CAAGCGATTT
AACCCCATCA
AAATTGCTTT
ATGAAGATCT
AAAGATGATT
TATGAAATTT
CTAGACGCTG
TGTTATCGTC
TAGGGCTAGA
TCACTATTTA
GGAGTTTCAT
CGCTTGTTAA
TGTTCGTAGA
TAAATGCGCA
ATTACCCTAA
TGCCTAAAGA
TTGGCTTAAG
TAGAGGGTTT
ACGCTCAAAA
ATTGAAATTC
AAACTTTAGC
CTCTCGTTTC
TAAAAAGGG
AGTCAATCAA
GGGGGTTAAA
AGATAAATG
CATGTTCGCT
AGCGGTTTTG
CCCGTTAGAA
AAAAGCGGGC
CALATGGGGAT
360 420 480 540 600 660 720 780 840 900 960 1020 1026 INFORMATION FOR SEQ ID NO:396: SEQUENCE CHARACTERISTICS: LENGTH: 363 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (geriomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 363 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:396:
ATGGGCGTAG
TTTGGTTTAG
GCGAGCATCA
CAAAGAGCGT
ACCTTTAATT
GATGTTCTTG
TAA
CGGTTGTTTT
CTAGCCCCAA
GCGTTTATAC
TTTTAAGGGG
TAGTGAGCGG
TGGATTTGGA
ATTTTTAACG
ACAAAAGATT
TTATAAACAA
GGAAACCTTG
GACTTTGAGC
TTCTTGTCAG
CTAATTTTAT
TTAGCTTTTT
AACCAACAAA
TTGTGTAAAG
TTTTTAGGCA
ACGCTTCAAA.
TGTTTTTAGT
TAATCGTAGG
ACCAACAAGA
GCATCAAAGT
AAAAACAAAC
AAGATCCCTT
TTTAAGGGAT
GATTATAGGA
AA.TCGCTTTG
CAATAACCAA
CCCTATGAAA
AATCCAACCT
120 180 240 300 360 363 INFORMATION FOR SEQ ID NO:397: SEQUENCE CHARACTERISTICS: LENGTH: 999 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO WO 97/37044 WO 9737044PCTIUS97/05223 380 (iv) ANTI-SENSE': NO (vi) ORIGINAL SOURCE: ORGANISIN: Helicobacter pylori (ix) FEATURE: NANE/KEY: misc-feature LOCATION 1.._999 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:397:
ATGCGTTTTT
TTGGATTTAA
CGTTATATTA
CAAAATAAAA
AAAAGCCCTG
ATGTTAGAAA
ATCAGAGATT
GAGGCTTACC
TCTTTGTTTA
AAAAAGCTCC
GAAGATTACC
CATTTTAAAG
ATTCTAGGAA
CGCTCAGAAG
TATTTAGCCT
CTCTATAGTC
TCATATCCAA
TTATTTTATT
AGGATTTAGA
GCGATAAAAA
ACAGCGCCCT
ATGCCAAAAT
CA.ACAGACGC
TTGATAAAAT
CCATTCTTTA
AGGCTAACGC
AAATTTPTGA
CGGCGTTTA.A
ACGCTCTCGC
TCAATGAAAT
CGGTTGTCAA
CTAAAAAGAA
TTTATGCGAG
AATTTAAGCC
TTTTATGGGT
AAAAAAGCCC
AACGAGTTTA
ACAAAAAGCC
GCCTGAAGAT
TTTCCAAGCG
CCCCATTCAA
TGAAGAATTA
GCAAGTGTTT
AAAGCATATC
CCGCTTGATC
TAAAAGTAAC
CTTGCACAAA
GGACGATGAT
AAAAACTTTG
CCGCAAACCT
AAGAGAACCC
ATGCTTGGCG
GCCGGGATCG
GAAAACGCTA
ATGCAAGAAA
ATTTATTGCA
AGTTGTATCG
ACCTTTAAAC
GAAATCTTGC
AGCGCGCTCT
CCGATTAAAG
TATCAAGTCA
GCCACCCAAA
AAAACTTCTA
TTTTCAAAAG
GAGCACCTTT
CAAAACCACG
TCCTTTTAA
TTGGTTTTTC
TTAGGGATTA
AAAAAGCCTA
AAGGCGTAGA
AGCAAATTAC
CTATCGCTTT
CTTTACAAGA
AAAGTAAGAA
TTAATCATTT
AGTTAAACCG
TCTTAGATCC
GCAACGCGCA
AAGCGCTCAA
ACAGAGCGAT
CACAAAGCCC
CCCAGTTATC
TCAAACCGAG
TTATTTATGG
TGAATTGACT
AAATTCAGAT
CTTAGAA.AGC
AAAATCAAAA
AAAAATCAAA
CGTGAGCGCT
GAGTTATGAA
CCTTTTAGAT
TAAATTGGAT
AACCTTTTTT
GTATTTTGAA
TTTTTGGCAG
AGCCCTAAAC
GCATCATTTC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 999 INFORMATION FOR SEQ ID NO:398: SEQUENCE CHARACTERISTICS: LENGTH: 474 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .474 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:398:
GTGCATCTAG,
GATGGGGGGA
ATTTTAGATT
AAGGACGCTC
ACTTTAAGGC
CCCACTACCT CAAAAGGAAT TTTTCTTTTT TCCCTAGCGT CTATGGCTCA GCAACTCATT CCTTTAATCA CTTCGTATGA GCGTGAGCGC TAAAGGCGTT GAAATTGGCT CAGTTTATGC CTAAAGAAAT CACATGGGCT GGGAGTGCGC ATGAAGTGGA TCACGGAGTT TTTAGGGGAT TTGCCTAAAA CTTTTATCGT
GGATATTGTG
A.AAGGTTTTG
CTTTGATTTT
AATGCTGCAC
GGGGCTTGTG
120 180 240 300 WO 97137044 WO 9737044PCT1US97/05223 381 CCTTTTGTGA TAGGGAGCGA GACCACTTTC AAGCTTTCAA GCGAAATGTT AAACGCTTTA.
GAAACAGCCC TAAAAGCCAT AGAAACCCAA CTCAACGCAT GGGGAGTTAA
AATGCAACGC
ACCGATCATA TTGCCCTAGA TTGTATCGCT GAACTCTCTT ATAAGGGTTT TTGA INFORMATION FOR SEQ ID NO:399: SEQUENCE CHARACTERISTICS: LENGTH: 924 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .924 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:399: 360 420 4714
ATGAAAAAAG
AGGAACGGAT
AGCATCGGCG
AATTCATTAT
AAAGATTCAA
AATGAACTCA
CACTCTCTTT
ATAGCTTTTT
CGAGCGTCTC
ATGACCGGCA
AAATCTTCAG
AATCTCACCC
GGCGTTCGTT
GGGGGTTCTA
GATGATTACG
TATAACTTTA
TCCTCTTACT
TTTATTTAGG
AAAAAGCTTC
TCCCTAACAC
ACAAAATCGC
GCTTGGGGTA
TTTTCGGTTA.
TACCCTATGG
AAGAATACGT
GAACGCTAGA
GGCTTGTGAT
CTTTTAATCA
TTAGTAGTGA
GTGTTGAATT
GGGATAAATT
AAAACAAACA
AACTCTCTCT
TTTAAATTTT
AGCAGAAA.AC
AAAAGCCATA
TAACCGATTC
TAAATATTTT
CCAACTTGGT
TTTCAATACG
TGAAAGAAGG
CGCTGATACA
TGGCATGGAA
AGTCAAGAGT
TGAATACGAT
AGGGGTTAAA
GGATTATAAA
TTAA
CTCTCTCTCT
GCAGAAGGAA
GCCTTAAATC
AGAGATGTGC
GCAGGAAATG
TTGGGTAAAA
GGCGTTGGTT
GATTTGCTCA
GTAAAAGGGC
TTAAAAAGAG
CTAGGGGCTA
CGCACGATTT
ATTGATCGCT
GTGCCAGCGT
AGAGTGGTGA
CGTTTTGGCT
GCTATATTCA
AAGCGATCAA
AAAACGCCTT
GTGGATCGGG
AAAGGATTAT
CTGTTCCTGG
TTAATTGGAC
TCTCTATATT
CATCAAGGCA
GCACTTGGTT
TTCAGTTACA
ATGGCGATGA
TTAAAGTCAA
GCGTTTATCT
CCACGCTGAA
AGGACAAGGT
TAACGCAAA
AAATGCAGTG
CGGTATTTTC
AGGGTTTAGG
CAGCGGTTTA
TAACGATAAG
TTACAAAGAT
CATAATCAGA
TGCAAGTAAC
AGGGAAATTT
AAACTATCTT
TTATTATAGC
TAACTATACG
120 180 240 300 360 420 480 600 660 720 780 840 900 924 INFORMATION FOR SEQ ID NO:400: SEQUENCE CHARACTERISTICS: LENGTH: 1737 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 382 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .1737 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:400:
ATGTCAAAAP
GTGATCGTAG
GGGCTAGAAZ
ATTTGTGGGG
TTAGGAATCA
CTTTTTCACG
TTGAGCACTT
TATCCCATTA
GCTAAAAGCG
TTAAACCCAG
AGGGAAGCGG
ACGGTGGGGG
AAGAGCAAGT
ATGGCAGGCG
TTTATCGCTT
GTGCTTGATG
GTTACGCATT
GGGCAAACTA
AAAATAATCC
AGATACGATA
GCGAAAAA.CC
TTAGAGGCGT
ATCGCTGATA
AGCACTTGTG
CAAGTGCCAA.
TATCAAGCGG
GGGGCTTATG
ATCATTAGGA
TTTAAAGGGC
AAATCGTAGT CGATCCTATC ACTAGGATTG
AGGGGCATTT
ATGATGATAM
CGATCATTAA~
TATGCACTTA
CTCCCCCATT
ATCATGTGGT
TAAAA.GCTGA
ATACCGGTGC
GATCTTTAGG
AACAAAATTT
CGAAAATGAC
GTGTTACGAG
TTGAAGTGGT
AAATGTTCGC
ATGAAGAAGT
GGGATATTTC
CTTGGTATCA
ACCCGCATTA
CTGCTAAAGT
GTAAGCCCAT
CTTATGTTAC
TGTTTTCAAC
ATGGTCTTTT
CTCCTTATCA
GGGGCATGTT
TAGTGCCTTC
AAATGAGCTT
CTATCCATTC
AGTCTTTG.
TGTGATCACC
GGOCAGAGAC
TTCGCATTAT
AAACGCGCAA~
GCATTTCTAT
TCCCATTCAA
GGGCGAATTA
GCCTTTTAGC
AATCGTCTTA
CGCTATTTTT
TGTCATGGAT
GGCTAATTTT
TAACGAACCA
GTTGCTTGGG
TAAATTACAC
ATATGAAGAC
TACCGGTTTA
GCTTGACACT
GGAAGTAGGG
TGAAGTGGCT
GCTTGGGCGC
GGCGTTTGAC
TATTGATAAA
AAGCCATTGG
CACTTGGAAT
GATTGGCACT
TTTTGATCCA
CGAGTTCAAA
GATGCGTTTT
CCACGAGATG
AAAGCCGGTG
TTGGTGCGAT
ACTTTGCATG
GCAGCAAAAC
AAAGCGGTTC
AATGGCTATT
AGCCACTACC
GGGGCCAAAC
ATATTGGATC
ATCAACCATG
TCTGTTATTA.
AAGGATAAAT
CCCATTGATG
ACTAAAGAAG
AAAGACGGCG
AAGGATAAAT
CCTTTAAGTT
ACTAAATTTT
ACAGCTGCAA
GCGTTAGTGG
AATCAAGAAT
GTGCGTATTA
GCAGGGCCTA
AAAATCGCTG
TGCATCGCAT
GTAGAGCCTA
CTTCTTCTAC
CAGGCTTTAT
TTACTGCGGT
CTTTGATGAA
GGCTTGATTG
TTTCTTTCAA
AAAAACGCTT
ATGGGCATAA
TCAAGCTTTT
AACCTCACCC
CAACAAGGTT
CTTACTACCC
AAGGCTGTGG
ACCTTTTGAG
AAAGTTTGAT
TGCAACTCCA
AGAGCGTGGG
ATTCTTGGAT
CCGTAGTGGT
TAAAAGACAC
GGTGTATTGA
AAAATCTAA-A
ATAAAGGGCG
AAAACGGCGT
GAGATTCTAA
ATTTAACCCA
GCTCAGTGCA
ATTTCGCTAA
AAGGATTGAG
GCTTTTTAGG
CGCTCAAAGG
AGAAAACGCT
CATGGCGTTG
GTGCGATATT
ATACAGCCCT
GAATGATTTC
AACCTATCGT
AGAAATCCAJA
ACAAAGCCTA
GGCTGAATGG
TGATTTGGTG
TTTAAGGAAT
TAGTGGGGTG
TAAAGAAGAG
CCCTTATGAC
GATTGAAT
AAAATCGCCC
AGGTTTAGCG
GAAACTGCCT
AGCTAAGACT
GAGCGATCAA
CTACATTGGT
GGTGGAAAAT
.AATCAAAGG
GCCTTTAGAA
rGTGATGGAT
ATTCTAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1737 INFORMATION FOR SEQ ID NO:401: SEQUENCE
CHARACTERISTICS:
LENGTH: 207 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 383 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...207 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:401: ATGAATAGTT CTAACCTCAA AAATTGGCTA TTCCCCACTA TTTGCTTTTT TTTATTTTGT TATATTTTAA TTTTTTTGAT ATTTTTTATG TTTAAAACT TGCAATCGCA ATCTTTTGGC 120 TCTGTGGCAG AAACCGGAAA AAAACCCATC ACCACCACCA AGAAATTTGG TAAGGAATTG 180 CAAAAACAGA TTTCAAAAAT CCATTAA 207 INFORMATION FOR SEQ ID NO:402: SEQUENCE CHARACTERISTICS: LENGTH: 162 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...162 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:402: GTGGTCTTGC CTAAAGAAAC CTTATCTTCT ATCGCTAAAC GCTATCAAGT CAGCATTTCC AGTATCCAAT TAGCCAATAA CCTCAAAGAT TCTAATATCT TTATCCACCA GCGCTTAATC 120 ATCCCCACTA ACAAAAAATT ACTCGCTACA AGGGAATTTT AA 162 INFORMATION FOR SEQ ID NO:403: SEQUENCE
CHARACTERISTICS:
LENGTH: 1059 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature WO 97/37044 WO 9737044PCT1US97/05223 384 LOCATION 1 .1059 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:403:
TTGTATTTCC
CTAATCGCTC
TCGTTATTAG
GAGCAAACCA
CCTGCCATGT.
GATATTGTTA
GTGGCGGCGT
GTGGGTAACC
CAAGAAAAAA
GTTGACGCTT
TTGAAAAATG
GGCCATCAAG
AAATACGTCA
GAGATTATTT
AAATTTTCCA
ATTGGCGGAC
GCGTTTAAGG
TTGAATGACG
TTAATGGTGG
GCTTTAAAA
GCGTGGCTAA
TAAAGCTTCC
TCAATACTTG
AAGCCACTCT
TGAATGTGGA
CTAAAGCGGT
CCATTGTAGA
CTAAAAAATT
TCAAAAkAGAA
CCCTTGATTC
AGTTTGGGCG
TTATTTGGTG
CTATCAAAGC
CTAGAGCCCC
GCGTGGATAT
CGGAAGTTGA
TTATAATCGG
AGCTTTAATT
CGCTTCTAAT
TGTTTCTAAA
GGATAGGGTC
CAAAGATCCT
ATTATTAAAA
AGAGCATGCG
GGTCATGGAA
GGCTAAAATG
AAAGGGGGTG
AGATATTTTA
CGCTGACATT
GATAAGCCCG
CATTAAAAAC
ACTCATTAGT
TAATGCGATA
ACCCTTTTTG
CTTCATAAAT
TCCTATTCTT
CAAGAGATCC
ATAATCTACT
GTGGGCATTT
GAACGCATTA
AAGCTTAGCC
AAAAAATTTG
GATATTGACG
CAAGAAACTT
GAGCTTTTCC
GAAAAAGGGG
AGTGTGGA.AA
CTTAGCCCTG
AAGCAAGTCT
CTTTTTATCG
ATCAAAGATT
TGGCACTGA
CATACGAAAG
TAGGGGTTCT
AAGTCAAGGA
TGGGCAGCTT
CGGATTACGC
AACCCATGAG
CTGATCTTGT
GGATTTCATT
CTCAAGCTAA
TGGATTTTAT
ATAAAGCCAA
GCATAGACAJ\
AAATCGTTAA
AAGACGTGTT
ATAAGCTCCC
CTTTAA.AAGC
ACTATAAAGT
GATTGTCATG
TCTTGTTTCA
TTATTTTGGG
TGCAGA.AGTG
TTTTAAATCT
CAGTGATCAT
GGTAACCTTT
CCTTTCTTTC
GGCCTTAGAA
CAAAGAGCGT
TAAAATCAGC
TTTTGGCTTG
AGAAAACCCT
GAACAACCCT
CACAATGGAT
CCACCCTGAA
GGTCTTTGAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1059 INFORMA~TION FOR SEQ ID NO:404: SEQUENCE CHARACTERISTICS: LENGTH: 594 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION I1.. .594 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:404:
ATGAAGTGCT
ATCACGATAG
GAAAATATCC
CTTGATCTGT
GATTCTAAAA
AAACGGCAGC
TCCAGTTTTT
GGCCGTTTCA
CATOCATTAG
AAACAAAGCC
CAAGCTTTAC
GGCTAGTGTT
CTAAAATTGA
CCATAGAGGG
TCAAGCGCTA
AGGATTTGTA
GGAGTGA.C
TTCTCACTTC
CTATTATTGA
AGAAAGAAAA
CTCTAATAGC
TTTCTTTTTG
ATTAGAAAAT
CAAAAAAGCC
TGATGAAGAC
TTTCTTCCCT
AGGGATTTAT
AAAGGACAGC
AGCTCAAAGC
GAAAAAATTC
GTCTTAAACT
CGTTCCCAAC
TTTAAAGCGT
CTACAATACG
ACCATTGAAA
AACGGGGTAA
AACCATA.AGG
AAGGTTGAAG
ATTCAAGCCA
CCCACTTTCA
TTTTTGTAGT
CAACTAGCGT
TTCAAATCAA
ATGATCATGA
GC!GTTGAGTC
CTTATAAAAG
AGCAAAATTT
GGCTTGACAT
ATTTGTTTTT
AAGGAGGGTT
TTTGTCTTTC
AGTTTCTAAA
CGATAAAATC
AATCTTTTTT
CCCTGAAGCC
AAGCGATGAT
TAAAGGCAAG
TTCTTATTCG
AGATGALATC
TTAA
120 180 240 300 360 420 480 540 594 WO 97/37044 PCTIUS97/05223 385 INFORMATION FOR SEQ ID NO:405: SEQUENCE
CHARACTERISTICS:
LENGTH: 558 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1 .558 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:405:
ATGCGTTGGT
AAATTAGAGA
TTTGTA.GCGA
GGTAAAGACC
GAACGCTATG
ATCAGCGGGA
CAAAATGCGG
TTAA.ACAJA.A
GTGTTTGATA
GCTAAGGAAA
GGTGTTTTTT
ATAAAGGCTT
ACGACAAAAC
GGTTGTTTGC
AAGCCACAGG
GCACTGACAA
TGGTTAGAGA
CCAAGGGTTA
TGGAAGACAT
AACCATGA
GGTGTGTTGT
GAAAAAAGAA
CAAAACCGCC
GGATAAAGTG
GAACACGCAT
GCTCATTTAT
AGTGGGAAAA
TGCTGATGTG
TAATGAAGAA
TTGAGCGTTT
AGAGAGCTTT
GTTATTCAAG
AGCGTGTTTT
TTTAACATCT
AACGCGCTGA
TCTAATGTCA
TTGGGGAGCG
AATCGTAAGG
TAAGCGTGAT
TAGAAATTAC
GCAATGTGCA
TAAACGATAA
TTACAGAAGA
ATGGGGAATA
TCACCGGCGA
CGAAGCGGCC
CTAAATTGAA
GGACGCTAA
CGGCAACCpJA
GATCAAAAA
ACGAAAGCCA
TAATCGTGAA
CAAATTATTG
TGAAATCATT
CGCTAAATTT
GAAAAAAGGC
120 180 240 300 360 420 480 540 558 INFORMATION FOR SEQ ID NO:406: SEQUENCE CHARACTERISTICS: LENGTH: 636 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .636 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:406: WO 97/37044 PCT/US97/05223 TTGATCCAAC AAAATTTTAA
AACACTTAAG
CCCGATTTTA
TCTAAAAATT
GTATGCCCTA
TTTAATGTGA
CCTGTGGAAA
AGCATTTCTA
TTGATTGATA
AATGCAGATG
GTATGCCCAG
GCAGAGTATC
AAGCGCCTGC
TAGGCAAAAJA
CAGAAATCAT
TTGGCGTGTC
AAGGCGGTAT
GAGACTATGA
AAAACATGAA
AAATGCTTCG
CAGGTTGGAG
TTAAAGAAAA
CGTTTTAGGA
TGGTGTGATC
TGCGTTTGAC
TATTGACAGC
CGGTCAAGTG
TGTGCTGTTT
AGTAAGACAC
CATGGTAGAC
AAAAGGCGAT
TTCCATTAAG
GAGTTGTATA
AACAATGAGG
CTTTTCTTTT
AAAAGAGTGA
GA.ACAAGTGC
TCTTTCCCTA
GAAGAAGCGA
GCAGTGATCA
GCTCTCTTAC
AAAGGCATGA
CTTTAA
TGTTAGTTAC
TTGATGAACA
GGCCAAAAGA
AAGACTTCCA
ATTTCGCATG
TGGTGGCTGA
TCGCTTTGAG
ATGACTTGCC
ACTTTGAAGA
AAGCAACTCA
AAAACTTGCC
CTTTGAGCTT
TTTTACTTTT
CGAAAAAGGC
GAAAAACACC
TATCACTAAG
AGGTGCTTTT
ATTAGGTAGG
ACATGGTGAA
CCAAGGCGTT
120 180 240 300 360 420 480 540 600 636 INFORMATION FOR SEQ ID NO:407: SEQUENCE CHARACTERISTICS: LENGTH: 591 base pairs TYPE: nucleic acid STRMJDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 591 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:407:
GTGCCGTTGT
GGGCCAAAGC
TTTTGCATCG
GTAAGCTCTA
TTGAGGGAGC
TTTGGGGAAG
AAAGAAAAAT
GAATTGAAA
CGCTCCATTT
TTAGGGGCGC
GGATTATGGT
AGTAGGGGCA
TCATTAAAAC
GGTGGGGTCT
CGCTTTCAGC
GGTCATCACC
CGCATATTGT
GGTGGGCGCG
AATCCAGAAG
GCGTTTTGCT
ATTTAGAAGA
AATTGAAGGC
CGCTCATGCT
AGAGAGCTTG
AGAAGAA
AAAGTCGCTT
TGAAAAAGAT
TGTTACCGCT
TTCTTTTTGT
AGCTCTTGGT
GCTGGGATCG
GAAATCACAG
GTGCTTTTAG
GTGTTTGGGG
AGAATCAGAG
TTTTTAGAGC
AAAAAAAGCA
AA.AAAAGTGT
TGGTTGGTAA
TTTATAGAAA
CTTTAGGCTT
AATTAGACAA
CCTCTCAATT
TGGGCTTTTT
ACAACATTGT
GCTTTGATAA
AGAACACCGC
ATAAAGAAGA
CCGTGCCGGT
AGTATTTCTA
GAGCTTGTAT
AATGCAAGCT
AGGCTTACCG
AAGGGAGCGT
AGCGGCGCAC
AGCCAATTTG
TATCGCTTTG
AGTGATCAAA
TTCTGCGCTT
G
120 180 240 300 360 420 480 540 591 INFORMATION FOR SEQ ID NO:408: SEQUENCE CHARACTERISTICS: LENGTH: 525 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
WO 97/37044 WO 9737044PCT/US97/05223 387 (iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .525 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:408:
ATGTCAAATA
TTAGGGCTAA
TTTTTAGAGG
TTTGTTAAAA
GAATTAGAAG
GATGAAGTGA
GCGGCAAAjAT
GACAATGGCG
AATACAACGA
GCATGTTGGA
TCGTGCTTTT
CTAGGGAATA
AAGGCGATCG
CCAAGCTCGC
AAAGAGGCTC
CCCAAGCGAA
TGGCGAGTTT
GAGCGCGGCT
TAAAAATAAA
TTATTTGGCT
CAGCGTGAGC
CATTAAAAAG
TCAAGCTGAA~
AAGAGATGAA
TTTGGCTAAA.
GCAAAAGCGC
TACCAAAAGT
GCGATTCTTA
TATCGCCCTA
TCTAAAGTCC
GGCGATTTAG
GCCGGGCATA
ACGATCAATT
GAGACTTATA
GATGAAGCCT
ATAAAATGGC
CAGGGGGTGG
AGGCTGAAGT
CTGGCCGCAT
TTTTTAGCAT
AAGCCGCTA.A
CTGCGAGGGA
AGCGCGTTCA
ATGCGGCTAT
TTTAG
GGCTTTATTG
GTTGCAAGGG
TGAAAAGGTG
TTCTAGCCCT
AGCCGTTAGC
CGTTTGGCAA
AGATTTGTAT
GAAAGCACCA
120 180 240 300 360 420 480 525 INFORMATION FOR SEQ ID NO:409: SEQUENCE CHARACTERISTICS: LENGTH: 447 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .447 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:409:
ATGGAATTTT
TCGTGGATGG
CATAAAAAAJG
GCTTCGCCGG
GAAATGTTTA
ATCTATCATT
AACGCAAGGT
ATCTTAGTGG
TGAGCGGGTA
CAGCGTTATT
AGTTTGTAGG
CGATGGGTTT
AAAGCGGGGG
TTTATTGCAA
TTTATCGTGT
TTGTCAAGCC
TTTTTTATGG
TTATTTGCCA
CGTGGTTCAA
CACGCTTATT
TTGGTTGCAT
AAAATGCATG
GTTTAATGAA
TTTTTAA
GTTAAAGCTT
CGCCTTTTTG
ATCCAAGAGA
ACAGGGATTT
GCTAAATTGG
CGCGAGCTGG
ATACCCACGA
TCCATGTGAT
TCTATCATGC
AAAAGCTCTA
TGATGTTGTT
CTTTAGTGGT
AAAAAGACCC
TTTTA.ATGAT
AGCGGTCATT
AGAGAACGCG
TTCCTTTATC
GATCGCCCCT
ATTGCTTTTA
TACAGGGAAA
CCTTATTGTG
120 180 240 300 360 420 447 INFORMATION FOR SEQ ID NO:410: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 388 LENGTH: 438 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .438 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:410:
ATGAAACCA
GACGCTTCTT
AAAGAGCAGG
ATTGTCCCCC
AAAGTGAGCG
TCTTGCTCGC
TATGAAATTT
CGCACCTTTC
TTTTTTTGTT
CAAACCTTTT
AGCAAAAAC
AAGAAACGCC
CTGTAGTGGA
TCTACCCCAC
TATGCGAA
AGTTTTAA
AATCTTTTTG
TGATTTGATC
GCGCTTGAAA
CTATTTAGAA
GAGCGTGGTT
GCCTAAAAGC
CCAGCCGCTA
TTGTTAGCGA
GCCTTATTGC
GATAAGGGGA
TTAACAGAGA
CTGGCTCAAA
GCCCTTTAGT
TGGCAAGGGG
CTAGGGAGTC
ATTTTAAAGA
TTGACATCAA
GTTTCTTTAG,
TGAGGAATCA
TGGATAGpjAG TTAGCACCAA
TAGAGAKA.AG
ACAAGAATTG
GGCGTTAGAJA
GTATTATTTG
TCAGGGGCGT
AAGCGTAGCC
TTTAGGCAAA
120 180 240 300 360 420 438 INFORMATION FOR SEQ ID NO:411: SEQUENCE
CHARACTERISTICS:
LENGTH: 774 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .774 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:411: ATGGCTA.AGA AAAACAAACC CACCGAATGC CCCGCCGGCG AAAAATGGGC
OGTTCCTTAT
GCGGACTTTT TGTCGTTGTT GCTCGCGCTT TTTATCGCTC TTTATGCCAT
TTCAGCGGTC
AATAAATCCA AAGTGGAJAGC CTTAAAAACC GAATTTATTA AGATTTTTAA
TTACGCCCCA
AAGCCAGAGG CGATGCAGCC GGTTGTAGTG ATCCCCCCTG ATTCAGGGA.A
AGAAGAAGAG
CAAATGGCGA GCGAAAGCTC CAAGCCGGCT TCGCAAAATA CCGAAACAAA
AGCCACTATC
120 180 240 300 WO 97/37044 PCT/JS97/0522 3
GCTCGCAAGG
CCCTCTAGTT
TATATTGAAC
AGAGGTTTTA
GCCGCCATC
CAATTGTCTT
AACAGAATGA
AAGATCCATT
GCGAAGGCAG
TGCTGTTTGA
GGATCGCTA
CGGATAACAC
GCGCTTATAG
TTTCTTCTTA
AAAACAATCG
CTATTTTAGA
TGTTTTAGAG
AAACGCCACT
AATCATTCA-A
GCCTTTAAAT
GGTGATGAAjA
CGGCTCTACC
TGTGGAAATC
TGAAGAATTC
CAAATTGATC
TCAGACGCTA
AAACTCCCTA
AAAACCCGTT
GTCCTTATAC
AACCCTATTG
TTTTTTTCA
AATCCCCATA
AAGGCTCTGT
TCAATCAAGA
AAAGGGTGCA
TTAAAAGCCA
AATACGGCGT
CGCCCAATGA
CCGATGCGAA
AACA~cAAGA
TTTAAAGCTC
CATGATGCTT
TATTAATGTG
CTATGAATTA
GGATCCTAAC
TTCCCTAGA
CGATTTGAGC
ATGA
360 420 480 540 600 660 720 774 INFORMATION FOR SEQ ID NO:412: SEQUENCE
CHARACTERISTICS:
LENGTH: 2397 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 2397 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:412:
ATGAGAAAG(
TCGGTCGTT'
TTTTTAAA
CAAAGCGAAC
AAAAAAACGC
AGAGACGCT~v
GGTAACGGGC
CCGTATTCCA
GATGTGATTA
AATGTCATCA
TTTTGGGGGC
GCCCAAACTT
GGTAAATATA
AATAGCCCTA
AATACTTTTA
AGCGCGCAAG
GGGCGAGCCA
GTGGGGGGCG
TCCAACCAAT
AAGGGAGAAA
AGCCCTTGCT
AAACTCAATC
TTTTTA.ACTG
AATGGGAGTG
TATGcCAGCG
TTATCATAA,
r TGACTTTTTC
SAAGTTACAAC
3AGGTGCGTA.
GGAATTTGA-Z
CAGGCACGGG
ATAGCAACAC
ATATTGALACT
AGGGGGGAAC
CTAAAGAAAT
GCTCCTCTAA
TAGGAAACCA
TAGGCATTAG
CAAAGGTGCA
AAGCTTATTA
ATTACGCCTA
AGCGCTTTGG
ATTTTAAATT
ACCAAAGCGT
TCAGCGcAA
GGCAATTTTT
TTATCGTCA.
AAGATTTATA
GGTTTGATGC
ATGAAATCAA
GAATGGTTAT TTGAGGGTAA
AAACCCCGTA
3GACTTTTAAT
-CACCGAGCAA
TTCCACAAGC
TATTGAAAAC
CGTGCTGCCT
CAACATGATT
GGCGATTTTC
GAGCGTCCAA
CCCCAAAGAG
TGGGAATTTT
AATGCTGTTT
CGCTCAAGGC
AAACTACTTG
CCAGTATTAT
TAACCGCTTC
GATCGTGTAT
CACTTATTTC
GTATATGAGC
AAACCCTAAT
TGACAATATC
TACTGGTAAA
CCGCCGATCC i AGGAACTTCA
C
TTTCAATAAC C
TCTTTTATGA
AAATTCAGTT
TCTCGCACGG
GCCTTGCAAA
AAAATTTCGG
TTAGTCAATG
CCTGTAACTT
TACGGCCCTA
TGGGAAAATC
GTCGATCCCA
AACACTTATG
AATTGGATTA
TTGGATGCGA
CAATACAACT
ATCAATGAGC
CAAAACTACT
ACGCATGACA
GGTCAAAATA
rGCGGTCTGT
:GCCGATCCGC
,TCAAACAAA
~CCACCAGGA
~TCAATAATT I2 ;GCATGCTAA
C
GCGCGAAAGA
CTAGTGCCCC
TGATTTCCAA
ATGTGCCAGG
TGCGCGGTTT
GTATCCCCAT
TCCAGTCAGT
ACACTTTTGG
AAGCGGCTGA
AAGAAAAAGG
GGCGAACGGC
ATGGGCA.AGG
TTTATAAGAT
CTTACCATCC
GCCCTGACAA
ITGGCGATCC
rGAGTAGGGAC kiGATTTTACC kTTCTTATAGC rGGTGAATGC
C
rTTTTAACAT
C
LAAACCCTAG
C
~CAACAATTAI
~GATCACGCC
G
TTTTTTAGCG
TAAGCACCAT
AATTTCATGG
CAAAGAACTC
GATTCAAATC
TGGTGGGGGC
TTATGGCGCG
GGATAGGATT
AGGCGTGGTG
AAGGATCACT
CAAGCCCTTA
TGGGATGTTG
TTTCAGGCA.A
CAATGCGACC
AGGCACTTTG
rCAAGATGGA 3GATAGGAAA rTTTGGGTTT
'TTTAAAGGC
~GACACGAAT
~TTTGAGCCA
;GGAATGCGC
~ATGCCTAAT
'ACCGCTGTG
GGCTTGAGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 i500 WO 97/37044 WO 9737044PCTIUS97/05223 TACACTTTTT TAAATTACGA AAAAAAAGAC GCTCCTCCCT
TTAAGGTGGG
AAAACCACGA
ATTAAAGAAT
AATATCGGTA
GGCGGCTCAA~
TTTGCGAATC
TCGCAAGGCG
GCTTACACTT
CCTAAAGGGC
TTCATTTTAG
TATAGCCGCG
ATTAAAAATG
AATTTGCAAA
AAGAGCGTTA
TATTGTTTTA
ATTTTGTAGG
GATATTATTT
ATTATTTTAC
TGGAGCTGGA
TTATAGACGC
CTAAAAAAGA
ACGCGAGTTA
CTTATAGCGA
GGGCTATCGT
TTTCTACCAC
TAACCAATGG
TTTCAATTAC
CACAAGCACG
TAACCATCAA
CGGGCGCTAT
ACTCTACTAC
TAATATCACA
TATTTTTGGC
CACTTACGCT
TGTGTTAAAC
TACCAAAACA
TTTGTGGGALA
CATGAAATAT
CAGGAGCATC
AATCCGGCAG
CA.AAGAAGCT
GATTATTTTC
GTGAGTTTTA
GGGGACAACA
ACGCCTATTA
AGCCACACGA
AAAAAGCTCC
AAAACCACGA
ACCGTGCCTT
GCGGGCATGA
CGCAAAAATC
TGGTTTAGCG
ACAGCGTATG
TCAATGTCGG
ACATTCCGCC
AAATCTTTAA
ACGCGAACTA
GAGAGCCGGT
GGGGGCTTAAk
TGGTTACTAJ\
CTTTTGTCAG
TTGGGTTAAG
TTACAGAATA
CGCCGTACTA
AAAGCGTGAA
GGATAGGCAC
TGAGTTATAA
TCAAACCCCA
CTATAAACCC
CCAATTCAGC
TGTCATGGAA
TTTTGTCATT
CAATGCGAGA
TTTCCATGCG
CCCTGCTA-AT
CCCGCACCAJA
CTCTTTCTTT
CGCGCCCACG
TTGGGTCTGG
TGCGAGCTTG
TAGCCCTAAC
TTTTTA.A
1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2397 CAAATCAATA ACATTTTTAA GGGAAAGAAG
CTGCGCCTCC
INFORM~ATION FOR SEQ ID NO:413: SEQUENCE CHARACTERISTICS: LENGTH: 2397 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .2397 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:413:
ATGAGAAAGG
TCGGTCGTTT
TTTTTAAAAA
CAAAGCGAAG
AAAAAAACGG
AGAGACGCTA
GGTAACGGGC
CCGTATTCCA
GATGTGATTA
AATGTCATCA
TTTTGGGGGC
GCCCAAACTT
GGTAAATATA
AATAGCCCTA
AATACTTTTA
AGCGCGCAAG
GGGCGAGCCA
GTGGGGGGCG
TTATCATAAT
TGACTTTTTG
AAGTTACAAC
AGGTGCGTAA
GGAATTTGAA
CAGGCACGGG
ATAGCAACAC
ATATTGAACT
AGGGGGGAAC
CTAAAGAAAT
GCTCCTCTAA
TAGGAAACCA
TAGGCATTAG
CAAAGGTGCA
AAGCTTATTA
ATTACGCCTA
AGCGCTTTGG
ATTTTAAATT
GAATGGTTAT
GACTTTTAAT
CACCGAGCAA
TTCCACAAGC
TATTGAAAAC
CGTGCTGCCT
CAACATGATT
GGCGATTTTC
GAGCGTCCAA
CCCCAAAGAG
TGGGAATTTT
AATGCTGTTT
CGCTCAAGGC
AAACTACTTG
CCAGTATTAT
TAACCGCTTC
GATCGTGTAT
CACTTATTTC
TTGAGGGTAA
TCTTTTATGA
AAATTCAGTT
TCTCGCACGG
GCCTTGCAAA
AAAATTTCGG
TTAGTCA.ATG
CCTGTAACTT
TACGGCCCTA
TGGGAAAATC
GTCGATCCCA
AACACTTATG
AATTGGATTA
TTGGATGCGA
CAATACAACT
ATCAATGAGC
CAAAACTACT
ACGCATGACA
AAACCCCGTA
GCGCGAAAGA
CTAGTGCCCC
TGATTTCCAA
ATGTGCCAGG
TGCGCGGTTT
GTATCCCCAT
TCCAGTCAGT
ACACTTTTGG
AAGCGGCTGA
AAGAAAAAGG
GGCGAACGGC
ATGGGCAAGG
TTTATAAGAT
CTTACCATCC
GCCCTGACAA
TTGGCGATCC
TGAGTAGGGA
TTTTTTAGCG
TAAGCACCAT
AATTTCATGG
CAAAGAACTC
GATTCAAATC
TGGTGGGGGC
TTATGGCGCG
GGATAGGATT
AGGCGTGGTG
A-AGGATCACT
CAAGCCCTTA
TGGGATGTTG
TTTCAGGCA.A
CAATGCGACC
AGGCACTTTG
TCAAGATGGA
GGATAGGAA-A
TTTTGGGTTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 WO 97/37044 PCTIUS97/05223
TCCAACCAA-I
AAGGGAGAP
AGCCCTTGCT
AAACTCAATC
TTTTTAACTG
AATGGGAGTG
TATGCCAGCG
TACACTTTTT
AAAACCACGA
ATTAAAGAT
AATATCGGTA
GGCGGCTCAA
TTTGCGAATC
TCGCAAGGCG
GCTTACACTT
CCTAAAGGGC
TTCATTTTAG
TATAGCCGCG
ATTAAAAATG
AATTTGCAAA
CAAATCAATA
GGGAAAGAAG
ACCAAAGCGI
TCAGCGCA
GGCAATTTTT
TTATCGTCAA
AAGATTTATA
GGTTTGATGC
ATGAAATCAA
TAAATTACGA
AAGAGCGTTA
TATTGTTTTA
ATTTTGTAGG
GATATTATTT
ATTATTTTAC
TGGAGCTGGA
TTATAGACGC
CTAAAAAAGA
ACGCGAGTTA
CTTATAGCGA
GGGCTATCGT
TTTCTACCAC
ACATTTTTA
CTGCGCCTCC
GTATATGAGC GGTCAAAATA AGATTTTACC
CTTTA.AAGGC
AAACCCTAAT
TGACAATATC
TACTGTA
CCGCCGATCC
AGGAACTTCA
TTTCAATAAC
AAAAAAAGAC
TAACCAATGG
TTTCAATTAC
CACA-AGCACG
TAACCATCAA
CGGGCGCTAT
ACTCTACTAC
TAATATCACA
TATTTTTGGC
CACTTACGCT
TGTGTTAAAC
TACCAAAACA
TTTGTGGGAA
CATGAAATAT
TGCGGTCTGT
CGCCGATCCG
GTCAAACAAA
ACCACCAGGA
CTCA.ATA.ATT
GGCATGCTAJA
GCTCCTCCCT
AATCCGGCAG
CAAAGAAGCT
GATTATTTTC
GTGAGTTTTA
GGGGACAACA
ACGCCTATTA
AGCCACACGA
AAAAAGCTCC
AAAACCACGA
ACCGTGCCTT
GCGGGCATGA
CGCAAAAATC
ATTCTTATAG
TGGTGA.ATGC
CTTTTAACAT
AAAACCCTAG
TCAACAATTA
CGATCACGCC
TTAAGGTGGG
TCAATGTCGG
ACATTCCGCC
AAATCTTTAA
ACGCGAACTA
GAGAGCCGGT
GGGGGCTTAA
TGGTTACTAA~
CTTTTGTCAG
TTGGGTTAAG
TTACAGAATA
CGCCGTACTA
AAAGCGTGAA
CGACACGAAT
CTTTGAGCCA
GGGAATGCGC
CATGCCTAAT
TACCGCTGTG
GGGCTTGAGA
TCAAACCCCA
CTATAAACCC
CCAATTCAGC
TGTCATGGAA
TTTTGTGATT
CAATGCGAGA
TTTCCATGCG
CCCTGCTAAT
CCCGCACCAA
CTCTTTCTTT
CGCGCCCACG
TTGGGTGTGG
TGCGAGCTTG
rAGCCCTA~C
TTTTTAA
1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2397 CAGGAGCATC ACAGCGTATG
TGAGTTATAA
INFORMATION FOR SEQ ID NO:414: SEQUENCE
CHARACTERISTICS:
LENGTH: 834 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .834 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:414:
ATGAAAACA\
TTTAACGCAT
AGGGGGGTTT
AAAGGGAACT
GGCGATGAAA
AAAGCCAATA
GTCGTGGATT
GTCATTAAAA
GCGGATTTTT
ACAGAGACTT
TTGTTGCTCG
ACGGGCTTTT
GCTCTGATAG
TAAAGGTGGG
ATCAAGGCTA
ATAAGATTGA
AAGTGGATAT
TCGCTAATCC
ATATAGAAGA
ATTTCACTAA
TTTTAGCCCT
CTTGGGTGA.A
TAAAATGTGG
CCATAAAGAA
GGTTTTTAGC
TGATGTAGTC
GTTTATCCCT
TATCATGGCT
GTATATGAAA
GCTGAAAGAT
AAATTACCCC
TTTAAACAAT
ACAACACCCT
GGGCTATTTT TAGTTTTA3AT AAGAAGGACG
CTTTAGA-AGT
GATAAGCCTC
CTTTTGGCTC
ATCGCTAAAC
GCATGGCCCT
GTAGAAGCCT
CAGCTAGGGT
AATTTCACGC
GCACTAAAGA
GTCGCTTTGG
GGGTGATTTC
AAAGAGTTGA
TCGTGAATAA
AATATCAAAC
TTTTGAAATT
AAGGCCATCG
CATTAGCCCA
GAATTTAAAT TAGGCATTAC
CGCTTTAGTC
CATTAAACA
TGTGGATTCT
TGATTTATTG
GGAATTTTTA
AAGAGAAAAA
TAAAGATGGG
AGGCACGACA
TGAACAAAAC
TGATAACACT
AAGCCTTGGC
120 180 240 300 360 420 480 540 600 660 WO 97/37044 PCTfUS97/05223 GATAAGGATG TGATCGCTCC AGCGATTAAJ\ AAAGGCAACC CTAAGCTTTT
AGAGTGGTTG
A-ATA-ACGAGA TTGATTCCCT CATTTCTAGC GACTTTTTAA AAGAAGCTTA
TCAAGAGACT
TTAGAGCCTA TTTATGGCGA TGGAATCAAA CCGGAAGAAA TTATTTTTGA
ATGA
INFORMATION FOR SEQ ID NO:415: SEQUENCE
CHARACTERISTICS:
LENGTH: 624 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: No (Vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NMNE/KEY: misc-feature LOCATION .624 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:415: 720 780 834
TTGGATAATT
ATTCGCATTT
GTGGGGATCT
CTTGCCGCAA
GGGATCAATC
ATTTACTTGC
TGCGGATCCC
TATATCAATT
TTTAATAATG
TCATCTCACA
ATGGCTATGA
CGCGCACTTG
GCACCGGTAA
TTGCATGGCT
CTGATTATGG
AATTGGGGTT
GGAGCCCTAA
TTTTCAACAA
ATACGCGCAA
AGTGGGTGGG,
GCCCACCACT
TGTTTTTATC
TAGCCGCTTT
GACAGCTGAA
AGTGCATATG
AACCAAAAGC
ATTCTATGTG
TATTAAAAGA
GTAG
CGCTTGATCG
ACTTTTAGAT
AGGGGGTATT
ATTAAATTGG
GTTACCGGTT
AAACCCAAAA
CACAAACCGG
CACTGGAAGG
GATCAAAcAA
GCGATCGCAG
TAACCAATTT
ACAAGGGCGT
TAATGGCGGA
ATTACACCAT
TGGATTATTG
CCCCTACCGA
GCTATGGCGT
GTTTGCTGCT
GCCCCTATGT
AGCCTGGGTT
GAGTCAAGCG
GAGTTTGAAT
CAGCTATGAG
TCCAAAAACA
CGGGTTTGAT
TTTAGCCAAA
GAGTAATTTT
TTCAGCCGTG
CATGAGCCCG
TAATGCGCGT
120 180 240 300 360 420 480 540 600 624 GATATGCCAG
GCACTGACGC
TTTGAAGTGG
CTTACAAATG
INFORMATION FOR SEQ ID NO:416: SEQUENCE
CHARACTERISTICS:
LENGTH: 735 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT[US97/05223 393 NAME/KEY: misc-feature LOCATION .735 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:416:
ATGAAAG
GCCAAAGATC
AAAAATCAAA
AAAGATATTA
GAAAAcCCTT
CGCGATTTTA
ATCAATTCTA
CTTGATGTGC
TTATCAGGTG
GCGCAAJ\IAA
GGAGACTATG
CAATTTAACC
CGGGTTTTCT
CGAATGTGTT
ATCCTTCACC
ACGGTCCGTT
TGTTGCTTGA
TGAGTAAAAA
AAA.AAGGGCG
GCAAACAGCT
GCTTGAACGG
CCCCGTTTGT
GCTTGGCGGT
CATTGACTTT
TTTTTTGGCG
GCGTAAOATT
ATGCATAGA\
GCAATATTTG
TCCTTCTACG
ATACGGAAAA
ATCGCAAAAC
GGATAATAAT
GCATAAATAT
CATGCTTGCT
GGTGCAACAG
AAATCGCGCT
GCGATGGCTA
GTTTTTGAGA
GTCAAACCCG
TTGATGCCAA
CCTAACTTTT
CCCATTCCTG
CATTTTCACA
CTAAAAAATA
TTGGCCCGTC
AAAGAAGTGC
AGCGATAACT
TCAGCCGAAG
TCATTOTTGT
AATGTTTGCC
ACGCCGGCTA
CGACTCACAT
TTTACTTGTC
ATTATGCGAT
TCCATATTTC
TCAACAGCCG
GGGTAACAGA
CTAACGCGCA
CCTTTGTCTT
AGATTCAAGA
GACTTTAAAC
TA.ATTATGAG
TGTGGTTTTA
TAGTGGCATT
ATGGCAAGCG
CTCTTTGACG
TTGCATTAGC
TTGGTCGCCA
GAGCGAATTA
CAAACGCATG
GTTAGCGACA
TCATGAATGC
120 180 240 300 360 420 480 540 600 660 720 GCGATTTTGC
GTTAA
INFORMATION FOR SEQ ID NO:417: SEQUENCE
CHARACTERISTICS:
LENGTH: 687 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miSC feature LOCATION .687 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:417:
ATGGTATTTG
GTGGGGATCG
GAGTGGGTGC
CAAGGGAGCA
ACCACTCCTA
GGGGTGTGGT
TTAGTGGCGC
TTCCCTAGCG
TGCTATTCTA
ATTTTTTTAA
GGAGGGTTTT
TTGAAACGCC
ACAGAACAAC AAGCGTAAGA
GAAAAAAAG
TCTTTTTTAT
AACAATTGGA
CGTGGCTTTC
TAGCCTTACT
TTTTCTTTAG
GCCCACGGCC
GGCATGCTTT
ACGCAAACAA
TGGCGTATGA
TATTAGGGAT
CTTATAAAGC
TTTGTTTGGC
TTTATTTTTT
TTTCGTGTTT
CATTGGCTTG
CATCTTATTA
TGTAACCAAT
AGCTTCAGCG
TCGCATTAAA
TAGGGTTTAT
TGCTTGGTCG
CGCTTAA
ATCGTGATAA
ATAGACTTGA
TTTAGCACTT
TGGTTTGGGT
GGTGAATTCA
GGCGAATTGG
CTTTTTTACG
ACTATTGCTO
TTAGGGGTGC
TGCTGCTCTT
CGGCTAAGGC
GCGGGGTGGC
TCCGCAACCC
GGTTTGCGCA
TTCAAAAACG
CCTTAAAATC
TTTTTGCGCA
GCTCTTTGGC
CTATTATTTT
ATTACCCTAG
TAGCGCTTTA
GCTTGGGATT
TTCTCAAAAA
TGCCCCCATT
AAGCAAGCTC
CATCGCTTTG
CCTTAAGCTT
TGGCTTTAGT
GTTGTTGTTA
ACTTTTTTGG
CGATGTTTTA
TTTGGGGTTT
120 180 240 300 360 420 480 540 600 660 687 INFORMATION FOR, SEQ ID NO:418: WO 97/37044 PCTIUS97/05223 394 SEQUENCE CHARACTERISTICS: LENGTH: 474 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...474 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:418:
ATGAAAAAAT
ATCATTCTAG
TTGAATCTGA
TATGTTTTAG
TATAAAAGAT
TTTGTTTTTT
CAAAGTGCTG
GTGCTGGTGT
TTGGTTTAGG
GAGCGATAGT
CTCCCTTTGA
GCGCGATCGG
CGTTAGTGTA
ATTACACGCC
AATTTGCCCG
GTGCTTTGTT
GGTGTATTTG
CGCGCCCATT
GAGTGGGAAA
TTTTGTGGTG
TTTGATCCTT
TTATATTTTA
CTCGCACGCT
TTTTTGGCGT
CTTCTTTTAG
GTTTTCAAAG
CTCATGGCGC
TTACTTTATG
GGCGTGGCGA
AACGCTCAAA
CAAAGCGAAT
TTGTTTGGGA
GTATTTTGGG
CTTCAAGCAT
AAATCTTTGT
AAATCATTTC
TAGGGGCGTT
AGGTGGGTGA
GGCTGTTTAA
AAAATGCGCT
CGGCTCTTTG
TTTACCTGAA
GCGTTTCAAT
GTTTATCTAC
GTGTTTGCTC
AGTTGCGCTT
GGAATTGTTT
TTAG
120 180 240 300 360 420 474 INFORMATION FOR SEQ ID NO:419: SEQUENCE CHARACTERISTICS: LENGTH: 333 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...333 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:419: TTGATCAATT TATTGCCCCA CCATTACCAT AAATTCCCCC CCAATATCAA CCCCTCTCTC ATCTCTCTAA AAGATCGCTT TTTACCCCAT GAAAAGCACA GCCAAAAGGT CAAAAAGAA TGCGTCAACT TGTTTGAAGT TTTATCGCCT TTGCATAAAA TAGATGAAAA ATACCTTTTC CATTTAAAGA TTGCGGGGGA ATTGGCGAGC ATGGGTAAGA TTTTAAGTGT ATATTTAGCC WO 97/37044 PCT[US97/05223 CACAAGCACA GCGCGTATTT CATTTTAAAC GCTTTGAGTT ACGGCTTTAG
CCACCAGGAT
AGGGCGATCA TTTGCTTATT GGGCGCAATT
TAG
INFORMATION FOR SEQ ID NO:420: SEQUENCE
CHARACTERISTICS:
LENGTH: 648 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .648 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:420:
ATGAAATCTT
TTGTATGCTT
AGTCCAAATG
GAAGCCAACG
TTAGACACAG
GAGATTAAAc
GAGCAAGAAA
AAAACCCCCA
GGCGTGAACG
AATAAAAGCG
GAAACAAAGG
TTTTAAAACT
TAGCGCATGC
TAGAAAAAAG
CAACCACAAC
CCACACAAAA
AAGAGATTAA
AAGAAAATAA
CAACCCCCTT
TGCGCGCTTT
TGAAGGTTTT
GATATGTGTT
TTTTGCACAG
TGTGCTTGGT
CGAGACAGAG
CGCCACAGAA
ACAAGAGATT
ACAAGAGATT
GCCTAAACAA
AATGGGAA~A
TCCTAGCACA
AGAAATCCA
TTTAAAACTT
CCTTTGTTGG,
TTTTACGTGA
CGCCAAAACA
CAAAACCCCA
AAACAAGAGA
AAACAAGAGA
AACAGTGTCT
AA.ACCTCTAG
AAAGGCAAAA
AACGATTGGG
TTAAAAAAGG
TGGTTTTAGC
AAAAAGACAG
GCACGTTTTC
CCAAAGACAC
TTAAACA.AGA
TTAAACAAGA
CGCCCGTTCA
AGTATAAAGT
TCATTGGCTC
CTGAAATTGA
CTGAATGA
GTTCATGTTG
CGCTCCAATG
GCCTAAAGAA
AGTGCCACCT
GATTAAACAA
AACTAAACAA
AAACGATCAA
CGCAGTCAGT
GCTTATAAAA
ATTTTCTCAC
120 180 240 300 360 420 480 540 600 648 INFORMATION FOR SEQ ID NO:421: SEQUENCE
CHARACTERISTICS:
LENGTH: 240 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature WO 97137044 WO 9737044PCTIUS97/05223 396 LOCATION .240 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:421: ATGTGGCCTG TGGCTCTCAA GCAGCCTAAT AGGGTGTCCC ATGCTTTTTA TTTTATTTGA TGTAGAAATC GTTTTCATGT AAAAAGTTAG GCTTGTTTGG GCTCGTTGA-A ATGCTAGGCT OGTTTTATTT ACGCTTTAAA GCGAAACGCT TTGAGCTGGC ATCATTTTTA TATCATGGCC TCCCTTGGGC GATTGATTTT TTGTCTTCTT TTTGGCAATC AAAAATTAGA GGTGAAATAA 120 180 240 INFORMATION FOR SEQ ID NO:422: SEQUENCE CHARACTERISTICS: LENGTH: 1251 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .1251 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:422:
ATGTGTGGGA
GAAGCTGTCG
GCTGGAAAAC
GCGCAAGCCC
ATCCCTACAG
CGTCGTGGCA
GTAGGACAAA
CAGATACAGC
GAATTGGGCA
AGCTTOCAAA
AATTACTACC
CGTAACCCCT
GGGATCGGTA
AGGTATTACG
GCTTCTGACG
AAAGCCACCA
GCATTAGCCG
GTCTATAACG
ATGAATTTAG
TTAGGGCTTA
AAATACCGAA
TGTTTAAGAA
CGCAAGCCAA
CATTCAACCC
AAGCGGAGAT
CCTTTGTAAA
CCAATCCGGG
CGATAACAAA
AAGCCGAAAA
ACACTTATAA
ATGCGGTGAG
TCAATCAAAA~
TTAGGAAAGT
TTCAGGTGGG
GCTTTTTTGA
TGTGGACTTA.
ATTTCTTAGG
GGACTTCATG
CTAAAATGAA
CCAGGCCTAA
AAATCCCCAC
GGCTCTATAG
TGAAATCAGC
AATCGTTAGT
CTACACAGAC
TTTAAACCAA
TGACTCTTTA
TCAGACGACT
TCTTAAAAAC
CATCGCTGAC
CAGCATCACC
TAAAAAGAAT
CTCTTACAAC
GGGGATTGTT
TTACAAACAA
TTACAACCAT
TGGCTTTGGA
CAAAAACAAC
GCTTAATTCT
CGTGGCGAAT
GAAAAAAGAC
CATCAACACG
CGTGTATTTG
GCTATCCAAG
GAAAACACGC
GCTAGTTTTG
GCCGAACAAG
GGGGTGTGTT
TCTAACACTT
AGCATCGCCC
ACTCTGGTGA
ACTGCGCTCT
AACCCCTATA
CAAATCCAAA
AGTTCTCAAA
TTCTTTGGCC
GCGTTCATTA
GCGGACGCTC
AAGCTTTCTG
GAGTATGTGA
TTCCAATTCT
AGCGATCATG
AACTACTACT
AATTACGTGT
GCATGATCGC
AAAATCAAAA
CTGAAAGCAT
TGGTGAAAAA
ATGAAGTGCA
GGGGGGCAGG
ATTTTGGCAC
ATTTCAAATC
CTAATATCCC
GCCCGCAAGG
CCATCAACCA
CCAATAATGG
AAAAAAGAAA
AATCCAGCTT
TTTATAATTT
TGGGGCTTTT
ATTTAGCCAC
TATTCAACAT
CGGCTCAGCA
CCTTTATGGG
TCGCTTACTA
TAACGCGCA).
CAGCCTAGAC
GCTCAAAAAC
CTTTGAAAAA
AGGAGGTGAG
CTGTGCGTAT
TCAAGAGCAG
TAGATACAGC
TAACGCGCAA
CATAGACACC
AGAACTCGGG
CGCGATGAAT
ATGGGGCGCT
CTTCAACTCG
CATCAACGAT
TGGGGGTATT
CATGAATAAC
GGGAGTGAGG
TGGGATTGAG
GGCTGAACTC
A
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1251 INFORMATION FOR SEQ ID NO:423: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 397 LENGTH: 345 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...345 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:423: ATGGCAACCA TTCAACCATT CAACCATTCA ACCATTCAAC CAACCATTCA ATCATTCAAT CATTCAATCA TTCAATCATT CCTTATTTTT ATAACTATTT ATCTTTTTAC AAAAACCTAT ATTATTCCCC CTTTTATTAA CCCTTTTATT AACCCTTTTA TTTATTAACC CTTTTATTAA CCCTTTTATT AACCCTTTTA TTTATTAACC CTTTTATTAA CCCTTTTATT AACCCTTTCA CATTCAACCA TTCAACCATT CAACCATTCA AGCAACGCTA TTAAAAATCC CCTATTTTTT TTAACCCTTT TATTAACCCT TTAACCCTTT TATTAACCCT
TATAA
120 180 240 300 345 INFORMATION FOR SEQ ID NO:424: SEQUENCE CHARACTERISTICS: LENGTH: 381 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...381 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:424: TTGGTTTCAG AATACTGGTT AAAATTCTTC ACGCGCTCTT TTTCTAAATC AACCATGCTC TTTAAAACCC TCTTGCGGTC TTTTTTCACA TTCCCTGTGG AGCTTTCAGA AAACATAACA TTGGGATCAA CTGTAGTGTT AATAGTAGCA GAAGCCGTTT CTGCTCTCAA CAAAAAAGTC GTAGCCGCTA AAAAAAATAA AATTCGCTTC ACACCAAACT CCTATATCCA TAACCGCAAC AAAAACAGAC GCTATTCTTC ATTAAGCCCT CTTTTAAAAT CTTCTTCTAT TTATAAAAAC CCACCTAGAA TTCAAGCTAT TTTGATTATC TTAAAATACA GACTTTCAAA GGGTATTTAT CATTTGGGTA TGATACTATA A 120 180 240 300 360 381 WO 97/37044 PCT/US97/05223 398 INFORMATION FOR SEQ ID NO:425: SEQUENCE CHARACTERISTICS: LENGTH: 861 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .861 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:425:
ATGAAGAAAT
GAGACCCCTA
GCTGCTTTTA
TCCCATGGGA
CCTACAGCGT
AAAGGCTTAA
CATTTTTTTA
AGCTCTTATT
TCTCAAAGTT
TTCAATCGTG
TGGGGTCCAA
CCATCAAATT
TTTGAAATTG
AATGTGCCTG
CGTTACATTG
CTGTTATAGT
AGCAAGAAAA
TAGGGATTGA
ATTGCAATGG
CAAACCCAAC
GCAACCAACA
AGAAAGCCCC
ATAAGTACTA
TCATGTTTGG
AGAACTTGCA
CAAACTATTA
TCCAGGTCTT
GCTTGAAAAT
AAGGGACTAC
TAAGCTTTTA
AGGTGCTATC
GGCTATTAAG
TTACCAGTTG
TAATCAAAGT
AGGAGGCCTT
ATACGCTATC
ACAATTTGGA
CACTTATAAT
CTATGGGGCT
TTTGGGGTTT
TTTTAAGGAC
AGTTAATGGC
CCAAACCATT
TTATAGATTC
TCTCTAGCAA
ACTAGCCCTA
GGTATGCTCA
GGGGCTTACG
ACTCATGGCG
AATGGTTTTG
ATGCGTTATT
GATTATGGCA
GGCACAGATG
TTCTTGGGCG
TTAGCTGAAG
GGGATCCGCT
CGCAATAATT
ACTTTCCACC
TGACAAGCTT
CCAAAAAAGG
GCACTACCGC
GCTCTAATAC
CTCTAGGGAC
GGTTTGTTGT
ACGGATTCTT
TGAGAGACGC
TGTTGTTTAA
TTGCGATTGG
AGTATAGAGG
TAGGCACTAA
ACTATACCGC
GCCCTTACGC
ATTGTCAGCA
TGAAAGAAAT
TCAAAATTGT
GCCTAACATG
TCGTGGGTAT
AGGGTATAAG
TGATTTTGCA
TCGCAAGGGT
CCCAGCTATT
TGGCACCTCT
GAGTTTCCAC
ACACCAAGGT
TAGCGCGGAT
CTTTTATTGG
120 180 240 300 360 420 480 540 600 660 720 780 840 861 INFORMATION FOR SEQ ID NO:426: i) SEQUENCE CHARACTERISTICS: LENGTH: 858 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY.: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PTU9/52 PCTIUS97/05223 399 (ix) FEATURE: NAME/KEY: misc feature LOCATION .858 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:426: GTGAAGCGAAk
TCTGCTACTA
AATGTGAAAA
GTGAAGAATT
GGAGCTTTCT
CATAGGAATG
CTTAAAGACT
GATTTTGGGG
ATCAGCATGG
AAAAAAATGG
GGGGAGCTTG
TCTTCTA.ACC
AAAGAGCCAG
AAGAAAGAAA
AGTA.AGTCTG
TTTTATTTTT
TTAACACTAC
AAGACCGCA.A
TTAACCAGTA
TTAAGGGGAG
GCAAGGTTTC
TAGGGACAGA
CTTTGCATGA
TTAGGGTGAG
AAGAACAAGC
AAAGCCATAC
CAGATCTAGA
AGACTTCTTC
AGCAACAACA
AAAAATAG
TTTAGCGGCT
AGTTGATCCC
GAGGGTTTTA
TTCTGAAACC
TTTGGAAGAT
TTTTGTGGTG
GCTTTCACTC
GCAGTTTGGG
CCAAAAAGAA
TGAGAAAGAC
TGATAGCCCT
CCCTATGACT
AAAAAAGGAA
GGCCTTACAA
ACGACTTTTT
AATGTTATGT
AAGAGCATGG
AAGATGAGTA
TGCGTGGAGC
AATGACAGAG
CCCTTGTTCA
GACATGTATG
AAGGCTAGAA
ACTAAGGCAG
GAATTTATAA
AACGCTAACA
AAAAAGCCCA
CAAGAGTTTG
TGTTGAGAGC
TTTCTGAAAG
TTGATTTAGA
AGGGCGATTT
AAAAGATTTG
AAAAGTTTTA
ACTGGCTTTA
ATGGGTATAT
AAGTGGATGC
CATTCCAAAA
GCTCTTCTAA.
CGCTCAAAGA
AGAAAAAACG
AAAAGCAAAT
AGAAACGGCT
CTCCACAGGG
AAAAGAGCGC
ATCCGCTTTT
TTACTATGAG
TAAGCATGTG
CAAAGGCTCA
CAAATACTTG
A.ATCGTTCTT
GAGGAGCAGT
GACACAGAAT
AACAGCTTCA
ACGCCTTTCA
TAGCGACTCT
120 180 240 300 360 420 480 540 600 660 720 780 840 858 INFORMATION FOR SEQ ID NO:427: SEQUENCE CHARACTERISTICS: LENGTH: 228 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .228 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:427:
ATGCCAAAAG
TTTAAAGCCA
ATGCTTAAAT
TTTAAAATGA
AAAACACCAC CCACGAAGAC GCTTTAGCGA AATGCTTTGC GCCTGCAATT TCATGCTGAA TAAATTGATA TTGGCGGTTG TGGCTGCTCC CATTATAAAG TGAGTAAAGA GATTTGGTGG AAAAAAATGG GCAAAAAACA TGCGATAGAT GGATAAAGAT ATTGGCTTCT TTAGCCCCTA GCTTTTGA 120 180 228 INFORMATION FOR SEQ ID NO:428: SEQUENCE CHARACTERISTICS: LENGTH: 405 base pairs TYPE: nucleic acid STP.ANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/US97/05223 400 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...405 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:428:
ATGTGCCAAA
GTTTATTTTT
TTTATCGTGG
GATTCAAAAG
TCTTTCAATG
TTATTCTTGC
TCTAGCGCGT
TCCAATGCTT
TCCAAGCATT
CGTTGTCTTC
AAAAAGAAGA
TGTGGCGTTT
ATCTTTTTAA
TATTACGATT
GCTTATTTTG
TCAAGGGGTT
GTATTACGGC
AAAGCAAAAA
AGGGGGGTAT
CGGATTGCCG
CTTAAACAAT
CTTTCTATCA
TTGAATTTTG
GTTAAAAAGC
TTCCAAAAAT
GGGGTTTTAT
TTTCTTATCG
AATGGTAAGT
ATATAGTTAG
AAGGGGGGTT
GTTTGGATTT
TCGCCCTAGG
TAGGTATTTT
GCGTGTTTGT
TTTGA
CGCGATTATC
TTTAGGGTTT
GAGGAAACAA
TTTGGAAATG
AGGAGTGCTT
GAGCTCGCTC
120 180 240 300 360 405 INFORMATION FOR SEQ ID NO:429: SEQUENCE CHARACTERISTICS: LENGTH: 279 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...279 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:429:
ATGGCGTGTG
TTGAGCTTAG
CTTAAAAAAC
GCGAGCTTTC
GCCAAAGACC
AATTTTTGAA
GGCTTTCTAT
TCACTCACAT
TCAATGTCTA
CTAAATACAC
AAAGCCAAAG
GGTGGTGGCG
TTCGTGGCTT
TAAGGCTTAT
GCAAAATAAA
TATTACAAGT TTATAGAGGG GGCGAATTAT ATCCTTATGG GCGTGGCTAT AGGCTATGGG TTTTGGCTTG GGGTTATTTG GGGCGTGTTA AAAAACATGC AAAAAGACTA TGAAGAACTA
ACAAAATAA
120 180 240 279 INFORMATION FOR SEQ ID NO:430: SEQUENCE CHARACTERISTICS: LENGTH: 240 base pairs WO 97/37044 PCTIUS97/05223 401 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genonic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 240 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:430: ATGGAAGACT TTTTGTATAA CACCTTATAT TTCATAGAGG ATTATAAGTT
GGTTGTTATT
TTTAGTTTCA TAGGGTTAAT AGCGTTATTT TTCCTCTACA AATTCATAAA
AACTCAAAAA
AAGGTTTTTA AAGATAAAGC TAACCAGCCT CAAAAGAAAA AAAGCTTTAA
AGAAATCATT
ATAGATGGGC TGAAAGAAAG GGTTAAAACC TTTGGCTTTA GGTTGCAAGC
TATACTATTA
INFORMATION FOR SEQ ID NO:431: SEQUENCE
CHARACTERISTICS:
LENGTH: 1092 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (geromic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1092 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:431: 120 180 240
ATGGCCATTG
CAGCCCACCA
TTGAAAAATC
CAACTCACGC
GCGATGAAAT
GACACCATGG
AGGGAAGTAA
AGTGGGGCGA
ATTGACGCTT
AAAACGATTC
ATTTAGCAGA
TCGCTAACGG
AAGACCCCAC
AAGTGGAAAT
CCAATAAAGA
AAAACCTGAA
GCGCCCTTAA
ATTTTGACGG
CTAAAGGAGT
CTTTAAAAGA
AGTTACAGGA
GTTGGATAAA
CGCTCCTATG
GCAAGAAGAA
AACTAACGAA
CAAAGGCATG
TTCTGTGAGC
CAACAACAAG
GCCAGCGATT
TTATAACGGG
GCTAAAGCCG CGCAAGAAAG AACGCTTTCA
TGAAACTCTT
GAAACGGATA
AAATCATCAC
AACAAAAAGA CCATGCAAGA TCTTTAAAAG ACTTTCAAGG GACGATAGCT
TAAAGGCTAA
ATGATAGGCA AAATCGCTGA CTTTCTTTTT
CGCTCTTTTT
CAAATCTTAA
ACGAAAACA.A
CAAAAGGGGT
ATATCAATTT
GAAAAAAGAA
TTTAGAGCAA
CCAAACCGCG
AGTCGCCAGT
CGCTTTAAAA
CAACGCTTTA
AACCGATGTG
TGATGAAAAA
TGAATTGGTC
TGAATGGGAC
120 180 240 300 360 420 480 540 600 WO 97/37044 PCTIIJS97/05223
GGCACAAACC.
AATTTAGACT
AGCGTGATTT
GACAGCGCGA
CAAAAACCTC
CCGCAAGAC
AAAATCTCTG
CTTGA.ACAGA
GAGACAGCAT
AAAAGGGCGA
CTCAAAGCAJA
TTGATAAAGG
TAGAGTTTTA
AAAAACTCTC
CCCTTGATCA
AACAGAAACC
AAATCTCTGA
GA
AAAAGTCCCT
GCAGTATTTG
CAAACCCATG
CAAACCGGAT
TGAACAAA_
AAAACTCTCG
GCTAGATCAA
ACAGAAACCG
AAAGGCAATT
CAAACGCGCA
TTAAGAATGG
CAAAAAC CC
GCCCTTGAAC
GATCAAAAAC
AAACTCTCGG
CTAGATCAAjA
ACAAAATCAA
TTGGTAGGGG
GCGXAATGAT
TTGATCAAA
AGAAAATCTC
CACAGAAACC
ATCAAAAJACC
AACCCCAAAC
GGCTGAATAC
CGAAGTGGAA
TTTACCCATA
ACTCTCGGAT
TGAACAA
GCTTGA-ACAG
ACAGAAAcCG
CCCCCCTAAA
660 720 780 840 900 960 1020 1080 1092 INFORMATION FOR SEQ ID NO:432: SEQUENCE
CHARACTERISTICS:
LENGTH: 654 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 654 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:432:
TTGAAACATT
TTAAGCATGT
AAGCCCGCTA
AATTGCGATA
CAATTCGGAT
TGCGCAGGGA
ATCCCTAGCG
GGGGAATTGC
TATTGGAAMA
AGTCGTTGGG
CAAGATAGAA~
TAGCCCCACT
TCTTAAGTTT
AAGGGGTTAA
ACCTTAAAGA
CTAAAGAAAA
CTTATAAAAT
TTTTAAAAAG
TCATTAAAGA
CGCGCTACCA
AAAAAAACGA
CATTCATATC
GAATTTGAAC
AAACAAGCCC
CTTTAACGCT
TTTAGGCTAT
CAATTTTTCG
CTATGGGCAT
CGATGCGTTT
TGACAACCTA
GAAAGCTAAC
CCTTTTAAAG
GCTGAAGAAA
AA.ATCGCCTG
AAGCAAAAAG
GAAATGGCGG
AATCCGAGTG
AATAATAGCC
GCTTCTGAAG
AAAGACATGA
CCTTATGGCT
ACCCTACTAA
TTACTAATGT
AAGTTTTAAA
GCATTGCGTG
CGGGCATTTA
CCTTTTTGCG
TGGCCCTAAA
TCAAGTCTTA
AGGCACAGCC
AACAGAGCCT
CATGATGACC
AGCCGCTTAT
GAAAGAATCA
CCATGCGTAT
TAACGTGATG
AGAATTGCTA
CAATAAGGGC
TGA.AGAGATA
CTAG
120 180 240 300 360 420 480 540 600 654 TCAGGCGTTT GAAAGAATCT AAAATCTTTG
AACTCGCAGT
INFORMATION FOR SEQ ID NO:433: SEQUENCE
CHARACTERISTICS:
LENGTH: 1101 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
WO 97/37044 WO 9737044PCTIUS97105223 403 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1101 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:433:
ATGCAATTTC
GAAGAAAATG
CACAATAI.CC
ATCTATAAAC
ATCAACAACG
TACTACCTGC
GCGTCTAACC
TTAGAATTGG
ATGCTTTCTT
CCCAGCTCTT
A.AGCATTTCT
GGTTACACTA
CACCTCTATG
TCGAGCGTGG
TTAGGCATGT
AAAATGCACA
AGGCACAATG
ACGCATGGCA
AGTTATGTTT
AAAAAACCTT
GGGCGTATGC
CTTTTTTGAA.
TCAATCAAGT
CTTTAAAAAA
AATCTACCCT
CTAAATTAGC
TAGAAAACTT
CTTTGTCTTC
ATTCTAAAAA
TTACCAAGAA
ATTTTGGTTT
GGCTTGGCAT
GGTTTTATGT
GGGTGAGTCA
CGAGTTTTTT
GCTTTGAAAT
AAGGGTTAAA
ATAGTTTTTA
ACTTTCTTTA
GAGCGTGGGT
TCAAGA.ACGC
CAAkAATGAA
CAATGCTAAA
TCAAAACATT
CCAAGCGTTA
AAAAAATTTA
TCAGATCGCT
CGTTTCAAGC
AAAAAATCAA
TGTGGGTAAT
AGATTATCTT
AGGCTTTGCT
AATGGATTTC
CCAAATCCCT
GGGCTTAAAG
CGCTTCCCTC
TCTTTATTAT
TTTGAATATT
ATCCAAACGA
ATCACAAACA
TTAACCCCTA
GAAAAAATAG
GAAAAAATGC
GAATTACAAT
CAAATTTCAA
ATGTATGGGG
GGGTTTCGTT
GGCTTTGATG
TTCAATTTCA
TTAGCGGGGA
ATCAACAACT
TTGAATTTTG
ATCCCTTTAG
TTTTTCAAAC
TTTTATCTTA
CCATCAGTCA
TTTCTAACGC
TGCCCAACAC
CTGAAAAGCA
TCATGCTTAG
AAGAACCCAT
TCAGCCAATC
ATTCTTTGAA.
TAGGTTTGAG
ATTACTTATT
GTTTAGGCAA
TTGATAATGC
GTTCGTGGGT
ATTTGACGGA
GGGTTCGTGT
CGGTCAATTC
GCCTTGTCAT
TTGTATCGCT
TGCCGTTGA
TCAAAACAAAp
CTTTAACTAC
AGCCGAAACC
CGGGGGCGTT
TACTA.ACCCT
TCAAAACAGC
CGCGCTTGAT
CGTAGGGTAT
CTATGACTAT
AATGAATAAC
GCAAAAACAT
AGGGAGTGGT
TTATCGGGCT
GAATGTGGAT
CTTTTATGAA
GTTTAACGTG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1101 INFORMATION FOR SEQ ID NO:434: SEQUENCE CHARACTERISTICS: LENGTH: 1101 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 1101 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:434:
ATGCAATTTC
GAAGAAAATG
CACAATAACC
AAAAAACCTT ACTTTCTTTA TCTTTATTAT TTTTATCTTA TTGTATCGCT GGGCGTATGC GAGCGTGGGT TTTGAATATT CCATCAGTCA TGCCGTTGAA CTTTTTTGAA TCAAGAACGC ATCCAAACGA TTTCTAACGC TCAAAACAAA~ 120 180 WO 97/37044 PCTIUS97/05223
ATCTATAC
ATCAACAACG
TACTACCTGC
GCGTCTAAICC
TTAGAATTGG
ATGCTTTCTT
CCCAGCTCTT
AAGCATTTCT
GGTTACACTA
CACCTCTATG
TCGAGCGTGG
TTAGGCATGT
AAAATGCACA
AGGCACAATG
ACGCATGGCA
AGTTATGTTT
TCAATCAAGT
CTTTAAAAA
AATCTACCCT
CTAAATTAGC
TAGAAAACTT
CTTTGTCTTC
ATTCTAkAAAA
TTACCAAGAA
ATTTTGGTTT
GGCTTGGCAT
GGTTTTATGT
GGGTGAGTCA
CGAGTTTTTT
GCTTTGAAAT
AAGGGTTAAA
ATAGTTTTTA
CAAAAATGA
CAATGCTAA
TCAAAACATT
CCAAGCGTTA
AAAAAATTTA
TCAGATCGCT
CGTTTCAAGC
AAAAAATCAA
TGTGGGTAAT
AGATTATCTT
AGGCTTTGCT
AATGGATTTC
CCAAATCCCT
GGGCTTAAAG
CGCTTCCCTC
ATCACAAACA
TTAACCCCTA
GAAAAAATAG
GAAAAAATGC
GAATTACAAT
CAA-ATTTCAA
ATGTATOGGGO
GGGTTTCGTT
GGCTTTGATG
TTCAATTTCA
TTAGCGGGGA
ATCAACAACT
TTGAATTTTG
ATCCCTTTAG
TTTTTCAAAC
TGCCCAACAC
CTGAAAAGCA
TCATGCTTAG
AAGAACCCAT
TCAGCCAATC
ATTCTTTGAA
TAGGTTTGAG
ATTACTTATT
GTTTAGGCAA
TTGATAATGC
GTTCGTGGGT
ATTTGACGGA
GGGTTCGTGT
CGGTCAATTC
GCCTTGTCAT
CTTTAACTAC
AGCCGAAACC
CGGGGGCGTT
TACTAACCCT
TCAAAACAGC
CGCGCTTGAT
CGTAGGGTAT
CTATGACTAT
AATGAATAAC
GCAAAACAT
AGGGAGTGGT
TTATCGGGCT
GAATGTGGAT
CTTTTATGAA
GTTTAACGTG
240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1101 INFORMATION FOR SEQ ID NO:435: SEQUENCE CHARACTERISTICS: LENGTH: 525 base pairs TYPE: nucleic acid CC) STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: CA) ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .525 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:435:
TTGAATACAA
GATCCTTTGC
GTTCAGCTGA
GCAAAAACCC
AGCGATTTTG
CAAAAACTCT
CTTTCTTCTT
AGGATTGGCT
GCGCCGATTT
TGAATAGCGT
ACAAGGCTCA
TTGTCTTGCC
GTTTTAAGGA
AAATCAAGCA
ACCGCCCTAA
GGACTAACGC
ATGAAGAAAT
CTTCTAGCGC
GTTAGAATGT
TTTAGCCATT
CGCTTATCAA
ATTAGAAAGA
AGAAAGGGCT
AACGCTTTAT
TAAAGAGCTT
TCAATTTAAG
GATTAGGGCT
AAAGAATTAG
ATTGAGCAAA
AACCCTTTCA
GCTTTAAAGG
GTGCCTACGA
TTAGTCATAG
TTAAAAAGGG
GGGCGTTATC
CGCTCTATGG
CTTTAGAATT
AAA-AGCCATG
GAATGCCTAG
TAGAAAGCGT
GAGCGGATTG
TGGAATTAGT
ACCCTTTAAA
AGGGAGTTTT
ATTGCCATTC
TTTTTTGGAC
GGTGCTGTTG
TCTTCATTTC
TTTGAGGCAT
GGTTTTTGAA
AGGCATTOAC
120 180 240 300 360 420 480 525 AGTTTAGGGG TTTAA INFORMATION FOR SEQ ID NO:436: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1131 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCTIUS97/0522 3 405 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: No (Vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1.._1131 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:436:
ATGAGATTTT
ACTTTTGATT
CTTTACAA.jc
CAACTTAAAG
GCTAGCCAGA
GAAAAATTAC
AAAGAATCCC
TTAGAAGAGG
AAAGAGCCTA
TTATGCGATC
GAATACAACG
ATTAAAAACA
CCCCTAGAAA
AAATCATGCG
GAATTAAAAA
CACACTTTAJA
TTGAATGAGA
AAAAAGCCCT
CGCTTAGATC
TTTGCTTCTT
CCCAAACTAA
TCAATGAAAG
A.AATCAAAAA
TCCGCCTTAT
AAGCTTTAGA
AAGCGCTTTT
CATCAAACGC
AACTCTTTCA
AGGTTTTAGA
CCCTTATGAA
AGCTTCAAAG
AACGCCTAGA
CTAACCAATT
ACGCTAAAAA
AAACCTTGAA~
CCATGGCGTT
TCAATGTGAG
CGCATGGATT
TTTATTTTTT
CGCCAAACTT
TTTAAGAATC
GGCTAACAGC
GGACACTGAT
AAAACACTTA
TTTGCA.AGAG
TTTAGAAGTC
TTTGCTCTCA
AAATGAAATA
CCATGATTTT
CCAAATCCAA
AACTTTAAAA
ACAGCAACGC
CAACAAAGAxA
TATAGAATTT
AAACGCCCAA
CGATGGTTTG
CCCTAGCTTT
CTAACCTTTT
TCGCGCTCCA
TATCAAAJACG
ACCCTAACTA
GCACTATTGA
AAAGAGAGCA
CATTGCCCTT
CAAGAAAAAA
CGATTGGATT
CAGGATCAAC
CAAGCCTATA
GCTAAAGAAG
TCGCGCTTTT
TACCAAAACG
AAGCATGCTC
TTAAGCGAAT
GTTTTAGCCC
AGCGGTGGGA
AAAAATTTTA
CAAACGCGCA
ACGAACAGCT
TGCTTTCTA
GCCAAAGGCG
AACAAAGCGC
TGGAGCATGA
ATTTGAGCGG
ATAACGCCCT
TGATGAGCGC
AATCCCATAA
AAGCCATGCG
ACGCCCTAA-A
TATGCGACAA
CCCTAATAGA
TAATCTTAGC
TGAGTGAGCA
rTTTAGCCCA
PAGCCCTTAT
kGCAAGAGTG
GATA.ATGATG
TTCTGA.CATG
CAACCAGGAT
TTTTTTTAAC
TTTGGAATTA.
ACGCTTAATC
CGTTAAGAAT
TTTCTTACTC
TTTAAACGCC
CAAAACCCTA
TTTGAAAAAA
AACCTTTTTA
AGAAAATCTC
GCGAGATAAG
CAATTACAA
AATGGCGTTT
ACAACAGACT
TAAAAATATC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1131 INFORMATION FOR SEQ ID NO:437: SEQUENCE
CHARACTERISTICS:
LENGTH: 690 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (gerloric) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1.._690 WO 97/37044 PCTIUS971105223 406 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:437:
GTGAAAGGCG
CAAAGCGTTA
AAAACCGACT
TTCGGGGAGA
GTGTTTGGAG
TOO GCGAC CA
ATTGACACTT
GCTCAAATCG
CCTTATA-AGC
GGGATTCGCA
ATCAATGTTT
CTTTATGTAG
AGAAAAACGC TTGGTATCTG GGGATTAGCT
ATCAAGTCGG
AAAACCCCCC
ATCTGGCCGT
AAAGATGGTT
CGAACGCTTT
AAGTAGGGAC
TATACAATGT
CGGGTAACTC
ACACCTCCTA
CCCATATTGG
ATTATTTTAA
GGTATCGTTA
CAAAAGCAGT
TATGCAAGGC
TGGTGCGCGC
GACATCAGAT
TATGGGCAAT
CATTAATAAG
TTGGGGTAAT
TAGCCTTGAT
TCAGCATCAA
CCATGGGAAC
CA.ATTTCTGA
GAATTTAACT
TTAGGGCTTA
TATTACGGCT
AATGGTGGGG
CTGTCTGACA
GAAGACGCGA
ACGACAGGGG
CCGGCGATTT
GAATT'PGACT
TTGAGCTTCA
ATCCTAAGTT
CTGTGGGTTA
TTATGGATTA
TGTGCAAACT
TGTTCACTTA
GTTTTGGTTT
CCTTTTTGGA
TCCAGTTCCT
TTGGCGTGAA
CTTACCGCCG
TCAGGCTTC'
CCCTGTGGGT
TAAGCAGTTT
TGGGCATGCC
CAATGAGCCA
TGGTGTGGGT
CTTTTTTGGG
AACTAAAA\CC
TTTTAATTTA
GATTCCTACT
TCAATACAGC
120 180 240 300 360 420 480 540 600 660 690 INFORMATION FOR SEQ ID NO:438: SEQUENCE CHARACTERISTICS: LENGTH: 1440 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (gerlomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .1440 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:438:
TTGAAAAACC
TGTAACGCTG
CAGGAATTGA
GGGGCTGTGA
AATAAAACAA
CAACCCAACC
GATCTATCCC
AAAGAACAAA
TTAGGGCAAG
AACATCATGA
AGTGTGGGCG
GCTAATTCTA
GAAACTTTGG
TTTAGTGGGA
GCGGTATTAG
AACCTCTTTT
GGTTATAAAC
GATTATGGCT
TATGGGGTAG
ACTCCTTTA
AAGAAGATGG
AAAATCCAGG
GGTTGCAAAC
AAACTCTTTT
AAGCTTTAGT
AACAATACGC
TCGGTATCAC
TGGATTTGAG
AAGTCATGCT
ATATCGCCAC
CGGTGAGCGA
GOTTAGOGAG
ATCAGCTTAT
GTCAGTTTGA
CGCTCAAAGA
AATTTTTCAC
ATGCGAATTT
GAACGGATTT
AAAAACGATC
GGCGTTTTTT
CTTCACTCAA
TTCTGCCATT
GAGTGAAAGC
CAATTTAGAG
TTCAGAAGGT
GGATAGCATG
CAAAATCCAA
TTTAGGCACA
AGGCATGCAA.
GCTCAACGCT
TTTTATTGAA
CTATAAGAAA
ATCTTCGGCT
TTACCAGTCA
CCATAAGAAA
TGGCGATACG
TTTATACAAT
GCTCTTTCCT
GTCATTGATT
GCGCAAGAAT
CCCTTATCCT
CTAAAAAACA
CAATCTCTAG
GTCATTAAGC
CTCTTGGTGG
CAAAACAATG
GGCGGGACTA
AATTTTCCTT
TTGATTAAGA
AAAAATATCT
GGGCTAGACA
AGTTCTCTTT
GCGAGCATGA
AATATCGGCT
AATTTAAAAG
GTGTATGAAC
TGCTGGCGAG
ACCAAACGAG
TAAAGCAACT
ACTACTTGGA
ATCCGCAACA
GGATTTTAGG
CTTTGGTGGT
CTCAAAACAT
GTAACCAACA
ATGGAGCGTA
CGCAAACGGG
GCGGGATTTC
GCAGCGGTGC
GAACCATAkL.
ATAAGATTTC
ACGGCTTTGG
TAAGGTATTA
TGGGAGCGAA
GCTCTAGAAG
CATGTCTTTG
TTTGGCCAGA
CATTAGAGAT
TATTTTAGGG
ACCAAACGGG
AAAACTACTA
GGATGTAGGG
CGTTTTAGCT
GCTATACGAA
TAATGGCGTG
CTTGATTGGG
TTTAGATCGT
ATCGTCTTGT
CATCATTAAT
TTATATCCCT
GGCTAAAATG
TGGGTTTTTG
TCTTOTTACT
GAGAGAAAGG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 WO 97/37044 PCTIUS97/05223 ACTACAATCG GTCTTTTCTT TGGCGCTCAA ATTGCAGGGC AAACTTGGAG
CACTAATGTA
ACGAACTTAT TGAGCGGGCA AAGGCCTGAT GTCAAGTCCA GTTCGTTCCA
ATTCTTATTT
GATTTGGGCG TGCGCACCAA CTTTGCAA ACCAATTTCA ATAAGCACAG
ATTAGACCAA
GGGATAGAAT TTGGGGTGAAk AATCCCTGTT ATCGCTCATA AATATTTTGC
AACCCAAGGC
TCAAGCGCGA GCTATATGAG GAATTTTAGC TTCTATGTGG GCTATTCAGT
CGGTTTTTAA.
INFORMATION FOR SEQ ID NO:439: SEQUENCE
CHARACTERISTICS:
LENGTH: 540 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1. 540 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:439: 1200 1260 1320 1380 1440
ATGGCGTCTC
GCGACCTTAA
CTTTTGGCTC
GGCGCGTTTT
ATAGAAACGC
GTGGGAGTCA
CTTAAAAACC
TATGAAAAA
ATTGACGACT
TTGCCTTTGT
TTAGCGTTTT
TGCTTTTAAG
TAAGCATGCC
GCATCTTGCA
AAAACATTTC
CTCACAATTT
CCTTTAAAGA
ACCCTTATTC
TCAAGCTTTT
AATAGCGAGC
AAACCGCTGG
TTTTGTTTTG
CGCTAATCCC
CAAATTCAGT
TGTGGAAGAG
AAAGATTTTG
AAAAACGGCC
TTGGAGTCTT
GTTTTAATCC
GCTAGTTATA
AACGTTTTAC
TTAAGTTATA
CTAAACAAAT
CGAGCTTTTA.
CCTGAAGAAT
CCTTATCAAG
TTAAGGGATT
TTTTTTGCGC
TAACAACAGC
TCACTCAAGC
GCAACGCCTT
GCGTTTTACG
AATGGTTTGT
CCAAGGTCTT
TTTCTTTGTT
TTTAAGCCA
GATTTTGCTC
AGCTTTTTTG
GATTTACCCC
TTCTTTGCA.A
CCTAGA.AGTC
CAAAAAAAGC
TTCATTCTTT
TTGTTTGTAG
120 180 240 300 360 420 480 540 INFORMATION FOR SEQ ID NO:440: SEQUENCE CHARACTERISTICS: LENGTH: 390 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/0522 3 408 NAME/KEY: misc-feature LOCATION .390 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:440:
TTGGGGGAGT
OTTTTTTATA
TTTTTACTCT
CACCCAGCCC
GAAGACACGA
TTAGGGCGTT
CGTATTTTAG
TGCTGCAAGC
AAAACGAATA
ACACGAACGG
CATTCCCAAG
TTTTTGATCC
TTAGCGTGGG
AGAGTTTGTC
TAGCGCACCT
CAAACGAAAA
GCTTTGGAAT
GGAATTACCC
CTTTAGCGGC
TTTAGAGATT
GTTAGTGTGA
TATGCGATCG
AAACAAACTT
TTTAGCGGTG
AGGCGTTGCA
TCTGGCACGA
GAAAAAGAAT
CTCCTGTGGA
CTACGATGAG
AATCCAAAAA
TCCAATTGTT
CTATTTTAGA
ATTGCGAGTT
GTTGATCGTT
TAGAGAGGAA
GCGCCTAAAG
TTCTTTTTTG
GGCCAACGCT
GTCTAAAAAG
120 180 240 300 360 390 INFORMATION FOR SEQ ID NO:441: SEQUENCE
CHARACTERISTICS:
LENGTH: 984 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .984 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:441:
GTGCTTAAGG
TTTAATGTGG
AAAGAGAAAT
ATTCAAGCCT
GCGATGAGTA
GAAAAITCAAG
AAGGATTTTC
CCCTTTAGCC
CTGCTTTTAG
GCCAAATCCG
TCTATGGG.
GACAGCCTTG
GAGGACGCTT
CAAGAGGGAT
GTTTTAGACA
ATAGAAGTGG
GAGAATGAAA
GGTTAAAA AGCGTTTAAG
GAGAGGTTTT
ATCACAATCT
TTTTTAAAAC
TAA.AAATCGC
AAGCTAAAGA
ATTATCAGGC
TAAAGGATTT
TTATTTATGA
AGCGTTCAAG
TGTTTTTAGA
TGGATAATGA
AAGAAGCGAT
ATAGTTTGAT
TGCGTGGCGT
GCTGTCAAAT
ATAGGCTCGA
AGCATGCGTT
TTTATCCGCT
TTTTGAGACT
CAGAACTTAT
GGTTTTATGC
TTGTGAAGTC
CAAGCGTTTT
TTTTGTGCGC
GTTTTATTTT
AGAGCAACCT
AGCTGTGGAT
AGGCCTAGAC
TGAAGGCATG
CTATCATTCT
CCACCACAAG
TTTTTCTTTA
TTAG
CAAGTTTTAA
AAAGTGGAAA
AGCCAAAAAT
GAAAAGCAGG
AATCAAAAGT
AAAATCCAGG
GATAATTTAG
TTGATTGCGG
GAAGAGTTTA
TTGTTTTTGA
AGCAGCAAGG
ACGAATATCC
AGAGAGATAG
GCGTTAATGC
GTGGAGCGGT
GCTCTCAAGT
GGGTTAAAAA
CTAAAAATGG
ACCCCTACAC
CGTTTGAACA
ATTGCGTTTA
ATGTGGATTT
AAAACAAGCC
ATAAAAAAGA
TAGAGAGCA
GTGAAATCCA
ATAATAGCGA
CCTTGATTGC
ACTTTGTAGA
ATTTGCAAGA
TGAATGTTTT
GTATATCTCT
CCACCGCATC
TGAAGTCCCC
CTATTTTAGC
AATCAAACAA
TGTGGAATCT
TTTGTTTTCG
GTTGTTGTAT
GATTTTTTTA
AGAAGAAGAT
AGAAGATATT
AAAAATAACA
AGATGTTTTG
AAAAGTGGTT
AACTTTGATG
AGCGCGCATG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 984 INFORMATION FOR SEQ ID NO:442: SEQUENCE
CHARACTERISTICS:
WO 97/37044 PCTIUS97/05223 409 LENGTH: 1296 base pairs TYPE: nucleic acid STRMJDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 1296 (xi) SEQUENCE DESCRIPTION: SEQ, ID NO:442:
ATGAATATTC
TGCCTTAAGG
CATAGAGGGG
AAAGTGGTGT
GCTATGCATT
AATAAGATTT
TTACACAAGA
TATTTGACTC
GTGCATGATG
AAAGAGGGTT
TGCCAAAAAG
TTCGCTCGTT
TTAATCTTGT
AGGCGTTTGT
GCTTCTTTA
CATTATGA
ATTCAAAAAT
AA.AGAAGACG
GACATGGATG
TCAGTGCTTT
GCTAAAAAAA
GAACACAATA
AAACAAAGAA
CTGAAACCCT
ATTTTGCCAC
ATGCTA-AAGA
TAGCCATGCT
TAGTGGATGG
TTCGTAAGGA
AAAAGCGTTT
AAGACAGCCT
TGGATCTGTT
CGCTCAACAC
TGTATGAAAA
TAAAAGAGTT
TGCTAGACTT
TCTATCAAGA
GCTTGAGCGC
TAGAGCAAGC
CGCAAGACGC
TTAAAAGGGG
ATTTGGATTC
TCTTTTCTAG
AAATCATTCA
AAGATTTTTA
TTCAGA.AGAC
CGCTCAAAAA
GGCGGCCATT
TTATCAAA
CTATGCACAA
AGAAAACC
GGATAAGGCT
AGAAAAACTC
GCAATCTCAT
CTTCACGCAA.
AAACCCTATT
TGATAAGGCC
ATACACCGCG
AAGAAAAGAC
GAATAAGAAA
CACCAAAGAG
TTTCTTTTAT
CATGGATCTT
TTTAGCATGG
CATCGCTAAA
AGAATGCAAG
GCAAATTTAT
CATCAAATCC
GGCTATATGA
TCAGCGGCGA
ATCACCAATA
ATGGGGCAGA
ATAGCCACAG
TTCCCGTTGT
ATTACGATCT
ATAGACAGGT
TTTA.ACGAGC
GTTCAAAACG
CAGCAAATCG
CAAAAAAAAT
CCTAAATTCT
AAGCTCACCA
CGCCAAGCAT
AATTTTTTAG
GTGAGGAAAG
GGTTATTACA
GAGCTTATCC
AAATAG
TGCTTTTTAG
TGTTGAGTTC
ATCTCTATA
GTTTAGGGGA
ATCGTAACGA
TTGATAAGGC
ACAATGTGTT
TGAATAAGTT
ATTTTTTACA
ATGGTTGCTC
TTGATTTGGC
CCCAATTTTA
CAGAATTATT
TTGATCAAGC
TAGGCTTAGA
AAGAAGAGAT
GGCTTGCT.
GGTATTCCTT
CCTTAGCGTT
AATTAGGGAIA
AAAACGAACC
CCTGTTTTCT
AGACGCTTTC
GCAAACCAAT
CATTAAAACC
TGTTTCTATC
GATTGAACTA
AGGGACTTTG
TTATAACCA
AAACCGCAAA
AGAGCAATTG
TAAAACGACT
CATAGGGGTA
CCCCTTTGAC
TTCCAAACAG
GGCGATTTAT
GTTGCCCATC
AACTAAAGAT
AATAGATTAT
AGATTCTAAC
TTGTTTGGAG
CGAATTGMAJA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1296 INFORMATION FOR SEQ, ID NO:443: SEQUENCE CHARACTERISTICS: LENGTH: 1245 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO WO 97/37044 PCTIUS97/05223 410 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION I1.. .1245 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:443:
TTGATTCMA
GGGAAAATTA
GAAAAAGAA
TTTGAATATG
GCAAAAGCTG
GCTGAAGTTA
ATCACCAAAC
TTAAGCTACG
CCCATTTATG
AAAGGACAAG
GAACATTTTA
GCTAAAGATT
TTTTTAGAAC
TATCAAAATA
TCGCAAGATA
GAAAAAITTAG
TTATTATTAT
ACAACAGCGC
TTGTGCCAA
AGAACAAGAG
TCTTGCATAA
GCATCCAAAC
AAGACGCTCA
TGTTAAAACA
AAATCATTGC
AAGCCACAAG
CTAAAAATAA
CTAAAGCATT
CTCAAGCCCA
ATTTTTTAAA
TATCCACGCC
CCTTAAAAAG
GACTTATTTT
ATTGTTTGAG
GCTTAAAGGG
GCTCCACCCC
AAGCGGTTAT
AAGAAGAAAA
CTTATCAATC
AGTTTTTATC
AATCTATATA
TGCTGACTAT
AAGTGGTTGT
CAATGAA.ATT
ATCTAACGCT
AGCGTTCGCT
AAAAGATTTA
ACTTATTTTA
AAATTATAAT
TTCAAACCCG
CAGAGGGTGG
TAAGAATAAT
CGTOTTAGAT
GCTTTTTAA.A
TAAAGATTCT
AGAAGTTAAT
AATTAAAAAT
TTCAGACATC
AAAAACAAAA
TTTTTAATCT
ATGGAAGCTT
GACCCCCCTT
GAAAAATGGA
ATTTTTATTT
TTTGGAACGC
AAACACATCA
CCTGGTTTTA
ATGCGAACGA
AAAGAACAAA.
TTAGTGGATG
CGCAGTGTAG
AGCAGCGATG
CGCCCTTATG
TTTTATAGCC
ACGCCAAAAC
ATTATTTTAG
AAGGATTATT
AACCCGCAAG
ATGCTGTTGC
TCTATTTTAT
CAAATTCGGT
TGGAGTGTTT
ATAACACCAA
TTGAAGAACA
CTATGGACGA
GCAATTTTTT
ATATTACTCA
AAATCTTACG
TTAAGAATGT
TCAMI.GAGTT
AAAAAGGTGA
CGATACAAGA
AAAAGCTTAA
AAAAATATTA
GACAAGGCAC
CTGTAGCATT
ATTTTTTTGC
ATTTGAATTG
CTGTTAGCAT
GTTTAGAAAA
TTTAA
CGTTTTTCTT
GAAA.AGAATA
GAGTTCTAAC
TTTGATTTTA
TAATAAAATG
AGGCACTTTT
TGAATATGTT
AACGCTTTTG
TTTTAAACA
ATCCCAAAA
GATCTATTTC
AATCAATCTT
GGAGTTATAT
CCTAAAAGA
AAAAGATTTA
GATTAAATAT
AGGCAGCGGG
GTCTTTTTAT
TTTAAAAAAC
GATCATCAAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1245 AAGGGGTATA
AAAACACGAT
AGAAGTGAAT
ACGAAATTTT
INFORMATION FOR SEQ ID NO:444: SEQUENCE
CHARACTERISTICS:
LENGTH: 501 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (geriomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .501 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:444:
TTGGAGATTG
GTGATACCCG
GGCAGTAGCG
ATAAAACAGA ATGCTCTACT CTTTTAGCAJA GCATTCAAAA
ACAACAGCTT
TTGTGGGGAA TTTTAGCGCA GGCAAAAGCA CGCTATTAAA
CCGCTTTTTA.
TTTTGCCTAC CGGTATCACG CCAGAGACTT CTTTAGCCAC
TGAATTGCAC
120 180 WO 97/37044 WO 9737044PCTIUS97/05223
TATAGCGCTA
GAACTGAATG
AAGGTTTATT
CCCGGGTTTG
GGCGTGCATT
AGGGAGTTAA
ATGAACGCAT
AGCAAAGTTT
TGAATAATGA
ATAGCCCCAT
TTGTCATTCT
AAACCTTTTA
AGAGGCTTTT
TGAGGCGATT
AGCCTTAAAA
TTCAAGCCAC
CACAAGCGTA
TCA-AGCAATG
AAAGAGAATG
GATAGTGCGC
ACCCATGCCA
GA2sAGAGGGTA ACGAAAAAAc
CGGCGAAGTA
CTTTAGTGTT
TTTTGGAATA
ACCTCACTA
AGAGAGTTTT
TTCCTACCTT
TGTGGATATG
TTTGGAA.AGG
ACGCATGCTT
240 300 360 420 480 501 INFORMATION FOR SEQ ID NO:445: SEQUENCE CHARACTERISTICS: LENGTH: 756 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...756 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:445:
ATGAAACTCC
AAGCCCAGTC
AAAACCTTAT
AAATTCAAAA
AATTTAGCCC
TATTTTAATA
TTATTTGTAG
CATTTCACCC
CTTAAAGCCT
GAGAGTCAAA
TTAAAAAATA
AATCTAAAAA
CATATCCGCT
CGGTCGTTGA
TTTTTTTGCG
TCAATGATGA
CATCTTGGGA
CTAATTATAA
ACCCTATTTT
AGAGTAATGG
TAAGCGTCAA
TACAAAATAT
ACGCCGCTCA
ATGAAATCTT
C CC TAGCC CC
TGTGGGACAA
GAGCTTTTTT
CTTAGGGGGG
AATCCTAACA
TTATTACAAT
GGATTTTGAT
ATTGAGCGTT
CTCTATTTTT
ACTCTCTTTT
CTTAAATAAC
TTCTATCGCA
TTTAATGCCC
CCTAGCCATA
TAAGAAAGGG
TCCTTACAGG
TGTAACCTTT
GGTTGCGATA.
GAGCCTAAGC
TTCATTCTTA
TTAGAGCATT
TTTGAATTTA
TCTTTGGAGC
GCTAA.AAGCG
GAAATTCAA.
CTAGGCACAA
GAGCATGGTT
TTTTAA
GTGAAGGAAA
CATGCAAGGG
GTTTGTATGC
CCTTGATTGA
CAGGCGGGGA
TTTACCATAA
GCCCTATTTT
AAGAAAGCAA
TGCATTTTAA
GCCTTTTGAA
CTAACAACGA
TTAGGCTGAG
AAGGATAGGC
CTTTAATTGT
GGTGCATCCT
ACGATTAGTT
GCCAAGCTTG
AAAAATCCCT
AAAAGAATTG
GCGGATCAAC
ATTCGTATTG
ACAGCTCTCC
GCTAGACAAA
CGATAGGCTT
120 180 240 300 360 420 480 540 600 660 720 756 INFORMATION FOR SEQ ID NO:446: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1146 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO WO 97/37044 WO 9737044PCTIUS97/05223 412 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter- pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 1146 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:446:
ATGAATTTAA
TTCAATTCTA
ACGGGCGAAC
CAAATTTTTT
GATAAAAACG
GAAAAAAATA
CACAATAAAA
ATGACCCCCC
ATGCCTAATT
TTAAAAATCT
TCAAGCTATC
ATCATTAGCG
GTCCATTCAA
GGCGATGTGA
GAGAGCGAAA
GCTCTTTATG
AGTGGCCATG
ACTTCTCAAT
AGTTATGCGA
TTTTGA
ACTTTATGCC
GCGCTAGGGA
ATGCCOTTAT
CTCAAATTTT
CGCTGACGAC
CGAGCAACTT
TCAAATTAGG
TAAACGCTCA
ATTTTGGCTC
TGCAAAATGA
AAAGTTATTT
CTTTTAGCGT
ACGCCCTAAA
TGCGCCATTA
GGTTTTTGAA.
CAAAAAATTT
CTAAAACGCT
ACATCAAAGA
GCGCGCTGCT
CCTATTGCAT
TTTTTGCGTG
TCAAGTGAGA
AGGGGTAAAA
TCAATTCATC
TCAAGAAAGA
GCATTTAAAG
AAAAACGGAG
GCAACGCTTC
AACGAAATTC
GTTTAATTCG
CAAAGAAAGT
AGCCCTTAAA
CCCTTATGGG
CAAAGAAGCT
GAGTTTAGAA
AGGATCCAGG
AAAAGCGCAA
CAAAGAAATC
GCTTATAACC
CATGAAGTGC
AAAAGCGGTT
ATCGCTGAAT
TCACTCCCTA
AACCTTAAAA
GGGAATCGCT
CAAGTTTTAG
GGGAAGTTCA
GCCCACCAAA
CTTTTAAGCA
TTAGAATTTT
AACCAAGCCC
AAGTTTTTTG
GTGCCTACGG
ATTGAAAAAG
CGGTTTTTTT
TTTGAATTGG
AAGCATGAGA
ATGTAAGCAT
CTTTGTATGA.
TAAGCACTTT
TGGGTTATGC
AAAAATACGC
TCCTGTCTTT
TTTTTATGCG
AACAAATCGC
ATGACAACCA
AATTAAACGC
AGCGATTAGA
TTAAACAAAA
ACCCCTTTAA
ACGCTTTGGA
GGTTACTGGA
GATTCCAGCA
GGGTGTTTGT
AATTTTACTT
AAGGAGAAAA
TGATTTTCAT
ATTTAGCAAC
AGAAATGCTT
GGGCTTGAAA
CCCTTTATTA
GAATTACCAT
TTTCAAAAAA
GCAGTTTGGC
TAAAGAGGGG
TTTTTTAATT
AATCAGTAAA
AAATTTAAGC
AATCTTAGAAk
ATTAGAAAAA
CGGTAAAAAA
TAACCTTTTA
GGAAAATATA
GCCTAAAGGG
TAATGACGAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1146 INFORMATION FOR SEQ, ID NO:447: SEQUENCE CHARACTERISTICS: LENGTH: 2094 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULETYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .2094 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:447:
TTGCAAAACT
TTTCTTAACC
TCTTTGGCCG
TTGTTTTTA.A TAAAAAATGG CTCATCTATT CTAGCCTACT CCCCTTATTT CTTTAATGGC AGAAGACGAT GGGTTTTTTA TGGGGGTGAG TTATCAAACT TTCAAAGGGT GGATAACTCA GGGCTTAACG CCAGTCAAGA CGCATCCACT WO 97137044 WO 9737044PCTIUS97/05223 TATATCCGCC
AAAACGCTA'I
GAAGCGATGG
GCCAACAAAC
AGATGTTTGC
TCTATGCAGC
AACAACCCCC CAAGAGGCAAP TTAAACAAGC TCACCCAACq AACTCCAAAG
TCTTTAACGT
CGTCTAGCCA
ATACCATGGA
TGGTATAACC
AAACCTTAAC
TTTAGCCCCC
AAGTCTTGCA
CAAACCATTT
GCAGCACTCA
CAAAACGCCC AAAACATCTT AATGAAAAGC
AATTTGGCTT
GGCTATCAA\J
GCTTTAGCGG
CCCTTAAAAG CATTACCcGC CACCCCAGCT
CGGCAGTCTA
TCTATGATTT
TTTCAGGCAT
TCAAGCTATA
ACCAGATOCA
GTAGCGTATG
GGGATTTCAT
GGTTTGAATT
TCTTGCCTAA
CAATTA.ACCC
CACAACAAGC
GCCACAAACG
CCCCCACCCC
ACGGCAAA
CTTTA.ATGGC
ATTGATGAAA
TGCALAACTAJA
TTGGAGCATA
ACTCTAATTC
CAATTTTTCG
GCAAAAAGCG
TACGCTCAAT
TTGGCACAGA
GGCACAGACT
TTCTTTATAA
GGTTTTTTTG
CCGGTATCCA
GTGGATGGCA ACCATCTTAjA ATAAGGACCA
ATTTTTCA
GAATTTGGCC
TTAAAATACC
GCGAAGTATA
GAAGAGCCTT
CGCTCTAGA-4
GAGAGTCTTA
GGGCTATCA-P.
TGTCAATGCC
CATCGGCGAG
CAAATTTGGC
CGCTTTAAAC
GAATAAATCT
ACACCTTTTA
AAACCAATGC
CCAGGCTTTA
CACTTACAAC
CCCGGGTTAT
TGGAGCGACA
TTATTTAGCC
GCAAAATTTC
AGATGTGATC
CACCAATTGG
TTATGGGGGG
CCAACA.AGA.A
CGCGCAAATA
TTATGGGCTC
AGTGAATCAA
TAATAACATG
CATGTTTGGG
ATCTTCTTTA
TGTTTTTAC
ACTTGCAGGG
ACCTAAGGAC
AATCGCTCAT
GGTGCTTTAT
TCTGCGGCGC
ATGCAAATGC
AACGGACAPIz
ACCTTTGATI
ACTTTAATCC
AATCAAIAGCP
AATGACATCP
TTTAGCACCC
CAAGACGGCT
ACCGCCACTA
ATGCAAGCAG
AAAGCCCCCA
TACACCAAAA
ATTGGATCAG
GATAGCATCA
GCCAATAAAG
AATTACGGGG
GTCGCCCCCT
CAATTGAATG
CAAAAAGTGA
AACAGGATTT
TATCGCTCTA
GTCTATCAA
AACGGCTTTG
CTTAGGTATT
GTGAAAGCCA
CGAAAAAGAG
CAAACCTGGA
ACTTCTTTCC
CAAAAAAGAT
3TGCCTTTGCc
TCTGCCCTGP
ATAATA-ACGCG
TGCAATCTCI
*GTAACCCTGA
CTGTTATTGC
CCAACGCTTT
*CTAGTAACAC
TAGCCACAGC
*ATGAAGCTA.A
GGATTTTAGG
ATGGCAGCGA
ACGACAACAC
GCAATGGCCA
TCGCTAATGG
CCGCTAAACT
AAAGCTTGCT
ATTTGGATTT
GCGCTAATp.A
TCATGAACCA
TAGCCAACCC
AAGCAGTGAT
TGGGCTTTGC
GCGTGAAk.AT
ATGGTTTTTA
CCCTCTCTAG
GGACTGAAGC
AAACGAATTT
AATTCCTTTT
CTCGTTTTTC
ACCAATCAGA
ACATAGGCTT
CTATTATTTA
TCCTTCTAAA
CGATACAGGC
AGTCA.ATAAT
AAATCTTCCT
ATTGCCTGAG
AACCACGCTC
TTCTGTGAAT
AAATAATAAT
ATCTATCGCT
GGGCTTAGCC
TTCCCAACAA
CACGCAAGCG
ATACACCTAC
CATCACCGCT
GATAGGCACT
TAGTAACACC
AAACAATAAA
TCAAACCCCA
ATTAGAGCAA
CTATTCCCCC
TGGCGGAGTG
TAGGAATTTT
GGGCTATAAG
TGATTTTGGT
CTATGGAGCG
GATAGATATA
TTTAGATCA
TGATTTGGGC
TCAAGGGATA
AGGCGTTACA
TTGA
240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2094 TAGTTTTTAT GTGGGCTACA INFORMATION FOPR SEQ ID NO:448: SEQUENCE
CHARACTERISTICS:
LENGTH: 2094 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .2094 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:448: TTGCAAAACT TTGTTTTTAA TAAAAAATGG CTCATCTATT CTAGCCTACT
CCCCTTATTT
WO 97/37044 WO 9737044PCT/IJS97/05223 TTTCTTACC
CTTTAATGGC
TCTTTGGCCG
TTCAAAGGGT
TATATCCGCC
AAAACGCTAT
GAAGCGATGG GCCAACAAAC AGATGTTTGC
TCTATGCAGG
AACAACCCCC CAAGAGGCAA TTAAACAAGC
TCACCCAACT
AACTCCAAAG TCTTTAACGT GGTCTAGCCA
ATACCATGGA
TGGTATAACC
AAACCTTAAC
TTTAGCCCCC
AAGTCTTGCA
CAAACCATTT GCAGCACTCA CAAAACGCCC AAAACATCTT AATGAAAAGC
AATTTGGCTT
GGCTATCAAAj
GCTTTAGCGG
CCCTTAAAAG
CATTACCCGC
CACCCCAGCT CGGCAGTCTA TCTATGATTT
TTTCAGGCAT
TCAAGCTATA
ACCAGATGCA
GTAGCGTATG
GGGATTTCAT
GGTTTGAATT
TCTTGCCTAA
CAATTA-ACCC
CACAACAAGC
GCCACAAACG CCCCCACCCC ACGGCAAAAA
CTTTAATGGC
ATTGATGAAA
TGCAAACTAA
TTGGAGCATA
ACTCTAATTC
CAATTTTTCG
GCAAAAAGCG
TACGCTCAAT
TTGGCACAGA
GGCACAGACT TTCTTTATAA GGTTTTTTTG
CCGGTATCCA
GTGGATGGCA ACCATCTTAA ATAAGGACCA
ATTTTTCCAA
GAATTTGGCC TTAAAATACC GCGAAGTATA
GAAGAGCCTT
AGA.AGACGAT
GGATAACTCA
CGCTCTAGAA
GAGAGTCTTA
GGGCTATCAA
TGTCAATGCC
CATCGGCGAG
CAAATTTGGC
CGCTTTAAAC
GAATA.AATCT
ACACCTTTTA
AAACCAATGC
CCAGGCTTTA
CACTTACAAC
CCCGGGTTAT
TGGAGCGACA
TTATTTAGCC
GCAAAATTTC
AGATGTGATC
CACCAATTGG
TTATGGGGGG
CCAACAAGAA
CGCGCAAATA
TTATGGGCTC
AGTGAATCAA
TAATAACATG
CATGTTTGGG
ATCTTCTTTA
TGTTTTTACC
ACTTGCAGGG
AC CTA.AGGAC
AATCGCTCAT
GGTGCTTTAT
TAGTTTTTAT
GGGTTTTTTA
GGGCTTAACG
TCTGCGGCGG
ATGCAAATGC
AACGGACAALA
ACCTTTGATA
ACTTTAATCC
AATCAAAGCA
AATGACATCA
TTTAGCACCC
CAAGACGGCT
ACCGCCACTA
ATGCAAGCAG
AAAGCCCCCA
TACACCAAAA
ATTGGATCAG
GATAGCATCA
GCCAATAAAG
AATTACGGGG
GTCGCCCCCT
CAATTGAATG
CAAAAAGTGA
AACAGGATTT
TATCGCTCTA
GTCTATCAAA
AACGGCTTTG
CTTAGGTATT
GTGAAAGCCA
CGAAAAAGAG
CAAACCTGGA
ACTTCTTTCC
CAAAAAAGAT
CACACCTACT
GTGGGCTACA
TGGGGGTGAG
CCAGTCA.AGA
TGCCTTTAGC
TCTGCCCTGA
ATAATAJACGG
TGCAATCTCT
GTAACCCTGA
CTGTTATTGC
CCAACGCTTT
CTAGTAACAC
TAGCCACAGC
ATGAAGCTAA
GGATTTTAGG
ATGGCAGCGA
ACGACAACAC
GCAATGGCCA
TCGCTAATGG
CCGCTAAACT
AAAGCTTGCT
ATTTGGATTT
GCGCTAATAA
TCATGALACCA
TAGCCAACCC
AAGCAGTGAT
TGGGCTTTGC
GCGTGAAAAT
ATGGTTTTTA
CCCTCTCTAG
GGACTGAAGC
AAACGAATTT
AATTCCTTTT
CTCGTTTTTC
ACCAATCAGA
ACATAGGCTT
TTATCAAACT
CGCATCCACT
CTATTATTTA
TCCTTCTAAA
CGATACAGGC
AGTCAATAAT
AAATCTTCCT
ATTGCCTGAG
AACCACGCTC
TTCTGTGAAT
AAATAATAAT
ATCTATCGCT
GGGCTTAGCC
TTCCCAACA-A
CACGCAAGCG
ATACACCTAC
CATCACCGCT
GATAGGCACT
TAGTAACAcC
AAACAATAAA
TCAAACCCCA
ATTAGAGCAA
CTATTCCCCC
TGGCGGAGTG
TAGGAATTTT
GGGCTATAAG
TGATTTTGGT
CTATGGAGCG
GATAGATATA
TTTAGATCAA
TGATTTGGGC
TCAAGGGATA
AGGCGTTACA
TTGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2094 INFORMATION FOR SEQ ID NO:449: SEQUENCE CHARACTERISTICS: LENGTH: 3714 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .3714 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:449: WO 97/37044 WO 9737044PCT/US97/05223 ATGATAAApJA AAGCTAAAA,4 GACAATGGCT GGTATATGTC AACAAACAAC TTTTAGAAAA ATTGCAGGGC CTACTACCGG GGCTATGGCG TGAGTAACAC CAAATTGGCA AAAGAAAAGA ATCATAGGGC TTAAAGGAAG AALACTCCTTT CCAACACCCA ATTAGTGCAG TCAATAGCCT CAAAACACCC CGCAATCCAT ACCACTAGCA CCACTTACGC TCTAGCAATA ATACCACTTA GGGGTTTTCC CCACCACAAC TTCTATccAA CTAATTCCCT TACAACAACA
CCCTTTTAAT
AACCCCAATG GTTGCGCCAA CCTTTAGCCG CAACCCCCAC CAAAAACTTC AAAGCGTTGC TATAATTTAA ACAACTTGCA CAATACA.ATA ACGCTTTAAA CTCAAAAACA CTTCCAATAJ' ATCAGCGCCT ATGATTGCAC ATTTCATGCT
CAGCCACAAG
GCTACCTCCA ALAGTCCAAAC CTTGTCTCTC AAGTGTGGAG AAAAACGCCA AAATATTATG TCTTCAGGGG GTTTGAGCAT AATGGGACTA CCACTAATAC ATGGTGAATA ATGAAGAAGA ACACALATCTT
CTAACAGCAC
AATTTCCAAC AAAGCATTCA GCGAACGCAC
TTTATA.ACAC
AACAATAACC AAGATTTACG ATTAACCAGC AAGTGCCTAC CAAACAAGCG GATCAGCAAG GGCAACTGGT
GCTACCAGCA
GCTTTAGGGT
ATCAAACACA
ACCTACAATG
TCCAACAAA~T
AACCTTAAGA GCGTTAATGG ATCAACACAG
CCTACCWAT
AGTAGC'AATA
GTACCAATAG
AGCACCAACG
GGAGCAATGG
AACGCCACCA CCGCAACCAC CAAAAAATAG CCAATATTAT AAACAATTCT TTGAAGCGTT GGTAGTAGCG
GTAGTAGCTC
CCCACAAACG GAGTGAGCGA ACCGCCGGCT TTATCCAAAA CAAGCCATTA CGACCGCTAT AATGCGATTT TGACCTTGCT TCGCAAACCT TACGGCAGCT ATTGATGCGA TGATTAACGC TACGOCTCTC
AACCCGTTTT
AGCAATGGCT
TAGGGGTTGG
GGCCTTAGGC ATTATTTTTT AGCGTGAAAG CGAATATCTT ATTTATACCA CTCTTTTTAA
TGTAGGCTAT
TCAAAATATC
CCTAATCACT
CGTGGGTAAC
CTTTTATTCT
CTCTGATCCC
AAGCGCGTTT
AAACCCTAGC
GACAGAATTG
ACAATCCTTA
TGTGAGCGCT
CTCAACGCAT
TTTAGGCTCT
GAACACCTTA
TCAAATCCAG
TTCAACTAAC
TATCAACGCT
CAACGCTTTG
GCAAATTTCG
CTACCAAATC
AAGCGCTACC
CTCCACAA.AT
CATCAACGGC
CGTTTATAAC
CAACAATGGA
CAGCGGGAAC
TCAAGCTAAA
AGCCAAAACG
GGTGATGGGA
AAGCGCTTTT
TAGTAACCCT
CATCCAATTA
AGACATGAAC
CACCACGAAC
GTGGTCCGAT
AGCGACAACT
CACGCTCACT
GGGCAGTAAT
GCTCACAGAC
TAGCAATAGT
GACGAGTGGG
CACGACCGAC
CGCCAGCTCT
AAAAAGTAAT
TACTTGCTCC
TACGAATAAT
TAATGATAGT
TTCTCAAGGG
CCAAGAAATC
TTTAGGGGAT
CAGAAATCAG
AAGCCAGTAT
CATAGGCTAT
CTTTGATTAC
TGCTTATGGG
CAAATCGGTG
ATCAATAGCA
TTAAGCTCTC
CAATTAGAAG
AGCCGTCAAA
TTAAAAGCCC
GATCAGGGCA
AACAACTCCC
TTGCAACAAA
CTCTCCAATC
CTTGTTAACG
GTGGTGCTAA
ACTTCTTCAA
CAAGGGGAAT
TGTTTAGAGC
CAGGCCAACC
TTAGACAACA
AATTTCCAAG
TGGATTAGTT
GGCACGGTTA
GGAAGCCTTT
AACACAAATA
AAAGAGCAGA
TCTTTAAAAA
TCGCAATCTG
GCCCAATTGC
AGCA.ACGCTT
ACCAATTTCA
GCTTTAAACA
CAAAACCAAG
AATGGGAATC
AGGGCGAATT
GCTTTAATTA
AACGCATGCG
TCTAAGGCTT
CAAAA~TGGGA
AGCGGTGGTT
GGGGGAAGCA
GCTAGCGATG
GGCAATAATA
AGCAATTGTT
AGCAATTTAC
GGGAACAATA
AGCAGCAGTC
GGTGGGCTTA
TTAATTAATC
AATGTATCTA
TTTCAAGCCT
ACTTCTAACA
MAACCTTCT
GTTCAAAACG
GCGGCCGGTA
AAATACTTCT
GGCTTTAGTG
GTAGGCACGG
TTGGCTCCCT
GCACGCAGCA
TCACCCAAjAG
AAAGCGTTAT
GCATTTCTAA
TCTCTAGCAT
ATTCTTCACA
TCGCTCTAAG
A.AGAAGTCAA
TTGAACACAG
TGACCGATGC
CTTTAAACAC
ACCCACCGGG
ACAGCAATAA
TAAGCACTAA
AATTCATCCA
AGCAAGTCCA
ATGCGATCAA
CCTATCAAAG
TTAGCGAGCC
CCAACGATCA
CTAGCGATGC
GTTTTGACAA
TCGGCGTGAA
CTTCAGAAGA
GGACAAGCCC
AAAATATTTT
CCAAACTAAA
ATCAAAGCAG
CCGTATTGCA
AAAATAATAT
AATCGCAAAA
TTTACCAGCT
ATCAAAGCCA
CGAGCGGAAT
ATTACAGOGG
GCAGTGGTGG
TGCTCAATCA
GTGGGAATGG
GGAAATTAGG
ACGGCTATAC
ATGAACCCAA
AAAAAGTCTA
AAGGCGTTGA
TTAGTAATTT
TCAACCTTTT
TGCTCACTGA
CTAGTCTTAC
rGCAAAACGA
CCACCACCAT
rTATGGTGCA CGCAAAATCA2 .AAGCACCCA2 rTGGTAAGGC ~AkATAGGCCT2 kTTTTTTATG
CTTAGCTGAA
ATTCATCAAT
CGCGATCAAT
TGACGCTTTA
TATCCTGAAT
TTCCCAGCAA
AATCACACCC
CTCTAATATC
AGCCCAGCTC
CATCACTAA
GGTGAATGCA
TTTAGGGGTG
ACAAGTCGTA
CCAACAACAA
CAATCAAAAT
AAATTTAACC
AGCCATCGCT
CAACACCACC
CACGATAGAA
TAAAAACTTG
AGGGCAAAAT
TTCTAGTGGG
TTCTTTAGTC
TTCTTTTAAT
AAATTTGCAA
ATGCAATAGC
AAGCCCTACT
AGCGATGGTA
TGGGCCAACC
AAATGTCAGC
CCAAGCTTGG
TTTAACCACT
CATCAATACC
ACAAACCCAG
GGGGAGTAGT
GTTGCAAAGC
GAGCALATATC
AATTATCACA
CACTAGTCAA
GACTTATAAT
GCCATGCAAT
CAAACAACAA
rALATGACGcC
A.AACGGCTTA
A.TGTGGTAAT
kGGGGCAATC
N.TTCATTAAA
k.AGCGCTTTT
TATTAGCCCT
rCAGTCATTC kCAAAAGCTC
%GCCAATAAC
%CACGGCATG
E'AGGAAATTA
kGCCALATCAA 3AATCTATTC 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 WO 97/37044 PCT/US97/05223 AGGAGGACTT ACAACACTAA AGCGTTGAAT
TTTGGGCTAT
GGTGCAACTT GGCTTAGTTC CTTAAGGCAA
CAAATCATTG
GACATCCATT CAACGAATTT TCAAGTGGCG
CTGAATTTTG
GAGTTTAAGC GTTTTGCTAA GAAATTCCAC AATCAAGGGG GAATTTGGGA TCAAGGTCCC TCTCATCAAT CAAGCGTATT GTGAGCTACA GGAGGCTTTA TACTTTCTAT
ATCAATTACA
TTGCCGGGGT
ACAACTGGGG
GGGTGCGCAC
TCATCAGCCA
TGAATAGTGC
TCATGGGGTT
CCAACTGGGC
GAACGCTAAT
CAATTTCGCG
AAAGAGCGTG
TGGGGCTGAT
TTAA
3420 3480 3540 3600 3660 3714 INFORMATION FOR SEQ ID NO:450: SEQUENCE CHARACTERISTICS: LENGTH: 516 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 516 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:450:
ATGGAAAAAT
CTAACCACCC
GAGTTAGAAA
AAAAATTACG
GCTTTTGAAG
GATTTTGAAG
GGGAATTTGC
AATTTATTCT
GAAAATTTTG
TACCTA.AA ACGAGTTTCT AAAACCAAAT CACAAAAACT AAAAAAACAG AGCCTTTCTC CAAAAAACCA GCGCCAATGA AAGGGGCGTT TAAAAAAAAT GAAGCCTATT TTATTTCTGA TTTTAGTGCC GGACAATGTG ATCTCTCTTT TGGCAGAAAA CCAGGCTTAG GGCGGAATTA GAAAGGGATA TTATCACCCA ACGTGCGCGA AGTTTCTTTG CAATTGTTAG AAAATTTACG CCAATATCAA CACCTTAAAT TTTGTCAAAC AAATCAAAAA TTAATTTTGA CAACATGTTC AAACAACCCC CTTTTAATGA ACAATAGCGA TGAGGAAAAT TTTTAA
TATCCATAGC
AATGCTTTTA
TGAAGAAGAT
CGCCAGAAAG
AGCGCCGATT
CCAAAAAGAC
AGAACACCCT
GAATAATTTT
120 180 240 300 360 420 480 516 INFORMATION FOR SEQ ID NO:451: SEQUENCE CHARACTERISTICS: LENGTH: 1308 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737044PCTJTJS97/05223 417 (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1308 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:451:
ATGAAAAAA
GCGAGTGCTT
AACCATTTGT
GATAAAAACC
GAGCTTGGCG
GATACCAGCG
GCCATAATC
AAAGTCAAAA
GCTAAACTGA
GGCACTAAAG
GTTTTTGAAC
CTCAI\TCGTT
TATTTTGAAG
TTCGTGTATT
AAAGTCATGA
GTTCAAGAGG
TTTGCGAGCG
AAAAAAATAA
AAAAAGTTTT
ACCACTTACA
CAA.ATCCAAA
AACGACTTGA
TTTTAATCAC
TOACACACCA
TACCCATGGG
AGTTGGGTTT
CGGTGGGGTT
CAGAAGATTT
GCTTAAAAGA
CCAGAATGTT
CTTTAAAACA
AGAGTCTTCA
TCAATAAACT
TAAATAACGC
CGAGCGATAA
TTGGCGTGCC
TOTTTGTGCT
GCTTGGCTTA
GGTATTTGCA
TTAAAGAATT
TACTAGGCTC
ATTATTTTTA
AAATGAGTTT
CCTTTGCTAT
TTTAATCACT
AGAAATCAAT
GTTTATCCAT
GGCGAAATTG
TGCGCAACTT
GCAAATCACT
GCTTTTAAAA
AGCCCAACTT
AGAGCTTTTT
AAAAATCAAG
CGTGGTGGTG
CCTCAATTTC
AAAAAGTGAA
CTTTAAAATC
TGGGGGGGGG
TAGCGTGTAT
AACCAAGCTC
TATAGAAAAA
TGAGCCTTTA
TTTGGGTTTG
GAAAGAGATC
TGTGAGCAAT
TTATTATTAO
CAAGCTAAAG
TTAGCCTTTA
TTCGCGCAAG
TTAGAGCAAA
TTAGA.ATTTT
TCCCCTAATT
TTACAAAAAG
GCTAACACCC
CTAGACGATT
CTTGGGGGCG
TTGCCGCAAG
AAAGTCCTCT
AAGGATCTAA
TTTGGCTCTC
ATCCGCTCCA
AGCACTCAAO
GGCATGACGC
AGGAATGAAA
CCTTTAGATT
AATGATTTCA
AAAAAGAAGG
GAGTTTTTAT
TCCCTCTGAT
GAGGGGGTGG
TTTTAAACGA
AAGCGATCAG
TAAAAGAATA
TCACGCAAA
AAAGCGACTT
CTTTAGCTAA
TGAAACAGCA
ATTTGAAAGT
GTAAAGCCTA
ATAAAGACAC
AACAGGATTT
GTTTGATGGA
ACTTTTCTAA
CTAAAAGCGT
AACAAGAATT
CGATCTCAAG
TCAACCAAAC
TTAAAGAGCA
ACAAATAA
GGGGTTACAA
TTATGAAGAA
GAGCTTGAGC
AGGCACTAAA
TTTGAATGTG
CGAAGATGAA
CGCTTTAGAA
TGATTATTTG
TGCGGCCTTA
ATTTGCTAAG
CAATCAAACG
TGAAGAGCCT
TGAGCAGGCT
AGCGAAATCT
AAAAATCAGG
AGTGGCGCAT
TGCCTTAGTT
AGACGACGCT
CCGCTTGAAC
GCTACTCAAT
TACCGAAATC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1308 INFORMATION FOR SEQ ID NO:452: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1845 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 1845 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:452: TTGGCTTCTC CAAAAGAAAC CCCAAAAGAG GCTCAAAAAA ATGAAGCTCA TCTCAATCCA ATCAAACGCC TAAAGAAATG AAAGTCAAGT CCATTTCTTA TCTTACATGT CTGACATGCT CGCTAATGAA ATTGCAAAGA TTCGCGTGGG GATTCTAAAA AA.ATAGACAC CGCTGTTTTA GCTTTGTTCA ACCAAGGGTA GTTTATGCCA CTTTTGAAAA CGGCATTTTA GAGTTTCATT TTGATGAAAA
AAACGAAACC
TGTCGGGCTT
CGATATGGTG
TTTTAAAGAC
AGCCAGGATT
WO 97/37044 PCTIUS97/05223 418 OCOGGOCTAG
AAATCAAGGC
ATGGGGATCA
AAAAGGGCGP
TTAAAAACGG
CTTTAGAG
AAGGTCAGTG
AGGGAGCGTW
ATCAAACA.AT
CCATTTATGA
AGCGCGAACA
AGCAGCGCGA
CGCTTAGATC
AATTAGAATA
TACTTAGACG
CTCATATTTC
AAGCTCCATT ATAAGGTCpjA ATTGACAACC
CGGTAGTCCC
GATGTCTTTA
ATATTGAGCA
GATAAGGGCT
ATGCGTTTGC
CTTGTGAAAG
TCATTTATCG
ATTTCAGGGA
ACCAGCGCAC
AAAGATAAAT
ACA.ACTTGAC
TTTTTCTCTA
AAGTCAAGAT
GTGAGCGTAG
AAGAGGGGCG
GGAGGGCTCA
TGCTTAATGG
ATGAGCTTGT
ATGCTAACAT
GGGGCGGGGC
GTATGTTTGC
TGGTATAGCT
CTACGATCAA
CAAGGCGGGG
GCTTTGGGGT
TTAGGGTATA
ACTTGAATGT
TACTATTCCT
CTGTTAATGA
ATTATCAACC
GCTTATCAGG
GGAGCGATCA
CCATCTTCAC
'TTATGGGAT
LCACCTTTGAT
GCAGGGCTAT
ATTGATCGTG
GGGAAGCGAT
TTTCATGGGC
CGATTCTTTG
TTCGCCTTTT
AGAGGGGATC
CTTAAAAACC
TTTA.AGAGCG
GGTGGTGAAG
TATTGALAGTG
GAGCGATAGG
CAAACTGAGA
TGAAGAAAALA
CACCGGGCAG
GAGCGTGAGC
TGCCACAGGG
CGGGAATTTG
TCTTTATGCG
GAATGTCGGG
TACCAAACTC
AGTGGCCTCT
CGGTAGAACT
CAGAAATAAA I
GAAAAGGAAA
GAG CAAAAAT
TATGGGAGCG
TTTGATGTGA
AAATTAAAAC
TGGATGTGGG
CGTATCCAAG
TTGAAAACGG
CA-ATACAGGA
TTAGAAAAAG
GATGCGCAJA
CCAGACTTGG
GGCGATATGG
ATCATTAGGA
AATTCCGAAA
AGGGTCAATA
TTGCAATTCG
GAAAGGAATC
GGGGGTAGAT
1GCTTGACTA
GATTACAGGA
CGCATGCTGG
CTTGGTTTCA
'CAAGGCAAT
'CATTGGTTC
C
~GGTATTTGGC
ALAGACGGCTT
TAGAGCATGC
TGGTGGAGGT
ATAGGGGGGA
GCCGTGTGAT
GCTTGAATGA
ATGTGTATAT
ATTTTTCCAC
TTTCAGATAT
CGCTTA.AAGT
TTTTAAAAAC
ATAAAGACGA
TGCATATCAA
GGGAATTGTT
ATTCTTTGAG
GCTCATTGAT
GGTTGGGCTA
TTTTTGGCAC
CTTATCCGGG
kTCCAAGGAT rAAGCTACCA 3TAATAGAAcC 'CAGCCCCTT 7 'TTCCACACC
C
TGAAAGCTG
'ATAG
AAAATCCCAJA
TAAAACGGCT
GCGCACAGA
CAGTATTTAT
TGAATCTTTG
CGGGAAATTG
GCGTAGGGGT
CCATGACGCT
TTTAATAGAG
TAAA-AGGAJA
CGAAATCGCC
AAAAAACGGG
TGATGTCATC
ACTAGGGCCT
GCGTTTAGGG
GGATTTGTTA
rGGCTCTTAT tGGGCAAAGC
CATGCCAAAA
PTTTGACAGC
kTACATCCAA
~CATGTGAGC
~TACAACCGC
~GCATCGGTG
'TCTAGTCCT
360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1845 INFORMATION FOR SEQ ID NO:453: SEQUENCE
CHARACTERISTICS:
LENGTH: 1131 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: No (iv) ANTI-SENSE:
NO
(Vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1131 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:453:
TTGTGTGAGG
AGAAA.CTGGA
GCTTGGGACG
AACGAAAATG
ATGAACAATT
GTGCTTACCT
TATGGAGGAG
GCCCTAACGT
ATAACTGGCG
GGCGCTTTTT
CGTCTAACTG
GGATGCCCTC
ACTTCAACTA
CGGAGTATTT
GATGAGCGGT
CAACAATTAC
CATCGTGCCT
GATGCAAATC
AACCAACATT
CCAACGCTCT
TACCCAACAC
TGGAAACCCC
ACATCAAGCA
ACAGCGGTCT
ATTTGAGCGA
GGTTTGCGCT
ACGCTTTTGT
CCTGAGAAGG
ATTTAAGAAA
GGCTTTATCC
CTGTGCAA.GG
TTCGTCCCGC
CTCAATTAGA
TTTGACACGG TGGAAGCAGG
TGGCGTTTAT
TCGCATTGAA
GCAATACAAC
AATCAAGCAC
CGATCACAAT
CGTTTTGAGC
AGCGCGCTAC
120 180 240 300 360 420 WO 97/37044 WO 9737044PCTIUS97/05223 ACCTATAAAG ATAAATTCAG
CTTCAATGCG
GCCACCGGC
AkATGGCTATT
TTCCATGCCG
TTGAACGGGO
TTCCAATTCA
TATTTTTATA
ATGCAATACT
GAAGCTTGGT
CAAGTGAGCC
AATAACATCT
CAACCTGCGC
AGTATTCAGT
CTCA.AGGCGT
CTTTCA-ACTA
ATGTGCTAAA
TTTTTGACGC
GCCGTGCTTA
ATAGTGGGGG
GTATGACCCA
AAATTTTCTG
TCAACATGAA
CTGGAAGATC
CTATACGAGC
GGAGCTGGIA
CATTGACACT
AGGGACTAGC
TCGCTACAAT
TAGCGGGATT
GAACAACTAT
ACATGAAGGG
GGAAAACGGA
GTATTATTTT
GGTTACAGCG
GACTACTTTA
GGGCCCATGA
TTGTATTACA
CGTGTAACTA
TATAACAAGC
TGGCGTAAAA
AGCAACAGCG
GAAAGCGTTC
CTCTTGCCTT
AGACACAGAG
ACAGGGATTG
TATTTGAACT
GGATTTGGGC
AAGGTAATGT
GGCCCATTAG
GCCATGGCcC
ATTTCCCTTT
CCACCATTGG
CAGCAGGAGG
TTAATAGCG
GGTATTGT
TTACAGGAAG
GCTCTAGCCC
ACACTTTCTA
GCGCGATTTT
GCGCCCCATT
AGGGTTGCA.A
TTTAACCGAC
TGTAAGCCCT
TATTTCTAGC
CTATTATGGG
TTATCA-TGC
GTGGAATATC
CTTACAAATC
TGCAGGCTTG
A
480 540 600 660 720 780 840 900 960 1020 1080 1131 INFORMATION FOR SEQ ID NO:454: SEQUENCE CHARACTERISTICS: LENGTH: 1866 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1866 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:454:
TTGCATGCTG
CAAATGGTCA
CAATCTTTAG
GCTGTCAACA
GAAACATCGC
AGTCTTTATG
TCTCCTTTAG
AGCAAA~GAAA
AATCTCTGTG
ACTACAGCTC
ATGGTGTGGA
ATCACATCCA
CCTATCTTGC
CAAGCTCGAG
GCTCAAA.ACC
ATTCCTAAAG
AAAACCCCTA
CAAGACAATG
GTTTATAACC
TTGAGCGAAG
TCGCCTAAAG
AAGACAACGG
AAAACACCGG
CCCAACTGGC
ATGCTTTAAG
CCATCTACAA
CAGGGAACGC
GAAGAATCCA
CTTATGATAA
CCTTATCAGA
TTCAAACCGC
AAAATATCGT
CTGGTCATGT
A-ACAAGCGCT
CTATGGGATC
AAAAGCAAAT
ACCAACTTAA
CTAACCCTTA
TAGCTAATTA
TAAAATCCAA
AGATTTCTAA
ATTCTACAGC
CTTTTTTGTG
CGAATTGAAA
TTCGTTAAAA
CGATTTAAAA
CACCGCGCAA
TCTCAGTTTT
TAGAGATGGG
AATGAAGACA
ATGCTCTAGC
GCAACAGCTC
CATCGCAGGT
AACCGACTAT
TACGCTTTCT
TCAAACAAAT
CCTTTCTAAC
GTATTTGGAG
CAGACAGAAT
TGGTAATCGT
TCAAACAGAG
ACTTCCCTAT
GGGCCAATAC
AGCGCGGGCT
A.ACTTGAACG
AAAAGCATTC
AGCTTTGCGA
GCTGTTATCA
CATGTGACCG
AACTGCACAG
CTTGCCGAA.
AATCAATCAA
ATGGACTTAA
GTTACAAACA
GCGGTGTTTA
CAAAGTAACC
CGTGAATTCG
GCTTCAAGTA
AACGCTTACT
GTGAATTTGA
TTGGATTCGG
ATCGTAACCA
AACCAAGTCA
CAAATCAACC
ATCAAATCGG
ACAAATACGA
AAACGGCGAA
GTAACAACCA
CTTCAGTATT
GTTTGAATGA
GATTACAACA
ACCTCCAAAA
ATGGAGGCAA
TCGAACAGAC
AACCCAATGG
ACAACATCAA
ACACCCTATC
CTAAAGACAT
TCTTCAATCT
TGAAAGTGCC
ATAAAGAAAT
CTTTAAGCGT
CTTATAACGA
ATGTAACA
CAGAGCAGCA
CGAAGCGGTG
GCAGTTAAGC
CAACATTCAG
CACAAACAAA
GGCTTTTTGG
TGGATCTAAT
ATGTTTTATG
AGCTCAAGGC
AACTTCCATG
CAAGGTTTCT
TGCTGGCGCT
GGCGATGCTA
CACTCAGTTG
CTACGCTTTA
CTTTAATTCC
ACATTTGGGT
TAATGCGGTT
GGCTAAAGAT
TGCTAAGAAT
CATCGTTATG
ATCCAATCTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 126 0 WO 97/37044 WO 9737044PCTIUS97/05223
AACCAAGCTT
CAAAACAATA
GGCGAAAGCA
ATCAAATCCA
TTGTTAGTGA
CTTTTTGGTG
ACAGCGTTCA
AATCTCGGCT
CAACATGGCG
CTAGGCACTA
TATTAA
TAGCGGCGAT
ACGGCGCTTT
AAAGATGGGG
GCTTTTTTAA
ATTTTATCAA
GTATCCAACT
ATAACCCTTA
TGAGGACGAA
TTGAACTGGG
AGCTAGAATA
GAGCAATAAC
GAACGGGCTT
GTTAAGGTAT
TTCTTCTTCT
CGATAGCATC
AGCAGGGACT
CAGCGCGAAA
TCTCGCTACA
CATTAAAATC
CCGAAGGCTT
CCCTTTAAAA
GGCGTGCAAG
TATGGTTTCT
GATATATGGA
ACAAGAAAGA
ACATGGCTTA
GTCAATGCTT
GCTAAGAAAA
CCTACCATTA
TATAGCGTGT
AAGTGGGCAT
TGGGTTATAA
TTGATTACAA
CTTATGGCGG
ACAACAAGCT
ATTCTCAATA
CCAATTTCCA
AAGACAGCGA
ACACCAATTA
ATCTCAATTA
GATCAGCTCT
ACAATTCTTT
CCACGGCTAT
TGGGAGCGAT
TTCTGTGGGT
CATGAATTTA
ATTTTTGTTC
ACGTTCCGCG
TTATTCTTTT
TGTGTTTGCT
1320 1380 1440 1500 1560 1620 1630 1740 1300 1860 1866 INFORMATION FOR SEQ ID SEQUENCE CHARACTERISTICS: LENGTH: 492 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 492 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:455:
ATGATGCGAG
TTTTCTAACA
AAAAGTTGTT
ATTTATATCA
ATCCTTAAA
CGTTTTGGGC
ATTATCGCTT
TGGAGTGAAA
GAAGAGTGGT
AGATCCTTAC
GAGTGGTTTT
TTTTATATTT
ATCCGCCTTT
CATCTTTAGG
GTGCAAGGAA
TCAATCAAGT
GAGACTCGGT
GA
TAACCGCTTT
AGGGCTGGGA
TAAAAATCAT
TGGTTACACT
TTTGCGCCAT
GCGCGATTTT
CATTTTGAGG
GTTAGTGCCT
TTCCCAAGCC
TCTAATCTTA
AGTAAAATCG
AATCAACCTA
TTTTTTGCTC
AAAGATGCTC
CAGAATGATT
TTAACTTTAC
TTTTTAAAAA
AAAACCCTTT
GGAAAATTTT
ACTTTTATAA
TAGTGTTTTA
CAAGAACTTT
TGACTTTACC
AACAAATCCT
AAGGCTTGAT
AAAAATATTA
TTCTTCACCC
CGCTACGATT
TATAGAAAGG
AGATATTGAT
TCACCCTAAA
TTTTAAAAGA
120 180 240 300 360 420 480 492 INFORMATION FOR SEQ ID NO:456: SEQUENCE CHARACTERISTICS: LENGTH: 858 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCT/US97/05223 421 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .858 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:456:
ATGCAAGATT
GGTAAGGCTC
ATCAGCACTC
GAATTACTGG
GGGGCGAGCA
AATATTTTTG
TTCACCACCA
GCGATGGTGT
ACTTCAGCGT
AAAAGCGCTA
AATATCAGCA
ATGATTTTAA
GTGAATGACC
ATCGTGGATG
GAGCAACTGA
TTATTAAGAT TTTTATTCAA CAAGCGTAGG ATTAGAAAAA CTTATGCAAG GGTTAAGATT CTCCGGTAGA TTTAGTTACC AGGAAGAAAT GGATAATGAC GTOCGATCGC TACAAGCTTG CGAACGCTGA AATCGCTAAA TTTCTTTTAA AATGGAAGCC TTGAGGGCCA ATTTGAAAAA CTGAAGAGAC TAAAACCCAC TGCTTTTAGA CGTGAAATTG AAGATGTGGT CTCTATGGAT CTTTAGAAAT CCTCGTAGAT GGAATTTTGG CATTCAAATC
AAAATTAA
GAGGTTGTCT
GAAGTTTCTA
AGCGCGATTG
GCTTTGAGCG
GATTTAGACG
AAGTCTCAAG
GAGCTTCCTA
ATCAAAGAAA
ACGCATAAAG
GACGCGTCTT
AACGTTAAGG
ATAGGGAGCG
GACAAGGTGA
ACGGATATTG
CTACTTTAGA AGGGTTAGTG ACAATGAAGA AGCGAGTTTA AAAAAAATGA AAGTCCTATT ATTTGATGCT AGGAGGTGAG CTTTTAAAGA AATGGCTTCT AATTGCTCCC TAAACTCAAT AAAAAGAAGA TTACGCTAAG GCCAAATCGT TTTATTGATC AAGAAAAAGA AGAAACGACA TAGAAAATAT AGAAATCCGC TGCGTATCGG TCAAAAAAAG TGGTGGAGTT GGATCAATTG TCGCTAAGGG CGAAGTGGTG GCACTAAAAA GGAACGATTA 120 180 240 300 360 420 480 540 600 660 720 780 840 858 INFORMATION FOR SEQ ID NO:457: SEQUENCE CHARACTERISTICS: LENGTH: 513 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 513 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:457: ATGAAAATTT TAGTGATTCA CTTTATGGCA TGGTAACCTT GGCAATTTAG ATGTGGAATT AAAATCCAAG AGAGCGTGGG TCGCACACTT CTATTGCGAT GTGCATCTCA CTAACATTCA GCTTGTGGAG GCGTGATCAT
AGGGCCTAAT
AGACCAAATC
AGAGTTTTTT
CAGCGATTAT
TGCGGATGCG
AGCCAGAGAA
GGGATTTGGC
TTAAACATGT
CATGAAATCA
CAAACCAATT
GAGGGGATTA
ATCATGCTAG
GAATTCAGGA
CCGCTTGGCT
TAGGACACAG AGACCCAAGA TGCAAACTTT CGTGAAGCAA TTGAGGGCGA AATCATTGAT TCATTAACCC TGGAGCGTTT CGGGCAAACC TGTCATTGAA AAAATTCTTA CACCGGAGCG ACAACATGGC TTTAATGGCG 120 180 240 300 360 420 WO 97/37044 WO 9737044PCT/US97/05223 ATGGTCATA TTTTAGCCGA AATGAAAGCG TTCCAAGAAG CCCAAAAAA
CAACCCTAAT
AACCCCAATA ACCCGATCAX CAATCAAPA
TAA
IN-FORMATION FOR SEQ ID NO:4S8: SEQUENCE
CHARACTERISTICS:
LENGTH: 1074 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1074 (xi) SEQUENCE DESCRIPTION:* SEQ ID NO:458:
ATGAGAGGAT
TGCGCTTATA
ACTGATTCTC
GTGGAAGTGA
TCGGTTAAAA
GATTCAGCGG
CGCATCATTA
GAAGCTTTTG
AGCGAGCGGT
CTGAGTTTTG
AGTGCGAAGG
GAACGCTATT
AAAAGGGAGC
GCGCAAGAAA
TTGGCTAGGG
CATGGCATTG
TTAGAAGAGG
GTGCGCATTG
TAGAAAGAGA
GTTGCGATAA
GTTACACCCA
TAGAGTCTAG
AGCTCTTTTT
TTGGGAATAA
AAAACGAGCA
AAAATTTTGC
ATTTGCAGCA
AGCCTATTTT
ATTTTTTAAA
GCTCTGATAG
AGAGTTTCAA
AGGCTATTTC
GAGTGATTAG
ACTTAGACAT
GCATGGTGTT
AGGATTTAGT
ATCGCATTTC
CGCTTTGTTT
AGAAGCTAA
CGATTTAGTC
TGACCCCAAT
GGTTGTTTTA
TGAGATCCAA
CGAGTATGTG
TAAGGTTA-AA
AGCCTTGAAT
AGCGGATCAT
GACTCGCACG
GGATAAAGAG
AGGCATTAGA
CGATTATGGT
TCATGAGCTT
TTCTGTAGAG
GGTGATTAAA
ACGCTCAATG
TTGCAATTAG
GAAAGCCTTC
CAAAGCGCGA
CAAGTGAATT
GAGGGCGTGC
CTCCTCAMA
AAAAAGATTT
GACTTTTTGA
GCGAACGCGA
AGCATTCTTT
GCTTTTTTTG
CGTCAAAAGA
GCGGGCATGA
TATGGGCAGT
CCCTATATTT
CCTGGGATTT
AATTCTAGGG
AAAACGCGAT
ACGATCGCTC
AGCCTAAAA
TTGATTTAAT
TGCAAACCTA
CCAGTTACCA
AATCTCAAGC
TTGATGAAA
CTAAAGAGGG
GCAAACCCCA
TGGATATGGG
ACCCTAPAkGA
TTTATGACAT
CCGGTAAAGA
ATTTCACTCA
CATCGCGCAG
ATATCCCTGG
CCGAGCTTTT
GTTTTTTGAG
GTTCTTTATC
GGGCGTTTTA
TACAAAGAGT
CA.AGCGTTTG
CCGCCAAAAA
GTTGAATGTT
AGAGTCATTG
GGTTTATGAT
TGCCTTGCCT
GATCAAATAC
CTTTGTGTTC
TGTGAAAGAA
AGCGGACAGC
CAGCACCGGG
TGAAACCATT
GTTTTTTGGG
ATGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1074 INFORMATION FOR SEQ ID NO:459: SEQUENCE
CHARACTERISTICS:
LENGTH: 1158 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
WO 97/37044 WO 9737044PCTIUS97/05223 423 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1158 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:459:
ATGAAAGATA
GCCGATCAGA
GTCGCATGCG
TCTGTTTATG
GACGCTCTTT
AGCCCTGATA
GGGCTTATGT
TTAGCGCACC
TTAAGGCCTG
GTTGATACGA
GAAGCGGTGA
ATCAAGTTTT
GGTTTGACAG
GGAGCGTTTA
TATGTGGCTA
TATGCGATTG
CATTCAAGCG
ATCATTGAAA
TTTGGGCGCG
GCGTTCTTTA.
GTTTTCTTTT
TCAGCGATGC
AGACTTTAGT
CCCCGATGCA
ATGGCTTTGA
TTAATCAAGG
TTGGTTATGC
AGCTCGCTTT
ATGGCAAGTC
TTGTCATTTC
TTGAAGAGAT
TTATAAACCC
GTAGAAAAAT
GCGGGAAAGA
AAAATTTGGT
GGGTGATAGA
CGGAGTTGGA
GCTTGGATTT
AGTTAGAAGA
AGCGTTAA
CACTTCCGAA
GGTTTTAGAT
TTCTAATGGT
AGAGATCGCA
TTACAGAAGC
CGTGGATAGA
ATGCAAAGAG
CGCTCTAGCT
TCAAGTGAGC
CACCCAACAT
CGTGTATAAG
TACAGGAAAA
CATCGTGGAT
CCCTAGCAAG
AGCGAGCGGG
GCCTGTGTCT
AAAATGCGTG
GTTAAGACCC
ATTCACTTGG
TCAGTAACCG
TACATTATTG
TTTTGCATGA
AGAGAAGTGG
GCGGCGGTTT
GAAGATGGCG
ACTGAAACGC
CAAAAAAGAA
GTGCGTTATG
TCCCCAGAAG
GTTTTACCCA
TTCGTCATCG
ACTTATGGAG
GTGGATAGGA
GTTTGCGATA
ATTTATGTGA
AAATCGGTTT
ATTTATTCGC
GAAAAGACTA
AAGGGCATCC
AGCGGGATAA
TCACTGGCGA
TTAAAAAAAT
TAAATGGCAT
AGATTGGGGC
TCATGCCTTT
AAGACAACAC
AAAACAACAA
TTTCACAAAA
AAGAATATTT
GTGGGCCTCA
GGTTTTGCCC
GCGCGGCTTA
AAGCGACCGT
ACACGCATAA
TCAAACTCAC
TCACTTCAGC
ACAAGGTTGA
TGATAAAATG
AAAAGCCAAA
GTTAAAAACT
TGGCTATACA
TGGCGAGCAA
GGGGGATCAA
ACCCATCCAT
CTTGCCTTTT
GCCTGTAAGC
GCATTTAAAA
GCATGACAAT
AGGCGATGCG
GCATGGAGGG
TGCGGCCCGC
GCAGCTTGCT
CACGAGCAAG
GCCAAAAGGC
TTATGGGCAT
AGAGATTAAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1158 INFORMATION FOR SEQ ID NO:460: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 657 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .657 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:460: GTGTTAGATA GTTTTGAGAT TTTAAAGGCT TTAAAGAGCT TGGATTTATT GAAAAACGCC CCTGCTTGGT GGTGGCCTAA CGCTTTGAAA TTTGAAGCTC TACTAGGAGC GGTTTTAACG 120 WO 97/37044 WO 9737044PCTIUS97/05223
CAAAATACTA
TTAGAAAATG
GCAGAGTGTG
AOCAAGAATA
GAGTGGCTTT
CTGTGCGCCA
GGCATAGAGA
AATTTAAATT
AGATTCCATG
AATTTGAAGC
ATGATGAGAT
TCCGCCCTAG
TTTTAAAAGA
TAGACCAAAA
AAGA-AGTGAT
TAGAAGATTA
CCGCCTTAGC
GAAAGATCGT
CGTTTTGAAA
CAATCTTAAA
CGGGTTTTAT
CTTTCAAAGT
GGGCATTGGC
GGTGGTGGAT
TGATGAATTG
GCTTTATGAA
GGAATTTTCC
TCTTTAGAAA
AAAATCGCTT
AACCAAAAAG
TTTGAAATT
AAAGAAAGCG
AAATACAGCT
CAGCATTTTT
AACACCATTT
AAACAAAAAT
ATCTAAAAA-A
ATATAGAGTT
CCAAACGACT
TTAAACAAGA
CGGATGCGAT
ATCTTTTTTT
TTGAAAA-GG
CTTTAGCGCA
TGGAATTGAA
CGCTTTCATT
TTCAAAGCTT
GATTGATTTG
AGTAACCAAA
TTTATGCTAT
AAAAAAATTA
CGTTCAAGAG
ACTTTATGCG
GCTTTGA
INFORMATION FOR SEQ ID NO:461: SEQUENCE CHARACTERISTICS: LENGTH: 399 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoinic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 399 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:461:
ATGAAACAAC
CTTATCGCTA
GCTGAGAGCG
GAATTAGTGC
AAAGGGATCA
AAGGAATTGA
GAAAACACCG
TATTTTTGAT
AGAATAACAG
CTAAAAAGAC
CTTTAGAAAT
TTTTAATTGA
ACGCTCCAA.
CTAAAGAAAG
TATTGGAGCC
CGAAACAATC
TGAGCGAGGC
TGTGGTAGAA
TGGCTATCCC
TGAAGTGATT
GGTTTTAGGG
CCAGGGAGTG
GCTCATTTTT
TTATTGATTG
ACGATCCTTT
AGGAGCGTGG
TTAAAAAGCG
CAGTTTTAG
GTAAAACCAC
CTACTGGGGA
AGAAATTCAC
CAGCGATTAA
AGCAAATGCA
TGATTGAAGT
TGACGCAGAG
TTTACTCAGG
TTCTCAAGGC
AAGCTCTAGT
AGCTTTGGAT
AGAAGTGAGC
INFORMATION FOR SEQ ID NO:462: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 462 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737044PCTIUS97/05223 425 (ix) FEATURE: NAME/KEY: misc feature LOCATION .462 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:462:
ATGCATTATT
CGCCTTTCCG
CATTTTTTGA
GACACCACCA
TTAAAAACCA
CTTAAAGTGT
AAAACAAGCG
TTCTTTTGGG
CTTATGAAGC
GTATCCCAGA
GTTTGCACTG
AGCGACAAAA.
TTTTAGTGGT
TGCAAGACAA
CGAAATACAA
AAGTGGATTT
CTTTTTAAAA
AGCCCTTGTG
GAATTTAAGG
CGCCCTAAA
AGATGAAATC
ACACCCTGAT
AGCCGATGCG
GAAAAACTTG
GACAGCCTGG
TGCGTGATGC
GAAGTTTATG
ATTGAAAATA
GTAGATAGCG
AAAAAGTTTT
TTTTTAAAAG
AAAAGCCACT
AGTTAGCTAA
GAGGGGGCAT
GCATCAATGC
CCCCCACGAT
GTAATTCTTT
ATAGCGCGAG
ACGCTCCTGA
GA
ACALAGTGGAG
GACTTTAGTG
GATTTCTTAT
CAAAGATCAT
GGAAGCGGTA
TTTGTTCCAA.
ATGGATTGAT
120 180 240 300 360 420 462 INFORMATION FOR SEQ ID NO:463: SEQUENCE CHARACTERISTICS: LENGTH: 708 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .708 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:463:
TTGTCTGA.AC
AAAATCCGCC
GATTCTTTGT
GTTACCAATC
TTGCAAGATC
AATTCATTTA
AAAACAAGCC
GGGAGCGCGA
GGCGATAAAT
TTTAAGGTGG
GTTACGGCGA
GAAAGGCAAT
CCATAGATAG
AACAA.AGGGT
ATCGTGTGGG
AAAACCGCCA
TCTATAAAGG
ATTTTAGAGA
TAGCGATAAA
AGCGTTTGAA
TCGGGCGTAA
TTTTTAGCCC
GTTTTGGTTT
GGAATGAAAG
ATTCACACGC
TTTAATCTGT
GATAGGGCAA
GATTGGATCA
CATCCAAGCT
TTATGATTAC
ATGTCAAAAT
TCCTAAACAC
ATTTAGGGAT
TGAAGTCCCG
GCAAATAGCG
ATTACGAAGA
ATAAGGTGGT
GGCGTGGGGG
ATCACTATCA
GAAAGGATAG
TTAAATTGTC
ATTTTAGATT
TTCGCTTACG
ATCCAAGTGG
TTTTTAAAAA
CATTGCATAG
AGTGAAGTCG
CGAATTGGAA
TGTTTAAAAA
GCGTTGGAGG
TTGATAAAGA
GAGAATCTAA
GCATAGATGA
GCATGGACGA
GGAAGTTTAT
GGAGCGTGTG
AACGCCGTTT
AGCTTGGAAG
TGCAAGATAT
GATTTTGA
CGATTTTGAA
CTTTGCACTA
CATGTTTGAT
AGTGTTGGTG
AGCGTTTTTA
TTTGCCTATT
CAGCTCTATG
GGAAAGCTAT
TAAGGGGGAT
CTTTA.ATGCG
TATCAAAAAT
120 180 240 300 360 420 480 540 600 660 708 INFORMATION FOR SEQ ID NO:464: SEQUENCE CHARACTERISTICS: LENGTH: 1320 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 426 STPANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: isc feature LOCATION 1...1320 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:464:
ATGCAAGTTA
GTGGATTCAG
AAAGCCGATG
ATCCAAACCA
GGGTGCTTGA
TTTACCGGCG
TTCAGCGAGC
GTGCATGCTT
CCTAGCTTTA
GATCTAGCCC
TTATACGATA
CAAGCCTTAA
ATTGGCGCGA
ATCAGCGACT
CTTTTAAACG
CATCCAGAAG
TTTGACAGAT
GAGAAAGTGC
AAACACCAAA
CATAAAGAGG
GGGGAAATTT
ATTGTGCCTA
AAGAAAACAA
AGGTGATGTT
TGATCCTTAT
TTCTCAACGC
GCGAGCGCTA
TGGGGGATTA
AGGTGTTTTT
ACGTTAAAAT
AGGGAAAATT
TTAAAGGCTA
AGGGGCAAAA
AGAGCGCGCG
TTGAAGACTC
CCATGCTTAA
CCATGAAGCA
AAAATGAGAG
TGAATATTTT
CTAAAAAAAT
ACCATTCTTT
GCGAATATTT
TAATCAATGA
GCGCCTTTAA
ACAACTCTGC TTGATCTCAT TAGGTTGTTC AGGCAAGCTT TATAATTACA CGCTCACTAA CAACACTTGC GGTTTTATTG AAAGCGCTAA AGCCAAAGAC AAAAAAGAGG GAGCGATTTT TAAAGATGAA ATCAAAGAAT TGATCCCTGA TGACAAGATT GATATTTTAA TCGCTAAAAA AAGCGAGCAT TATAACGCGC GCATTATCAC TTCTGAGGGC TGTAACCAAA AATGCTCTTT ACAAAGCAGG GkATTAGACT CCATTTTAAA TAAAGATATG ACTTTTATCG CTCAAGATTC AGACGGCTTG ATCCAGCTCA TTAGAGCGAT TATTTTATAC CTCTACCCCT CTAGCACCAC GCCTATTTTT CAAAATTATT TTGACATGCC AAAAATGCGG CGCAACTCCA GCCAAGCACA GGTTAAAGAA AGCTTTATAA GAAGCACGAT CGAATTTGAA GAATTGAGCG CGTTTTTAGA TGCTTTCAGT GCTGAAGAAA ACACGCATGC CATCAACGCC CGCATTAAAG CCTTGAATAA TAAGGCTTTG TTGAATAAGC CCATTAAGGC TTACAAAGCA AGGGATTTGA GATGGGCGCC CAGCGAACTA ACCACCCCTT TAAAACCCGG AGACAATATT TTACTCGCTA AGGTTTTAAG
TAAAAATTTA
TGACGCTAAA
ACAAGAGAGT
AATTGCGAGC
AGTGGATATT
ACAAAACCAA
GGGATCGAGC
TTGCGCTATC
AGAAGTAGAA
TAGCTCATTT
TGACAAACAG
TTTAGAGCTG
CATCCAGCAT
CCATTTAAAG
CATTGTAGGG
CGAGTTCCGG
CTATTCTTTA
AATCGCTTTA
GTTAGTGGAA
TGAAGTGGAT
GCATTACACG
CCCTTTTTAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 INFORMATION FOR SEQ ID NO:465: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 2004 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737044PCT/1JS97/05223 427 (ix) FEATURE: NAME/KEY: misc-feature LOCATION .2004 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:465:
AAAAAACCCT
TTTTTTATCA
GAATTGAkA
AACCTCAATC
AATTTAA.AAG
GCGGTGTATT
CAATGCGGTC
AATTCAAGTT
TCCATTGAGA
CAAGATAGCG
ACGCAAACTA
GCTCAAACCC
TGGGTCAATC
A.ATGTGTGTC
CAAGAAATCG
GATTTCAATC
CAAGCGCAAG
CCAAGCCAAT
GATGCGGGGG
ACGGCTTTAA
GAGTTGTTGG
TATAACAGCA
ATAAGCCAAT
CAAAGCGCTT
CGCGTTGGAT
GTGGGCTACA
TTTGACTATA
ACTTATGGGG
AGCAAGATTT
TCCCAGTATG
AATTTCCAAT
GCGAGCGATC
ACGAATTACT
TTGAATTATG
TTTACTCTCT
GCGCGGGCTA.
A.ACTTTCAGA
AGGCGGTAAC
CAAACACGCA
TGGCGCTCAA
CTGGTAACAG
CCATTAATTG
ATTTTAAAAA
GATTTCCTGT
ATGGAGCTAA
TTTTGCAAGA
ACAATCAAGG
AGGTTTTTGC
TAACGCAAGC
CTTACACCTC
CCAAGATACT
TTATCACAAA
TTACTAACAA
ACAACAGCCT
CGCGCACCAT
TCACCACGAC
CCACTAACC
ATTCGCAATT
TAATCAGCTC
AACAATTTTT
ACCATGCTTA
TAGGGACAGA.
CTTTTGGGGT
TGAATTTAGC
TCTTGTTCAA
ATGCGGCTCA
ATTCTTTGCT
TGTTCGCTTA
CTCTCTCGCT
TCAAATCGGT
CACTTATGAG
GAACGCGAGC
AGGGCTAATT
TGCGGCGGTA
TGGACAACAA
CAATTTAACC
GCTTAATCAG
TTTGGATAGT
TAAAAGTGAA
AGCCAGTAAA
ACAAAACGGG
CACGGAATTT
TCAAAGCCTT
TGCTGATAGG
TGAGCTAGCC
TTACTTGGCA
CACTTGGGGG
TGCGCATTTT
ACTTGATTTT
CGCTTCAAAC
TAATAACCCC
ATTAAGCGCC
TCAAACCAAC
TGGTGAAAAG
TATCAAATCT
TGTCCTCTAT
GTTTGGGGGG
GACCTTCAAT
TTTAGGCTTG
GCATGGCGTG
AGGCACTCAA
CTAA
TCATCGCTTT
GAAGCCGCTC
AATTTGAGCA
AGCCCTTCAG
GGCGAAAAAA
GGGCTGTGGA
AGCGTAACCT
GGTTATAACA
GCTTATCAAA
GCAGGAAAAC
ACTACTACTA
ATGATAAGCG
GGCGCGCCGT
AGCGCCGTTA
AACCAGCAAA
GCTTTCGCTC
GATCAAATGA
GCTTGCCACA
GCCGGTTGCG
GGCACTCAAG
AGAGGCAGCC
ACGCCTAATT
GGGGGCTTAC
ACGCAAGAAT
AATGGTGCCA
AGAAGGTGGG
AGCTTTTTCA
AACTTTATCA
ATTGCGTTAG
AATTTCTATA
AGAATGAACC
GAATTAGGCG
CTCCAATACC
TAAACGCTGA
AAATGGTGAA
ACCTTTTAAC
AAATCAATGC
CCAATTCCCC
ATGTCATCGC
TTGAGGGCCA
ACGGGGTTAG
CTATCCAACA
AAGTAACTAT
CTACTACTAC
TCCTCACTAC
GCGGTTTAGA
CTAGCATGAT
ACAATCAAAA
AAAACATGCT
AAAAAGACCT
ATGGGGGTGG
CGTATGTGGA
CTGAGCAAAT
TTAGTAATTT
CCCCATTCCT
AGGCCGTTTA
TAGGGCATAA
TGAATGGGAT
GGTTAAGGTA
ATTCGGCTTC
ATGATAAAAC
CTGGCACTTC
GCGCTAAA.AT
TCGCTAAGAA
TGAAGATCCC
GAAGATTGTA
AGACA-ACGGC
AAACACCGGC
CAATTTTAAC
TGCGATCGAT
GGCGTATCAA
CTATAATGTC
ACCAGGACAT
CGGCCCTTTA
AGCTTTAAA
AACAATAACA
TACTAATGAC
AAACTGCCCA
TACGGCAGGG
CAAAAACGCC
CGCGCCGCAA
CAATCACGCG
TAACACTATC
GACATTACCT
AGAGACGATA
CAAGCAATCT
AAACAACACT
TAAAAATTTG
TCAAGTCAAC
CCCTTTCAGA
CGGTGTGCAA
TTACGGCTTT
TGATGTGTTC
CACCAAAAAC
ATGGCTGAAT
GAATGTGGCG
TAAGAAAAAA
CACGATCAAC
TAGCGTGTAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2004 INFORMATION FOR SEQ ID NO:466: SEQUENCE CHARACTERISTICS: LENGTH: 264 base pairs TYPE: nucleic acid STRAN~DEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: WO 97/37044 PCTIUS97/05223 428 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...264 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:466: ATGTTTATAC TTTTTAAGTT TGGTAGGGTT TTGGGTAAGG CTTATAGCTT
ATACTTATAT
ATATATGAAA GCTTGATTTG TCAAGCTTTT GGGTTGTCAT TGAGTTGCAA
TAACTCTATG
CTGTTTTCTA CTTTTTTGAT AAATCTACCA TTACCACACA ATGAGTCCTT
ATGCTGTTGT
AGGGATATTT TAGCGTATTC TAACTCTTCA TCGCTAAAGA CATATTCATT
GGAGTCAAAT
TTTTCTTTCA ATTCTTTATT
TTGA
INFORMATION FOR SEQ ID NO:467: SEQUENCE CHARACTERISTICS: LENGTH: 405 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...405 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:467: 120 180 240 264 ATGTTTATGG
CGTCAATACG
TGCGGAGACA
CGACGCCGGC
ACAGGAAAAT
TAAACGCTAA
ATTAGAACCA
ATATTTTTGA
CCTAATTACT
TTTTCAAAGG
AATGGGAACC
CGACTACTAT
CGCCGTCAGT ATTCTATGTA GACGCTGACG CACATGACTA ACGCTGATGG GAGTTGTAAT GTGGGGATTA
ACCCTAACAG
AGTGAATCAC ACGATTTTCC AATTTTTAGT ACACCATGGC ATTGAGTTTG GTATCAAAAT TTCTACTACT ATAAGAGCGA AAAAACAAGG CACCGGAGCA GAAACCAATT TCAGCTTAAC TTTGCGTTAT GTTTATACTT
TTTAA
TACGATCACA
CGTCTATACC
GAATGTGGGC
CCCCACGCTC
CCCACTAGAG
CCAAACCTTA
120 180 240 300 360 405 INFORMATION FOR SEQ ID N0:468: SEQUENCE CHARACTERISTICS: LENGTH: 837 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 WO 9737044PCTUS97/05223 429 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 837 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:468:
TTGAACAAAC
GATGATCTGG
GCCAAGAAGT
AAGAGGAAAC
ATGTCATGGC
TTTGCTGAAG
GGCTTCCGTT
GCATTAGGCG
CAAAGAAAAG
AAACGCGCTT
GAAACAAAGA
TTTCAAGTGA
ATAGAATTTG
ACCCTTCATT
TGCTTAAAAA
TTACTTATAC
GTTTAAGGGG
CTTTTAATAT
TACAGAGTAG
TCAGTTTGAT
TTTATGTCTC
ATAGTTTGAG
AGACTTTTAT
TTGGGACGCT
TTTTCAAACA
TGTTGAATGT
GCGCGCGCAT
TCAAACGCAA
GGGGTTTTTA
GATCGCCAAA
TAAAACCCAC
AGATAAAAGC
GGAAAAATTT
TTATGGCTAT
TTTGGATTAC
GGCGAGTTCG
TAACGCTATT
GATTTTAGGG
ATGGGCTAAA
GGGGTATCGC
CCCCTTTTTA
TATTTCTGTC
GCGTTCTTTT
GAAGAAGATC
CCCCCATGTT
TCCCATTATT
GAAAACCATT
AAACAATTTT
GCTTATGGCT
CAGATCCCTA
TTTTATGGCG
GTGAATTTCG
GACTCTCTAA
TACCGCTTTT
ATTAATGATT
TATCTCACTT
TAAGCGTGTA
TAGGGTACCA
TTACCGCGTC
ATGGCACGAG
CAAAATACCG
TTCCTAAAAA
TTTTTCTTAA.
AAAGCTATAG
TGGGGGCTGA
TGGGAGAAAC
ATACTTACCG
CAAGGTATAA
ATTTTAAAAC
CAACTTATGA
TTTA.AGGGCT
GCGGTTTTTA.
TAAAAAGCCC
CGTGGTGCAA
AAACATTCCT
AGAGCACTAC
AAATAAAGGC
AGAAAAATTA
CTTTTTATAC
CTGGTTTTAT
CCCCAACATG
GAATTGGGCG
CCCTTTATAC
TTTTTAG
120 180 240 300 360 420 480 540 600 660 720 780 837 INFORMATION FOR SEQ ID NO:469: SEQUENCE CHARACTERISTICS: LENGTH: 1497 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1497 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:469:
ATGAAATTAA
TTGTTCACTA
TTCTCCAAAG
GAAACTTTTG
ACCGTGAATG
TGGGCAAAGG
TTGAGCCTTT
GACCCTAGGG
AAAAACGAAA
CGGGTTCATT
TGGGTTTTAA
TTAACCTTAC
TAGGCGGTGT
ATTTTACCCC
GTATGAATGC
GTATTGGCTA
AGTTGCGGCT
AGGGGCGGTT
CCGTTCGCCT
GGGTAAGCTA
TTTGGGCGGA
CCCAAGCTAT
GACTAAAATG
TATGTATATG
ACATTGCTAA
ACTTATGAAG
ATTAACCCTG
GAGGGGTCTG
CAAGTTTATG
TGGGATAA-AA
TGGCAACAGC
GGTGAGTGGA
AGCGTTTGAC
TGCATGGGGA
TTAAAGGTAT
TGCATTTAGG
ATAACACTAG
CTTCTTGCGG
AAGGGCCAGG
ACGGCTTGTT
CTTGCCACTA
TTTTATCAAC
CTATCCTACA
TAGGGGATGG
GTATGATAGG
CACTGATTCT
TGGTATCATT
CCCTAATTAC
120 180 240 300 360 420 480 WO 97/37044 WO 9737044PCTIUS97/05223
TATCCGGCTA
CTTACCTATG
GAGCAAATG
AACAACATGA
TTGTTCCCTA
CCTACAAAGA
CCCGGTGCTA
AAAACGACTT
GCACCCGCTC
CAAGGTCCTG
GTGGTTGGTG
AACCCTGTGG
GCGGGGATTG
AAGCATGGTA
GAATATGGTA
AAACTCGTGT
GGGCCAAACG
ACGCCTACTT
ACAGCGACAG
ATTGGATTTA
AATTCTTGCT
TCTATCGTGA
ATCTAATGAT
AA.ATAGAATA
TCTATGTGTT
GTTATAACAC
CTCGTGCGAC
CTCCTTACCT
CTCTTGATCG
ACAACATTAC
AGTTCAGTTG
TTCGTATGTA
GGTTAGAGTT
GTCAGCCGCT
GCCCGGGCAT
ACTCCATATG
CCAATTGTTC
CTTTAGCTCT
AAAGCCTTGG
CCACCCTTAT
CGATACCAAT
GTATGACTAT
TTGGGATCCG
GCTCTATTTG
CAACATCGGT
TATCGAACAA
CGATGCTGAT
GAGCGTTTAT
TCTAGACTAT
CCAAATCCGT
CAACTTGAAT
TCA.AGGCGCT
GTAATGGGGC
CAAGGTTTTT
TGGGGTCGTG
GGTATTCATA
GTGTATCTCA
CCTGAGTTTA
CGTTGGA.ATA
TTCTTGGATA
CACCACCACA
AACCCTAACA
TGGGTCGCTG
GCGTTCACTG
CAGCGCTTCA
CAGTTCAGCA
GCGGGTTACA
AATGGTTTGT
ATCAAGTTTA
GCTTTGATGT
ATGGGACTTT
GTATCGCTGA
AGGCGGGTAT
TCCCAATGGT
GCGGTAGAGG
ACGCTGAATA
ATGGTAAGTG
TAGACATTAA
TGAACTTAGG
GCATCTACAG
AGTATGTTAA
CTACCGCACC
AGCATGTTAA
ACCCTGGAAC
TTGAATCTTC
TAAAGCGAAT
TACCGAGCAG
CAAGCTTACT
TGGTCAATGG
TATTTATCC
AGGTACATTG
TATAAGGAAT
CGGCCGTTAC
GCGTGGCTTG
CAACTACTTT
TACTTGGGGT
CTTAGGCTTT
AGGTGGAGGT
AAGGGCTTTG
AGCGGGTCTC
CGGTTTCCTT
GGCGTTC
540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1497 INFORMATION FOR SEQ ID NO:470: SEQUENCE CHARACTERISTICS: LENGTH: 207 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoinic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .207 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:470:
TTGAAA)GAAA
GAAGATTTAA
TTAGAAAATA
ATTGGCACTA
AATTTGATTT TTTTAAAGGC AAAAATTTTA AAATTGTCTA TTGTATTGGC CGACCAGAGA AAAGGGTTTT AGGGCTGTAA AGGAATTTTT AAGCGAGCAA TTGATCTTAA TTATTCCAAT TTAATTGTGG CTTATGAGCC TATTTGGGCG AAAAAGCGCG CTTTTAG INFORMATION FOR SEQ ID NO:471: SEQUENCE CHARACTERISTICS: LENGTH: 792 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCTIUS97/05223 431 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .792 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:471: ATGAGTA-AGA GCGCGATTTT TGTTCTTTCT TTATTATATG GTTTGTTGTT AGAAAGGCAT TTAAATAAAA AGGACGAACA AGCCATTGAC AAGAATGAAA AAATTGAAAA AGTAACGGAA GAACCCAAAG AAGAGCCTGA AGAAAGCCTT CAAGAAAAGA CAGACAAAAA CGCTCAAAAA AGGCGTTTAA GAGAGCAACA ACGCTTGAAA GGTTTGCAAC AAAATTTGAA TCAATTCACG TTAGATTTGC AAATCCCTAA ACAAGATGGG GCTCAAATCT ATCAAATTTT ATATAAAGGC GTGAGCGTGC TGATTATGAT CACTAAAGAT TCTGATTTTA AGGATTATAA CAAGAGCGTG GATTTCCCCC CATATCCTGG AGGAAACATG
GGCTTTTTAG
AATAAAGAAG
TTGAATTTAG
ALACAGGGCG
GAAGATATTT
GACGAGCAAA
CAAAACCAAG
CAAAAATTAG
GTTGATGAAA
TGGAGGGGCG
GGAGAGTTTG
ATGACCCTTT
ATTTCTATTA
CGTTCTTGCT CTATGCTTTG CAGAGAAAAT CCTTTTAGAT AAGATCTGCC AAGCGAGAAA ATTTTTTAGA GCCTAAAGAA TTTCTTCACT CAATGATTTT AAAATGAACA AGAAGAACAA AAAACCAAGA GATGTTGAAA AAAGCGTTAA AAACAAGACT AGGCTTATCA AGAGTGGTAC TTTTTTATCA CAAGGCTTCA ATTATACCAT TCTTAGCTAC TAGATGATTT AAAGAAAGTG AAGTTAATTT TACGACTAAG 120 180 240 300 360 420 480 540 600 660 720 780 GAAGAACAAT GA INFORMATION FOR SEQ ID NO:472: SEQUENCE CHARACTERISTICS: LENGTH: 2355 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .2355 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:472:
ATGGAGCAGC
TTTAGAAGCT
ACAGGGCTTT
ATGCCTTTTA
GGCGAATACA
GCCTTAGAAT
GATGATGTGA
CAGTCATTAA AGAAGGGACT TTAGCTTTAA ATTACATGAG CGCTAAAAAT AAGCCTTTAA TAACGGGGCT TGTGGGCATG GTTAAAAAAT TCGTGTTCGC CCTAGAAAGC CAGACTAAAA AACAAAATCG TAAAGACGCC CCTAAAGAGA GGTTGCAAAA AATGGGTTTT ACTTGCGTGG TCGCAAGTTT AGCCACGCTA AGCCCTTATA TTGATACCTT TGCGTATTTG CCAACGATAA GGGCTTTCCT TTTATAAAGA CAGAAAAAAC CTAAAAGAGC TGAAAAACTG TGCTTTTACA AATCCCTATC AGGTGGGAGG GTTTGAAGCC AAACCCGCAT TTATTCTAAA WO 97/37044 WO 9737044PCTIUS97/05223
GATAAGGATT
TTTTTGGCGA
CAGGGCATTG
AACGCTAAGG
TTGGCGAAAA
GCCTTTTTAA
TTAAGTTGCG
TATGGTTTCA
CCCATATTA.A
AAATCGCGCA
GAAAACCCTA
GCCCTAGCGT
TCGCCCTTTT
ATCATTGGGC
TTGGAAAATA
GTGGGGTTTG
ATCAAAGATT
AACGCCTTAA
ACTTTAGCTA
GGCTTTAAGA
AACGTTTTAG
AAGCAACTTG
ACCGATGAAA~
GAATACAGGG
GACAAAGACG
AGCTCGCATT
CGTAAGGGTT
ATTGAATTAC
AAGGGGCGAG
GAAAAACGAT
AAATTGAGCG
TTCAAGCGAT
ACTTCTAAAG
TTAACCAGCT
AAGATTGCGT
TGGGGGATAG
AATTGTTGCA
ATTTACTCAC
GCAAAGAATT
CTTTTCCGAG
TTTCTACCTT
ACAGCACGCC
TGATTGTTTT
ACGCTAGGGT
TTTTATTACA
CTTTGGAATT
ATGATTTAAA
TCCGCATCCA
ATGAGGTTTT
TTAAGACAAA
AGCGTTTGTG
GAGACATTGA
TTGATGCGCC
AGCGCCAAAT
GCGAAGTCTT
AAAACTTGTT
AATTGAATAA
ATAAAATCCA
CGCCTAATTT
TTATCGCTAG
GCTTGTTGGC
ACATCCATTT
CCATCGCTAA
AAACTTTAAG
TCCCTAGCAT
CTTTT
TTTGAGCGAT
CGAAAAATAC
CAGCGATAAT
GCGATTAGGG
CCCTAAAATG
AGCCACTTTA
CGAAAACCCC
AAGGGATTTA
TATATTAGAC
AGAAAGCGCT
TTTTATGCGT
AGATCAAGGC
TTTACAAAAC
GCCTTTATTA
AGACACTCAG
AAAGGAATAT
AAGTAAGGCG
CGAATACTTT
AACGCCGTTT
TTATTTCAAG
TTTGGATCTA
GTATGATAAA
AAAAATCCTA
GCTTTTTAAC
TACCACTTTC
GCAAAATATC
CTCTAAAGAA
CCATTTCAGC
AGAAACTTCT
AAGCATTALAT
CATCCCTTTA
TAAAGATTAT
AAAATCGCGC
GGGATTTTGC
TACAAGGGGG
AGCTTGGAAA
TATCAAGCCT
CAAAGAGGAT
TTGTTGAAAA
GAAAATTCCC
AATACCCCCG
GAGCCTTTGA
TTGGTTTTGG
TATTTTTTAC
GCTTTTTCTC
AGCTTTTTA.A
ATTTTAGCGT
TTAAAAGAAG
GAAAAATCAG
GAAAAAGGGG
GTGAAAGTTT
CGCTTAGAGC
ATCGGCGTGG
TTAGGGCTTC
GACAAGCACC
ACTTACACGA
ATCCAAACCG
CCGGTGCGAT
TATTGTTTAT
CAGGATAAGG
AAGGCGTTGT
TTTGGGCTGG
AGTGAGGCTA
TTGAATGGCA
TTTTTGATGG
CGAGTCAATT
TTAAAGGCAT
AAATCTATGA
TGATACAAGA
GCATTAAAGA
TCAAAGATGA
CTTTCATTGT
CCTTAGACAA
GCATGTTTTT
ATAAAGACAA
CTTTAGAAGA
AAATGTTACA
AAGCCAAATA
TTTTAAAGAA
ATTTA.ATCCC
AACTTTTAAG
GGCTAGAAGA
TAATGGGCAT
AGGAGTTTAA
ATTTCAACCT
CTAAAAATAA
CAAGCATCCC
CCCCCTTATT
GCACAGCTAC
CGCCTAAAGG
TAGGGGTGGA
ATTTAATGGA
TTGGAGAAGA
TGTATGGCAT
AAAGTTACAT
TGCGAGAAGA
CAAAACGGAC-
CACGGATTAT
TGGGAGCAAG
AAATTTAGAC
CAAAGGAAGC
ATTTGATTTT
ATTGAAAGAA
AGAAAACGTG
CGCCCCTAAA
AGAAAAATTA
AAAAATTCTA
GGCGTTGTTT
GCATGCGTGT
TCAGGTGCCT
TCCGGAAAJA
GCATGAAAAG
CATGGAATTA
GGATTTGCTC
GGAATTTCAA
GAATGAATTA
CAATTCGCCC
AAGCCACTCT
TTTGATTTTA
GCGCTTAAAA
CGGGCGTTTA
CTTACTCATT
TTATTCGCAG
AGCGTTTTTA
TTTAGCCAAA
GGGGAGTAAG
AGAAGCGTAT
GATTTTAAAA
480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2355 INFORMATION FOR SEQ ID NO:473: SEQUENCE CHARACTERISTICS: LENGTH: 168 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .168 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:473: WO 97/37044 PTU9/52 PCTfUS97/05223 433 GTGAGCGAGA GCGAAAAACA AGAAATTGAA AACAAGATAG AAAAAAAG ACGTGCCAAA GAACAAAAAG ATTTTTTAAA AGCCGATAGC ATCACAGAAG AGCTTTTACA ACAAAAA-ATC GCTTTGATGG ACACOCCACA AGGCACGATT TGGGAGALAGC TTTTTTAA INFORMATION FOR SEQ ID NO:474: SEQUENCE CHARACTERISTICS: LENGTH: 1716 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1716 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:474: 120 168 TTGCCGAGCG TTCAAATATG GGGGAGCTAC
AGTAATAGCA
ACACACOG
TGGCCCTATT
ATCACAGCGA
TTTAATGGGG
GCTTATTCAA
GGCCTAAATT
GACGGGAAAT
GGGAGTTACC
GGTTCGTTTG
ACTTCTTTTA
GCTAATTCAA
CAAAATACCG
GATAACGTGG
AACATCACTT
AGCGCTCATT
GTAAGCTCTT
ATCAACTACC
GTCTATGATG
CCCAACAGCA
GAAAA-AT CG
AGCTATAACA
GACTTTTATG
GGGCAAAACA
CCGCAATCTA
AGTGGGCATT
AAGATTTACA
GCCTTAATTT
ATCTGGTGAT
TGGCTTTTGG
ACCAATGCAC
ATCTGCGTTC
TAGATAGTAT
GCTCTATGAC
CTAACGGCAA
TCATTTTCAA
AATTCAGCGG
AGATTGGCGC
ATTTCAATAA
ATTTGCAAAT
-CTAATTTTAA
TATTTA.ACAG
TAAAAAACTT
CGGTGATTAA
CTAAAGCAAT
AAGGGCATGG
TGGTGTATTC
TTTCTATCCG
ATCGTTTGTA
ATAATTTAGG
CGAGCGGGAA
GCGCGATCGT
ATGTGATCAT
GCTGGAATTT
TCACCGGCAG
TAACGGGCTT
TGGGTATAAC
GGACCCTTCA
AGGCAGGACT
TGGCAATCGT
CAATATCGCT
TTTTTCCACG
GCTTTTGGTG
TGCAGGGCAA
CGATAGCTTG
AAAAAATACT
TTCTAGCGCG
CGCTGGGAAC
TAATACCGGC
CCCTACGAAC
GAACGCTCCT
TATTGGTGAA
TGAATACAAC
GGCTAGCAGT
TCTCAACAAC
GCGTTTGGGC
TTATCAAAAC
GAATTTAAAC
AACTCTATTC
TTTTGGGGCT
TCGCTTTGGG
GCAATGCATA
TATTGAAAGC
CAAGGCATTC
ACTAACTCGT
GCAACATGGA
GGGAGCGCTC
AACGGCACTT
ATAGGCACCO
AACGCTACCA
CAAAACATGG
TATGGCACAA
GCGACTTTTG
AATTTTTCAA
ATTTTTAATA
ACCACTTCGT
GCTGTTTTTG
TCTGTTAATA.
ACGAGCGTGA
TTGTCTTTTG
GCTATCACAA
GACGCTTTCA
GAAA.AGCTCG
CAAACCTACA
GTTGGCATGG
GCTCTCGGTT
AACACCATTT
ACTAAAGCGG
AAAAATATAT
GACAATAAGG
GGCTTTATCA
GGGAACCGCA
TTTTAA
TTAAACAAAC
CTGATCCCAA
TTAATGGGCA
ATAGCGCTTA
GTGGGGCAGC
TCACGCAACA
ACAATTCGCA
CTTTCACTAA
AAAACACCAA
ATAACAACCA
ACGCTAATTT
TTGTGGGGGA
GGAACTCTAC
TTGCAGGGAA
AAGGGAk.AGT
GCGATGGGAC
ATGGCAACCC
GTAAAAATCT
TTTCTAGTGC
ATTTCCAAGA
TGTTTGATTA
TTATGACCTA
ACTATTACGA
AATTTTCTCA
GGACGAGCGT
GAGCAGGGAG
CAGGGCATTA
TTTCTAGCGG
CTTTAGCGCA
TGTTTCTTCT
TTGCGGGCCT
TCATGTCTAT
CAATCTAATC
TAACGCCGGG
GAATTTGAAT
CCAAGCCAAA
CTTTAATGGA
GTTCAATAGC
TAACAACAGC
TTTCACTAAC
TAATGGCTCT
TGCAACCTTT
TACTCTCAAT
GATTGTTTTT
TATCACCCTT
ATGGCAGCTC
GGGTAATGGC
GGTTTTTTCA
TGTGGATATG
CATGCCTAAT
CAACAGCATT
AACGTTCACT
AAGCGATGCG
TAATGATGCG
TGAAGCGCAA
TGGGGCGCGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1716 WO 97/37044 WO 9737044PCTIUS97/05223 434 INFORMATION FOR SEQ ID NO:475: SEQUENCE CHARACTERISTICS: LENGTH: 1107 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1107 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:475:
ATGGTTGTTT
TTTGGTGTAA
ATCAAAGTTT
OAGCC.TTTGG
TGCGTTTCCA
AAAAGCTTAC
GGGCATTTAT
GCGTTGTTGC
GCTTTTAGGG
CACCCTAATT
CTTTTTGAAA
ATTGAAAGA.A
CATTTAGCGA
TCTTTGAGTG
CCCATTAGCG
AAGCTTGGCG
GAAATCGCTT
AAATCTTTCA
CGTTTTGGAA
TAGGAAGCAC
AAATAGAAGC
TCAAACCCAA
GCGCGGAAGT
ATTTAGTCAT
AAAGGAACAA
TAGACATTTC
AAAACAAGAC
ACACGCCCTT
GGAGCATGGG
TTTTAGAAAC
GTTCTATCGT
GTGCGGACAT
CTTCCATTAA
TGGAGCGCTA.
TGGTGCTGAA
TTGGTGGCTT
AGCTCTCTAG
GTGTAGCGAn
CGGCTCTATC
CTTAAGCTGT
AAAAGTAGCG
GTTTGTGGGG
TAACGCCATT
AAAACTAGCC
ACAAATCACG
TTTAAAGCCT
AGATCTTATC
GGATAAAATC
CTATTGGCTT
GCATGCTTTG
GCAATTACCC
ACCCTTAGAT
CACTTTGTGG
TGCGAGCAAT
TATCCAAATC
TTTAGATOAA
AGTGTAG
GGGAAAAACG
GGGAAAAATA
ATCTTAGATC
TTAGACGGCA
GTGGGCGTAG
CTAGCGAATA
CCCGTTGATA
AAATCTTTAA
GCTATTCAAA
ACCATTGATT
TTTGGTGCGT
GTGGAGTTTG
ATAAGCTATG
TTATACGCTT
CGTTATAAAG
GAAGTGGCGA
ATCTCTCAAG
OTGCTACT
CCCTTAAAAT
TCGCTTTAAT
CTAACGATTT
TTGATGCGAT
CCGGATTGAA
AAGAAAGTTT
GCGAGCATTT
TCATTAGCGC
ATGCGCAAAA
CACAGGAT
CTTTAAAGAT
AAGACAACTC
CCATTAACCC
TAAGCGCGAT
ATTTGTTGTT
TGAAGAAGTT
CTTTAGAATT
TAGATAAGGA
CGCAAAAAAA
CAATGAACAA
GAATAATTTA
GATAGAGGAG
AGCGAGCTTT
AGTGAGCGCG
TGGTTTGTGG
GAGTGGGGGG
CGCGCTCA.AG
GGTCAATAAJA
TGACGCGCTG
TGTCATTGCG
TAAATTAGCT
TAAATTCGAA
GGAAAACCCA
CTTGAATCAA
GTATGCTAAA
AGTTAGGGAG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1107 INFORMATION FOR SEQ ID NO:476: SEQUENCE CHARACTERISTICS: LENGTH: 261 base pairs TYPE: nucleic acid STRMNDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 WO 9737044PCTIUS97/05223 435 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .261 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:476:
ATGAAAACGA
CGCAAACCAA
GAGCAATACC
ATCAAACGCT
TTTAACTTGT
CAGAAAACAC AGATGAAACT AAGCAGACGC TAATAAAAAA AAAAACTAGA GAAAATGGCT ATATTTTGAA GGCTTTAAGA.
TTTTAATTTA A
CACTTAAGGG
ACTCGCGCGG
AACGAAGAAG
AAAATAGAAC
AAACTAAAAA TAAACTAGGA TAAGCTTGTA TTTTTCTGAT AAGAA.AGCGT GGGATCTTAT AAGGTGGTGG CTTTATAGCT INFORMATION FOR SEQ ID NO:477: SEQUENCE CHARACTERISTICS: LENGTH: 2412 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .2412 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:477:
TTGTGGCTTA
AACGCTTTAA
ACCTTACGAA
ATTGAAGCGT
ATTCCTAAAG
CGCCAGCAAC
TCTTTTGATA
AAGATGTGGC
GCTCAAAAG
GAGCTGGATG
TTAGAAGCTA
CGCGCGTTTA
ATCGCATTAG
ATTAAAAATT
TTAGATGAAA
TACAAAAACT
GGCTCTGATT
AAAGAGAGCG
TTTAATAACG
AGTCAAAATT
GCCTCACGCT
ACGATAAGGC
CTTTATCTAT
AGGGCTTTAC
AATTCATTTT
GGGATTATAA
AAATCATAGG
GCTTGAATTT
TGAATAACAA
AAAA.ACAAAT
AAA.ATTACCC
GTAAATTAGG
ACCCGACTGA
ACAACAATTA
CCCGCTACGC
TGAGCAACGC
CGAGTGAAAT
CTAAATATCT
CTTTCTTTTA
CACGCAAGGC
GTTTTCTTGC
TATACACGCT
CCCTTTAGAA
ACACATCAAA
AAAAGCGATC
CTATGATCAA
CCCCATTATC
ACCCCTACTC
GGATTCGCA.A
TCAAACGATT
CATTAAAAAA
TCCTAATATC
CAAACAAGCC
TCCCTTAGCC
TAATATGCTT
CGCGCTTAAT
CATTGATAAG
ATGGGCGTGG
AAAGAAGAAG
TCTTATGCTA
AAACGCCCCA
AACGCTTTTT
CCCAAAGTGA
CCCCTTTTTG
AAAATCCCTT
ATTAAAGACG
ACCACAAAGG
GCCTATTTTG
TTTAAAAAAG
TCCTTACTCA
CCTGAAGCGC
GTGCGCTATT
CAAATGCGTT
TTTAAAGAAG
TGGGCTGAAG
GTGGTTCAAT
GTTTGCTCTC
GAGAAGATTT
ATGAAAAACC
TAGAATGCGT
TCAATATCAC
TGCGAAGGCT
TGGAAAACGA
TTTTGAGCGA
CTCAAACCCC
GCTATGATTT
ACGCCTTACG
ATCTGTATTT
TAGATATTGG
TATACTATGT
ACAAACGAAT
TAGCCATTGA
CTTTTTCTAA
CAGAGATAAA
CCAACCCTGA
CCATTCACTC
TTCGGTTTTA
GCCGAGCGGG
GATAGATTCT
CTATTCCATG
CACCCTTTTT
CGCTAAAGCC
AAAAGACAAC
TATCATTCAA
AAACGCCTAT
CACGATCAGC
ATTAGAAATT
CACCCAATGG
CGCCAAAGCT
CCTTTTAGAA
AGCGGCTGAA
CGCCAAAGAC
CTATCAAAAC
TTATCTTTCT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 WO 97/37044 PTU9152 PCTfUS97/05223
ATGCACAGCG
AGTGCGATTG
CA-AGCGCTCT
CTTTACAATC
GCACGCGATG
TACGATAAAA
AAAGCCCAAC
TTGCCCAAAG
GAGGATCATC
AACCCCCA.AG
GCGCAAATTA
TGGCTTTATC
GCTTCTAAAG
GCTTTTGTGT
TATGCGTTTT
TTATTAGAGA
CTCCAAGACG
GCTTACAGAA
CGCAGGCTTT
TTAACCAATC
AAAGATCAAA
AATTAGCCCT
AGATCGCTCA
ATGATTTAGG
TGCAATATTT
AAAAAGCCCT
TCATTCAAAA
TCTTGTTTGA
ATTCTCCCTT
GTTGCAAAGA
AAGAAATCCA
TTGCCCTAAA
GTTTGGGGCG
ACGCCTTAAT
TATTTTCAGA
TAGAAA.AGCA
ATGAAAAAGA
CTTATAAGGA
CCACCAAAGA
CTTTAGAAGA
AAAAAGCAAA
CAAACGCATG
AGAATTGCTC
CTTGCTCCTC
GGCGTTGTAT
GCAAGACCAT
TTTTTCAATG
TTTCCCTA.AT
CAATAAGCAC
GATCCAAAA
AGCTTTAAAA
AGCCTTTGAT
CGCTTTAA.AA
CAATTACTAC
CCTCGCTCAA
TTACATGCALA
TTTCAAAGAC
CCCTAAAAGC
TTATTCTTAC
TTATTCAAAA
TCACCAAAAA
ATCTAAAGCC
GCAAAATTTA
AAGTTATTGA
AATCPACATG
GCAAGGATCA
GCGGAATTGG
GAAGGGAACA
TCTAATGAAG
TATGCTGAAG
ACGCTCAATA
TACTTATCCC
TGCTTGTATT
GCGGCTAAAA.
CGCTTAGGGG
AGCTTAAATA
AACAATGAAA
GATAAACGCA
GTTAAAATTT
ACGCCCTTTG
GCGTTAGAALA
GCCTTATACT
AGTTTAGAAA
TGCGAACAGG
AAAAAAACCA
ATGACTTGA.A
AGGATTTTAA
AGCGGGCTTC
CGCAAGAAAA
CCCAAAAGGC
TTTTAGGCAT
TCCTTGCTAA
AGATCACCGC
TCGCATCGCT
CCCCTAGCGA
ATTTTAAAA.A
AAAAAGAGTT
AAGGATTGGC
TGGCGTTGGT
ATGCCACAAG
GCGAATTTGC
CGCTAGACA.A
TGCAATCCAG
AATGCGTTCA
GTTTAAATTT
AATGAATGAA
AGCTAAAGAG
AAACGCCCAC
TGTCGTTAGA
AATCGCCCAC
TTTAGAGTTG
GCAAAAAAAT
AACCCCATTA
CTTTGCATTC
CAAGGAAAAA
OAAATTAACA
TTCCACTCTG
TTATGACATT
CCTCAATTTG
TTATTTTAAA
CTTACTCAAA
CCTTATTGAC
ACTCTTAAAC
CTTATTGGAT
ATTAALAGCAA
ATTCAAAAAC
1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2412 AAGGAGTCAT GA INFORMATION FOR SEQ ID NO:478: SEQUENCE CHARACTERISTICS: LENGTH: 798 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .798 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:478:
ATGATATTAA
AAACATTCAG
AAACAAACCA
TTAAAAATGG
A.AGGATGGTA
GTGAAATTAG
AGCGCGAAAT
TCTACTAACG
CATTGCCAAA
GTTGTAGTGG
ATGGCGAAAG
GAGCGAGCGT
TTTCAGCTAG
ATAAAAAGGT
TCGTTATTGA,
ATTCAGTCAT
TTGCAGAAAC
TGAACGCTAT
CTGAAAATA.A
AAGAGCTCAC
GGTGGCTTGG
CTAGGGCTAG
GTTGAGCGCG
TGACAAACGG
GCGTATTTTA
AGATCCGGAC
AGGGCTTAGC
CAATCAAAAA
TTTTAATGAA
GGATAAAATC
TAAACTCAGG
AGTCAATTCG
GGGAGCGAGC
TTACTTCTTG
ATGCAGGATA
GAAATCAAAC
ACTAAATACA
A.ATATCTTTT
ATCCAAGCCC
ATACCGGCTG
CTTTACATTG
GATCACTTAA
GCTAAAAAAG
GTGGAAGATA
TAGGCTTAGG
ATTTAGTGAG
CTTTAAAATC
ATATCCCGCT
TTAGCAATAA
TTA.ACGCTAC
ATTATGCGAT
TTTCTGATCC
AAGAAAACAC
CGGCTTTGAT
AA-ATCTCTAT
GGCAGCCCCT
CGTGATTGAA
CAGCCAGGAT
TGTAGTGAGT
AAGCGATGAT
CCAGCAAA.AC
AGAGTTGCCC
CATGTGCCCG
TGTGAGAATG
CCAAGAAGAA
TCTTGAAAAG
120 180 240 300 360 420 480 540 600 660 WO 97/37044 PCTIUS97/05223 437 ATTTATTCCA CCCAATACGA TATTAACGCT CAAAAAGAGC CTGAAGATTT ACGCACTAAA GTGGAAAATA CCACTAAAAA GATTTTTGAA TCTGGCGTGA TTAAGGGTGT GCCTTTCTTG TATCATTATA AGGCATGA INFORMATION FOR SEQ ID NO:479: SEQUENCE CHARACTERISTICS: LENGTH: 684 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION .684 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:479:
TTGGATTTTA
TTAAGCGATT
AATAAAAAAC
CACCGCAAGA
GAGGTTTTTA
GAAACTGGCT
TATCCTAGTG
GCGTTAGAAA
AAATTAAAAC
ATCATCGCCC
GTGGCACTGC
AAGGGGTTTG
AAGCGATAGA
TTGCGAAGTC
TTTACGATAG
TCTATTTGTG
CGCTGGATTT
TTGGCGTGGG
CGTTTGGCAA
ATGGCTGTTT
AAACGCTAGA
AAGCCACCAA
AACGCCAAAA
GAAAACTCAC
OCATOGAGAG GAAACCCTAA AAAACGAGAC TAGCGATACG CATATCGTGG CATGCAGTAT CGCTTATATC ATTCCACCAA AAGGAAAAAT GGGCGATGAA AAATCGCGCT TCAAAAGGGG TGGGGATTTT AGCGCGAAAG TGGGTTTGCA TGCGAACCTG CTTGTGTTAC AAGGGGCTGA AGCTAGGGCT TATAATTGGG ATTTGTTGAG TGTGTGCGCT TGCAATCATA GTGGGGAAGA ATTTGCCGGT GATTCAAGAA TCATCGCACC GCTTAATGAA GTCATTATCG CCGAAATGGA AATCCCTTAT TTACAAGATT TTGACACCAA
TTAA
TTTAAGAGCG
TGAAAAAAAT
CGTTGGCAAA
TAAAAAATAC
AATTTGCTAT
GGTTTTAATC
CAAAGCTAGA
AACTAACGCT
CAATGGGAAA
TTTAAACGAA
ACTCACCAAA
INFORMATION FOR SEQ ID NO:480: SEQUENCE CHARACTERISTICS: LENGTH: 882 base pairs TYPE: nucleic acid STPANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 WO 9737O~PCTIUS97/05223 438 (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .882 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:480:
ATGGCAAAAG
OOGAOCTATG
GTGGGGATCC
GCGCCGGGGC
CATGAAGTGG
GAAGAAGACG
ACAGGCTATG
CACGGCTTCA
GTGGGGGATA
TTAATCCGTG
TTACCGCCGA
GATATAGGGC
GTGTTTAGCA
GAAAAAATCA
ATCGCTGATG
AAATTTTAGT
OTGGOAGGA
CACGGCTTTT
ATTCTATTGA
GTGCGCATGG
TTTTGTTAA.A
TGGCGCCGTG
AATACGACCA
GTTGGAGCAA
GGGTGGAA.AC
TGATGTTTAT
AAATGTGGAT
TGACAATCCA
TTGAGCATAT
ATTTCTTAAA
GGCTTATGGC
TTCGCCTGAT
GAAATTGTTT
AACTTTTCCT
GTATTCACAC
AAGCGTTGAA
GTGGGAGTTT
TTCGCTCATG
GATTGATTAT
AAATTTAGTG
CAAAAAATCC
CGATCAATTT
CCCTGATGTG
CAACCAGCAC
ACGAAACCCT
GTGGATATTG
GATATTTCGC
AAAAAATACC
GAACCAATGA
GAAAACCCTA
TTGATTAAAG
TCTAATATCA
CACAATGATT
AGTTTGGAGG
GAAATCCCTG
CCCAATAGCT
GATTGGGTTT
AGCGCCCGTC
GAGGGCGTGC
AGGAAAAAAT
ATGCGGTGGC
GTGGGCTTTT
ATCTCCCTGC
AALATGATCGT
TCGCTATGAC
ATCTCACCGG
CTAATGAATT
TCACGCCCTA
CCAAGGATTG
CGAACTGGTA
TTGGTTTTGT
ATCGTGAGAT
CGCAAGTGTT
GTTGGGTAAC
AG
TGGTTGGCTG
TGCGGGGGAA
GACTTGGTTT
GGATGCAGGG
GGCCALAGCAA
CAAAGCCCCC
GCTTTTAAAA,
TTTCGTGCGC
GATGAAGCCT
TTTGGACGAT
AAGCCCGCGC
GGATTATGCG
GCTCATGCAT
GTTCAATGAA
120 180 240 300 360 420 480 540 600 660 720 780 840 882 INFORIMATION FOR SEQ ID NO:481: SEQUENCE CHARACTERISTICS: LENGTH: 411 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .411 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:481:
GTGTGCGGCG
GACTGCTCAC
AGCTCTCAAG
TGCTCTAAGC
GGCAGTTGCA.
TTGAGTTTTA
CTAACCGGTT
TATTATCGTT
TATTCGCTAC
GGGGCGTGTG
CCCTAGACTG
TOGTCAGCTC
AAAATTGCTT
TTTTAGGGGG
ACAAGCGCGT
AACTTGCACC
CGGCGTATTA
CTCACTATTC
TCAAGGGGGC
TTTAGTGAAA
GTTTTCTTTT
TTGAGCTTTG
CCGCAAAACC
TCGTTACAAG
GCCACAACTT
GTGCGCGGCG
TCGCTCAAAC
TTGACAACCG
AAAATTGCTC
CGATCGGCAG
CGCGTTTGAG
GCACCCCGCA
TATTATCGTT
CCTCGCAATT
GTTTTTGGTA
TAAGCCCCTA
TTGCATGGTC
CTTTGAAAAT
AAACCCGATC
ACAAGCGCGT
GACATTGATG
120 180 240 300 360 411 INFORMATION FOR SEQ ID NO:482: SEQUENCE CHARACTERISTICS: LENGTH: 1083 base pairs WO 97/37044 WO 9737044PCTIUS97/05223 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION l...1083 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:482:
ATGAATGGTT
ATAGCGGTAT
CAAGGGCATG
TTATACATTA
GATTTTCAAA
GGGCAAACCC
GATCACGCTT
GAAATTGACA
TTTTTATACG
CTAAAAAAGG
ACTTATAAGG
AAGCATGTTG
ACGCATAAAG
GCGTTAGAGC
AAAAAAGAAG
TTTAAAGATG
GTGAGCTTA.A
AAAGGGCAAG
TAA
TTTGCGCTAG
TACTCAGTGG
AATTAGTGGG
AAAACGCTCA
AGGATTTTAA
CTAACCCATG
TAAAATTAGG
AGGTAAGTTA
CTTTAGAGCA
ATATTAAGCC
AATCTCAAGA
AAGTGGAAAA
GCTATATGCA
CGCATTTTGT
ATCTCGCCAC
GCGAATACTT
AAGATGGAAT
CCTTGGTCGT
ACTACGAGCC
GGGGGTGGAT
GATTTATTTA
AAAAGCGTGC
AAGCGCGGTT
CGCGTTGTGC
GTGTGAAAAG
TATTCAAGAG
TGAAGTTATC
TTTAGCCTTG
AATCTGCTTT
AGAGGGCGTG
ATACACGATT
GGTGGGGATT
GCATTCGCTT
CATCAAGGCC
GATTGAAGTG
TTATAAAGAT
ATAACATTTA
AGCTCTTATA
AAACTCCATG
GAGTTTTTAG
TATGATGAAT
AACCCTTTAA
ATCGCTACCG
GCTTTGGATA
GCTAAATTGG
AATGCGATGC
GTGGAAAAAJA
GTGAAAAACC
GGCAAACGCA
GACGCTAAAA
AAGGCTAAA\
CGTTACAGGA
GAATTTAAAG
GATATTTTGC
ATGAAAGACT
GCGCTTATAG
CGAGTGAAAA
GCATTCCTTT
TTATCAACGC
TGAAGTTTGG
GGCATTATGC
AAACTAAAGA
TGTTCCCTTT
CCTTTTTAGG
GCTACATTGA
TACAAGGCGA
AAGGCTTTAG
AGAACGAGTT
ACAAATCTTT
GCGTGCCTAC
ACCCTTTTTA
TTGGCGGGGG
AAAAATGAAA
CTTAAAAGAG
AAAGCATGAT
AGAGGTGTTG
CTATGAAGAA
GCTAGCTTTA
GAGGGTCAAA
TCAGAGCTAT
AGGGGATTTG
CACTTTAGAA
CACTTTAAAA
AGTCATTGGC
TGTTAAAGGC
GATTGTGGGC
AACGAAAGAT
TAAAGCGTTT
CGGCGTGGCT
CGTGATTGTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1083 INFORMATION FOR SEQ ID NO:483: SEQUENCE CHARACTERISTICS: LENGTH: 1074 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 WO 9737044PCTIUS97/05223 440 NAME/KEY: misc-feature LOCATION 1 1074 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:483:
ATGAAAGCTA
TTTAGGGCTA
ATGCAAAATA
ACGATAGAAA
TCTTTAAGGG
GATGAAGAAA
ATCGGCTGTC
CTCAAAGCGA
ATTGAAAAAG
GAGGTGTGTA
ACCATTTCCA
GTGCAATTAG
TTGAATAAAA
CAGCGAAAAA
TGCGCTAAAA
TTCAACCCGC
GCGGATTTTT
ATTGAAGCGG
GCATTTATGA
AACAGCTTTA
ATTTTTCAAA
TCGCGCATGT
ACAACCACAC
CGAACGCCGT
AAGTGGGTTG
GCGAGATTAT
CGCTCAACAT
AAGCGGTTGA
CGAGCGGCGT
CCATATCCTT
AATACAACAT
GAGTGATGTT
AACTTTTAAA
ATGAAGGCTC
TAAACGCCAA
CTTGCGGGCA.
TTTCACTTTA
TTTGTGGCTC
AGACTTTATC
GAGGGAGAGT
TTTTGAAGCG
TTTAGAGGGG
CTCGTTTTGT
CCAGCAAGCC
TGTTTTTATG
CATTTTTAAT
GGCCGATAAA
ACACGCCGTA
TGAATGCGTT
TGAATACCTT
ACTTTTAAAC
TAAGTTTGAG
AGGCTTATTA
ATTGAGGGAG
GATGAATTGA
TATGCGAAGT
GCTTATTTGG
GTTGATGGCT
GTGTTGTTGA
GAAAAATACA
TTCACTCAAA
CTACTCATTA
GGAATGGGCG
ACCGGCATGC
ATCCCCATTT
GATGATAAAA
TTGAATGAAG
TTAATCAAAG
GGCATTAAAT
CGCCCTAGCT
TGCACCATTA
AAAAAGCTCC
GCCAACTTTT
ATAAAACAAG
AGCAAGAATT
CTAAkAAATA
AAATGAAAGA
CCGTGTGCGT
AAGGCGGTTT
AAGAGGCCAA
AGCCTTTGAA
AAATTTCACC
TAGCGGGCAA
CGCGCTCGTC
TGAGGAAATG
ATTTAAACGA
CCAAAGTGAA
TAGAGAGCGC
GAGAATCCAA
AGCAGAAAAT
AAAACCAAGT
CTTTAAAGAC
TACTTTGCGC
CCTTTTTAAA
CAAAAAGATT
GTCTTGTCAA
TGTAAGGGAT
TAACCTCCCC
CAATTTAGAT
TAAAAGGATC
AAACTTAGGC
TTTAATGCCC
GCCTTTAGAG
TAGCCTGGAT
TTTGATCTTA
CAGAATGTTT
AGCCTTGGAT
TTAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1074 INFORMATION FOR SEQ ID NO:484: SEQUENCE CHARACTERISTICS: LENGTH: 1407 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION I. 1407 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:484:
ATGGCTGAAT
TTTAAAAGAT
GCGCTAAAAA
GAATTAAAAA
GGTGCTTCTG,
GTAGCCGCTG
GTGGAGCTAT
CTTATGAATC
CAGGAATTTA
CCAACTCCAG
GGAAAACGGA
CCCTGCAAGA
TCATAGTGGA
AAGCCAAA.AG
TCGTCTCGTG
AAGCTGTTTT
TAGAACGAAT
AAGCTTGGGA
TCAGGCGTTT
GCGTTTTATT
CACAGAAGAA
AGAAAAATGC
GCGCAGAAAA
CAATGAAGAT
GATTTGGCCT
GAAATTTATG
GCTTGAAATC
GGGCATTAAA
GAAACAAGCG
GGAATACGAT
GTCAAAAAGO
AGTCCATTTA
ACTGAAATGC
GACGCTAAAG
CCTGCTAGAA
AAAGAAGACA
TACTCGAATC
AAGAGGTTGC
AGCGATGCGA
TTTGAACGGC
TTGTTGGAAG
TCAAAGACCT
AATTGGAAAA
TTGCTTTAAG
TAGCTGCTAC
CAGAAAAATG
AAGCCAAAGC
ATTTTTATAC
TAGATAACGA
CTGCTATCTC
ATGCAGGGAA
TGATAGTTAC
AGCCATAGGA
AGTCTTGCAA
GGCAGCCATT
CAAAAGGAAT
GAGCGCGAAT
TGATAAGCAC
ATACAACTTT
TTATACCCCT
WO 97/37044 WO 9737O~PCTIUS97/05223 441
AAAAAAAGCG
TACGCTGATT
ATGATAGCTT
TCTTATGATG
TTGGAAGA
AGCTTGACTT
ACAGAAAAAA
TTAGAAACAG
GAACGGAATG
AAAGAGAGCT
TGGCGTTGGG
AAAACCGCAT
TGGGCTTTGA
TGTGGCATTT
TTTTTAATGA
TGAAAGACAA
TTAGAGAGCA
AAAACCCTAA
GTTTAGAGGA
TAGCCAAAkA
TCAAAGAATC
AGTTAGAGCG
AAAATTATAC
TTATTGTAGA
CGGAATTTGA
GCGCTCATCA
TGCGACAGAG
TGACGCGAAT
GCGCTTGAAG
GATCCATCAT
AGAATTTGAA
CGATGAATTA
TTTGATGCCA
TCGTTGCGTT
CCCTAACGAT
CGCTACAGA.A
GCAAAAAGCG
TGAAAAAAAC
TAGCGTTTTT
TGCCCTAAAG
TTGGAACAGA
GTTTTAG
GATTTGAGGG
AACGCCTTGT
AAGAGTTTAG
GATCGCATGG
AGCGTTTTA.G
AAAAACTTTA
TCAAACGCTA
AATTTGAGTC
TTGGAGTATA
CCCTATCCGG
AGCGCCATTG
GCTTTACAAG
TCGCAAAAGG
AAAATTTTAG
CTAACGATGA
AGGATTGGAT
CAATATCTAA
GCGTGCCTTC
AGGAAGCTTT
TCAATGAAGC
AAAAAATCGA
GGGAGTTTTT
AAGAAGTCAG
TGCCTTTAGA
CTACGCTAAA
ATTCATCCCT
CGCTTCTTTA
TTTAGMACGC
GGGTGCATAT
AGAGCAGGAG
TTATAACGA-A
AGAAGGTTTT
CTTTGATAAT
TCCCGTTTTA.
AGAAAGCCGT
CTTTAA.TGAA
AGATTTAAAT
GACAACGATT
AGGGGGTATT
660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1407 INFORMATION FOR SEQ ID NO:485: SEQUENCE CHARACTERISTICS: LENGTH: 807 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (gerioric) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .807 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:485:
ATGAAACAAA
ATTTTGGATA
GAAGAAATGG
TTCTTAGAAG
ACAGAACACC
TCATTTCTCA
GATCCCCAAA
TCGTTTAAAG
GAAACTATCA
ATAAGCGAGG
TACTATGTTC
ATGGACTCCA
GAGCTTTTTG
GAAAATCAAA
GTTTGCGCGA.
GTTTTTCTAA
AAGGTGAAkAT
ACCATGTCAA
TTTATTCAGG
GTAATCAGGA
AAACATTATC
CCAATAATTC
GGCAGTCGTA
ACATGCAGAG
AGAACAAAAT
TTATTCCTAA
AAAA.ATATTT
ACAAAAACAG
ACAAAAATTA
TTACCTTTTT
CACTGAACAA
TGTGTTTTAT
TCTCATAGAT
TTTAGACTTC
AAGATTGATT
TTTTGTTTCA.
TAGGATTCTC
CGATATAGAG
ATACCCAACC
TTGGATTCAG
TCAAAACATT
CGATTGA
TTGAAAATTT
GAACTGAGAG
AACCTTACCG
GAGAATGTTT
AGTCTTAACG
CGCTTTTTTA.
CCTCTTCAAA
TTAGTTTATG
AGATTACTAG,
AATTTTTTTG
AATCATGCCT
ATTGATATGA
GATGAAGTAA
TGGAAAATGA
AAGAATTGGA
CTCTTTATGA
TGAATATAGA
CTAATCTTCA
AAGAAATAAA
GTGGGAAGAA
TTTATGCTTA
AAAAACCTAT
TTCAAGCAAA
ATGACTTCAC
GCGTTGAAGC
CAAACAAAAT
TGTCTTGACG
CTTCATAGAA
TTTTTCTA.AT
TGATGTCAAA
CTTTGTCAAG
CGATGGGCAA
TGATGCAAGC
CTTCATGCTA
CAATAACAAC
TTTTTTAGAA
GCATTTGATC
TAA.AAAGAAA
GCTCGATCAA
INFORMATION FOR SEQ ID NO:486: SEQUENCE CHARACTERISTICS: LENGTH: 777 base pairs WO 97/37044 WO 9737044PCT/US97/05223 442 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .777 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:486:
TTGCTCTCTA
GTTATTTTCC
CTTTTTTTTA
CAAAAACGCC
TTAGTGGAAA
TATGAAGCCG
GCGAGCATGG
AACGCTTTTT
GTGGGGGATA
GATTTTTTAA
AAAATTAAAG
TTAGTCAATA
AAAAAAGCTT
ATAAAACGCT
TGACCACTCA
TCACTAGGGG
ACCAATTTGA
AATACCCCCT
ATGATATTAT
ATAAGGATAT
TTAATGTGAG
AAGGGGATAA
ACGAAGACGC
AAGATTTGAG
TGCACCAGAT
TTTTCCCTAA
CCCTAAAAGA
GATTTTATCT
CAGGGAGAGT
TGAAGATTTT
AAAAAAAGGG
CTCTTTTTAC
TTTGTATTCC
CCAAAAAGAA
TATTAAGGGG
TAAAGAGCAT
CGATAGCGAA
GAC CCAC CAT
AAAACCCCAA
TTCGCCACTT
ATCATGCGCA
TTCCGCTACC
AGATCCCTTT
GCTAAAATCC
AAAAAGAAAG
AATAGAGGCT
GCCCATTTTT
GTTAAAGGGA
GAACTATGGG
GCCAAAGAAA
GGCGTGATCA
AGACCTGATT
ATAGCTTGCA
AGACCCGTTG
AGCTGTGCGA
TAAAAGCTCT
AGGGCGAACA
ACCCCAACAA
CTCATTTCAA
TTGCTTATTA
TTGGCGGCTT
AGCAGATCAT
AGGCTCTATT
AACTATGGGA
TTAAAAGAAT
AGAAGTGGGC
CTCTAAAACG
TCATTACAAA
CAAAATCGCT
TTGTTTTGAA.
TTATGTGATA
TCTCAAAACG
CCAATGCGTT
CAACTATAAA
TCAAGCGTTC
AAACATGCGT
GCCTGAGTTT
TTCTTAA
INFORMATION FOR SEQ ID NO:487: SEQUENCE CHARACTERISTICS: LENGTH: 771 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .771 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:487: TTGTGTTTGA AACTCTTA.AT TTGGAATTTT AAGGAGCATT CTTTGAAAGT CAATTTCTTT WO 97/37044 WO 9737044PCTIUS97/05223
GCCACTTGTC
CGCAAGGAAA
TACAACTCAG
TCCAATAACG
GATTATTTGG
AGGGTGTATG
GGCGAACCTC
ATTGACTCGG
AAAGAAGAAG
GCGGTTATGG
TCAGCGGATG
ACAAAACCCA
TAGGAGCAGC
ATTTGGAAGT
GATACTATGA
ACTACCCTAT
AATTGTTTGA
AATTGAGCGA
TCAAAATCAC
CGAAAA-ATCT
AATGCTGCGG
TCAAAGAAAA
CTGGGTGCTT
TGCATTTTTA
TATTTATAGC
GGTTTTTAA.
AGAGACTkAAA
TATTTTGCCT
AGGGCATGCG
GTTTTTGGAT
ATGGCATTCT
CATCAGACAG
GTTTGGGGGG,
GATTAAAGAC
GATGAATATC
TGACTTTTTA
AACGCATCGC
AAAGACCAGA
AAAGTCGTTT
AGCGGTTCAT
GAATTTAACA
AAAAAATTC
AATTGCCATG
CTTAAAAATG
ACTTTTTCAG
ATAGAGAGCC
AGCACCGCTA
GCCTCAAGGC
TTAACGCTAT
CATGTTGTGG
TATACAATAT
GCACAGGGAT
TGGTTAAAGA
AAGTCAAATA
CCTTAAGGGT
TGGAACTCAT
TTAAAGAGCC
GTCATGTGGA
TGCAAAAAAT
TTGGACTTTA
CAAATTACTC
CCAGCCAAGC
CAAACTTTAT
GATGCGGCAT
TTTTTGCTCT
CGAAGATAAG
GGCTAAAGTG
TGAATTGGAA
TGAA.ATTTCA
TGTGATCGTT
GGGCTCTTTG
A
120 180 240 300 360 420 480 540 600 660 720 '77 1 INFORMATION FOR SEQ ID NO:488: SEQUENCE CHARACTERISTICS: LENGTH: 1410 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1410 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:488:
TTGAAAATAT
GGCTATCGTA
GCTTATGAAT
CAAAACGGCT
CAAGCCGAGC
GCCTACCA
TTA.ATCAATT
GTGCAATTC.A
GTGCAAAAAG
TCTAGGGCAG
GCTTATCAAG
AATGGGGATA
CCTTTCGTGG
ATCATCATTG
TTTATGATTG
GCGTTTTATC
GCCTCAATAG
AGGGCTATTT
GGGTTTGTTG
AGAAACACGA
GAAAGTTATG
TCGTTCTGTT
TGGACTTAGA
ATTCCAAACA
TGAGCGCTTT
AACGCTTTGA
TGATTAATGA
ATTACAAAGC
ACCGCGCGAA
CCATTAAAGA
AAGTGAGCGA
GCTTGCTTAA
AGAATAAGGG
CTCAAGACTT
AAGATGGTAA
ATTCGGTTTA
AAAATTTCAC
ATGCGGTGGT
TATCGGCCAC
GCGGGTTAGT
GC.ATTTTTGC
AAGTTCGAGC
GATGTCGGTA
ACATTTTAAC
ATTCACGAAG
ATACGCAAGG
TAAAACCCAA
TAATGTACGC
GATAGACTAC
TGAACGCCAA
AATCGATTCT
GATTTTAAAC
CCCGGCGGTT
ATTAGGCTAT
GGTTTTTTTC
AGAGCCGCAA
TAACGTGAGC
CCTCAAGGAC
CGCTAGCGAA
TTTTAAAGTG
AACTTCGTTG
CCATAAAATC
CGATTCCATT
ATTTTAGGAA
ACGCTCTATT
AAAAAAAAGA
GATTATAAGA
AGCGCTTTCA
GCTTATGGGG
ATGCTTTTA.A
CGCAGGGCTA
AGCAAAAAGC
AACACCTATT
TCGTATCTTT
CTTAATGAAG
AAAAACCCTA
AAAAGCGAAT
GTGGCCTTGC
GGAGAAAAAG
TTCAGGAAGC
GGCATGCAAG
TATTCAGGTG
TACCTCATGC
GACGCTTTTT
TATCATTAAC
ATGAAGAAAG
ACGCTCTTTT
CTTCTTTAGG
CAAGAGGGGC
GGAATATTTA
ACGATAGCGC
AAGAATTTTA
ACAATATTAA
CTAATTTAGA
CAGGGTTGTT
CCTATGGGAT
ACAGGAGCCA
TTAAAATTGA
CCAAGCTAGA
TAACACCCTT
AATTGCCCTA
CGGTGGCGAA
TGAGCACCTT
GCATCAA
CGTTTTCATT
AGGTTGCATA
CCCTAAAAAA
ATGGGACTTG
GGTATTGGAT
TGGTTATGTG
TGAGGGCGTT
GAAAGCTAGG
TTATGAGGAA
TATGGAACGC
CAAATACGAA
TTACGCTTTA
CAGTCAAAGC
TTTCACTTGG
TGTGCCTATT
AAAAGGGGAA
TGACACTTTA
CATTATCACT
CTATTATTTA
TGCAGACACC
CA.AAGCCTTT
AAAGCCTTGT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 WO 97/37044 WO 9737044PCTIUS97/05223 444 AAAAGATCGC TTGAAAGCCC TAAAATCATT GACGCTAGGG AATTGCTTTC
AGGGTTTGTA
ACAGCCCCAC AAGTCTTTTG CTCTAACCGC CATAATATTT TATATGTGCG
CAGCTTTAAA
AACGGGTTTG TTTTGAGTCA
TTTAAAATGA.
INFORMATION FOR SEQ ID NO:489: SEQUENCE CHARACTERISTICS: LENGTH: 1053 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1053 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:489: 1320 1380 1410
ATGAAACAAA~
AAATTACTCA
GGGAATAAAG
TTGGTGTGCG
TATGGGAAGT
TATAAAATGC
CCTAAATCCC
GAAGAGGCTC
TTTTTAGACG
GAAAAAAAGG
AGGCAATTGC
TGCTTGCTTA
ATGGTGTTTG
GCTCGTTTGG
TTAATGGATA
GCCAGCCAAG
AAAAATCAAG
AAAGAGA.ACG
TAAAAGCTTT
TTGATCAAGG
ATAAAAGAGA
ATATTAGAGA,
ATTTCAACCC
TTGAATTGGA
AAAGGAAAGA
GTTTTGATCT
AATTGCTCTT
GTTGGGTTGA
AAATGATCAA
CGGACATTCA
AAGATGGCGT
TGGTGGCAAG
AGATTGAATT
AAGATAAAGA
CTTACCTCAT
CTAGGGAGTA
AGCCTTATTT
CATTGAAGTA
GTATTTTGAA
GCAGTTCTTT
TTGCATTGAT
TGCGGATTTT
AGCGCTCAAT
TATTTTAGAC
AAGGCCCATG
TAGGGAAAAG
AGATTATGGC
AGTGAGTAAT
GATTGTCAAA
GAATGAAGAA
GCTAGGCTGT
ATTGGCCGGT
TCAAATAGGG
TTTGTTCGCT
AGTGGGGGGT
ACCGCTTTAC
AACGCCACCG
AACGATGTGT
TGCCATGCCA
GTTTTGAGCG
CAGGTGAGGA
CGAACGCAAG
AGCGCTAAAC
CTTTTAGATG
TTGAAATATT
AAGATTAAGA
AACGGGCGTT
GAAAACCATA
AAAGGTCCTT
CGTATCGCTT
AATGAAAAGC
TGA
TGGATAGTTT
ACTTCAATAT
CGCAAATTGG
TGTTCAAGCC
ACATGTTTAG
GGGAAGTGCT
AATTAGTCAG
CAAGCGGTGA
TCTTAGAGCC
TGAGTGGTAG
ATGAAA-AGCC
ATTTGAAAGA
ATTTTGTCCT
AATTAGACAT
TGAGTTTGGT
TAGGCTATGC
GCGAACTTTA
GTTGTCCATG
AGGGTTTGGA
GGCTAAACTC
CAAATACGGC
GAA.CGCTTTT
AGGGCAACGC
AGAAGTGGGC
AAAACCGCAA
TACTTTTATG
GGGGCGTGCT
AGGTGGGGGG
ATACAGAGAA
ACCCCATAAC
TGAACACCCC
GGATAAAAAC
TAAGACTTTA
CCCTTTGGAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1053 INFORMATION FOR SEQ ID NO:490: SEQUENCE CHARACTERISTICS: LENGTH: 1218 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCTIUS97/05223 445 (iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1218 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:490: ATGAAAA3AT ACAGCGCTAT CCCCACCCCT
TGCTATCGC'
AAAAACGCCA
TTAAAAGGGT
TGCGCGAGCG
CATAAGGkAAA
CCCCTAGCGA
ATTTTAGACA
AGGATTAACC
AGCCGGTTAG
GGGGTGAGCG
ACTTTAGAGC
TTTGGTGGGG
ATTAAGGATT
ATAGGGTGGC
GAAATTGCGA
TATCGCCCTA
AAGGGCGAAA
GATTTTATGG
CAAGACATGC
AGCCTGGCTA
TATAAAAACA
AGATTTTAGA
ATGCGTTTTG
GTCTTTATGA
TTTGCGTTTA
CAAGCATCAT
AAAACAAGCA
CCCTTTATAG
GGATTACGCC
GGTTGCATTT
ATGTGGAA.AG
GGCATCATAT
TTAAAGAACG
AATGCGGGTT
TTTTAGACGC
GTATTCTTAA
ATCAAGGGGC
GGAGTTTTAG
TCCATTATAC
AAATAGATTC
GGAATTAA
AATCGTGCGC
GCGTGAGTTT
GGCTAAGCTC
TAGCCCGGCC
TTTCAACTCT
GTTAGAAAAC
CGAAGTAACC
TAGCGGATTT
CCATACGCAT
GCATTTCAAG
CACCAGGAGC
CTACCACAAT
TTTAATCGCA
TTCTTTTAGC
GGTTTC!TGTA
GTTTTCTTAC
CTTTGAAACG
GATCGTGAAA
GCAAGGTTTT
CAGCAAAGCG
GGGATTTTGA
GCTTTTGAAG
TTTAAAGAGG
TTTCACCAAT
TTGGGCTTA
CCAGCGATCT
GAAAAAGGAG
TGCGAGCAAA
CCTTATTTAG
GATTATGATG
ACAGAAGTGA
AGCGTGATAG
GCTCACATGC
GAAAACGATG
TTTTTAGGCG
CCTTTAAAJ.
AACAACTCGT
AAGATCCTCA
TAGAGAGCGA
GGGCAAAGGT
GGCAAAAATT
AATTTGGGGG
CTGAAATGAG
ACGCCACCTA
GCCCCATTA.A
ATAACCCATG
TCAAAGAGCA
ACGCCGACGC
AAAACATGGC
TGAATTTGCT
TCTTAGAGCC
ATATCGTTCA
CCGATTGCCT
AGGAAATTAT
GCCCCACTTG
GGGGCGATAA
TTAATGGCGT
AAAGCTTTTC
ACGCTTAGAA
TTTGCTCGCT
GAACGGGTGT
GCGAGAGAGC
CGCGATTTTA
CAAAGACAGG
AATGGGTTTG
CTCTAAAATG
TGGCTTAGAG
TTTGTGCCGG
GTGGGTGAAT
CATTCAAACG
TGGGGAAGCC
AAACGATCAA
AGAAATGCCT
TGAAGTTGAA
TTTGGCGGGG
AATCGTGTTT
GCCACTGCCA
TTATGAAGAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1218 INFORMATION FOR SEQ ID NO:491: SEQUENCE CHARACTERISTICS: LENGTH: 966 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (geniomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .966 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:491: WO 97/37044 PCTTJS97/05223
ATGGCTGATA
CAATTGTTGT
ATCCGTGAAG
CTCGTTGAGG
TGGTTTTATT
GAAAAAGGCG
AGGATCTATG
GGGCTAGGGG
CGCTTGGTGC
GATGAJAAC
CTCGCTGATG
GTCAAGCACA
ACCGATGTGA
TTTGAAGTGA
AATTCTTCTA
TTCATCTCCA
GCATGA
GTTTAGCGGG
GTTTCAGGCT
TGGTGATA
GCTTGATCAT
ATGACAGCCA
AAGACGATAT
AAGCGGATAG
GATCTGCTGG
AAGTGGTGGA
ACAACGATTT
ATTCCCCAAG
TTGACTTTAT
GTALATATTGG
TCAkACAGGT
TGAGCGGCAG
AGTCTAACCC
CATTGATCAA
GGGTAAAAAC
CCATGGCAAT
TATAAGAGAG
AAACAAGAGC
TGTCATGATT
GATTTTGAGC
GAATAACAAA
TATTGAAAAA
AGAGACGCTT
CGTCTTAAAA
CAATGGTAAA
CCTGATTATT
TAAAAACAAT
CTCTAATGA.A
TAAAGACATC
GTTACGAGTT
AAGGATTTGT
CTCACTATCA
CTCACCATTC
AAGGATTTAC
TGTGAGTTTT
ALAGAAATGGA
CTCGTGAGCC
ATGCTTATAG
TCTAAAATCC
ACCATCAAj.j
ACCTTATTAG
ACCGATTTAG
CCTTTGACTT
GACATGGCCA
CAGCGGGTGG
TACATAAA
ATGCGGTCAA
TTAGCCACGA
CTTTGATTGA
GCCCTTACAG
CTCGCTGGAC
CTGAAATGGA
GCACGCGCTA
ACGTGTTCCC
ATTCTAACCA
TGATTTTAGA
AGCATTTGTT
AAATGCCAGA
CAAAAATCCC
GGAGTTTGAA
TTAAGCAATT
TAACGAGTTG
TGTCTTTAAG
AAACAATTCG
CATGAAAAAA
GATAGAAAAA
CATAGCGGTT
GCAAAGCGCT
TTTTGACGGG
TTGGATTGAA
ATGCGTTTTG
CAAGCTGGGC
CAACCCAACA
AGCGAGCGGT
CATTGTGGTC
GGCCGATGAT
TTTGGAATTA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 INFORMATION FOR SEQ ID NO:492: SEQUENCE CHARACTERISTICS: LENGTH: 96 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: CA) ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .96 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:492: Giu 1 Cys Ala Arg Ile Lys Val Ala Leu Ile Ser Asp Arg 5 Arg Ala Val Met Lys Asp Lys Ile Val Val Ser Leu Gly Phe Leu Ser Ala 25 Glu Pro Ala Gin Ala Asp Leu Glu Ile Arg Ile Ile Sen Ala Giu Val Leu Sen Thr Arg Asp Tyr Ala Asn Lys Leu Lys Lys Lys Val Gin Gin Asn Gly Gin 70 Thr Glu 55 Leu Gly Ile Asn Asp Met Gin Leu Ang Lys Asp Ile Leu Ser Arg Cys Ile Sen Leu Arg Pro Tyr Ile Tyr Asn INFORMATION FOR SEQ ID NO:493: SEQUENCE CHARACTERISTICS: LENGTH: 301 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 WO 9737044PCT/US97/05223 447 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .301 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:493: Asn Ala Trp Asn Asp Phe Val Lys Ser Gly Sen Leu 1 Asn Leu Ala Sen Thr Ile Ala Ala Gly 145 Ser Thr Tyr Ile Ser 225 Val1 Thr Thr Asp Gly Ile Ala Leu Arg Asn Asn Tyr 130 Val1 Leu Lys Thr Pro 2 10 Pro Val1 Lys Leu Asn 290 Tyr Val1 Lys Thr Leu His Glu 115 Glu Val Ile Glu Gly 195 Ala Arg Val1 Phe Gly 275 Gly Tyr Leu Met Val Ala Ala 100 Pro Glu Leu Lys Val1 180 Leu Lys Tyr Giy Leu 260 Ang Leu 5 Glu Sen Thr Gly Glu Tyr Ser Val1 Asp Glu 165 Gin Lys Val Asp Leu 245 Lys Thr Leu His His Ala Gly 70 Trp Tyr Val1 Leu Gly 150 Giu Leu Asp Leu Sen 230 Ala Asp Ala Ala Lys Tyr Ile 55 Val1 Lys Pro Ile Leu 135 Asp Val1 His Gly Asp 215 Lys Ala Thr Ala Phe 295 Thr Leu 40 Phe Thr Sen Asp Lys 120 Gly Ile Thr Pro Giu 200 Thr Pro Lys Lys Arg 280 Asp Ile 25 Lys Gly Sen Lys Leu 105 Gly Lys Ser His Tyr 185 Ser Lys Met Asn Leu 265 Cys Ala 10 Arg Leu Ala Val Phe 90 Val1 Cys Asp Lys Sen 170 Asp Val1 Asp Giu Pro 250 Pro Ile Leu Leu Leu Lys Met 75 Giu Met Gly Lys Leu 155 Trp Gly Gly Lys Val 235 Tyr Leu Giu Val1 Asn Giu Gin Asp Val1 Ala Leu Tyr 140 His Tyr Gin Ile Tyr 220 Gly Val1 Glu Ala Giu 300 Gly Pro Ile Pro Ile Val1 Gly Arg 125 Leu Pro Gin Thr Giu 205 Ser Pro Thr Ala Lys 285 Ile Pro Giu Gin His Leu Ala Giu 110 Asn Leu Ile Tyr Asn 190 Asn Trp, Leu Giu Leu 270 Thr Phe Gin Arg Pro Asp Asn Met Phe Ser Asp Glu 175 Pro Lys Ile Ser Val 255 Phe Ile Ser Asn Giu Gin Pro Phe Phe Ile Ser Giu 160 Asp His Ile Lys Ser 240 Ala Ser Ala INFORMATION FOR SEQ ID NO:494: SEQUENCE CHARACTERISTICS: LENGTH: 169 amino acids WO 97/37044 PCT/US97/05223 448 TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...169 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:494: Gly Ile Asn Met Ser Lys Lys Ile Val Val Asp Pro Ile Thr Arg Ile 1 5 10 Glu Gly His Leu Arg Ile Glu Val Ile Val Asp Asp Asp Asn Val Ile 25 Thr Asp Ala Phe Ser Ser Ser Thr Leu Phe Arg Gly Leu Glu Thr Ile 40 Ile Lys Gly Arg Asp Pro Arg Asp Ala Gly Phe Ile Ala Gin Arg Ile 55 Cys Gly Val Cys Thr Tyr Ser His Tyr Lys Ala Gly Val Thr Ala Val 70 75 Glu Asn Ala Leu Gly Ile Thr Pro Pro Leu Asn Ala Gin Leu Val Arg 90 Ser Leu Met Asn Met Ala Leu Leu Phe His Asp His Val Val His Phe 100 105 110 Tyr Thr Leu His Gly Leu Asp Trp Cys Asp Ile Leu Ser Thr Leu Lys 115 120 125 Ala Asp Pro Ile Gin Ala Ala Lys Leu Ser Phe Lys Tyr Ser Pro Tyr 130 135 140 Pro Ile Asn Thr Gly Ala Gly Arg Ile Lys Ser Gly Ser Lys Thr Leu 145 150 155 160 Gly Met Ile Ser Leu Lys Ala Asp Leu 165 INFORMATION FOR SEQ ID NO:495: SEQUENCE
CHARACTERISTICS:
LENGTH: 185 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...185 WO 97/37044 PCT/US97/05223 449 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:495: Trp Leu Trp Leu His Gin Arg Met Ser Gin Lys Ile Leu Ile Leu Gly 1 5 10 Ile Gly Asn Ile Leu Phe Gly Asp Glu Gly Ile Gly Val His Leu Ala 25 His Tyr Leu Lys Arg Asn Phe Ser Phe Phe Pro Ser Val Asp Ile Val 40 Asp Gly Gly Thr Met Ala Gin Gin Leu Ile Pro Leu Ile Thr Ser Tyr 55 Glu Lys Val Leu Ile Leu Asp Cys Val Ser Ala Lys Gly Val Glu lie 70 75 Gly Ser Val Tyr Ala Phe Asp Phe Lys Asp Ala Pro Lys Glu Ile Thr 90 Trp Ala Gly Ser Ala His Glu Val Glu Met Leu His Thr Leu Arg Leu 100 105 110 Thr Glu Phe Leu Gly Asp Leu Pro Lys Thr Phe Ile Val Gly Leu Val 115 120 125 Pro Phe Val Ile Gly Ser Glu Thr Thr Phe Lys Leu Ser Ser Glu Met 130 135 140 Leu Asn Ala Leu Glu Thr Ala Leu Lys Ala Ile Glu Thr Gln Leu Asn 145 150 155 160 Ala Trp Gly Val Lys Met Gin Arg Thr Asp His Ile Ala Leu Asp Cys 165 170 175 Ile Ala Glu Leu Ser Tyr Lys Gly Phe 180 185 INFORMATION FOR SEQ ID NO:496: SEQUENCE CHARACTERISTICS: LENGTH: 108 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...108 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:496: Tyr Gin Ile Leu Asn Gin Arg Lys Thr Met Lys Lys Val Leu Leu Leu 1 5 10 Thr Leu Ser Leu Ser Leu Ser Phe Trp Leu His Ala Glu Arg Asn Gly 25 Phe Tyr Leu Gly Leu Asn Phe Ala Glu Gly Ser Tyr Ile Gin Gly Gin 40 Gly Ser Ile Gly Glu Lys Ala Ser Ala Glu Asn Ala Leu Asn Gin Ala 55 Ile Asn Asn Ala Gin Asn Ser Leu Phe Pro Asn Thr Gin Ala Ile Arg 70 75 WO 97/37044 PCT/US97/05223 450 Asp Val Gin Asn Ala Leu Asn Ala Val Lys Asp Ser Asn Lys Ile Ala 90 Asn Arg Phe Ala Gly Asn Gly Gly Ser Gly Gly Ile 100 105 INFORMATION FOR SEQ ID NO:497: SEQUENCE CHARACTERISTICS: LENGTH: 128 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...128 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:497: Arg Val Ser Gly Asn Leu Lys Ser Asp Gin Ser Thr Cys Ala Pro Tyr 1 5 10 His Ile Asp Lys Asn Gin Glu Tyr Lys Gly Arg Tyr Ile Gly Gin Val 25 Pro Arg Gly Met Leu Ser His Trp Val Arg Ile Lys Asn Gly Val Val 40 Glu Asn Tyr Gin Ala Val Val Pro Ser Thr Trp Asn Ala Gly Pro Arg 55 Asp Ser Lys Asn Gin Arg Gly Ala Tyr Glu Met Ser Leu Ile Gly Thr 70 75 Lys Ile Ala Asp Leu Thr Gin Pro Leu Glu Ile Ile Arg Thr Ile His 90 Ser Phe Asp Pro Cys Ile Ala Cys Ser Val His Val Met Asp Phe Lys 100 105 110 Gly Gin Ser Leu Asn Glu Phe Lys Val Glu Pro Asn Phe Ala Lys Phe 115 120 125 INFORMATION FOR SEQ ID NO:498: SEQUENCE CHARACTERISTICS: LENGTH: 86 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 451 NAME/KEY: misc feature LOCATION 1...86 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:498: Leu Tyr Gin Asn Asp Lys Gly Phe Lys Thr Glu Leu Arg Ile Leu Ser 1 5 10 Val Phe Ile Val Glu Phe Leu Val Asn Ile Leu Gly Phe Met Leu Ala 25 His Met Leu His Phe Trp Phe Leu Arg Cys Val Lys Ala Leu Ala Trp 40 Leu Met Lys Thr Phe Asp Arg Arg Arg Tyr Phe Asp Ala Lys Ala Asn 55 Leu Asp Phe Val Phe Gly Asp Ser Lys Ser Glu Glu Glu Lys Lys Arg 70 75 Ile Ile Lys Lys Gly Val INFORMATION FOR SEQ ID NO:499: SEQUENCE CHARACTERISTICS: LENGTH: 67 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...67 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:499: Tyr Lys Lys Gly Asn Gin Ile Met Asn Ile Gin Thr Lys Lys Arg Phe 1 5 10 Leu Ala Asn Leu Leu Leu Phe Ser Leu Phe Ser Cys Leu Lys Ala Glu 25 Thr Leu Ser Glu Asp His Gin Ile Leu Leu Ser Ser Asp Ala Phe His 40 Arg Gly Asp Phe Ala Thr Ala Gin Lys Gly Tyr Met Asn Leu Tyr Arg 55 Ala Asn Gin INFORMATION FOR SEQ ID NO:500: SEQUENCE CHARACTERISTICS: LENGTH: 256 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...256 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:500: Leu Cys Leu Lys Leu Leu Ile Trp Asn Phe Lys Glu His Ser Leu 1 Val Ser Phe Tyr Ser Met Asn Leu Lys 145 Ile Ile Ser Lys Gly 225 Thr Asn Leu Lys Tyr Asn Met Met Asp 130 Ile Asp Glu Val Asp 210 Cys Lys Phe Asn Lys Glu Asn Arg Val 115 Lys Thr Ser Leu Lys 195 Ile Leu Pro Phe Ala Asp Glu Asp His 100 Lys Lys Trp Ala Glu 180 Glu Glu Met Met 5 Ala lie Gin Thr Tyr Asp Asp Leu His Lys 165 Lys Pro Ser Asn His 245 Thr Cys Lys Leu Thr Cys 55 Lys Lys 70 Pro Ile Tyr Leu Phe Cys Gin Val 135 Ser Asn 150 Asn Leu Glu Glu Glu Ile Arg His 215 Ile Ser 230 Phe Tyr Leu Leu 40 Cys Val Ile Glu Ser 120 Lys Cys Ile Glu Ser 200 Val Thr Asp 10 Gly Ala 25 Arg Lys Gly Gin Val Leu Leu Pro 90 Leu Phe 105 Arg Val Tyr Glu His Ala Arg Gin 170 Cys Cys 185 Ala Val Asp Val Ala Met Phe Leu 250 Ala Glu Pro Tyr 75 Ser Glu Tyr Asp Leu 155 Leu Gly Met Ile Gin 235 Ala Ile Asn Ser Asn Gly Gly Glu Lys 140 Arg Lys Phe Val Val 220 Lys Ser Tyr Leu Tyr Ile Ser His Leu 125 Gly Val Asn Gly Lys 205 Ser Met Arg Ser Glu Asn Lys Cys Ala 110 Ser Glu Ala Val Gly 190 Glu Ala Gly Leu Asn Val Ser Leu Thr Glu Glu Pro Lys Glu 175 Thr Lys Asp Ser Gly 255 Lys Ala Val Gly Tyr Gly Phe Phe Leu Val 160 Leu Phe Ile Ala Leu 240 Leu INFORMATION FOR SEQ ID NO:501: SEQUENCE CHARACTERISTICS: LENGTH: 255 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: WO 97/37044 PCTIUS97/05223 453 ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .255 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:501: Met Ala Leu Pro Giu Val Phe Ser Asn Glu 145 Leu Gin Thr Ser Asn 225 Tyr Lys Lys Ile Phe Leu Gly Met Ala Leu Ala Phe Sen Val Ser Met Giu Giu Ilie Thr Asp Gly Phe Asn 130 Sen Giy Met Sen Lys 210 Gin Lys Lys Asn Asn Pro Al a Phe Val1 115 Phe Lys Gly Gin Tyr 195 His Phe Arg Ser Gin Thr Ser Met Ang 100 Giy Thn Glu Asp Ile 180 Phe Ser Tyr Asn Gly Asn Ser Val Ala Thr Ser Tyr Giy Ser 165 Cys Gin Gly Lys Phe 245 Ala Thr Met Ile 70 Giy Tyr Lys Gly Tyr 150 Phe Asn Met Ile Glu 230 Ser Phe Thr Phe 55 Asn Tyr Gly Leu Vai 135 Asn Ile Asn Pro Giu 215 Arg Ile Leu Arg 40 Gly Thr Lys Tyr Giy 120 Gly Thr Val Thr Val1 200 Val1 Gly Tyr Gly 25 Thr Asn Asn Trp Tyr 105 Ile Phe Ala Gin Ala 185 Giu Gly Val1 Phe 10 Gly Pro Asn Asn Phe 90 Ser Met Asp Gly Gly 170 Gly Phe Phe Asp Asn Gly Sen Gin Tyr 75 Phe Tyr Asp Ala Leu 155 Giu Cys Gly Lys Gly 235 Tyr *Phe Ala Ala Gly Gly Asn Gly Leu 140 Phe Sen Ser Phe Leu 220 Sen Met Gin Asn Ala Gin Lys His Ala 125 Tyr Val1 Tyr Ala Ang 205 Pro Val Ile Tyr Asn Pro Met Thr Ala 110 Ser Asn Gly Leu Sen 190 Ser Leu Asp Asn Sen Asn Ala Tyr Lys Asn Gin Phe Phe Lys 175 Met Asn Phe Val1 Leu Asn Thn Gin Gly Arg Leu Val Tyr Gly 160 Ser Asn Phe Thn Phe 240 INFORMATION FOR SEQ ID NO:502: SEQUENCE CHARACTERISTICS: LENGTH: 218 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature WO 97/37044 PCT/US97/05223 454 LOCATION 1...218 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:502: Ala Cys Glu Ile Arg Ala Trp Ala Phe Asp Arg Met 1 Ser Thr Lys Phe Gin Ser Ala Lys Cys 145 Ala Ser Ala Leu Val Thr Thr Val Ile Gly Ile Ala 130 Val Leu Leu Leu Asn 210 Gly Leu Ala Pro Ala Asn Lys 115 Val Pro Ser Phe Asn 195 Ala Val His Ser Pro Tyr Val 100 Glu Met Gin Val Pro 180 Lys Ile 5 Val Leu Thr Phe Phe Phe Arg Arg Thr Pro 165 Gin Ala Glu Phe Lys Leu Tyr 70 Leu Phe Leu Phe Ala 150 Ile Glu Tyr Asn Ile Tyr Thr 55 Gin Lys Ser Lys Leu 135 Cys Asp Asp Tyr Ala 215 Leu Lys 40 Pro Lys Asp Phe Lys 120 Asn Pro Tyr Lys Ser 200 Ala Leu 25 Asp Pro Glu Lys Glu 105 Thr Leu Phe Ala Ser 185 Leu Trp 10 Gly Tyr Lys Phe Ser 90 Glu Ile Gin Asp Asn 170 Tyr Met Leu Cys Pro Ile Lys 75 Ala Ser Glu Ala Thr 155 Arg His Glu Gin Lys Phe Lys Leu Pro Pro Ser 140 Leu Leu Asn Gly Leu Phe Asn Phe Ala Thr Lys Asn 125 Leu Leu Gly Ala Leu 205 Arg Phe Ser Asn Leu Phe Asp 110 Thr Ile Ile Asp Leu 190 Glu Val Asn Pro Ala Ala Asn Leu Asp Leu Pro Asn 175 Ile Lys Leu Lys Leu His Gin Ile Lys Pro Glu Thr 160 Pro Lys Arg INFORMATION FOR SEQ ID NO:503: SEQUENCE CHARACTERISTICS: LENGTH: 124 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...124 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:503: Thr Ile Gin Glu Arg Leu Tyr Arg His Glu Ile Ser Arg Leu Gin Val 1 5 10 Lys Thr Asp Glu Thr Leu Lys Leu Ile Lys Glu Ala Lys Lys Arg Leu 25 WO 97/37044 PCT/US97/05223 455 Asn Tyr Asn Asp Asp Ile Arg Asp Val Leu Gin Gly Leu Leu Asn 40 Val Pro Asp Leu Ile Thr Ile Asn Ser Ile Glu Ile Asp Gin Gin 55 Val Val Val Ser Gly Lys Thr Pro Ser Lys Glu Ala Phe Tyr Phe 70 75 Phe Gin Asn Lys Leu Asn Pro Met Phe Asp Tyr Ser Arg Ala Glu 90 Phe Pro Leu Ser Asp Gly Trp Phe Asn Phe Val Ser Thr Asn Phe 100 105 110 Asn Ser Leu Leu Ile Lys Asn Pro Glu Ser Ile Lys 115 120 INFORMATION FOR SEQ ID NO:504: SEQUENCE CHARACTERISTICS: LENGTH: 378 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...378 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:504: Ile Ser Leu Phe Ser Ile Ser Ile Glu Gin Thr Asn Lys Val Val Tyr Ala 1 Ile Met Lys Ile Asp Ala Ser Glu Glu 145 Leu Ser Leu Ile Glu Asn Phe Leu Gly 130 Gin Asp Ala Tyr Leu Leu Val Pro Glu 115 Leu Leu Leu Ala Gin Val Leu Leu Leu 100 Lys Val Cys Ala 5 Ser Lys Asp His Gly Leu Leu Leu Gin Lys 165 Leu Ile Gly Lys 70 Thr Asn Ile Leu Lys 150 Thr Gly Thr Tyr Ile Leu Lys Thr Gin 135 Ala Thr Asp Asn 40 Ala Arg Tyr Phe Ile 120 Ser Leu Phe Ile 25 Asn Gin Lys Leu Tyr 105 Tyr His Asn Ala Lys Arg Met Glu Thr Asn Phe Ile Thr Arg 170 Thr Asn Gly Glu 75 Gin Gin Leu Asp Phe 155 Leu Lys Met Val Ile Thr Arg His Asn 125 Tyr Gin Glu Glu His Ser Asp Ile Leu Asp 110 Arg Gly Phe Lys Leu Ala Ala Asn Ala Thr Lys Asp Lys Ser Glu 160 Pro Lys Ile Val Gin Asn Ala Gin Phe Tyr Ile Gly Val Leu Ile 180 185 WO 97/37044 PCT/US97/05223 456 Glu Phe Asp Lys Ala Gin Gin Ile Ala Glu Leu Phe Pro Phe Asp Arg 195 200 205 Arg Leu Leu Leu Asp Leu Tyr Thr Ala Gin Lys Lys Phe Asp Gin Ala 210 215 220 Ser Lys Gin Ala Ser Leu Ile Tyr Gin Glu Arg Lys Asp Pro Lys Phe 225 230 235 240 Leu Gly Leu Glu Ala Ile Tyr His Tyr Glu Ser Leu Ser Ala Asn Lys 245 250 255 Lys Lys Leu Thr Lys Glu Glu Met Leu Pro Ile Ile Gin Lys Leu Glu 260 265 270 Gin Ala Thr Lys Glu Arg Gin Ala Trp Leu Ala Lys Thr Lys Asp Lys 275 280 285 Glu Asp Ala Gin Asp Ala Phe Phe Tyr Asn Phe Leu Gly Tyr Ser Leu 290 295 300 Ile Asp Tyr Asp Met Asp Val Lys Arg Gly Met Asp Leu Val Arg Lys 305 310 315 320 Ala Leu Ala Leu Asp Ser Asn Ser Val Leu Tyr Leu Asp Ser Leu Ala 325 330 335 Trp Gly Tyr Tyr Lys Leu Gly Asn Cys Leu Glu Ala Lys Lys Ile Phe 340 345 350 Ser Ser Ile Ala Lys Glu Leu Ile Gin Asn Glu Pro Glu Leu Lys Glu 355 360 365 His Asn Lys Ile Ile Gin Glu Cys Lys Lys 370 375 INFORMATION FOR SEQ ID NO:505: SEQUENCE CHARACTERISTICS: LENGTH: 156 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...156 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:505: Phe Tyr Thr Gly Lys Lys Gly Ala Cys Met Arg Lys Ile Leu Leu Met 1 5 10 Gly Leu Ile Leu Gin Ala Leu Phe Gly Glu Glu Thr Ala Gin Glu Leu 25 Leu Gin Cys Ser Ala Ile Phe Glu Ser Lys Lys Ala Glu Leu Lys Glu 40 Asp Leu Arg Gin Leu Ser Glu Lys Glu Gin Ser Leu Arg Ile Leu Gin 55 Thr Glu Asn Ala Arg Leu Leu Asp Glu Lys Ser Asp Leu Leu Asn Lys 70 75 Lys Glu Lys Glu Ile Asp Glu Lys Leu Lys Asn Leu Ala Ala Lys Glu 90 WO 97/37044 PCT/US97/05223 457 Glu Ala Phe Lys Thr Leu Gin Thr Glu Glu Lys Lys Arg Leu Lys Asn 100 105 110 Leu Ile Glu Glu Asn Glu Gly Ile Leu Arg Glu Ile Lys Gin Ala Lys 115 120 125 Asp Ser Lys Ile Gly Glu Thr Tyr Ser Lys Met Lys Asp Ser Lys Ser 130 135 140 Ala Leu Ile Leu Glu Asn Leu Pro Thr Gin Asn Ala 145 150 155 INFORMATION FOR SEQ ID NO:506: SEQUENCE CHARACTERISTICS: LENGTH: 141 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...141 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:506: Lys Arg Leu Cys Asp Arg Leu Trp Ser Gly 1 Glu Arg Arg Asn Ile Leu Leu Phe Gin Lys Ile Leu Ser Val Tyr Thr Leu Gin Arg Ala Lys Val Asn Asn 100 Leu Gly Lys Lys 115 Ser Cys Gin Thr 130
INFORMATION
5 Val Leu Ala Tyr Phe Gin Gin Leu Met Val Phe Lys 70 Leu Thr Thr Gin Gly Leu Leu 55 Gin Arg Phe Pro Lys 135 Val Arg 40 Ile Asn Gly Asn Met 120 Asp Ala 25 Asp Val Gin Glu Leu 105 Lys Pro 10 Val Phe Gly Gin Thr 90 Val Asp Leu Gly Val Gly Ile Asn 75 Leu Ser Val Ile Tyr Tyr Leu Phe Leu Ala Ile Gly Gin Gin Leu Cys Gly Thr Leu Val 125 Gin Pro 140 Leu Leu Ser Ala Glu Lys Leu 110 Asp Ser Thr Pro Ser Ile Gly Ser Leu Val Leu Lys Ile Ala Ile Phe Asp FOR SEQ ID NO:507: SEQUENCE CHARACTERISTICS: LENGTH: 100 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 458 (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...100 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:507: Ala Lys Asn Thr Ile Phe Asn Asn Ala Asn Phe Asn Asn Ser Thr Ser 1 5 10 Phe Asn Phe Asn Asn Ser Ser Ala Thr Thr Ser Phe Val Gly Asp Phe 25 Thr Asn Ala Asn Ser Asn Leu Gin Ile Ala Gly Asn Ala Val Phe Gly 40 Asn Ser Thr Asn Gly Ser Gin Asn Thr Ala Asn Phe Asn Asn Thr Gly 55 Ser Val Asn Ile Ala Gly Asn Ala Thr Phe Asp Asn Val Val Phe Asn 70 75 Ser Pro Thr Asn Thr Gly Val Lys Gly Lys Val Thr Leu Asn Asn Ile 90 Thr Leu Lys Thr 100 INFORMATION FOR SEQ ID NO:508: SEQUENCE CHARACTERISTICS: LENGTH: 123 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...123 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:508: Trp Ala Met Asn Val Trp Val Tyr Arg Pro Leu Leu Ala Phe Met Asp 1 5 10 Asn Arg Gin Ala Glu Ile Lys Asp Ser Leu Ala Lys Ile Lys Thr Asp 25 Asn Thr Gin Ser Val Glu Ile Gly His Gin Ile Glu Thr Leu Leu Lys 40 Glu Ala Ala Glu Lys Arg Arg Glu Met Leu Ala Glu Ala Ile Gin Lys 55 Ala Thr Glu Ser Tyr Asp Ala Val Ile Lys Gin Lys Glu Asn Glu Leu 70 75 Asn Gin Glu Phe Glu Ala Phe Ala Lys Gin Leu Gin Asn Glu Lys Gin WO 97/37044 PCT/US97/05223 459 90 Ile Leu Lys Glu Gin Leu Gin Ala Gin Met Thr Val Phe Glu Asp Glu 100 105 110 Leu Asn Lys Arg Val Ala Met Gly Leu Gly Ser 115 120 INFORMATION FOR SEQ ID NO:509: SEQUENCE CHARACTERISTICS: LENGTH: 103 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...103 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:509: Lys Gly Gly Phe Ile Ser Ala Lys Ser Cys Leu Ser Met Ala Ser Gly 1 5 10 Leu Phe Glu Asn Asp Glu Ile Lys Asn Asn Lys Ala Arg Asp Phe Phe 25 Tyr Ser His Ser Ser Leu Ile Val Phe Phe Leu Leu Leu Leu Gly Phe 40 Gly Tyr Tyr Leu Gly Lys Leu Leu Phe Gly Gly Ser Ser Leu Glu Val 55 Tyr Leu Asp Leu Arg Asp Lys His Glu Arg Leu Gin Gin Glu Ile Thr 70 75 Glu Leu Gin Ser Lys Asn Val Arg Leu Gin Lys Arg Leu Phe Glu Leu 90 Arg Glu Leu Arg Pro Arg Asp 100 INFORMATION FOR SEQ ID NO:510: SEQUENCE CHARACTERISTICS: LENGTH: 332 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 WO 9737044PCTIUS97/05223 460 LOCATION .332 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:510: Leu Tyr Tyr Asn Ser Gin Leu Asn Trp Gly Phe Thr 1 Cys Thr His Ala Ser Tyr Tyr Leu Lys 145 Ser Gin Asn Lys Gly 225 Tyr Leu Val Lys Ser 305 Asp Giu Lys Leu Leu Cys Glu Tyr Gly 130 Ala Cys Asn Tyr Gly 210 Cys Giu Tyr Al a Ala 2 9-0 Cys Asp Leu Lys Val Lys Asn Tyr Arg 115 Ser Ile Gly Tyr Giy 195 Val His Ala Lys Val 275 Thr Lys Ala Trp Trp Ala Arg Leu Gly 100 Arg Met Tyr Ser Ala 180 Ile Glu Leu Gly Lys 260 Met Ser Val Gin 5 Leu Phe Thr Gly Arg Asp Gly Tyr Tyr Leu 165 Lys Ser Lys Lys Met 245 Gly Tyr Tyr Leu Asn 325 Ser Leu Thr Asp 70 Met Gly Cys Giu Tyr 150 Gly Ala Cys Asp Asp 230 Asp Cys Tyr Tyr Giu 310 Asp Phe Ile Gly 55 Tyr Gly Val Asn Asp 135 Arg Phe Leu Asn Leu 215 Gly Val1 Ser Thr Lys 295 Val1 Thr Val1 Leu 40 Glu His Val Asp Leu 120 Gly Arg Met Ser Phe 200 Lys Ala Lys Leu Gly 280 Lys Ile Gln Arg 25 Phe Lys Arg Gly Gin 105 Arg Asp Gly Tyr Phe 185 Val1 Lys Ser Gin Lys 265 Lys Gly Gly Asp 10 Asn Leu Tyr Al a Cys 90 Asn Asn Gly Cys Phe 170 Ser Gly Ala Cys Asn 250 Giu Giy Cys Lys Giu Met Phe Val1 75 Thr Ile Tyr Val His 155 Asn Lys Tyr Leu Val 235 Glu Gly Ala Ala Giu 315 Met Ala Lys Ala Ser Se r Leu Gin 140 Leu Gly Tyr Met Ala 220 Ser Giu Ser Pro Leu 300 Ser Asp Ile Ser Met Phe Leu Lys Al a 125 Lys Lys Thr Ala Tyr 205 Asn Leu Gin Gly Lys 285 Gly Asp Leu Lys Cys Ala Tyr Gly Ala 110 Cys Asp Gly Gly Cys 190 Lys Phe Gly Ala Cys 270 Asp Phe Asn Ser Ser Phe Asn Lys Ser Val Ala Phe Gly Val 175 Ser Ser Lys Tyr Leu 255 His Leu Ser Leu Leu Trp Giy Gin Arg Met Phe Ser Pro Val1 160 Lys Leu Ala Arg Leu 240 Asn Asn Giu Gly Gin 320 Ser Val Gin 330 INFORMATION FOR SEQ ID NO:511: SEQUENCE CHARACTERISTICS: LENGTH: 94 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...94 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:511: Ile Lys Val Arg Ser Val Lys Lys Thr Glu Arg Gly Leu 1 5 10 Lys Phe Thr Ser Gin Gly Glu Leu Val Pro Leu Glu Ile 25 Thr Ile Leu Ser Glu Ile Lys Ser Ser Ser Lys Gly Ile 40 Asp Gly Tyr Pro Arg Ser Val Glu Gin Met Gin Ala Leu 55 Leu Asn Ala Pro Asn Glu Val Ile Leu Lys Ser Val Ile 70 75 Val Ser Glu Asn Thr Ala Lys Glu Arg Val Leu Gly Gin INFORMATION FOR SEQ ID NO:512: SEQUENCE CHARACTERISTICS: LENGTH: 113 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...113 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:512: His Phe Val Phe Arg Gly Asp Phe Cys Trp Ala Tyr Arg 1 5 10 Ile Asp Lys Glu Arg Glu Thr Gin Arg His Asp Ser Ser 25 Ile Asp Glu Leu Val Gly Met Cys Leu Ala Met Ala Ile 40 Ser Leu Val Gly Val Ile Leu Ser Phe Ile Phe Leu Gly 55 Ile Thr Lys Pro Ser Leu Ile Gly Lys Ile Asp Lys Glu 70 75 Gly Leu Gly Val Val Ala Asp Asp Ala Leu Ala Gly Val 90 Leu Ser Thr Leu Leu Ala Ile Asn Ile Leu Gly Phe Phe 100 105 Leu Ile Glu Val Val Glu Ile Leu Ile Asp Lys Glu Glu Val Glu Phe Tyr Tyr Asn Cys Val Leu Asn 110 Tyr Ile Gly Tyr Lys Ala Ile Ser Val Leu Asp Gly Gly Lys WO 97/37044 PCT/US97/05223 462 Phe INFORMATION FOR SEQ ID NO:513: SEQUENCE CHARACTERISTICS: LENGTH: 79 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...79 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:513: Met Trp Pro Val Ala Leu Lys Gin Pro Asn Arg Val Ser His His Phe 1 5 10 Tyr Ile Met Ala Met Leu Phe Ile Leu Phe Asp Val Glu Ile Val Phe 25 Met Phe Pro Trp Ala Ile Asp Phe Lys Lys Leu Gly Leu Phe Gly Leu 40 Val Glu Met Leu Gly Phe Val Phe Phe Leu Ala Ile Gly Phe Ile Tyr 55 Ala Leu Lys Arg Asn Ala Leu Ser Trp G1n Lys Leu Glu Val Lys 70 INFORMATION FOR SEQ ID NO:514: SEQUENCE CHARACTERISTICS: LENGTH: 60 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:514: Phe Asp Leu Gly Val Arg Thr Asn Phe Ala Lys Thr Asn Phe Asn Lys 1 5 10 His Arg Leu Asp Gln Gly Ile Glu Phe Gly Val Lys Ile Pro Val Ile WO 97/37044 PCT/US97/05223 463 25 Ala His Lys Tyr Phe Ala Thr Gin Gly Ser Ser Ala Ser Tyr Met Arg 40 Asn Phe Ser Phe Ile Cys Gly Leu Phe Ser Arg Phe 55 INFORMATION FOR SEQ ID NO:515: SEQUENCE CHARACTERISTICS: LENGTH: 148 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...148 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:515: Phe Gly Ser Arg Ala Phe Arg Arg Ile Phe Thr Lys Trp Gly Glu Arg 1 5 10 Asp Phe Arg Lys Gly Ala Val Val Leu Met Arg Leu Leu Ile Ala Leu 25 Val Leu Phe Leu Trp Trp Leu Asn Leu Gly Ala Lys Glu Ala Asp Phe 40 Ile Ser Asp Trp Glu Tyr Gly Leu Ala Leu Tyr Lys Asn Pro Arg Gly 55 Val Ala Cys Ala Lys Cys His Gly Ile Lys Gly Glu Gin Gin Glu Ile 70 75 Thr Phe Tyr Tyr Glu Lys Gly Glu Lys Lys Ile Leu Tyr Ala Pro Lys 90 Ile Asn His Leu Asp Phe Lys Thr Phe Lys Asp Ala Leu Ser Leu Gly 100 105 110 Phe Cys Met Met Pro Thr Tyr Asn Leu Asn Leu Glu Glu Ile Gin Ala 115 120 125 Ile Tyr Leu Tyr Ile Thr Ser Leu Gly His Lys Asp Glu Arg Lys Asp 130 135 140 Pro Ser Lys Pro 145 INFORMATION FOR SEQ ID NO:516: SEQUENCE CHARACTERISTICS: LENGTH: 172 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 464 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...172 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:516: Lys Asn Arg His Lys Leu Leu Tyr Leu Leu Leu Lys Asn Leu Asn Phe 1 5 10 Leu Lys Thr Ile Ser Thr Ile Phe Thr Gin Lys Arg Val Ile Ile Arg 25 Ser Gin Gin Gly Phe Ser Cys Tyr Leu Asn Leu Lys Val Lys Thr Met 40 Lys Lys Leu Ala Ala Leu Phe Leu Val Ser Ala Leu Gly Val Met Ser 55 Leu Asn Ala Trp Glu Gin Thr Leu Lys Ala Asn Asp Leu Glu Val Lys 70 75 Ile Lys Ser Val Gly Asn Pro Ile Lys Gly Asp Asn Thr Phe Val Leu 90 Ser Pro Thr Leu Lys Gly Lys Ala Leu Glu Lys Ala Ile Val Arg Val 100 105 110 Gln Phe Met Met Pro Glu Met Pro Gly Met Pro Ala Met Lys Glu Met 115 120 125 Ala Gin Val Ser Glu Lys Asn Gly Leu Tyr Glu Ala Lys Thr Asn Leu 130 135 140 Ser Met Asn Gly Thr Trp Gin Val Arg Val Asp Ile Lys Ser Lys Glu 145 150 155 160 Gly Gin Val Tyr Arg Thr Lys Thr Ser Leu Asp Leu 165 170 INFORMATION FOR SEQ ID NO:517: SEQUENCE CHARACTERISTICS: LENGTH: 74 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...74 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:517: Cys Leu Phe Val Leu Thr Lys Ser Gin Arg Ile Arg Met Lys Arg Leu 1 5 10 Leu Leu Leu Ala Leu Ala Leu Phe Phe Ser Leu Ser Cys Thr Asn Ala 25 WO 97/37044 PCT/US97/05223 465 Gin Glu Ile Lys Glu Thr Gin Glu Thr Lys Lys Thr Lys Glu Thr Lys 40 Ser Gin Thr Arg Phe Asn Ile Ser Thr Thr Lys Val Ile Glu Lys Glu 55 Phe Ser Gin Ser Arg Arg Tyr Tyr Ala Leu INFORMATION FOR SEQ ID NO:518: SEQUENCE CHARACTERISTICS: LENGTH: 293 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...293 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:518: Ile Lys Phe Ala Lys Lys Ile Val Leu Ile Leu Lys Lys Gly Gly Phe 1 5 10 Met Lys Thr Asn Gly Leu Phe Lys Met Trp Gly Leu Phe Leu Val Leu 25 Ile Ala Leu Val Phe Asn Ala Cys Ser Asp Ser His Lys Glu Lys Lys 40 Asp Ala Leu Glu Val Ile Lys Gin Arg Gly Val Leu Lys Val Gly Val 55 Phe Ser Asp Lys Pro Pro Phe Gly Ser Val Asp Ser Lys Gly Asn Tyr 70 75 Gin Gly Tyr Asp Val Val Ile Ala Lys Arg Met Ala Leu Asp Leu Leu 90 Gly Asp Glu Asn Lys Ile Glu Phe Ile Pro Val Glu Ala Ser Ala Arg 100 105 110 Val Glu Phe Leu Lys Ala Asn Lys Val Asp Ile Ile Met Ala Asn Phe 115 120 125 Thr Arg Thr Lys Glu Arg Glu Lys Val Val Asp Phe Ala Asn Pro Tyr 130 135 140 Met Lys Val Ala Leu Gly Val Ile Ser Lys Asp Gly Val Ile Lys Asn 145 150 155 160 Ile Glu Glu Leu Lys Asp Lys Glu Leu Ile Val Asn Lys Gly Thr Thr 165 170 175 Ala Asp Phe Tyr Phe Thr Lys Asn Tyr Pro Asn Ile Lys Leu Leu Lys 180 185 190 Phe Glu Gin Asn Thr Glu Thr Phe Leu Ala Leu Leu Asn Asn Lys Ala 195 200 205 Ile Ala Leu Ala His Asp Asn Thr Leu Leu Leu Ala Trp Val Lys Gin 210 215 220 His Pro Glu Phe Lys Leu Gly Ile Thr Ser Leu Gly Asp Lys Asp Val 225 230 235 240 WO 97/37044 PTU9/52 PCTIUS97/05223 Ile Ala Pro Ala Ile Lys Lys Gly Asn Pro Lys Leu Leu Giu Trp Leu 245 250 255 Asn Asn Glu Ile Asp Ser Leu Ile Ser Ser Asp Phe Leu Lys Glu Ala 260 265 270 Tyr Gin Giu Thr Leu Giu Pro Ile Tyr Giy Asp Gly Ile Lys Pro Giu 275 280 285 Giu Ile Ile Phe 0Th 290 INFORMATION FOR SEQ ID NO:519: SEQUENCE CHARACTERISTICS: LENGTH: 676 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .676 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:5i9: Ala Phe Giu Glu Leu Giu Pro Leu Ser Phe Ser Phe 1 Gly Ile Lys Gly Leu Ile Ile Asp Lys 145 Phe Thr Ile Ala Ser Val1 Phe Asn Ser Ile Tyr 130 Ala Leu His Leu Cys Lys Ile Cys Lys Phe Gin 115 Met Ser Thr Phe Lys 195 Giu Ile Phe Thr Giu His 100 Ile Sen Ser Lys Asn 180 Giu 5 Ser Leu Gly Tyr Gin Phe Ala Giu Leu Pro 165 Tyr Ile Cys Asp Tyr Asn 70 Gin Lys Tyr Lys Ser 150 Ile Leu Leu Leu Pro Asn 55 Gly Asp Asn Asp Thr 135 Val1 Giu Asn Giu Gly 215 Giy Asn 40 Ang Ile Ala Sen Met 120 Cys Gin Giu Giu Ang 200 Leu 25 Thr Ser Asp Leu Pro 105 Phe Ser Val1 Val1 Gin 185 Val 10 Gly Pro Tyr Sen Leu 90 Leu Lys Sen Ala Tyr 170 Giu Phe Thr Leu Tyr Ala 75 Tyr Lys Giu Cys Gly 155 His Lys Phe Lys Asn Al a Leu Gly Arg Gin Asn 140 Leu Phe Lys Leu Thr 220 Asn Phe Gin Gin Cys Asn Pro Lys 125 Gly Lys Phe Ile Tyr 205 Sen Ser Gly Met Phe Gly Trp 110 Asp His Met Asn Ala 190 Asp Pro Leu Ala Phe Asn Thr Lys Leu Arg Ala Asp 175 Giu Val Lys Asp Ile Giu Giu Giu Gly Sen Leu Asp 160 Pro Pro Gly Leu Gly 210 Tyr Leu Thr Leu Arg Asp Ala Arg Ile Ser Gly Gly WO 97/37044 WO 9737044PCTIUS97/05223 Giu 225 rnly Asp Asn Asp Val1 305 Thr Giu Asn Cys Thr 385 Gin Val1 Pro Giu Phe 465 Ile Ser Lys Tyr Leu 545 Thr Ser Thr Ser Asp 625 Gly Ala Leu Ser Val1 Thr Thr Phe 290 Val1 Al a Pro Asn Ile 370 Leu Ser Ile Ala Gin 450 Asn Lys Cys Gly Glu 530 Ile Leu Lys Gly Leu 610 Ile Asp Gin Glu Gin Leu Leu Leu 275 Val1 Phe Leu Pro Ile 355 Thr Leu Leu Tyr Thr 435 Lys Val1 Ile Lys Lys 515 Phe Asp Ser Lys Leu 595 Val Ile Lys Asn Leu Arg Tyr Lys 260 Ile Val1 Ser Tyr Lys 340 Lys Gly Pro Asn Leu 420 Tyr Giu Lys Giu Gly 500 Ser Phe Val1 Gly Asp 580 His Ala Lys Gly Cys 660 Lys Ile Val1 245 Leu Val1 Asp Gly Leu 325 Giu Asn Val Thr Gly 405 Asp Thr Ala Gly Met 485 Ala Ile Ala Gly Gly 565 Thr Phe Leu Asn Gly 645 Glu Arg 230 Leu Ile Val1 Ile Ser 310 Asn Lys Leu Ser Ala 390 Val1 Gin Giy Lys Gly 470 His Lys Ala Lys Leu 550 Giu Giy Giu Gly Ala 630 Lys Lys Ile Asp Asn Giu Gly 295 Val1 Gly His Ser Gly 375 Gin Ciii Ala Val Ile 455 Arg Phe Tyr Asp Phe 535 G iy Aia Lys Asp Asn 615 Asp Val Thr Ala Giu Thr His 280 Pro Lys Thr Phe Val 360 Ser Thr Ile Pro Met 440 Leu Cys Leu Asn Vali 520 Pro Tyr Gin Thr Val1 600 Ser Tyr Ile Gin Ser Pro Leu 265 Asp Lys Asp Lys Leu 345 Gin Gly Leu Val1 Ile 425 Asp Gly Giu Pro Pro 505 Leu Lys Ile Arg Leu 585 Asn Met Ile Ala Ser 665 Gin Ser 250 Arg Lys Ala Leu Lys 330 Giu Ile Lys Leu Gly 410 ely Giu Tyr Lys Asp 490 Gin Asn Ile Thr Ile 570 Tyr His Leu Ile Ser 650 Tyr Ile 235 Ile Asn Giu Gly Leu 315 Ile Ile Pro Ser Asn 395 Leu Lys Ile Ser Cys 475 Val1 Thr Met Ala Leu 555 Lys Ile Leu Val Asp 635 Giy Thr Gly Gly Leu Thr Arg 300 Gin Ciii Lys Leu Her 380 His Ciii Thr Arg Thr 460 Gin Leu Leu Ser Val1 540 Gly Leu Leu Leu Ile 620 Met Thr Giy Ser Leu Gin Ile 285 His Asn Arg Asn Lys 365 Leu Ala Tyr Pro Ile 445 Ser Giy Val1 Glu Val1 525 Lys Gin Ala Asp Gin 605 Giu Gly Pro Lys Gly His Lys 270 Lys Gly Asn Pro Val1 350 Gin Ile Lys Leu Arg 430 Leu Arg Asp Gin Ile 510 Glu Leu Asn Lys Glu 590 Val1 His Pro Leu Phe 670 Leu Ciii 255 Lys His Gly His Lys 335 Asn Leu Leu Lys Asp 415 Ser Phe Phe Giy Cys 495 Lys Giu Lys Aia Ciii 575 Pro Leu Asn Asp CGi1u 655 Leu Thr 240 Lys Gly Ala Glu Her 320 Phe Ile Vali Gin Asn 400 Lys Asn Ala Ser Asp 480 Asp Val1 Ala Thr Thr 560 Leu Thr His Leu Giy 640 Val1 Ala WO 97/37044 PCT/US97/05223 468 675 INFORMATION FOR SEQ ID NO:520: SEQUENCE CHARACTERISTICS: LENGTH: 138 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...138 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:520: Ile Ile Ile Asp Tyr Ser Ser Gin Tyr Phe Ser Gly Arg Ala Ala Ala 1 5 10 Phe Tyr Gin Ala Leu Asp Asn Phe Ile Ser Arg Tyr Ala Gln Arg Leu 25 Ile Val Thr Asn Leu Ser Gin Ala Ile Arg Ile Tyr Gly Tyr Glu Val 40 Gly Gly Thr Phe Arg Tyr Lys Gly Val Ser Leu Asn Val Gly Ile Ser 55 Arg Thr Trp Pro Thr Thr Arg Gly Tyr Leu Met Ala Asp Ser Tyr Glu 70 75 Leu Ala Ala Ser Thr Gly Asn Val Phe Ile Ile Lys Leu Asp Tyr Thr 90 Ile Ser Lys Asn Arg Asp Gin Ser Cys Met Ala Leu Ala Ala Leu Leu 100 105 110 Pro Val Trp Ile Ile Ala Gly Leu Ile Phe Thr Cys Leu Ile Met Gly 115 120 125 Gin Leu Lys Asn Pro Lys Pro Leu Pro Ile 130 135 INFORMATION FOR SEQ ID NO:521: SEQUENCE
CHARACTERISTICS:
LENGTH: 173 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature WO 97/37044 PCT/US97/05223 469 LOCATION 1...173 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:521: Gin Gly Met Gly Leu Lys Asn Leu Ser Thr Leu Leu Val Phe Leu Phe 1 5 10 Phe Cys Leu Gly Cys Val Ser Asn Phe Asn Glu Asp Thr Tyr Thr Leu 25 Asp Leu Val Leu Glu Lys Lys Ile Gin Ala Ser Arg Lys Gly Glu Ile 40 Thr Gin Asp Asn Val Pro Ile Ile Thr Ala Ile Ala Thr His Leu Asn 55 Asp Val Asp Ser Gly Thr Tyr Tyr Asp His Glu Tyr Phe Leu Val Glu 70 75 Ile Phe Thr Gin Asn Asn Asp Trp Ile Asp Asp Gly Tyr Ile Ser Tyr 90 Glu Leu Phe Gly Thr Lys Pro Thr Gly Ser Glu Pro Leu Trp Val Arg 100 105 110 Glu Ile Thr Arg Asp Glu Phe Asp Gly Ile Leu Glu Thr Thr Asn Arg 115 120 125 Trp Ser Arg Ala Phe Leu Ile Ala Phe Asp Lys Leu Asp Tyr Leu Ala 130 135 140 Val Gin Glu Ala Lys Leu Glu Leu Asp Ala Tyr Ser Leu Gly Lys Ile 145 150 155 160 Val Phe Asn Phe Ala Tyr Gin Val Pro Leu Pro Gin Phe 165 170 INFORMATION FOR SEQ ID NO:522: SEQUENCE CHARACTERISTICS: LENGTH: 109 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...109 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:522: Gin Lys Glu Val Ser Asn Tyr Glu Glu Ser Ile Leu Val Asn Thr Pro 1 5 10 Tyr Ala Arg Val Asn Ile Leu Ala Ile Glu Lys Asn Glu Ser Pro Ile 25 Glu Leu Leu Ala Pro Val Asp Leu Val Thr Ala Leu Ser Asp Leu Met 40 Leu Gly Gly Glu Gly Ala Ser Lys Glu Glu Met Asp Asn Asp Asp Leu 55 Asp Ala Phe Lys Glu Met Ala Ser Asn Ile Phe Gly Ala Ile Ala Thr 70 75 WO 97/37044 PCT/US97/05223 470 Ser Leu Lys Ser Gin Glu Leu Leu Pro Lys Leu Asn Phe Thr Thr Thr 90 Asn Ala Glu Ile Ala Lys Glu Leu Pro Lys Lys Glu Asp 100 105 INFORMATION FOR SEQ ID NO:523: SEQUENCE CHARACTERISTICS: LENGTH: 182 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...182 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:523: Ile Cys Leu Leu Lys Glu Lys Ile Met Ser Asn Ser Met Leu Asp Lys 1 5 10 Asn Lys Ala Ile Leu Thr Gly Gly Gly Ala Leu Leu Leu Gly Leu Ile 25 Val Leu Phe Tyr Leu Ala Tyr Arg Pro Lys Ala Glu Val Leu Gin Gly 40 Phe Leu Glu Ala Arg Glu Tyr Ser Val Ser Ser Lys Val Pro Gly Arg 55 Ile Glu Lys Val Phe Val Lys Lys Gly Asp Arg Ile Lys Lys Gly Asp 70 75 Leu Val Phe Ser Ile Ser Ser Pro Glu Leu Glu Ala Lys Leu Ala Gin 90 Ala Glu Ala Gly His Lys Ala Ala Lys Ala Val Ser Asp Glu Val Lys 100 105 110 Arg Gly Ser Arg Asp Glu Thr Ile Asn Ser Ala Arg Asp Val Trp Gin 115 120 125 Ala Ala Lys Ser Gin Ala Asn Leu Ala Lys Glu Thr Tyr Lys Arg Val 130 135 140 Gin Asp Leu Tyr Asp Asn Gly Val Ala Ser Leu Gin Lys Arg Asp Glu 145 150 155 160 Ala Tyr Ala Ala Met Lys Ala Pro Asn Thr Thr Arg Ala Arg Leu Thr 165 170 175 Lys Ser Ile Lys Trp Leu 180 INFORMATION FOR SEQ ID NO:524: SEQUENCE CHARACTERISTICS: LENGTH: 148 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 471 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...148 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:524: Met Glu Phe Leu Ser Gly Tyr Phe Leu Trp Val Lys Ala Phe His Val 1 5 10 Ile Ala Val Ile Ser Trp Met Ala Ala Leu Phe Tyr Leu Pro Arg Leu 25 Phe Val Tyr His Ala Glu Asn Ala His Lys Lys Glu Phe Val Gly Val 40 Val Gln Ile Gin Glu Lys Lys Leu Tyr Ser Phe Ile Ala Ser Pro Ala 55 Met Gly Phe Thr Leu Ile Thr Gly Ile Leu Met Leu Leu Ile Ala Pro 70 75 Glu Met Phe Lys Ser Gly Gly Trp Leu His Ala Lys Leu Ala Leu Val 90 Val Leu Leu Leu Ile Tyr His Phe Tyr Cys Lys Lys Cys Met Arg Glu 100 105 110 Leu Glu Lys Asp Pro Thr Gly Lys Asn Ala Arg Phe Tyr Arg Val Phe 115 120 125 Asn Glu Ile Pro Thr Ile Leu Met Ile Leu Ile Val Ile Leu Val Val 130 135 140 Val Lys Pro Phe 145 INFORMATION FOR SEQ ID NO:525: SEQUENCE CHARACTERISTICS: LENGTH: 291 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...291 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:525: Gly Leu Thr Ile Arg Gln Ala Phe Asn Lys Thr Arg Ser Thr His Ser 1 5 10 Arg Thr Leu Leu Leu Asp Ile Asp Cys Val Ile Pro Asn Ile Val Arg WO 97/37044 PCT/US97/05223 472 25 Arg Leu Leu Ser Asn Lys Thr Leu Pro Lys Arg Phe Ala Thr Tyr Ser 40 Leu Gin Glu Val Gly Val Ile Phe Leu Thr Thr Gin Ile Leu Ser Ile 55 Met Arg Lys Thr Arg Cys Ser Lys Thr Leu Phe Phe Ile Thr Arg Gly 70 75 Arg Glu Ser Phe Arg Tyr Gin Leu Cys Asp His Tyr Lys Gin Lys Arg 90 His Gin Phe Asp Glu Asp Phe Arg Ser Leu Leu Lys Ala Leu Lys Ile 100 105 110 Ala Leu Val Glu Lys Tyr Pro Leu Lys Lys Gly Ala Lys Ile Gin Gly 115 120 125 Glu His Cys Phe Glu Tyr Glu Ala Asp Asp Ile Ile Ser Phe Tyr Lys 130 135 140 Lys Lys Asp Pro Asn Asn Tyr Val Ile Ala Ser Met Asp Lys Asp Ile 145 150 155 160 Leu Tyr Ser Asn Arg Gly Ser His Phe Asn Leu Lys Thr Asn Ala Phe 165 170 175 Phe Asn Val Ser Gin Lys Glu Ala His Phe Phe Ala Tyr Tyr Gin Cys 180 185 190 Val Val Gly Asp Lys Gly Asp Asn Ile Lys Gly Val Lys Gly Ile Gly 195 200 205 Gly Phe Asn Tyr Lys Asp Phe Leu Asn Glu Asp Ala Lys Glu His Glu 210 215 220 Leu Trp Glu Gin Ile Ile Gin Ala Phe Lys Ile Lys Glu Asp Leu Ser 225 230 235 240 Asp Ser Glu Ala Lys Glu Lys Ala Leu Leu Asn Met Arg Leu Val Asn 245 250 255 Met His Gin Met Thr His His Gly Val Ile Lys Leu Trp Glu Pro Glu 260 265 270 Phe Lys Lys Ala Phe Phe Pro Lys Lys Pro Gin Arg Pro Asp Phe Lys 275 280 285 Arg Ile Ser 290 INFORMATION FOR SEQ ID NO:526: SEQUENCE CHARACTERISTICS: LENGTH: 193 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...193 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:526: Gly Val Tyr His Asn Lys Glu Ile Ile Val Ala Tyr Arg Lys Ile Gly WO 97/37044 PCT/US97/05223 473 1 5 10 Lys Val His Ser Thr Leu Thr Thr Thr Ser Met Ile Leu Ala Phe Gly 25 Val Gin Lys Val Leu Phe Ser Gly Val Ala Gly Ser Leu Val Lys Asp 40 Leu Lys Ile Asn Asp Leu Leu Val Ala Thr Gin Leu Val Gin His Asp 55 Val Asp Leu Ser Ala Phe Asp His Pro Leu Gly Phe Ile Pro Glu Ser 70 75 Ala Ile Phe Ile Glu Thr Ser Gly Ser Leu Asn Ala Leu Ala Lys Lys 90 Ile Ala Asn Glu Gin His Ile Ala Leu Lys Glu Gly Val Ile Ala Ser 100 105 110 Gly Asp Gin Phe Val His Ser Lys Glu Arg Lys Glu Phe Leu Val Ser 115 120 125 Glu Phe Lys Ala Ser Ala Val Glu Met Glu Gly Ala Ser Val Ala Phe 130 135 140 Val Cys Gin Lys Phe Gly Val Pro Cys Cys Val Leu Arg Ser Ile Ser 145 150 155 160 Asp Asn Ala Asp Glu Lys Ala Gly Met Ser Phe Asp Glu Phe Leu Glu 165 170 175 Lys Ser Ala His Thr Ser Ala Lys Phe Leu Lys Ser Met Val Asp Glu 180 185 190 Leu INFORMATION FOR SEQ ID NO:527: SEQUENCE CHARACTERISTICS: LENGTH: 80 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...80 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:527: Lys Leu Ala Leu Phe Lys Gin Arg Gin Leu Lys Gin Asn Lys Asn Ala 1 5 10 Gin Ala Asn Pro Lys Ser Pro Phe Ile Thr His Val Val Leu Pro Lys 25 Glu Thr Leu Ser Ser Ile Ala Lys Arg Tyr Gin Val Ser Ile Ser Ser 40 Ile Gin Leu Ala Asn Asn Leu Lys Asp Ser Asn Ile Phe Ile His Gin 55 Arg Leu Ile Ile Pro Thr Asn Lys Lys Leu Leu Ala Thr Arg Glu Phe 70 75 WO 97/37044 PCT/US97/05223 474 INFORMATION FOR SEQ ID NO:528: SEQUENCE CHARACTERISTICS: LENGTH: 245 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...245 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:528: Gly His Arg Lys Arg Ile Phe Ser Lys Pro Ala Leu 1 Glu Tyr Gly Ser Lys Ile Phe Phe Leu 145 Asn Ala Met Pro Lys 225 Val Pro Val Asp Glu Glu Ile Asn Ile 130 Trp Thr Ile Leu Asn 210 Ile Phe Asn Glu Arg Leu Lys Ser Gly 115 Lys Ala His Thr Glu 195 Met Leu Lys Glu Lys Leu Leu Leu Ser 100 Val Lys Leu Gin Leu 180 Ala Phe Pro Lys 5 Ala Leu Leu Ser Lys His Ile Gly Val Ala 165 Glu Arg Ala Lys Asp 245 Leu Tyr Ser Ser 70 Leu Lys Phe Gin Lys 150 Ile Asn Phe Gin Glu 230 Ile Ala Val 55 Leu Leu Val Lys Glu 135 Val Leu Ile Asn Val 215 Phe Asn 40 Tyr Lys Gly Gin Lys 120 Leu Asn Phe Asn Val 200 Glu Ser 25 Lys Ser Phe Leu Asn 105 Ser Phe Gin Val Pro 185 Pro Ile 10 Gin Thr Pro Asn Glu 90 Glu Pro Lys Glu Glu 170 Ile Asn Phe Thr Tyr Glu Gin 75 Asn Ile Asp Ile Asp 155 Gly Ile Leu His Leu Thr Leu Gin Phe Thr Leu Ile 140 Leu Val Asn Lys Lys 220 Leu Arg Pro Ala Val Ser Ile Asn 125 Asp Glu Lys Ala Leu 205 Pro Arg Phe Ile Gly Gly Ile Tyr 110 Glu Leu Phe Gly Gin 190 Leu Gin Ala Asp Lys Val Ala Glu Ser Gly Ser Leu Lys 175 Asp Tyr Lys Leu Gly Lys Gin Ile Lys Arg Ser Arg Lys 160 Gin Lys Tyr Met Ala Val Leu Ile Lys Gly Gly Lys Ala Ile INFORMATION FOR SEQ ID NO:529: SEQUENCE CHARACTERISTICS: LENGTH: 152 amino acids WO 97/37044 PCT/US97/05223 475 TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...152 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:529: Ser Gly Glu His Thr Met Lys Lys Tyr Ile Leu Asn Leu Ala Leu Val 1 5 10 Gly Ala Leu Ser Ala Ser Phe Leu Met Ala Lys Pro Ala His Asn Ala 25 Asn Asn Ser Thr His Asn Thr Lys Glu Thr Thr Asp Ala Ser Ala Gly 40 Val Leu Ala Thr Val Asp Gly Arg Pro Ile Thr Lys Ser Asp Phe Asp 55 Met Ile Lys Gin Arg Asn Pro Asn Phe Asp Phe Asp Lys Leu Lys Glu 70 75 Lys Glu Lys Glu Ala Leu Ile Glu Gin Ala Ile Arg Thr Ala Leu Val 90 Glu Asn Glu Ala Lys Ala Glu Lys Leu Asn Gln Thr Pro Glu Phe Lys 100 105 110 Ala Met Met Glu Ala Val Lys Lys Gln Ala Leu Val Glu Phe Trp Ala 115 120 125 Lys Lys Gin Ala Glu Glu Val Lys Lys Asp Pro Asn Pro Arg Lys Arg 130 135 140 Asn Ala Gly Phe Leu Gin Arg Gin 145 150 INFORMATION FOR SEQ ID NO:530: SEQUENCE CHARACTERISTICS: LENGTH: 184 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...184 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:530: WO 97/37044 PCT/US97/05223 476 Arg Ala Leu Lys Thr His Gly Ala Leu Ala Phe Val Gin Ala Phe Leu 1 5 10 Glu Ser Phe Lys Gly Phe Leu Ser Gin Ala Thr Leu Ile Ser Val Leu 25 Ile Ala Ser Val Leu Ile Leu Phe Cys Ala Ile Leu Leu Leu Leu Ala 40 Leu Leu Leu Arg Asn Arg Trp Ala Ser Tyr Ile Thr Thr Ala Ala Phe 55 Leu Gly Ala Phe Leu Ser Met Pro Phe Val Leu Asn Val Leu Leu Thr 70 75 Gin Ala Ile Tyr Pro Ile Glu Thr Arg Ile Leu His Ala Asn Pro Leu 90 Ser Tyr Ser Asn Ala Phe Ser Leu Gin Val Gly Val Lys Asn Ile Ser 100 105 110 Lys Phe Ser Leu Asn Lys Cys Val Leu Arg Leu Glu Val Leu Lys Asn 115 120 125 Pro His Asn Phe Val Glu Glu Arg Ala Phe Lys Trp Phe Val Lys Lys 130 135 140 Ser Tyr Glu Lys Thr Phe Lys Glu Lys Ile Leu Pro Glu Glu Ser Lys 145 150 155 160 Val Phe Ser Phe Phe Ile Asp Asp Tyr Pro Tyr Ser Lys Thr Ala Pro 165 170 175 Tyr Gin Val Ser Leu Phe Cys Leu 180 INFORMATION FOR SEQ ID NO:531: SEQUENCE CHARACTERISTICS: LENGTH: 115 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...115 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:531: Ala Met Asn Ile Lys Thr His Ser Ser Asn Glu Lys Glu Arg Phe Val 1 5 10 Arg Ile Glu Glu Asp Glu Lys Lys Glu Leu Phe Ala Glu Ala Thr Asn 25 Glu Asn Pro His Gly Leu Ser Leu Met Ala Leu Ile Gly Val Leu Val 40 Phe Gly Gly Ala Phe Leu Ala Leu Leu Val Pro Lys Ile Tyr Leu Ser 55 Asn Asn Ile Tyr Tyr Ile Ser Arg Lys Ile Asn Thr Leu Glu Asp Gin 70 75 Lys Arg Leu Leu Leu Glu Glu Gin Gin Ile Leu Lys Asn Glu Leu Glu 90 WO 97/37044 PCT/US97/05223 477 Lys Glu Arg Phe Lys Tyr Tyr Ile Glu Asn Ser Glu Asn Ile Gly Asp 100 105 110 Ile Ala Phe 115 INFORMATION FOR SEQ ID NO:532: SEQUENCE CHARACTERISTICS: LENGTH: 238 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...238 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:532: Leu Leu Leu Ile His Phe Leu Ile Leu Arg Lys Leu Leu Gin His Lys 1 5 10 Thr Ile Met Asp Lys Ile Ile Ile Gin Gly Ala Arg Glu Asn Asn Leu 25 Lys Asn Ile Phe Leu Glu Ile Pro Lys Asn Gin Phe Val Val Phe Thr 40 Gly Leu Ser Gly Ser Gly Lys Ser Thr Leu Ala Phe Asp Thr Leu Tyr 55 Ala Glu Gly Gin Arg Arg Tyr Leu Glu Ser Leu Ser Ser Tyr Ala Arg 70 75 Gin Phe Leu Asp Lys Val Gly Lys Pro Asn Val Asp Lys Ile Glu Gly 90 Leu Thr Pro Ala Ile Ala Ile Asp Gin Lys Thr Thr Ser Lys Asn Pro 100 105 110 Arg Ser Thr Val Gly Thr Ile Thr Glu Ile Tyr Asp Tyr Leu Arg Leu 115 120 125 Leu Leu Ala Arg Val Gly Glu Gin Phe Cys Pro Thr Cys Leu Glu Pro 130 135 140 Ile Ser Ser Met Ser Ala Ser Asp Ile Ile Ser Gin Ile Cys His Leu 145 150 155 160 Glu Glu Asn Ser Lys Ile Ile Ile Leu Ala Pro Ile Ile Lys Asp Lys 165 170 175 Lys Gly Ser Phe Asn Asp Lys Leu Glu Ser Leu Arg Leu Lys Gly Tyr 180 185 190 Val Arg Ala Phe Val Asp Gly Val Met Val Arg Leu Asp Glu Glu Ile 195 200 205 His Leu His Lys Thr Lys Lys His Thr Ile Glu Ala Val Val Asp Arg 210 215 220 Val Val Ile Asn Ser Glu Asn Ala Ser Arg Ile Ala Ser Ala 225 230 235 INFORMATION FOR SEQ ID NO:533: WO 97/37044 PCT/US97/05223 478 SEQUENCE CHARACTERISTICS: LENGTH: 361 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...361 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:533: His Gin Ser Phe Lys Arg Ala Phe Glu Pro Arg Arg Lys Gly Arg Val 1 5 10 Phe Arg Ile Met Gly Phe Glu Lys Ser Ile Leu Asp Asn Leu Asn Gly 25 Ala Gin Lys Ile Ala Ala Cys His Ile Gin Gly Pro Leu Leu Ile Leu 40 Ala Gly Ala Gly Ser Gly Lys Thr Lys Thr Leu Thr Ser Arg Leu Ala 55 Tyr Leu Ile Gly Ala Cys Gly Val Pro Ser Glu Asn Thr Leu Thr Leu 70 75 Thr Phe Thr Asn Lys Ala Ser Lys Glu Met Gin Glu Arg Ala Leu Lys 90 Leu Leu Lys Asn Gin Ala Leu Ile Pro Pro Leu Leu Cys Thr Phe His 100 105 110 Arg Phe Gly Leu Leu Phe Leu Arg Gln His Met Asn Leu Leu Lys Arg 115 120 125 Ala Cys Asp Phe Ser Val Leu Asp Ser Asp Glu Val Lys Thr Leu Cys 130 135 140 Lys Gln Leu Lys Ile Ser Asn Phe Arg Ala Ser Ile Ser Gin Ile Lys 145 150 155 160 Asn Gly Met Met Asp Leu Ser Val Gin Asp Ser Glu Cys Tyr Lys Ala 165 170 175 Tyr Glu Leu Tyr Gin Asn Ala Leu Lys Lys Asp Asn Leu Val Asp Phe 180 185 190 Asp Asp Leu Leu Cys Leu Ser Leu Lys Ile Leu Gin Asp Asn Glu Lys 195 200 205 Leu Ala Lys Glu Thr Ser Glu Arg Tyr His Tyr Ile Met Val Asp Glu 210 215 220 Tyr Gin Asp Thr Asn Ala Leu Gin Leu Glu Phe Leu Lys Gin Leu Ser 225 230 235 240 Phe Thr His His Asn Leu Cys Val Val Gly Asp Asp Asp Gin Ser Ile 245 250 255 Tyr Gly Phe Arg Gly Ala Asp Ile Ser Asn Ile Leu Asn Phe Ser Lys 260 265 270 His Phe Lys Gly Ala Lys Ile Val Lys Leu Glu Thr Asn Tyr Arg Ser 275 280 285 Ser Ala Glu Ile Leu Ala Cys Ala Asn Ser Leu Ile Ser His Asn Gin 290 295 300 WO 97/37044 PCT/US97/05223 479 His Arg His Ile Lys Thr Leu Gin Ser Phe Lys Gly Ser His Lys Ser 305 310 315 320 Val Ile Cys Lys Glu Tyr Pro Thr Gin Lys Glu Glu Ser Leu Asp Val 325 330 335 Ala Tyr Gin Ile Gin Ser Pro Phe Lys Glu Gly Arg Glu Phe Arg Lys 340 345 350 Tyr Arg Tyr Phe Val Ser Phe Lys Trp 355 360 INFORMATION FOR SEQ ID NO:534: SEQUENCE CHARACTERISTICS: LENGTH: 152 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...152 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:534: Met Arg Lys Thr Ile Ser Ala Leu Phe Leu Ser Ala Cys Ile Gly Leu 1 5 10 Ser Ser Val His Ala Ser Asn Ala Leu Ile Leu Gin Thr Asp Phe Ser 25 Leu Lys Asp Gly Ala Val Ser Ala Met Lys Gly Val Ala Phe Ser Val 40 Asp Ser Asn Leu Lys Ile Phe Asp Leu Thr His Glu Ile Pro Pro Tyr 55 Asn Ile Trp Glu Gly Ala Tyr Arg Leu Tyr Gin Thr Ala Ser Tyr Trp 70 75 Pro Lys Gly Ser Val Phe Val Ser Val Val Asp Pro Gly Val Gly Thr 90 Asn Arg Lys Ser Val Val Leu Lys Thr Lys Asn Gly Gin Tyr Phe Val 100 105 110 Ser Pro Asp Asn Gly Thr Leu Thr Leu Val Ala Gin Thr Leu Gly Ile 115 120 125 Asp Ser Arg Ala Glu Ile Asp Glu Asn Ser Ile Arg Leu Lys Gly Ser 130 135 140 Lys Phe Leu Ser Ser Arg Pro Val 145 150 INFORMATION FOR SEQ ID NO:535: SEQUENCE CHARACTERISTICS: LENGTH: 243 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 480 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...243 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:535: Asp Ile Phe Asn Lys Glu Glu Ile Met Ala Ile Asp Leu Ala Glu Val 1 Thr Ala Leu Thr Lys Asn Asn Arg Glu 145 Phe Ala Leu Gly Lys 225 Ala Gly Asn Lys Gin Thr Glu Leu Glu 130 Thr Ser Ile Lys Thr 210 Ala His Ala Gly Asn Thr Met Ser Asn 115 Val Asp Leu Gin Asp 195 Asn Glu Trp Lys Leu Gin Ala Gin Leu 100 Lys Ser Val Phe Ile 180 Tyr Glu Tyr 5 Ala Asp Asp Gin Glu Lys Gly Ala Ser Phe 165 Leu Asn Lys Asn Ala Gin Glu Lys Asn Ala 40 Pro Thr Ala 55 Leu Thr Gin 70 Val Ala Ser Asp Phe Gin Met Asp Asp 120 Leu Asn Ser 135 Gly Ala Asn 150 Asp Glu Lys Asn Glu Asn Gly Gin Lys 200 Gly Glu Lys 215 Leu Gly Ser 230 10 Arg Lys 25 Phe Met Pro Met Val Glu Ala Met 90 Gly Ala 105 Ser Leu Val Ser Phe Asp Ile Asp 170 Asn Glu 185 Gly Tyr Val Pro Lys Lys Glu Met 75 Lys Leu Lys Met Gly 155 Ala Leu Ile Lys Glu Leu Thr Gin Ser Lys Ala Ile 140 Asn Ser Val Asn Gly 220 Gin Phe Asp Glu Asn Asp Asn 125 Gly Asn Lys Lys Phe 205 Asn Pro Thr Leu Glu Lys Ile Glu Asn Lys Glu Thr Met 110 Asn Ala Lys Ile Lys Leu Gly Val 175 Thr Ile 190 Glu Trp Tyr Lys Ile Gin Ile Lys Thr Glu Leu Ala Ser 160 Pro Pro Asp Ile Pro Lys Gin Ala Val Phe Ala Asn INFORMATION FOR SEQ ID NO:536: SEQUENCE CHARACTERISTICS: LENGTH: 253 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 481 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...253 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:536: Asn Ala His Ala Phe Thr His Pro Phe Ser 1 Leu Gly Val Lys Pro Gly Arg Gly Tyr 145 Asp Val Glu Ile Ala 225 Thr Gin Ser Thr Leu Glu Phe Ile 115 Phe Ala Ala Ile Glu 195 Glu Gly Lys Lys Gin Ile Lys Tyr Val 100 Ile Thr Arg Thr Tyr 180 Lys Ser His Val 5 Arg Val Val Glu Leu Ile Trp Gly Tyr Val 165 Val Cys Leu Phe Glu 245 Lys Ser Ile Ala 70 His Gly Asp Lys Val 150 Gin Asn Val Asp Gly 230 Glu Asp Val Ser Val Asp Gly Thr Asp 135 Ala Leu Thr Lys Leu 215 Arg Ile Asn Arg 40 Thr Ile Asn Pro Tyr 120 Pro Lys Ala His Ser 200 Leu Glu Lys Thr 25 Tyr Gin Glu Ile Gin 105 Gly Tyr Asn Tyr Asn 185 Val Arg Leu Ala 10 Leu Glu His Glu Lys 90 Gly Gly Lys Leu Ala 170 Thr Phe Pro Glu Phe 250 Ala Pro Ala Pro Phe Leu Asn Asn Lys Ser Pro Glu Ile Val Tyr Phe Phe Ile Asp Ala Gly Phe Cys Pro 125 Val Asp Met 140 Val Ala Ser 155 Ile Gly Val Ser Lys His Lys Leu Thr 205 Ile Tyr Ser 220 Glu Phe Thr 235 Phe Lys Arg Arg Arg Pro Val Lys Asn Leu 110 His Ser Gly Ile Ser 190 Pro Leu Trp Phe Pro Val Ser Val Pro Thr Gly Ala Val Glu 175 Ser Lys Thr Glu INFORMATION FOR SEQ ID NO:537: SEQUENCE CHARACTERISTICS: LENGTH: 65 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 482 NAME/KEY: misc_feature LOCATION 1...65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:537: Val Ile Ser Tyr His Phe Ile Ile Leu Lys Asp Val Phe Ile Met Arg 1 5 10 Ile Lys Ala Tyr Phe Leu Arg Leu Ser His Trp Phe Leu Ser Phe Gly 25 Trp Val Leu Ala Leu Val Lys Thr Leu Lys Asn Leu Lys Ile Leu Lys 40 Thr Ile Pro Pro Asn Lys Ile Ala Leu Lys Pro Thr Pro Tyr Gly Ser 55 Glu INFORMATION FOR SEQ ID NO:538: SEQUENCE CHARACTERISTICS: LENGTH: 122 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...122 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:538: Lys Gly Ile Ser Met Lys Lys Ser Leu Cys Leu Ser Phe Phe Leu Thr 1 5 10 Phe Ser Asn Pro Leu Gin Ala Leu Val Ile Glu Leu Leu Glu Glu Ile 25 Lys Thr Ser Pro His Lys Gly Thr Phe Lys Ala Lys Val Leu Asp Ser 40 Lys Glu Pro Arg Gin Val Leu Gly Val Tyr Asn Ile Ser Pro His Lys 55 Lys Leu Thr Leu Thr Ile Thr His Ile Ser Thr Ala Ile Val Tyr Gin 70 75 Pro Leu Asp Glu Lys Leu Ser Leu Glu Thr Thr Leu Ser Pro Asn Arg 90 Pro Thr Ile Pro Arg Asn Thr Gin Ile Val Phe Ser Ser Lys Glu Leu 100 105 110 Lys Glu Pro His Ser Asn Pro Ile Pro Ser 115 120 INFORMATION FOR SEQ ID NO:539: SEQUENCE CHARACTERISTICS: LENGTH: 136 amino acids WO 97/37044 PCTUS97/05223 483 TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...136 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:539: Lys Lys Phe His Thr Leu Ile Leu Pro Val Glu Trp Gly Thr Glu Ile 1 5 10 Val Leu Val Val Ser Gly Ser Leu Gin Asn Ile Val Arg Ser Phe Leu 25 Arg Arg Ile Ser Val Phe Phe Ile Ile Asp Phe Ile Asn Phe Ile Phe 40 Val Ile Asp Gin Ile Ile His Phe Phe Gly Asp Phe Phe Ile Phe Phe 55 Phe Met Leu Asn Leu Phe Tyr Ile Leu Ile Arg Leu Ile Asp Leu Ile 70 75 Val Leu Ser Gly Phe Ser Val Val Leu Val Arg Leu Gly Gin Phe Phe 90 Arg Leu Ile Phe Ala Cys Ala Gin His Ala Phe Asn Lys Val Asp Ile 100 105 110 Asp Lys Met Ser Asp Phe Arg Phe His Gin Leu Ser Phe Phe Lys Ser 115 120 125 Val Tyr Asn Leu Phe Ile Leu Ser 130 135 INFORMATION FOR SEQ ID NO:540: SEQUENCE CHARACTERISTICS: LENGTH: 69 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...69 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:540: Tyr Phe Leu Leu Ser Gly Glu Leu Arg Leu Ser Trp Trp Leu Val Ala 1 5 10 WO 97/37044 PCT/US97/05223 484 Arg Cys Lys Thr Leu Phe Val Leu Phe Cys Glu Gly Ser Ala Phe Ser 25 Ser Leu Leu Ile Ser Ser Ile Leu Phe Ser Leu Leu Ile Arg Ser Phe 40 Thr Ser Leu Val Thr Phe Ser Ser Ser Ser Ser Cys Ser Ile Phe Phe 55 Ile Ser Leu Leu Gly INFORMATION FOR SEQ ID NO:541: SEQUENCE CHARACTERISTICS: LENGTH: 243 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...243 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:541: Thr Leu Phe Lys Ser Ser Asp Lys Val Cys Asn Leu Met Arg Val Ile 1 5 10 Ile Lys Phe Met Leu Ile Ser Leu Lys Thr Phe Leu Lys Ile Leu Leu 25 Glu Ile Phe Leu Lys Thr Phe Gin Lys Ile Trp Ile Val Cys Val Val 40 Ile Trp Gly Leu Gly Cys Ser Phe Leu Asn Ala Asn Ser Val Gin Leu 55 Glu Glu Thr Leu Arg Arg Asn Pro Lys Asn Leu Ile Trp Gin His Phe 70 75 Lys Lys Lys Phe Lys Lys Ser Asn Thr Ile Pro Tyr Ala Pro Asn Ser 90 Arg Trp Lys Tyr Leu Gly Thr Ser Ile Gly Ile Leu Gly Val Ser Leu 100 105 110 Val Ile Gly Ile Val Gly Leu Tyr Leu Met Pro Glu Ser Val Thr Asn 115 120 125 Trp Asp Lys Glu Lys Phe Gly Val Lys Ser Trp Phe Glu Asn Val Arg 130 135 140 Met Gly Pro Lys Leu Asp Asn Asp Ser Phe Ile Phe Asn Glu Ile Leu 145 150 155 160 His Pro Tyr Phe Gly Ala Met Tyr Tyr Met Gin Pro Arg Met Ala Gly 165 170 175 Phe Ser Trp Met Thr Ser Ala Phe Phe Ser Phe Ile Thr Ser Thr Leu 180 185 190 Phe Trp Glu Tyr Gly Leu Glu Pro Phe Val Glu Val Pro Ser Trp Gin 195 200 205 Asp Leu Val Ile Thr Pro Leu Leu Gly Phe Ile Leu Gly Glu Gly Phe 210 215 220 WO 97/37044 PCT/US97/05223 485 Tyr Gin Leu Thr Pro Ile Ser Asn Ala Thr Lys Ala Ser Cys Leu Ala 225 230 235 240 Leu Tyr Phe INFORMATION FOR SEQ ID NO:542: SEQUENCE CHARACTERISTICS: LENGTH: 486 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...486 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:542: Lys Lys Arg Lys Pro Met Phe Arg Lys Leu Ala Thr Ser Val Ser Leu 1 5 10 Ile Ser Leu Leu Ile Ser Asn Ala Leu Tyr Ala Lys Glu Ile Ser Glu 25 Ala Asp Lys Val Ile Lys Ala Thr Lys Glu Thr Lys Glu Thr Lys Lys 40 Glu Ala Lys Arg Leu Lys Lys Glu Ala Lys Gin Arg Gin Gin Ile Pro 55 Asp Asn Lys Lys Pro Gin Tyr Val Ser Val Asp Asp Thr Lys Thr Gin 70 75 Ala Leu Phe Asp Ile Tyr Asp Thr Leu Asn Val Asn Asp Lys Ser Phe 90 Gly Asp Trp Phe Gly Asn Ser Ala Leu Lys Asn Lys Thr Tyr Leu Tyr 100 105 110 Ala Met Asp Leu Leu Asp Tyr Asn Asn Tyr Leu Ser Ile Glu Asn Pro 115 120 125 Ile IIe Lys Thr Arg Ala Met Gly Thr Tyr Ala Asp Leu Ile Ile Ile 130 135 140 Thr Gly Ser Leu Glu Gin Val Asn Gly Tyr Tyr Asn Ile Leu Lys Ala 145 150 155 160 Leu Asn Lys Arg Asn Ala Lys Phe Val Leu Lys Ile Asn Asp Lys Ile 165 170 175 Pro Tyr Ala Gin Ala Thr Phe Leu Arg Val Pro Lys Arg Ser Asp Pro 180 185 190 Asn Ala His Thr Leu Asp Lys Gly Ala Ser Ile Asp Glu Ser Lys Leu 195 200 205 Phe Glu Gin Gin Lys Lys Met Tyr Phe Asn Tyr Ala Asn Asp Val Ile 210 215 220 Cys Arg Pro Asn Asp Glu Val Cys Ser Pro Leu Arg Asp Glu Met Val 225 230 235 240 Ala Met Pro Thr Asn Asp Ser Val Thr Gin Lys Pro Asn Ile Ile Ala 245 250 255 WO 97/37044 PCT/US97/05223 Pro Tyr Ser'Leu Tyr Arg Leu Lys Glu Thr Asn Asn Ala Asn Glu Ala Gin Lys Glu 305 Glu Lys Val Met Lys 385 Asn Leu Tyr Lys Pro 465 Leu Pro Leu 290 Glu Leu Ala Val Arg 370 Glu Ser Glu Ala Glu 450 Leu Leu Ser 275 Ile Arg Ala Leu Glu 355 Val Thr Tyr Glu Asn 435 Gly Arg Lys 260 Pro Glu Glu Lys Glu 340 Val Ile Thr Ser Glu 420 Gly Met Asp Asp 265 Tyr Glu Lys Tyr 325 Ala Pro Lys Ile Lys 405 Ile Ile Leu Lys Leu 485 Ala Leu Lys 310 Lys Glu Ile Glu Lys 390 Lys Lys Asn Cys Leu 470 Lys Thr Ile 295 Leu Leu Leu Pro Lys 375 Arg Thr Ser Leu Gly 455 Gin 280 Ala Leu Lys Lys Pro 360 Glu Ser Pro Tyr Tyr 440 Tyr Thr Asn Ala Asp Lys 345 Lys Asn Tyr Ile Tyr 425 Val Glu Ala Ser Glu Leu 330 Lys Thr Tyr Lys Asn 410 Val Lys Ser Pro Gin Lys 315 Glu Asn Ser Asn Gly 395 Leu Lys Ile Val Glu Leu 300 Glu Asn Thr Asp Gly 380 Thr Glu Ser Lys Gin 460 Asn 285 Ile Lys Gin Lys Ser 365 Leu Leu Asp Asn Asn 445 Lys 270 Ser Ala Gin Lys Lys 350 Asp Leu Ile Leu Gly 430 Asp Leu Lys Asn Glu Lys 335 Pro Glu Val Ser Arg 415 Leu Pro Leu Glu Glu Thr 320 Leu Arg Thr Asp Glu 400 Ser Cys Tyr Ser Ala 480 Lys Tyr Asp Lys Gin Lys Leu Gin Lys 475 INFORMATION FOR SEQ ID NO:543: SEQUENCE CHARACTERISTICS: LENGTH: 101 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...101 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:543: Val Gly Thr Met Glu Asp Phe Leu Tyr Asn Thr Leu Tyr Phe Ile 5 10 Asp Tyr Lys Leu Val Val Ile Phe Ser Phe Ile Gly Leu Ile Ala 25 Phe Phe Leu Tyr Lys Phe Ile Lys Thr Gin Lys Lys Val Phe Lys 40 Val 1 Glu Leu WO 97/37044 PCT/US97/05223 487 Asp Lys Ala Asn Gin Pro Gin Lys Lys Lys Ser Phe Lys Glu Ile Ile 55 Ile Asp Gly Leu Lys Glu Arg Val Lys Thr Phe Gly Phe Trp Leu Pro 70 75 Ser Tyr Thr Ile Thr Ile Leu Phe Phe Tyr His Ile Arg Val Ile Phe 90 Leu Asp Ser Leu Arg 100 INFORMATION FOR SEQ ID NO:544: SEQUENCE
CHARACTERISTICS:
LENGTH: 293 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...293 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:544: Arg Met Ala Gin Ile Val Lys Gly Ser Asp Met Phe Glu Asp Phe Tyr 1 5 10 Arg Thr Thr Leu Ser Phe Leu Arg Ser Leu Leu Leu Leu Leu Gly Leu 25 Leu Leu Pro Phe Ser Leu Cys Ile Ala Asp Glu Tyr Ile Ser Ile Ser 40 Asp Asp Trp Asp Glu Arg Ala Arg Asn Gin Trp Asp Glu Thr Ala Arg 55 Asn His Lys Thr Tyr Tyr Phe Glu Asn Gly Leu Asp His Phe Asn Gin 70 75 Gly Gin Tyr Lys Gin Ala Phe Lys Asp Phe Lys Leu Ala Gin Glu Tyr 90 Ser Ile Gly Leu Gly Asn Val Tyr Leu Ala Lys Met Tyr Leu Glu Gly 100 105 110 Lys Gly Val Lys Val Asp Tyr Lys Lys Ala Gin Phe Tyr Ala Gin Asn 115 120 125 Ala Ile Lys Gly Tyr Gly Ser Gly Leu Leu Gly Gly Ala Leu Ile Leu 130 135 140 Gly Arg Met Gin Ala Glu Gly Leu Gly Met Lys Lys Asp Leu Lys Gin 145 150 155 160 Ala Leu Lys Thr Tyr Arg His Val Val Arg Met Phe Ser Asn Lys Ser 165 170 175 Ala Asn Phe Ala Asn Lys Phe Gly Ser Asn Leu Ala Glu Phe Thr Ser 180 185 190 Met Leu Ile Gly Ser Arg Phe Ile Asp Leu Ser Gly Leu Ser Ala Asn 195 200 205 Pro Ile Lys Phe Gly Asn Lys Phe Gly Ile Leu Val Lys Lys Ala Leu 210 215 220 WO 97/37044 PCT/US97/05223 Gin Ile Lys Asp Asn Thr Leu Ser Trp Glu Asp Ile Ala Glu Ile Ser 225 230 235 240 Ser Asn Ile Ile Leu Leu Lys Gin Gin Met Gly Glu Ile Leu Tyr Arg 245 250 255 Ile Gly Ile Ala Tyr Lys Glu Gly Leu Gly Thr Arg Lys Lys Lys Asp 260 265 270 Arg Ala Lys Lys Phe Leu Gin Lys Ser Ala Glu Phe Gly Tyr Glu Lys 275 280 285 Ala Met Glu Ala Leu 290 INFORMATION FOR SEQ ID NO:545: SEQUENCE CHARACTERISTICS: LENGTH: 81 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...81 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:545: Asp Phe His Asp Phe Cys Ala Asn Ile His His Arg Ile Arg Trp Arg 1 5 10 His Arg His Ile Ala Ser Leu Leu Ser Arg Ser Met Pro Lys Ile His 25 Ala Val Phe Lys Ala Phe Ile Pro Ile Pro Phe Ala Leu Phe Ala Ile 40 His Phe Val Val Leu Gly Ile Gly Ser Val Phe Asn Leu Asn Arg Ile 55 Lys Asp Lys Lys Phe Ile Leu Arg Ala Lys Ile Ser His Ile Ala Gin 70 75 Ala INFORMATION FOR SEQ ID NO:546: SEQUENCE CHARACTERISTICS: LENGTH: 104 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 489 (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...104 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:546: Lys Ala Met Leu Ser Ser Thr Asn Asp Ile Arg Tyr Gin Asn Pro Lys 1 5 10 Asn Lys Ala Leu Lys Gly Glu Ile Leu Ala Asn Leu Lys Phe Ile Lys 25 Lys Leu Leu Gly Ile Gly Cys Lys Asp Pro Ser Ala Tyr Val Gin Leu 40 Gly Val Ser Lys Ser Glu Lys Gin Glu Ile Glu Asn Lys Ile Glu Glu 55 Arg Lys Arg Ala Lys Glu Gin Lys Asp Phe Leu Lys Ala Asp Ser.Ile 70 75 Arg Glu Glu Leu Leu Gin Gin Lys Ile Ala Leu Met Asp Thr Pro Gin 90 Gly Thr Ile Trp Glu Lys Leu Phe 100 INFORMATION FOR SEQ ID NO:547: SEQUENCE CHARACTERISTICS: LENGTH: 529 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...529 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:547: Met Arg Lys Val Ile Ile Met Asn Gly Tyr Leu Arg Val Lys Thr Pro 1 5 10 Tyr Phe Leu Ala Ser Val Val Leu Thr Phe Trp Thr Phe Asn Ser Phe 25 Met Ser Ala Lys Asp Lys His His Phe Leu Lys Lys Val Thr Thr Thr 40 Glu Gin Lys Phe Ser Ser Ser Ala Pro Ile Ser Trp Gin Ser Glu Glu 55 Val Arg Asn Ser Thr Ser Ser Arg Thr Val Ile Ser Asn Lys Glu Leu 70 75 Lys Lys Thr Gly Asn Leu Asn Ile Glu Asn Ala Leu Gin Asn Val Pro 90 Gly Ile Gin Ile Arg Asp Ala Thr Gly Thr Gly Val Leu Pro Lys Ile 100 105 110 Ser Val Arg Gly Phe Gly Gly Gly Gly Asn Gly His Ser Asn Thr Asn WO 97/37044 PCT[UJS97/05223 115 Met Ile Leu Val Asn Gly Ii Ile 145 Asp Gly Asn Asn Gly 225 Gly Gly Ala Tyr Tyr 305 Gly Pro Asp Met Ser 385 Ser Ala Gln Arg Phe 2 465 Tyr I Pro C Pro I Gin 13( G01 Va G1 Glr Phe 210 Asr Lys Phe Ile Tyr 290 Ala Arg Asp Met Ser 370 Ala Pro Phe rhr Ser 150 1 sp ~la fly 'he 0 u Leu 1 Ile Val Ala 195 Val Gin Tyr Arg Tyr 275 Gin Tyr Ala Arg Ser 355 Gly Lys Cys Glu Phe 2 435 Thr '9 Ala C Ser A Leu A Lys V 515 AlE Lys Val 180 Ala Asp Met Ile Gin 260 Lys Tyr Asn Lys Lys 340 Arg 3ml %sn rrp ?ro 120 sn 'hr fly sp irg 00 'a1 I iE Gl~ 16! Asr Gli Prc Leu Gly 245 Asn Ile Asn Arg Arg 325 Val Asp Asn Pro Gin 405 Lys Met Arg Thr Glu 485 Tyr Gly e Phe 150 Gly Val Arg Lys Phe 230 Ile Ser Asn Ser Phe 310 Phe Gly Phe Lys Asn 390 Phe I Leu I Gly I Lys 1 4 Ser L 470 Ile A Thr P Gin T 13' Pr Thi
ILI
ILE
dli 215 Asr Ser Prc Ala Tyr 295 Ile Gly Gly Gly Ile 375 :ys ?he sn let Isn 155 leu ~sn 'he hr 120 e Pro o Val r Ser a Thr Thr 200 I Lys Thr Ala Thr Thr 280 His Asn Ile Asp Phe 360 Leu Gly Asp Leu Arg 440 Pro Asn 2 Phe I Leu 7 Pro I 520 Ile Tyr Gly Ala Tyr Ser Asn Th Va Lyl 18! PhE Gi Ty2 Glr Lys 265 Asn Prc Glu Va1 Phe 345 Ser Pro Leu ksn Ile 125 Phe 3er ~sn Isn sn i05 lys r Ph 1 GlI 17) s Gl~ e Trr i Lys GiY 1 Gly 250 Val Thr Gly Arg Tyr 330 Lys Asn Phe Tyr Ile 410 Val Leu Met Phe Asn 490 Tyr Thr e dln 155 Tyr a Ile Gly Pro Arg 235 Asn Gin Phe Thr Pro 315 Gin Phe Gin Lys Ser 395 Arg Asn Thr Pro 2 4 Asn 2 475 Gly D.
Glu L Thr L 14 Se Gl' Pr Ar Lei 22( Thi Trr Asr Lys Leu 300 Asp Asn Thr Tyr Gly 380 Tyr krg rhr flu ~sn 160 ~sn let ~ys ,ys 0 r Vai y' Pro 0 Lys j Ser 205 1 Ala Ala Ile Tyr Ala 285 Ser Asn Tyr Tyr Gin 365 Lys Ser Ser Gly Asp I 445 Asn Tyr '9 Leu 'I Lys I dlu P 525 Asj Asr dlL 19 Ser Gin Gly Asn Leu 270 Tyr Ala Gin Phe Phe 350 Ser ly Asp Ial Lys 130 eu fly 'hr 'hr ~sp Lrg Ar 1 Thl 17! TrE Asr Tha Met Gly 255 Leu Tyr Gin Asp Gly 335 Thr Val Glu Thr Val 415 Val Tyr Ser Ala Ile 495 Ala Tyr g Ile 160 r Phe Glu 1 Gly Leu Leu 240 Gin Asp Gin Asp Gly 320 Asp His Tyr Ile Asn 400 Asn Lys Arg Gly Val 480 Thr Pro Asn INFORMATION FOR SEQ ID NO:548: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 491 LENGTH: 194 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...194 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:548: Arg Val Gly Phe Arg Ser Phe Ser Gly Gly His Cys Ala Glu Ile Phe 1 5 10 Arg Phe Asn Pro His Pro Met Ala Phe Cys Ala Arg Gin Lys Pro Leu 25 Lys Glu Ile Met Arg Phe Phe Ile Leu Phe Phe Met Gly Met Leu Gly 40 Val Gly Phe Ser Gin Thr Glu Leu Asp Leu Lys Asp Leu Glu Lys Lys 55 Pro Ala Gly Ile Val Arg Asp Tyr Tyr Leu Trp Arg Tyr Ile Ser Asp 70 75 Lys Lys Thr Ser Leu Glu Asn Ala Lys Lys Ala Tyr Glu Leu Thr Gin 90 Asn Lys Asn Ser Ala Leu Gin Lys Ala Met Gin Glu Lys Gly Val Glu 100 105 110 Asn Ser Asp Lys Ser Pro Asp Ala Lys Met Pro Glu Asp Ile Tyr Cys 115 120 125 Lys Gin Ile Thr Leu Glu Ser Met Leu Glu Thr Thr Asp Ala Phe Gin 130 135 140 Ala Ser Cys Ile Ala Ile Ala Leu Lys Ser Lys Ile Arg Asp Phe Asp 145 150 155 160 Lys Ile Pro Ile Gln Thr Phe Lys Pro Leu Gin Glu Lys Ile Lys Glu 165 170 175 Ala Tyr Pro Ile Leu Tyr Glu Glu Leu Glu Ile Leu Gin Ser Lys Asn 180 185 190 Val Ser INFORMATION FOR SEQ ID NO:549: SEQUENCE CHARACTERISTICS: LENGTH: 206 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 492 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...206 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:549: His Ala Leu Tyr Ser Pro Ile Lys Ile Ala Phe Glu Lys Ala Leu Lys 1 5 10 Asn Ala Lys Glu Ser Val Phe Ile Ala Ser Ser Tyr Phe Ile Pro Gly 25 Lys Lys Ile Met Lys Ile Phe Lys Asn Gin Ile Ser Lys Gly Ile Glu 40 Leu Asn Ile Leu Thr Asn Ser Leu Ser Ser Thr Asp Ala Ile Val Val 55 Tyr Gly Ala Trp Glu Arg Tyr Arg Asn Lys Leu Val Arg Met Gly Ala 70 75 Asn Val Tyr Glu Ile Arg Asn Asp Phe Phe Asn Arg Gin Ile Lys Gly 90 Arg Phe Ser Thr Lys His Ser Leu His Gly Lys Thr Ile Val Phe Asp 100 105 110 Asp Ala Leu Thr Leu Leu Gly Ser Phe Asn Ile Asp Pro Arg Ser Ala 115 120 125 Tyr Ile Asn Thr Glu Ser Ala Val Leu Phe Asp Asn Pro Ser Phe Ala 130 135 140 Lys Arg Val Arg Leu Ser Leu Lys Asp His Ala Gin Gin Ser Trp His 145 150 155 160 Leu Val Leu Tyr Arg His Arg Val Ile Trp Glu Ala Thr Glu Glu Gly 165 170 175 Ile Leu Ile His Glu Lys Asn Ser Pro Asp Thr Ser Phe Phe Leu Arg 180 185 190 Leu Ile Lys Glu Trp Ser Lys Val Leu Pro Glu Arg Glu Leu 195 200 205 INFORMATION FOR SEQ ID NO:550: SEQUENCE CHARACTERISTICS: LENGTH: 442 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...442 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:550: Gly Lys Ile Lys Gly Val Met Met Lys Phe Phe Leu Leu Lys Lys Phe 1 5 10 Ser Glu Phe Leu Asn Thr Gin Thr His Phe Asn Leu Lys Arg Leu Asn WO 97/37044 PCT/US97/05223 Ala Ser Ser Phe Leu Leu Glu Thr Phe Val Glu Phe Arg Thr Ile Arg 145 Glu Glu Asn Lys Glu 225 Asn Cys Lys Tyr Gin 305 Phe Glu Lys Met Lys 385 Ile Gin Lys Val Ser Thr Ile Phe Leu 130 Val His Lys Gin Leu 210 Leu Arg Met Lys Leu 290 Ile Met Val Glu His 370 Asn Lys Arg Tyr Asp Val Lys Leu Ile 115 Asp Ala Gin Asp Ile 195 Glu Gin Arg Ile Phe 275 Glu Asn Pro Leu Asn 355 Val Thr Met Lys Arg 435 Leu Leu Asn Glu 100 Leu Gin Lys Glu Phe 180 Ile Lys Thr Glu Glu 260 Thr Glu Tyr Val Tyr 340 Ile Arg Pro Gln Phe 420 Thr Ser Lys Ala Ile Arg Glu Asn Glu 165 Leu Lys Leu Gin Asn 245 Ile Leu Glu Val Lys 325 Tyr Lys Asp Lys Lys 405 Val Ile SAla Asn 70 Lys Lys Leu Lys Asp 150 Asp Ser Arg Glu Ala 230 Arg Asp Ser Asn Arg 310 Asn Lys Leu Ile Asp 390 Asp Lys Ser Pro 55 Thr Ile Gly Glu Cys 135 Ile Leu Tyr Leu Asp 215 Ser Val Lys Lys Leu 295 Asp Ser Asp Leu Pro 375 Glu Ala lie Leu 40 Tyr Leu Leu Ala SMet 120 Val Leu Asp Gin Asn 200 Pro Leu Ile Ser Lys 280 Lys Ala Lys Phe Gin 360 Gly Val Phe Ile Lys 440 Ile Ala Gin Lys 105 Ile Ile Gly Phe His 185 Ala Lys Leu Leu Met 265 Lys Glu Ala Ile Lys 345 Asp Ser Ile Asn Lys 425 Asp Ser Gly Leu Ala 90 Asp Pro Glu Ala Lys 170 Lys Gin Thr Leu Lys 250 Pro Lys Lys Glu Lys 330 Ile Ala His Met Gly 410 Gly Thr Lys Leu Asp 75 Asn Leu Lys Ala Leu 155 Gly Glu Lys Leu Thr 235 Asp Leu Gin Ile Glu 315 Arg Gly Arg Leu Glu 395 Tyr Ala Glu Ser Phe Val Ala Lys Phe 140 Pro Leu Leu Glu Gin 220 Tyr Phe Asn Lys Ala 300 Ser Pro Leu Ala Ile 380 Leu Glu His Lys Lys Cys Ile Tyr Ala 125 Arg Pro Leu Glu Arg 205 Leu Gin Glu Ala Ser 285 Phe Val Met Gly Asn 365 Val Ala Ile Val His Lys Leu Asp Lys 110 Asn Phe Asn Asp His 190 Leu Glu His Asp Phe 270 Gin Lys Leu Asn Lys 350 Asp Phe Lys Asp Ile 430 Ala Pro Asn Asn Ser Leu Asn Ile Ile 175 Lys Lys Ala Leu Lys 255 Ile Phe Glu Glu Gly 335 Asn Leu Cys Met Tyr 415 Tyr Phe Pro Lys Asp Glu Met Asp Tyr 160 Leu Lys Glu Lys Ile 240 Glu Asn Leu Asn Met 320 Tyr Gin Trp Gin Leu 400 Thr Ser INFORMATION FOR SEQ ID NO:551: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 494 LENGTH: 293 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...293 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:551: Gly Phe Met Met Ile Phe Ile Asp Ala Cys Phe Arg Lys Glu Thr Pro 1 5 10 Tyr Thr Pro Ile Trp Met Met Arg Gin Ala Gly Arg Tyr Leu Ser Glu 25 Tyr Gin Glu Ser Arg Lys Lys Ala Gly Ser Leu Leu Glu Leu Cys Lys 40 Asn Ser Asp Leu Ala Thr Glu Val Thr Leu Gin Pro Val Glu Ile Leu 55 Gly Val Asp Ala Ala Ile Leu Phe Ser Asp Ile Leu Val Val Pro Leu 70 75 Glu Met Gly Leu Asn Leu Glu Phe Ile Pro Lys Lys Gly Pro His Phe 90 Leu Glu Thr Ile Thr Asp Leu Lys Ser Val Glu Ser Leu Lys Val Gly 100 105 110 Val Tyr Lys Gin Leu Asn Tyr Val Tyr Asp Thr Ile Ser Gin Thr Arg 115 120 125 Gin Lys Leu Ser Lys Glu Lys Ala Leu Ile Gly Phe Cys Gly Ser Pro 130 135 140 Trp Thr Leu Ala Thr Tyr Met Ile Glu Gly Glu Gly Ser Lys Ser Tyr 145 150 155 160 Ala Lys Ser Lys Lys Met Leu Tyr Ser Glu Pro Glu Val Leu Lys Ala 165 170 175 Leu Leu Glu Lys Leu Ser Leu Glu Leu Ile Glu Tyr Leu Ser Leu Gin 180 185 190 Ile Gin Ala Gly Val Asn Ala Val Met Ile Phe Asp Ser Trp Ala Ser 195 200 205 Ala Leu Glu Lys Glu Ala Tyr Leu Glu Phe Ser Trp Asp Tyr Leu Lys 210 215 220 Lys Ile Ser Lys Glu Leu Lys Lys Arg Tyr Ala His Ile Pro Val Ile 225 230 235 240 Leu Phe Pro Lys Gly Ile Gly Ala Tyr Leu Asp Ser Ile Asp Gly Glu 245 250 255 Phe Asp Leu Phe Gly Leu Asp Gly Gly Thr Pro Leu Thr Ala Ala Lys 260 265 270 Lys Lys Leu Ala Val Ile Ile Phe Leu Gin Gly Ile Tyr Asn Pro Pro 275 280 285 Ala Phe Met Ile Lys 290 INFORMATION FOR SEQ ID NO:552: WO 97/37044 495 SEQUENCE CHARACTERISTICS: LENGTH: 192 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...192 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:552: Arg Val Phe Leu Thr Lys Lys Phe Met Ser Trp Met PCTIUS97/05223 1 Ala Leu Ser Val Arg Pro Glu Arg Asn Pro Leu His Ile Gin Ile Pro 130 Arg Leu 145 Asn Asp His Phe Ile Lys Asn Phe Gin Ser Ser 115 Leu Asn Asn Phe Cys Lys Ile Phe Asn Leu 100 Pro Lys Cys Leu Ser 180 5 Val Ser Glu Lys Ser Asp Ile Asn Ser Lys 165 Arg Leu Leu Gly Ile 70 Pro Lys Leu Leu Leu 150 Cys Gly Leu Thr Met 55 Ala Ile Ser Glu Asn 135 Thr Asp Phe Gly Val 25 Ala Tyr 40 Gly Ile Cys Val Met Asp Ser Leu 105 Gin Ser 120 Ala Leu Phe Asn Leu Thr Asn Gly 185 10 Phe Leu Ile Ser Phe 90 Thr Ile Leu Ala Asn 170 Gly Ile Asn Gly Lys 75 Lys Leu Gin Glu Leu 155 Ala Ala Phe Ala Val Glu Asn Ser Gin Lys 140 Asp Glu Arg Val Phe Tyr Pro Leu Leu Ile Lys 125 Phe Glu Asn Lys Val Thr Leu Phe Arg Glu His 110 Ile Lys Lys Leu Ser 190 Ile Ser Glu Lys Phe Ile Ser Ser Pro Thr Leu 175 Leu Gly Met Gin Cys Leu Lys Gln Gin Thr Leu 160 Leu Pro INFORMATION FOR SEQ ID NO:553: SEQUENCE CHARACTERISTICS: LENGTH: 370 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 496 (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1...370 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:553: Gin 1 Leu Val Phe Ile Thr Pro Asn Lys Leu 145 Ser Ser Ser Thr Gly 225 Lys Phe Phe Val Lys I 305 Vai Leu 1 Asp Gin Pro Met Gin Phe Gin Lys Thr Leu Leu Ser Leu Ser Leu Phe Gi4 Leu Tyr Phe Thr Ile Leu 130 Glu Gin Asn Ser Lys 210 Tyr Met Ile Ala Ser 290 vlet %sn kla Le PhE Asr Lys Asr Glu Glu 115 Ala Leu Asn Ser Met 195 Lys Thr Asn Asp Leu 275 Gin His Val Val i Ser Glu Gin Leu Tyr Lys 100 Lys Gin Val Ser Leu 180 Tyr Lys Asn Asn Asn 260 Ala Met Thr Asp 1 Asn 5 Tyi Tyr Glu Asn Ile Gin Ile Ala Glu Met 165 Asn Gly Asn Phe His 245 Ala ly ksp Ser krg 325 3er Cys Ser Arg Gin 70 Asn Ala Vai Leu Asn 150 Leu Ala Val Gin Gly 230 Leu Gin Ser Phe Phe 310 His Phe Ii( lE Ile 55 Val Asr Glu Met Glu 135 Leu Ser Leu Gly Gly 215 Phe Tyr Lys Ser Ile 295 Phe Asn Tyr Ala Ser 40 Gin Lys Ala Thr Leu 120 Lys Lys Ser Asp Leu 200 Phe Val Gly His Trp 280 Asn Gin Gly I Glu Val b 360 Glu 25 His Thr Asn Leu Tyr 105 Ser Met Asn Leu Pro 185 Ser Arg Gly Leu Ser 265 Val %sn Ile ?he rhr 345 det 10 I Glu Ala Ile Glu Lys 90 Tyr Gly Gin Leu Ser 170 Ser Vai Tyr Asn Gly 250 Ser Gly Tyr Pro Glu I 330 His Phe I Asn Val Ser Ile 75 Asn Leu Gly Glu Glu 155 Ser Ser Gly Tyr Gly 235 Ile Val Ser Leu Leu 315 Met ly %sn Gl Glu Asn Thr Asn Gin Val Pro 140 Leu Gin Tyr Tyr Leu 220 Phe Asp Gly Gly Thr 300 Asn Gly Lys Val Ala His Ala Asn Ala Ser Ala 125 Ile Gin Ile Ser Lys 205 Phe Asp Tyr Phe Leu 285 Asp Phe Leu Gly I Ser 365 Tyi Asr Gir Met Lys Thr 110 Ser Thr Phe Ala Lys 190 His Tyr Gly Leu Tyr 270 ly ryr ly ys .eu 350 ryr Ala Asn Asn Pro Leu Leu Asn Asn Ser Gin 175 Asn Phe Asp Leu Phe 255 Val Met Arg Vai 1 Ile I 335 Asn 1 Val '9 Ser Pro Lys Asn Thr Gin Pro Pro Gin 160 Ile Val Phe Tyr Gly 240 Asn Gly rrp kia %rg 320 ?ro kla ryr Ser Leu Phe Phe Lys Arg Leu 355 Ser Phe 370 INFORMATION FOR SEQ ID NO:554: WO 97/37044 PCT/US97/05223 497 SEQUENCE CHARACTERISTICS: LENGTH: 243 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...243 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:554: Pro Met Pro Met Gly Lys Ala Leu Lys Glu Tyr Glu 1 Thr Ala Thr Gly Pro Val Gly Val Ala 145 Ile Phe Leu Asn Phe 225 Val Arg Gin Ser Val His Gin Pro Glu 130 Tyr Ala Leu Asp Leu 210 Ala Pro Asp Val Tyr Asp Leu Phe Ile 115 Lys Gly Met Arg Leu 195 Gly Asn Thr Val Glu Leu Leu Leu Trp 100 Gin Ile Ser Phe Gly 180 Phe Gly Val 5 Gly Ile Leu Gly Val Ser Ile Pro 70 Ser Ser Asp Gly Ser Ser Asn Ser Lys Val 150 Glu Ala 165 Asn Pro Ile Ser Thr Met Gly Asp 230 Gly Lys Cys 55 Ser Pro Arg Phe Ile 135 Lys Thr Lys Lys Gin 215 Phe Ala Met 40 Asn Ala Thr Val Glu 120 Pro Ile Leu Ala Gly 200 Pro Lys Lys 25 Leu Thr Ile Val Thr 105 Met Gly Asp Ile Leu 185 Cys Phe Gly 10 Asn Tyr Cys Gly Tyr 90 His Gly Tyr Phe Thr 170 Ser Val Gly Ser Glu Phe Asp His Asn Ser Gin 75 Asn Ser Leu Asn Ala Asp Val Lys 140 Lys Leu 155 Pro Ser Lys Ala Ala Cys Val Val 220 Gin Ile Pro Leu Trp Val Glu Pro 125 Leu Ile Arg Glu His 205 Lys Ile Met Arg Gly Lys Phe Gin 110 Lys Phe Ala Tyr Lys 190 Asn Pro Lys Thr Ile Leu Lys Asn Ala Val Arg Asp Asp 175 Glu Gly Tyr Lys Ser Ser Gly Asn Asp Gin Val Lys Ser 160 Asp Gly Ile Lys Lys 240 Asp Lys Asn Gly Leu Val 235 INFORMATION FOR SEQ ID NO:555: SEQUENCE CHARACTERISTICS: LENGTH: 234 amino acids TYPE: amino acid WO 97/37044 PCT/US97/05223 498 TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...234 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:555: Asp Phe Trp Arg Lys Glu Ile Thr Met Arg Ala Asn 1 Ile Ser Asn Val Ile Lys Glu Gly Leu 145 Asp Glu Gly Ser Glu 225 (2) Phe Ile Ala Arg Val Trp Ala Ala 130 Glu Ile Asn Ser Thr 210 Ala Ser Asn Ser Leu Phe Ile Gin 115 Pro Lys Ser Ile Gin 195 Gin Gin Phe Leu Ser Ile Lys Asp 100 Lys Thr Glu Val Ser 180 Asp Ala Ser 5 Ser Lys Phe Ile Ala Leu Ala Tyr Asn Lys 165 Ser Phe Met Val Ser Gin Glu Ile 70 Gin Pro Cys Ala Ser 150 Gly Leu Gin Gin Ser 230 Ala Met Met 55 Leu Asp Arg Leu Val 135 Thr Asp Glu Glu Glu 215 Pro Leu Leu 40 Thr Ser Gly Asn Gly 120 Arg Tyr Ser Asn Ser 200 Ala Leu Ala 25 Pro Gin Ala Gin Met 105 Val Phe Arg His Lys 185 Ala Ile Lys 10 Leu Glu Cys Asp Ile 90 Leu Ala Thr Ala Ser 170 Thr Ile Ser Lys Leu Ile Pro Leu 75 Thr Lys Leu Ile Glu 155 Gly Thr Gin Leu Leu Arg Lys Phe His Thr Pro Leu 140 Phe Val Lys Ser Ile 220 Thr Ala Gin Gly Thr Tyr Pro Leu Asn Thr Gly Lys Met Phe 110 Pro Tyr 125 Ser Phe Ala Leu Ile Ile Thr Ser 190 Leu Gin 205 Lys Lys Ile Lys Cys Leu Asp Leu Thr Glu Lys Glu His Gin Met Gin Gly Ala Ser Leu Gly Tyr 160 Lys His 175 Lys Asn His Val Ala Val INFORMATION FOR SEQ ID NO:556: SEQUENCE CHARACTERISTICS: LENGTH: 283 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 WO 9737044PCTIUS97/05223 499 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .283 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:556: Arg Gin Phe Ile Gin Phe Asn Thr Arg Arg Glu Ile Leu Glu 1 Val Met Lys Ala Ile Leu Vali Ser Leu 145 Gin Leu Asp Leu Leu 225 Met Asp Asn *Asn Vai Tyr s0 Thr Lys Met Ser His 130 Ile Vali Asp Asp Ile 210 Asp Phe Asn Pro *Tyr Giy Tyr Asn Val Ile Ser 115 Asn Phe Val1 Asp Leu 195 Asn Vai Thr Phe Tyr 275 Thr Phe Giu Ser Giy Lys 100 Arg Giu Lys Gin Giu Ile Asn Asp Pro Ja 1 260 Lys 5 Leu Ile Tyr Pro Phe Ser Giy Giu Giu Giu 165 Asn Aia Aia Lys Leu 245 Giu Thr Ile Leu Val Ile 70 Aia Ser Phe Phe Gly 150 Val1 Val Asn Asn Arg 230 Ile Lys Ile Gly Trp Val1 55 Asn Lys Val1 Met Tyr 135 Leu Met Glu Leu Asn 215 Val1 Met Giy Phe Giy Leu 40 Tyr Tyr Asp Lys Gly 120 Gly Met Lys Lys Asp 200 Leu Lys Gin Ser Gly 280 Leu 25 Giy Thr Lys Lys Ile 105 Leu Ser Asp Ala Phe 185 Ser Val1 Gin la Alia 265 10 Phe His Asp Gly Val 90 Arg Lys Gly Arg Ile 170 Lys Arg Ser Giy Gl1 250 Leu Phe Leu Lys Ile 75 Gly Lys Phe Asp Leu 155 Arg His Lys Asn Gin 235 Leu Ile Leu Gly Asp Gin Val1 Asp Leu Lys 140 Ser Asn Ile Thr Vai 220 Tyr Ser Asp Cys Leu Leu Val Val1 Ser Ala 125 Gly Gly Val Leu Gin 205 Asn Asp Leu Lys Leu *Asp *Gly Gly Arg Lys 110 Leu Giu Asp Asn Ala 190 Phe Asn Phe Arg Phe 270 Arg Val Asp Gly Asn Leu Val Giu Arg Ala Arg 175 Ser Asp Val Lys A\sn 255 Asp His Cys Gly Ile Val1 Asp Ala Gin Ile Asn 160 Ile Val1 Ser Ala Ala 240 Ile Ala G1u Arg Lys INFORMVATION FOR SEQ ID NO:557: SEQUENCE CHARACTERISTICS: LENGTH: 150 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
WO 97/37044 PCT/US97/0522 3 500 (vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...150 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:557: Ser Ser Ser Gln Ser Ile Ala Leu Leu Ile Gly Leu Trp Phe Gly Phe 1 5 Gin Lys Arg Ile Ala Leu Gly Val Trp Phe Phe Phe Ser Ile Leu Leu 25 Gly Glu Phe Thr Leu Lys Ser Leu Lys Leu Leu Val Ala Arg Pro Arg 40 45 Pro Val Thr Asn Gly Glu Leu Val Phe Ala His Gly Phe Ser Phe Pro 55 60 Ser Gly His Ala Leu Ala Ser Ala Leu Phe Tyr Gly Ser Leu Ala Leu 70 75 Leu Leu Cys Tyr Ser Asn Ala Asn Asn Arg Ile Lys Thr Ile Gly Ala 90 lie Ile Leu Leu Phe Trp Ile Phe Leu Met Ala Tyr Asp Arg al Tyr 100 105 110 Leu Gly Val His Tyr Pro Ser Asp Val Leu Gly Gly Phe Leu Leu Gly 115 120 125 Ile Ala Trp Ser Cys Cys Ser Leu Ala Leu Tyr Leu Gly Phe Leu Lys 130 135 140 Arg Pro Tyr Lys Ala Ala 145 150 INFORMATION FOR SEQ ID NO:558: SEQUENCE
CHARACTERISTICS:
LENGTH: 73 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...73 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:558: Met Met Asp Leu Glu Ser Leu Arg Gly Phe Ala Tyr Ala Phe Phe Thr 1 5 10 Ile Leu Phe Thr Leu Phe Leu Tyr Ala Tyr Ile Phe Ser Met Tyr Arg 25 Lys Gin Lys Lys Gly Ile Val Asp Tyr Glu Arg Tyr Gly Tyr Leu Ala 40 Leu Asn Asp Ala Leu Glu Asp Glu Leu Ile Glu Pro Arg His Lys Glu WO 97/37044 PCTUS97/05223 501 55 Val His Asp Lys Gly Ile Lys Glu Ser INFORMATION FOR SEQ ID NO:559: SEQUENCE CHARACTERISTICS: LENGTH: 100 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...100 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:559: Gly Asn Leu Met Gly Gin Thr Lys Glu Ile Ile Thr Thr Leu Leu Pro 1 5 10 Leu Leu Val Leu Phe Leu Ile Phe Tyr Phe Leu Ile Val Arg Pro Gin 25 Arg Gin Gin Gin Lys Lys His Lys Glu Met Ile Glu Gly Leu Thr Lys 40 Gly Asp Lys Ile Val Thr Gin Gly Gly Leu Ile Val Glu Val Leu Lys 55 Ala Glu Ala Asn Phe Phe Ser Val Lys Leu Asn Asp Asp Thr Thr Ala 70 75 Lys Leu Ser Lys Asn Tyr Val Ala Phe Lys Leu Asp Glu Glu Thr Thr 90 Pro Asn Asn Asn 100 INFORMATION FOR SEQ ID NO:560: SEQUENCE CHARACTERISTICS: LENGTH: 159 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...159 WO 97/37044 PCT/US97/05223 502 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:560: Met Lys Leu Phe Asn Pro Arg Leu Ile Val Phe Ile Phe Ala Leu Leu 1 5 10 Leu Gly Val Gly Phe Ser Val Pro Ser Leu Leu Glu Thr Lys Gly Pro 25 Lys Ile Thr Leu Gly Leu Asp Leu Arg Gly Gly Leu Asn Met Leu Leu 40 Gly Val Gin Thr Asp Glu Ala Leu Lys Asn Lys Tyr Leu Ser Leu Ala 55 Ser Ala Leu Glu Tyr Asn Ala Lys Lys Gin Asn Ile Leu Leu Lys Asp 70 75 Ile Lys Ser Ser Leu Glu Gly Ile Ser Phe Glu Leu Leu Asp Glu Asp 90 Glu Ala Lys Lys Leu Asp Ala Leu Leu Leu Glu Leu Gin Gly His Ser 100 105 110 Gin Phe Glu Ile Lys Lys Glu Ala Glu Phe Tyr Ser Val Lys Leu Thr 115 120 125 Pro Leu Glu Gin Glu Glu Leu Arg Lys Asn Thr Ile Leu Gin Val Ile 130 135 140 Gly Ile Ile Arg Ile Arg Leu Asp His Leu Ala Trp Gin Ser Leu 145 150 155 INFORMATION FOR SEQ ID NO:561: SEQUENCE CHARACTERISTICS: LENGTH: 88 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...88 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:561: Gly Asn Val Lys Val Asn Lys Lys His Arg Leu Ala Phe Leu Gly Leu 1 5 10 Ile Val Gly Val Leu Phe Phe Phe Ser Ala Cys Gin His Arg Leu His 25 Met Gly Tyr Tyr Ser Glu Val Thr Gly Asp Tyr Leu Phe Asn Tyr Asn 40 Ser Thr Ile Val Val Ala Tyr Asp Arg Ser Asp Ala Met Thr Ser Tyr 55 Tyr Ile Asn Val Ile Val Tyr Glu Leu Gin Lys Leu Gly Phe Tyr Asn 70 75 Val Phe Thr Gin Ala Asn Ser Arg INFORMATION FOR SEQ ID NO:562: WO 97/37044 PCT/US97/05223 503 SEQUENCE CHARACTERISTICS: LENGTH: 165 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...165 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:562: Lys Ala Leu Met Val Gly Met Lys Thr Gly Ala Trp Thr Gly Leu Lys 1 5 10 Leu Phe Ala Gin Pro Leu Leu Val Val Leu Ala Phe Met Leu Leu Tyr 25 Ala Leu Ala His Ala Val Leu Gly Phe Tyr Val Lys Lys Asp Ser Ala 40 Pro Met Ser Pro Asn Val Glu Lys Ser Glu Thr Glu Arg Gin Asn Ser 55 Thr Phe Ser Pro Lys Glu Glu Ala Asn Ala Thr Thr Thr Ala Thr Glu 70 75 Gin Asn Pro Thr Lys Asp Thr Val Pro Pro Leu Asp Thr Ala Thr Gin 90 Lys Gin Glu Ile Lys Gin Glu Ile Lys Gin Glu Ile Lys Gin Glu Ile 100 105 110 Lys Gin Glu Ile Lys Gin Glu Ile Lys Gin Glu Ile Lys Gin Glu Thr 115 120 125 Lys Gin Glu Gin Glu Lys Glu Asn Lys Pro Lys Gin Asn Ser Val Ser 130 135 140 Pro Val Gin Asn Asp Gin Lys Thr Pro Thr Thr Pro Leu Met Gly Lys 145 150 155 160 Lys Thr Ser Arg Val 165 INFORMATION FOR SEQ ID NO:563: SEQUENCE CHARACTERISTICS: LENGTH: 97 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 504 NAME/KEY: misc_feature LOCATION 1...97 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:563: Met Leu Tyr Ala Ser Lys Ala Arg Leu Phe Leu Gin Ile Lys Gly Lys 1 5 10 Phe Met Leu Arg Ile Leu Ile Pro Leu Leu Ile Ile Val Trp Val Leu 25 Trp Arg Leu Phe Leu Arg Gin Lys Pro His Lys Asp Asp His Arg Asp 40 Asn His Ser Tyr Thr Gin Gin Thr Pro Lys Glu Leu Glu Asp His Met 55 Ile Val Cys Ser Lys Cys Gin Thr Tyr Val Ser Ser Lys Asp Ala Ile 70 75 Tyr Ser Gly Ala Val Ala Tyr Cys Ser Glu Thr Cys Leu Lys Asp Lys 90 Gly INFORMATION FOR SEQ ID NO:564: SEQUENCE CHARACTERISTICS: LENGTH: 84 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...84 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:564: Lys Thr Ile Asn Lys Leu Arg Ile Ile Met Lys Pro Thr Asn Glu Pro 1 5 10 Lys Lys Pro Phe Phe Gin Ser Pro Ile Val Leu Ala Val Leu Gly Gly 25 Ile Leu Leu Ile Phe Phe Leu Arg Ser Phe Asn Ser Asp Gly Ser Phe 40 Ser Asp Asn Phe Leu Ala Ser Ser Thr Lys Asn Val Ser Tyr His Glu 55 Ile Lys Gin Leu Ile Ser Asn Asn Glu Val Glu Asn Val Ser Ile Gly 70 75 Gin Thr Leu Ile INFORMATION FOR SEQ ID NO:565: SEQUENCE CHARACTERISTICS: LENGTH: 230 amino acids WO 97/37044 WO 9737044PCTIUS97/05223 505 TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .230 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:565: Phe Lys Asn Asn Lys Thr Pro Leu Thr Phe Glu Arg 1 Leui Tyr Ala Phe Leu Tyr Thr Val1 Leu 145 Leu Thr Val1 Thr Ser 225 Asp Leu Giu Met Gin Tyr Leu Ala 130 Gin Val Tyr Leu Pro 210 His Gly Arg Asn Arg Gly Phe Gin 115 Tyr Leu Val1 Phe Tyr 195 Glu Ser Leu Ile Leu Pro Tyr Phe 100 Ala Ala Al a Asn Arg 180 His Arg Trp 5 Lys Leu Ser Leu Giu Ile Asn Lys Ala Thr 165 Ala Lys Arg His Lys I le Tyr Asn 70 Val Asp Met Asn Asn 150 Trp Ile Val1 Ser Phe 230 Glu Leu Met 55 Thr Asn Tyr Phe Pro 135 Thr Asp Gly Asp Leu 215 Arg Ser 40 Ser Asn Pro Gly Thr 120 Ile Trp Ser Lys Val1 200 Phe Gin 25 Met Ser Lys Lys Asn 105 Tyr Asn Ile Leu Phe 185 Giu Glu 10 Gly Ser Ser Leu Asn 90 Vai Gly Arg Leu Lys 170 Gly Ile Arg Phe Phe Tyr Leu 75 Asp Leu Val1 Trp Asn 155 Asp Val1 Gly Ser Tyr Leu Gin Gin Trp Phe Gly Ala 140 Asn Phe Gin Met Phe 220 Asp Lys Asn Ile Gly Ala Asn Gly 125 Phe Lys Asn Phe Lys 205 Leu Lys Gin Ile Gly Ala Tyr Asn 110 Asp Phe Val Phe Arg 190 Ile Phe Gly Val Leu Thr Ser Ser Asp Phe Phe Lys His 175 Thr Phe Phe Ser His Asn Val Ile Arg Ser Met Gly Asp 160 Asn Ile Leu Val1 INFORMATION FOR SEQ ID NO:566: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 209 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 WO 9737044PCTIJS97/05223 506 (Vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .209 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:566: Gly Ile Cys Leu Lys Phe Gin Ile Val Ser Leu Leu 1 Leu Met Vai Pro Sen Vai Ser Tyr Aia 145 Leu Asp Asn Arg Aia Tyr Asp Lys Asn Gin Val Thr 130 Phe Sen Asn Leu Ser Ile Arg Ile Arg Ser Giu 115 Lys Giu Leu Asn Lys 195 Cys Ala Gly Lys Phe Phe 100 Thr Asn Val1 Ile Glu 180 Leu 5 Leu His Val1 Leu Met Lys Gin Gly Lys Lys 165 Lys Lys Pro Gin Val1 Lys 70 Leu Leu Lys Tyr Pro i50 Thr Giy Asn Pro Giy Ala 55 Asp Leu Glu Gly Asp 135 Asp Pro Ala Ala Lys Gin 40 Lys Pro Trp Pro Ile 120 Phe Ser Arg Asn Ser 200 Gly 25 Ser His Lys Lys Gly 105 Leu Lys Lys Gly Thr 185 Phe 10 His Vai Asn Gly Asn 90 Phe Gin Asn Ile Phe 170 Pro Lys His Ang Giu Pro 75 Arg Tyr Ser Asn Val 155 Leu Trp Asp Ser Thr Thr Leu Tyr Tyr Ala Arg 140 Leu Gly Ile Al a Leu Gly Tyr Leu Phe Thr Leu Pro 125 Pro Pro Val1 Giu Trp 205 Ala Leu Trp Lys Met Leu Asp 110 Gly Phe Ser Phe Gly 190 Glu Phe Val1 Arg Giu Leu Ala Ser Tyr Phe Val1 Leu 175 Sen Leu Leu Asn Lys Asn Gly Lys Phe Ser Leu Glu 160 Phe Leu Giu INFORMATION FOR SEQ ID NO:567: SEQUENCE CHARACTERISTICS: LENGTH: 170 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacten pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .170 WO 97/37044 PCT/US97/05223 507 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:567: Lys Pro Phe Gly Tyr Leu Gly Leu Leu Tyr Lys Gin Gly Thr Gin Lys 1 5 10 Asn Pro His Ser Tyr Val Gly Ala Pro Ala Arg Leu Gly Val Asp Phe 25 Ser Tyr Ser Asn Gly Trp Ser Phe Gly Ile Gly Ala Ile Gly Ala Trp 40 Asn Ile Tyr Asn Lys Gin Arg Leu Ala Asn Leu Tyr Ile Ser Leu Gly 55 Asn Phe Phe Gly Asn Pro Asn Asn Val Lys Pro Tyr Leu Ser Ala Gly 70 75 Asp Val Ser Asp Ala Tyr Leu Gin Tyr Ala Asn Gin Arg Phe Lys Ile 90 Ala Leu Gly Arg Phe Asn Thr Asp Phe Val Asp Phe Asp Trp Ile Gly 100 105 110 Gly Asn Ile Gin Gly Val Ser Val Ala Phe Lys Gin Asn Ser Met Arg 115 120 125 Tyr Phe Gly Ile Phe Met Asp Ser Met Leu Tyr Asn Gly His Gin Ile 130 135 140 Asn Lys Glu Gin Gly Asn Arg Ile Ala Thr Phe Leu Asn Ala Leu Ala 145 150 155 160 Leu Met Ile Leu Cys Leu Asn Ala Cys Met 165 170 INFORMATION FOR SEQ ID NO:568: SEQUENCE CHARACTERISTICS: LENGTH: 195 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...195 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:568: Leu Gly Ser Pro Met Gln Phe Gin Lys Thr Leu Ser Ser Leu Ser Leu 1 5 10 Phe Leu Ser Leu Ser Leu Phe Leu Ser Phe Ser Ile Ala Glu Glu Asn 25 Gly Ala Tyr Ala Ser Val Gly Phe Glu Tyr Ser Ile Ser His Ala Val 40 Glu His Asn Asn Pro Phe Leu Asn Gin Glu Arg Ile Gin Thr Ile Ser 55 Asn Ala Gln Asn Gin Ile Tyr Lys Leu Asn Gin Ile Glu Asn Glu Ile 70 75 Thr Asn Met Gin Asn Thr Phe Asn Tyr Thr Asn Asn Ala Leu Lys Asn 90 WO 97/37044 PCT/US97/05223 508 Asn Ala Lys Leu Thr Pro Thr Glu Met Gin Ala Glu Gin Tyr Tyr Leu 100 105 110 Gin Ser Thr Leu Gin Asn Ile Glu Lys Ile Val Met Leu Ser Gly Gly 115 120 125 Val Ala Ser Asn Pro Lys Leu Val Gin Ala Leu Glu Lys Met Gin Glu 130 135 140 Pro Ile Thr Asn Pro Leu Glu Leu Val Glu Asn Leu Lys Asn Leu Glu 145 150 155 160 Leu Gin Phe Ser Gin Ser Gin Asn Ser Met Leu Ser Ser Leu Ser Ser 165 170 175 Gin Ile Ala Gin Ile Ser Asn Ser Leu Asn Ala Leu Asp Pro Ser Ser 180 185 190 Tyr Ser Lys 195 INFORMATION FOR SEQ ID NO:569: SEQUENCE CHARACTERISTICS: LENGTH: 72 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...72 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:569: Lys Gly Lys Val Met Asn Ser Ser Asn Leu Lys Asn Trp Leu Phe Pro 1 5 10 Thr Ile Cys Phe Phe Leu Phe Cys Tyr Ile Leu Ile Phe Leu Ile Phe 25 Phe Met Phe Lys Asn Leu Gin Ser Gin Ser Phe Gly Ser Val Ala Glu 40 Thr Gly Lys Lys Pro Ile Thr Thr Thr Lys Lys Phe Gly Lys Glu Leu 55 Gin Lys Gin Ile Ser Lys Ile His INFORMATION FOR SEQ ID NO:570: SEQUENCE CHARACTERISTICS: LENGTH: 278 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 509 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylor (ix) FEATURE: NAME/KEY: misc-feature (B LOCATION .278 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:570: Leu Met Arg Lys Val Leu Tyr Ala Leu Met Gly Phe PCTIUS97/05223 1 Ser Pro Ser Asp Ala Asp Leu Ile Phe 145 Asn Asn Leu Ile Gin 225 Leu Ile Ala Ala Ala Phe Tyr Thr Lys Leu Asp 130 Thr Tyr Met Lys Ile 210 Lys Trp Phe Leu Leu Asn Phe Phe Leu Val1 Ile 115 Ser Ser Tyr Sen Giu 195 Gly Gly Ala Asn Ser 275 Lys Leu Asp Gin Leu Gly 100 Lys Giu Ser Thr Giu 180 Giu Asp Tyr Cys Asp 260 Lys 5 Ala Asn Lys Gly Phe Phe Pro Gly Lys Asn 165 Asn Lys Asn Lys Sen 245 Lys Ser Asp His Asn Gin 70 Phe Asp Leu Lys His 150 Ala Ala Giu Thr Ala 230 Lys Gin Arg Asp Pro Arg 55 Thr Ser Ala Gin Ile 135 Pro Phe Pro Giu Asn 215 Leu Lys Phe Phe Met 40 Sen Tyr Lys Lys Ile 120 Phe Asn Ile Lys Thr 200 Al a Lys Ser Thr Leu 25 Gin Lys Lys Pro Ile 105 Gly Sen Leu Lys Asp 185 Lys Met Sen Lys Tyr 265 10 Giu Giu Ala Asp Met Ile Ile 90 Leu Val Phe Gin Pro 170 Ala Giu Lys Sen Leu 250 Phe Leu Sen Arg 75 Sen Giu Asp Tyr Val 155 Gin Gin Lys Ile Gin 235 Ser Lys Asn Asn Leu Asp Ser Ser Val1 140 Phe Lys Lys Giu Ile 220 Ang Leu Phe Leu Asn Al a Thr Arg Phe Asn Asn 125 Phe Ile Giu Asn Glu 205 Lys Lys Met Asp Leu Giu Ile Leu Tyr Val Asp 110 Ile Ser Glu Asn Asn 190 Giu Lys Trp Pro Lys 270 Val1 Thr Gin Asn Ala Leu Arg Sen Thr Asp Gin 175 Lys Thr Asp Tyr Lys 255 Arg Phe Ala Gly Ile Met Gly Ile Val Thr Lys 160 Glu Pro Ile Ile Cys 240 Giu Leu INFORMATION FOR SEQ ID NO:571: SEQUENCE CHARACTERISTICS: LENGTH: 60 amino acids TYPE: amino acid TOPOLOGY: linear MOLECULE TYPE: protein HYPOTHETICAL: YES (ii) (iii) WO 97/37044 PCT/US97/05223 510 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:571: Tyr Arg Glu Met Asp Tyr Ala Val Phe Ser Met Thr Ile His Pro Asp 1 5 10 Val Ser Ala Arg Pro Gin Val Leu Leu Met His Glu Lys Ile Ile Glu 25 His Ile Asn Gin His Glu Gly Val Ala Trp Val Thr Phe Asn Glu Ile 40 Ala Asp Asp Phe Leu Lys Arg Asn Pro Arg Lys Lys 55 INFORMATION FOR SEQ ID NO:572: SEQUENCE CHARACTERISTICS: LENGTH: 78 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...78 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:572: Leu Val Leu Gin Gly Ala Glu Val Leu Ile Tyr Pro Ser Ala Phe Gly 1 5 10 Lys Ala Arg Ala Tyr Asn Trp Asp Leu Leu Ser Lys Ala Arg Ala Leu 25 Glu Asn Gly Cys Phe Val Cys Ala Cys Asn His Ser Gly Glu Glu Thr 40 Asn Ala Lys Leu Lys Gin Thr Leu Glu Phe Ala Gly Asp Ser Arg Asn 55 His Arg Thr Gin Trp Glu Asn His Arg Pro Ser His Gin Ala 70 INFORMATION FOR SEQ ID NO:573: SEQUENCE CHARACTERISTICS: LENGTH: 61 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 511 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...61 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:573: Asn Leu Pro Val Ile Gin Gly Ile Ile Ala Pro Asn Gly Lys Ile Ile 1 5 10 Ala Gin Ala Thr Lys Leu Asn Glu Val Ile Ile Ala Glu Met Asp Leu 25 Asn Glu Val Ala Leu Gin Arg Gin Lys Ile Pro Tyr Leu Gin Asp Phe 40 Asp Thr Lys Leu Thr Lys Lys Gly Phe Gly Lys Leu Thr 55 INFORMATION FOR SEQ ID NO:574: SEQUENCE CHARACTERISTICS: LENGTH: 162 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...162 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:574: Ile Ile Met Phe Gly Met Gly Phe Phe Glu Ile Leu Val Val Leu Ile 1 5 10 Val Ala Ile Ile Phe Leu Gly Pro Glu Lys Phe Pro Gin Ala Val Val 25 Asp Ile Val Lys Phe Phe Arg Ala Val Lys Lys Thr Leu Asn Asp Ala 40 Lys Asp Thr Leu Asp Lys Glu Ile Asn Ile Glu Glu Ile Lys Lys Glu 55 Thr Leu Glu Tyr Gin Lys Leu Phe Glu Asn Lys Val Glu Ser Leu Lys 70 75 Gly Val Lys Ile Glu Glu Leu Glu Asp Ala Lys Val Thr Ala Glu Asn 90 Glu Ile Lys Ser Ile Gin Asp Leu Met Gin Asp Tyr Lys Arg Ser Leu 100 105 110 Glu Thr Asn Thr Ile Pro Asn His Leu Asn Glu Glu Val Ser Asn Glu 115 120 125 WO 97/37044 PCTIUS97/05223 512 Glu Ala Leu Asn Lys Glu Val Ser Ser Asp Glu Ser Pro Lys Glu Val 130 135 140 Gin Leu Thr Thr Asp Asn Asn Ala Lys Glu His Asp Lys Glu Lys Glu 145 150 155 160 His Val INFORMATION FOR SEQ ID NO:575: SEQUENCE CHARACTERISTICS: LENGTH: 185 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...185 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:575: Arg Tyr Arg Ser Phe Ser Arg Asn Gin Ser Asn Phe Arg Arg Cys Glu 1 5 10 Gin Pro Tyr Gly Ser Thr Leu Asn Ser Val Pro Leu Trp Ile Met Val 25 Val Gly Ala Ala Gly Ile Ala Leu Gly Leu Ser Leu Tyr Gly Pro Lys 40 Leu Ile Lys Thr Val Gly Ser Glu Ile Thr Glu Leu Asp Lys Met Gin 55 Ala Phe Cys Ile Ala Leu Ser Ala Val Ile Thr Val Leu Leu Ala Ser 70 75 Gin Leu Gly Leu Pro Val Ser Ser Thr His Ile Val Val Gly Ala Val 90 Phe Gly Val Gly Phe Leu Arg Glu Arg Leu Arg Glu Gin Ser Arg Arg 100 105 110 Arg Phe Ala Arg Ile Arg Asp Asn Ile Val Ala Ala His Phe Gly Glu 115 120 125 Asp Leu Glu Glu Ile Glu Gly Phe Leu Glu Arg Phe Asp Lys Ala Asn 130 135 140 Leu Lys Glu Lys Ser Leu Met Leu Glu Ser Leu Lys Lys Ser Lys Asn 145 150 155 160 Thr Ala Ile Ala Leu Glu Leu Lys Lys Lys Asp Lys Lys Ser Leu Lys 165 170 175 Lys Val Tyr Lys Glu Glu Val Ile Lys 180 185 INFORMATION FOR SEQ ID NO:576: SEQUENCE CHARACTERISTICS: LENGTH: 105 amino acids TYPE: amino acid WO 97/37044 PCT/US97/05223 513 TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...105 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:576: Ala Gin Ile Phe Val Arg Phe Asn Tyr Val Leu Gly Ala Ile Gly Phe 1 5 10 Val Val Leu Leu Tyr Glu Ile Ile Ser Phe Ile Tyr Tyr Lys Arg Ser 25 Leu Val Tyr Leu Ile Leu Gly Val Ala Ile Gly Ala Leu Cys Leu Leu 40 Phe Val Phe Tyr Tyr Thr Pro Tyr Ile Leu Asn Ala Gin Lys Val Gly 55 Glu Val Ala Leu Gin Ser Ala Glu Phe Ala Arg Ser His Ala Gin Ser 70 75 Glu Trp Leu Phe Lys Glu Leu Phe Val Leu Val Cys Ala Leu Phe Phe 90 Trp Arg Leu Phe Gly Lys Asn Ala Leu 100 105 INFORMATION FOR SEQ ID NO:577: SEQUENCE CHARACTERISTICS: LENGTH: 116 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (Vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...116 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:577: Lys Ala Cys Lys Met Lys Cys Ser Ser Phe Thr Ser Asn Ser Val Leu 1 5 10 Asn Phe Phe Val Val Leu Ser Phe Ile Thr Ile Gly Leu Val Phe Phe 25 Phe Leu Arg Ser Gin Pro Thr Ser Val Val Ser Lys Glu Asn Ile Pro 40 Lys Ile Glu Leu Glu Asn Phe Lys Ala Phe Gin Ile Asn Asp Lys Ile WO 97/37044 PCT/US97/05223 514 55 Leu Asp Leu Ser Ile Glu Gly Lys Lys Ala Leu Gin Tyr Asp Asp His 70 75 Glu Ile Phe Phe Asp Ser Lys Ile Lys Arg Tyr Asp Glu Asp Thr Ile 90 Glu Ser Val Glu Ser Pro Glu Ala Lys Arg Gin Gin Asp Leu Tyr Phe 100 105 110 Phe Pro Asn Gly 115 INFORMATION FOR SEQ ID NO:578: SEQUENCE CHARACTERISTICS: LENGTH: 105 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...105 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:578: Asn Gin Thr Lys Pro Glu Arg Lys Glu Lys Ile Pro His Phe Gin Arg 1 5 10 Arg Val Leu Met Arg Trp Trp Cys Phe Leu Val Cys Cys Leu Ser Val 25 Leu Ser Val Met Asp Ala Lys Lys Leu Glu Asn Lys Gly Leu Lys Lys 40 Glu Arg Glu Leu Leu Glu Ile Thr Gly Asn Gin Phe Val Ala Asn Asp 55 Lys Thr Lys Thr Ala Val Ile Gin Gly Asn Val Gin Ile Lys Lys Gly 70 75 Lys Asp Arg Leu Phe Ala Asp Lys Val Ser Val Phe Leu Asn Asp Lys 90 Arg Lys Pro Glu Arg Tyr Glu Ala Ala 100 105 INFORMATION FOR SEQ ID NO:579: SEQUENCE CHARACTERISTICS: LENGTH: 319 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: WO 97/37044 PCT/US97/05223 515 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...319 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:579: Lys Glu Ser Asn Ser Met Asp Leu Asp Lys Leu 1 Leu Thr Asn Ile Asn Arg Leu Lys Met 145 Lys Arg Glu Thr Leu 225 Phe Ser Lys Val Tyr 305 (2) Arg Glu Gly Leu Ala Val Asn Leu 130 Ile Thr Ile Asn Tyr 210 Met Gly Val Ser Asp 290 Ile Asn Leu Val Gly Asn Ser Ile 115 Ser Asp Ser Val His 195 Glu Val Asn His Gly 275 Val Gin Ala Ile Arg Phe Tyr Met 100 Arg Ser Gly Phe Ser 180 Lys Asn Gly Thr Gly 260 Leu Val Glu 5 Ile Met Glu Gly Pro Asn Val Ile Lys Leu 165 Val Ser Ala Glu Gly 245 Val Asp Val Ile Leu Asn Lys Glu 70 Thr His Pro Cys Asn 150 Asn Glu Leu Leu Ile 230 His Ile Val Gin Leu 310 Arg Lys Val 55 Gin Leu Phe Ser Gin 135 Leu Ala Asp Leu Asn 215 Asp Lys Glu Asn Ile 295 Pro Leu Glu 40 Phe Leu Asn Ala Asp 120 Tyr Leu Leu Ser Val 200 Met Thr Gly Ala Val 280 Val Ala Leu 25 Lys Asp Ala Thr Ile 105 Lys Asp Ile Ile Glu 185 Asp Ala Arg Met Ile 265 Ala Leu Lys 10 Pro Glu Glu Ser Ser 90 Ser Lys Tyr Ser Glu 170 Glu Lys Met Asn Val 250 Ala Lys Asp Asp Tyr Ile Asn Phe 75 Ile Ala Phe Glu Gly 155 Phe Leu Thr Arg Ser 235 Ser Leu Lys Lys Leu 315 Lys Leu Trp Leu Arg Pro Asp Asp Tyr 140 Gly Ile Asp Glu Met 220 Met Thr Asn Phe Ala 300 Arg Asp Asp Leu Asp Asp Thr Asn Leu 125 Leu Thr Pro Leu Ser 205 Ser Leu Leu Leu Phe 285 Thr Asp Tyr Ser Tyr Lys Leu Ser Glu 110 Lys Lys Gly Lys Arg 190 Ser Pro Phe His Gin 270 Lys Asn Ser Arg Gly Lys Ala Phe Arg Leu Ala Asn Ser His 175 Ala Lys Asp Leu Ala 255 Met Ser Thr Leu Ala Ile Leu Phe Phe Tyr Ser Phe Leu Gly 160 Thr Phe Phe Arg Arg 240 Asp Asn Ser Arg INFORMATION FOR SEQ ID NO:580: SEQUENCE CHARACTERISTICS: LENGTH: 87 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 516 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...87 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:580: Gly Val Lys Ala Met Asn Tyr Asp Val Leu Met Gly Phe Leu Ala Leu 1 5 10 Ile Leu Leu Ile Leu Trp Tyr Ala Tyr Gly Leu Arg Gin Tyr Leu Lys 25 Leu Lys Asp Lys Asn Lys Arg Leu Lys Glu Lys Leu Gin Arg Cys Asn 40 Cys Asn Ile Lys Ile Pro Ser Ile Leu Glu Met Ala His Lys Pro Ile 55 Ile Met Asp Ile Lys Gly Glu Leu Leu Pro His Leu Thr Glu Ser Tyr 70 75 Arg Lys Ser Lys Phe Lys Glu INFORMATION FOR SEQ ID NO:581: SEQUENCE CHARACTERISTICS: LENGTH: 95 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...95 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:581: Leu Ser Arg Asn Gly Val Val Ser Ser Val Ser Ser Asp Gly Ser Lys 1 5 10 Ile Leu Met Ser Leu Ala Pro Asp Gly Gin Pro Asp Val Tyr Leu Tyr 25 Asp Thr His Lys Lys Thr Lys Thr Lys Ile Thr Arg Tyr Pro Gly Ile 40 Asp Val Ser Gly Val Phe Leu Glu Asp Asp Lys Ser Met Ala Phe Val 55 Ser Asp Arg Ser Gly Tyr Pro Asn Ile Tyr Met Lys Lys Leu Gly Leu 70 75 Lys Glu Arg Arg Ser Asn Ser Phe Met Lys Glu Glu Ala Met Asn 90 WO 97/37044 PCT/US97/05223 517 INFORMATION FOR SEQ ID NO:582: SEQUENCE CHARACTERISTICS: LENGTH: 146 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...146 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:582: Gly Asp Lys Met Leu Lys Lys Leu Leu Leu Ile Ser Leu Phe Leu Gly 1 5 10 Phe Leu Arg Ala Glu Gly Glu His Tyr Glu Ile Ile Ala Glu Leu Ser 25 Lys Ala Phe Leu Lys Ala Lys Glu Val Leu Thr Ala Ile Asn Lys Thr 40 Tyr Lys Thr Cys Ile Glu Thr Gly His Asp Arg Thr Gin Ile Arg Leu 55 Gin Asn Asp Phe Leu Glu Asn Leu Ser Gin Thr Glu Gin Gin Phe Asp 70 75 Asp Tyr Phe Glu Lys Asp Phe Lys Ser Val Gly Val Leu Lys Thr Leu 90 Leu Lys Asp Ile Gin Ser Leu Glu Lys Thr Ser Asn Lys Leu Val Cys 100 105 110 Val Ala Pro Lys Asn Ala Lys Asn Phe Glu Ile Leu Glu Gly Ala Ile 115 120 125 Thr Gin Ile Ile Gly Leu Glu Glu Gin Met Asn Gin Phe Ile Asn Gly 130 135 140 Ala Lys 145 INFORMATION FOR SEQ ID NO:583: SEQUENCE CHARACTERISTICS: LENGTH: 115 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCTIUS97/05223 518 NAME/KEY: miscfeature LOCATION 1...115 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:583: Ile Gin Arg Leu Lys Glu Met Leu Arg Asn Gin Phe Arg Ile Val Phe 1 5 10 Val Thr Cys Ile Val Ala Ser Ser Leu Gin Ala Gin Glu Asn Thr Pro 25 Thr Leu Gly Lys Val Thr Thr Lys Gly Glu Arg Thr Phe Glu Tyr Asn 40 His Gin Met Tyr Thr Asp Arg Lys Glu Leu Gin Gin Arg Gin Ser Asn 55 Gin Thr Arg Asp Ile Phe Arg Thr Arg Ala Asp Val Asn Ala Ala Ser 70 75 Gly Gly Leu Met Ala Gin Lys Ile Tyr Ala Arg Gly Ile Glu Ser Arg 90 Leu Leu Arg Val Thr Ile Asp Gly Val Ala Gin Asn Gly Asn Ile Phe 100 105 110 His His Asp 115 INFORMATION FOR SEQ ID NO:584: SEQUENCE CHARACTERISTICS: LENGTH: 264 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...264 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:584: Ala Val Ser Trp Ile Met Ala Cys Trp His Lys Arg Leu Ala Val Gly 1 5 10 Cys Cys Ile Val Leu Leu Ser Cys Val Met Ser Ala Asn Asn Val Ser 25 Ile Val Arg Asp Asp Pro Pro Leu Asp Pro Thr Leu Pro Ala Trp Ile 40 Tyr Ser Val Ala Leu Leu Lys Val Tyr Phe Ser Asp Gly Thr Tyr Lys 55 Glu Gly Tyr Ala Thr Leu Leu Glu Asn Gly Arg Tyr Ile Ala Ser Ser 70 75 Glu Thr Leu Tyr Ser Asn Gly Leu Tyr Pro Lys Met Ile Leu Ala Lys 90 Met Gin Asp Ser Ser Ala Lys Glu Leu Ile Cys Ile Ala Ser Leu His 100 105 110 Leu Glu Ala Met Asp Arg Asp Gin Gly Leu Ser Leu Leu Lys Thr Ala WO 97/37044 PCT/US97/05223 519 115 120 125 Asp Phe Arg Asp Asp Tyr Cys His Lys Arg Glu Glu Ser Tyr Tyr His 130 135 140 Ala Arg Ile Tyr Ala Lys Tyr Ala Gin Thr Phe His Ser Asn Pro Tyr 145 150 155 160 Thr Asn Gin Lys Thr Pro Thr Ser Asp Leu Tyr Tyr Pro Ala Leu Asn 165 170 175 Glu Gly Asn Ser Phe Ser Ile Gin Thr Thr Asp Ile Ser Val Ala Glu 180 185 190 Leu Leu Lys Ser Lys Lys Phe Leu Ser Leu Asp Ala Ser Phe Lys Lys 195 200 205 Gly Ser Val Leu Trp Gly Gly Arg Pro Tyr Phe Ser Glu Val Gly Glu 210 215 220 Phe Met Gly Met Thr Ser Ser Thr Leu Glu Asn Gin Glu Ser Leu Val 225 230 235 240 Ile Ile Pro Lys Glu Lys Ile Ala Arg Phe Leu Ser Ala Leu Lys Asn 245 250 255 Gin Asn Ile Phe Pro Asn Ile Pro 260 INFORMATION FOR SEQ ID NO:585: SEQUENCE CHARACTERISTICS: LENGTH: 147 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...147 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:585: Leu Gly Gly Leu Asp Met Tyr Lys Leu Gly Ile Phe Leu Leu Ala Thr 1 5 10 Leu Leu Ser Ala Asn Thr Gin Lys Val Ser Asp Ile Ala Lys Asp Ile 25 Gin His Lys Glu Thr Leu Leu Lys Lys Thr His Glu Glu Lys Asn Gin 40 Leu Asn Ser Arg Leu Ser Ser Leu Gly Glu Ala Ile Arg Ser Lys Glu 55 Leu Gin Lys Val Glu Ile Glu Arg Gin Met Val Ala Leu Lys Lys Ser 70 75 Leu Glu Lys Asn Arg Asn Glu Ser Leu Val Gin Glu Lys Val Leu Thr 90 Asn Tyr Arg Lys Ser Leu Asp His Leu Gin Lys Gin Arg Ser Phe Leu 100 105 110 Gin Lys Arg Val Phe Asp Thr Leu Leu Glu Asp Phe Leu Phe Ser Gin 115 120 125 Ala Leu Lys Gly Gin Asn Leu Ala Ser Ser Asn Asp Val Ile Leu Leu WO 97/37044 PCTIUS97/05223 520 130 135 140 Ser Gly Val 145 INFORMATION FOR SEQ ID NO:586: SEQUENCE CHARACTERISTICS: LENGTH: 523 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...523 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:586: Tyr Ile Pro Phe Asn Leu Phe Gin Gly Gin Thr Asn Met Lys Lys Leu 1 5 10 Leu Tyr Thr Met Leu Ala Leu Leu Leu Ile Gly Leu Leu Thr Ala Tyr 25 Leu Ile Leu Phe Thr Glu Trp Gly Asn Lys Ile Ile Ala Ser Tyr Ile 40 Glu Lys Lys Ile Asn Pro Asn Glu Arg Tyr Leu Ser Val Lys Thr Phe 55 Lys Leu Arg Phe Asn Ser Leu Asp Phe Lys Ala Gin Ala Asn Asp Asp 70 75 Ser Thr Leu Ile Leu Lys Gly Asp Phe Ser Leu Leu Thr Gin Ser Val 90 Asn Leu Asp Tyr His Ile Asp Ile Lys Asn Leu Arg Ser Phe Lys Asp 100 105 110 Leu Ile Pro Tyr Pro Leu Arg Gly Ala Ile Val Thr Ser Gly Asn Ile 115 120 125 Lys Gly His Arg Lys Ala Leu Val Val Gin Gly Val Ser Asn Val Ala 130 135 140 Gin Ser His Thr Ala Tyr Asn Ala Leu Leu Asp Asp Phe Lys Leu Ser 145 150 155 160 His Leu Ser Leu Asn Ala Lys Asp Ala Asn Leu Glu Asp Leu Leu Tyr 165 170 175 Leu Ile Asn Arg Pro Ala Tyr Ala Asn Ala Lys Val Ser Leu Gin Ala 180 185 190 Asp Phe Asn Ser Leu Lys Pro Leu Glu Gly His Leu Ile Leu Thr Ala 195 200 205 Asn Asn Ala Leu Ile Asn Asn Ala Leu Ile Asn Gin Ile Phe His Leu 210 215 220 Asn Leu Lys Asp Thr Leu Val Phe Asn Leu Ser His Ser Ser Asp Phe 225 230 235 240 Lys Gly Asn Lys Ala Ile Ser Asp Thr Thr Leu Thr Ser Pro Leu Val 245 250 255 Asn Phe Thr Ala Leu Lys Ser Glu Tyr Ser Phe Pro Ala Leu Lys Leu WO 97/37044 PCTUS9705223 Asn Ala Pro Tyr Thr Leu Glu Ile Pro His Leu Ala Lys Leu Gin Asn Ile Glu 305 Asp Phe Lys Ser Lys 385 Thr Gin Lys Leu Gly 465 Ile Thr Glu Thr 290 Gin Gly Ser Phe Lys 370 Asn Lys Arg Ile Met 450 Ser Gin Leu Lys 275 Asn Ser Ala Asn Phe 355 Gin Ala Glu Leu His 435 Asp Met Gin Lys Leu His Pro Leu Ile 340 Gin Gly Phe Ile Leu 420 Asn Ala His Asn Lys 500 Glu Pro Lys Asp 325 Ser Ser Val Ser Tyr 405 Ser Gly Glu Gin Leu 485 Gly Lys Leu Leu 310 Phe Thr Ile Leu Asp 390 His Asp Leu Ile Pro 470 Gin Leu Gly Lys 295 Leu Thr Ser Ala Lys 375 Phe Asp Leu Leu Leu 455 Lys Gin Asp Leu 280 Gly Ser Lys Val Leu Leu Lys Ala 345 Asp Ala 360 Ala Asn Leu Tyr Ala Asn Ser Leu 425 Asp Leu 440 Lys Phe Phe Ser Gly Leu His Leu 505 Lys Gly 520 Leu Ser Asn 330 Leu Asn Leu Ser Leu 410 Lys Asn Ile Leu Lys 490 Leu Leu Thr Gly 315 Lys Asp Leu Lys Ile 395 Val Ser Thr Phe Ile 475 Glu Lys Phe Leu 300 His Asp Leu Asp Asn 380 Ser Ser Pro Lys Lys 460 Leu Ile Asp 285 Lys Ser Leu Phe Tyr 365 Ala Arg Gin Lys Gin 445 Met Asn Leu Asp Gly Asn Lys His 350 Asp Arg Phe Ile Thr 430 Met Lys Glu Lys Lys 510 Asp Ile Leu Leu 320 Ala Arg 335 Tyr Pro Leu Ile Phe Leu Asp Ile 400 Asn Gin 415 Gin Leu Asp Ile Leu Gin Lys Ala 480 Asn Asp 495 Leu Lys INFORMATION FOR SEQ ID NO:587: SEQUENCE CHARACTERISTICS: LENGTH: 222 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...222 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:587: Ala Phe Leu Tyr Ile His Ser Ile Ala Leu Ala Arg Val Ile Leu Gin 1 5 10 Leu Lys Pro Leu Leu Thr Ile His Leu Asn Ala Lys Ser Gin Ser Ser WO 97/37044 PCT[US97/05223 522 25 Leu Lys Asn Lys Ile Thr Leu Lys Asn Lys Leu Asn His Ala Arg Ile 40 Ile Leu Glu Phe Ile Pro Ser Leu Ile Tyr Phe Leu Ile Gin Lys Val 55 Ser Val Leu Lys His Leu Ala Pro Leu Ile His Ile Pro Phe Lys Ala 70 75 Leu Trp Leu Gly Thr Ala Leu Ser Met Phe Leu Ser Leu Asn Leu Asn 90 Ala Glu Glu Asn Pro Thr Lys Thr Glu Pro Lys Pro Ala Lys Gly Val 100 105 110 Lys Asn Lys Pro Lys Ser Pro Val Thr Asn Val Met Met Thr Asn Cys 115 120 125 Asp Asn Leu Lys Asp Phe Asn Ala Lys Gin Lys Glu Val Leu Lys Ala 130 135 140 Ala Tyr Gin Phe Gly Ser Thr Glu Asn Leu Gly Tyr Glu Met Ala Gly 145 150 155 160 Ile Ala Trp Lys Glu Ser Cys Ala Gly Thr Tyr Lys Ile Asn Phe Ser 165 170 175 Asn Pro Ser Ala Gly Ile Tyr His Ala Tyr Ile Pro Ser Val Leu Lys 180 185 190 Ser Tyr Gly His Asn Asn Ser Pro Phe Leu Arg Asn Val Met Gly Arg 195 200 205 Ile Ala Thr Leu Lys Thr Met Arg Leu Leu Leu Lys Trp Pro 210 215 220 INFORMATION FOR SEQ ID NO:588: SEQUENCE CHARACTERISTICS: LENGTH: 299 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...299 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:588: Leu Val Ile Ile Ser Leu Leu Thr Thr Leu Lys Leu Lys Ser Ile Lys 1 5 10 Glu Ile Ser Ile Lys Lys Phe Ile Leu Ser Ser Leu Val Phe Ala Cys 25 Ile Asn Thr Gly Val Glu Ala Leu Glu Asn Asp Gly Ser Lys Pro Asn 40 Asp Leu Ala Ser Pro Lys Glu Thr Pro Lys Glu Ala Gin Lys Asn Glu 55 Ala Gin Asn Glu Thr Ser Gin Ser Asn Gin Thr Pro Lys Glu Met Lys 70 75 Val Lys Ser Ile Ser Tyr Val Gly Leu Ser Tyr Met Ser Asp Met Leu WO 97/37044 PCT/US97/05223 523 90 Ala Asn Glu Ile Ala Lys Ile Arg Val Gly Asp Met Val Asp Ser Lys 100 105 110 Lys Ile Asp Thr Ala Val Leu Ala Leu Phe Asn Gln Gly Tyr Phe Lys 115 120 125 Asp Val Tyr Ala Thr Phe Glu Asn Gly Ile Leu Glu Phe His Phe Asp 130 135 140 Glu Lys Ala Arg Ile Ala Gly Val Glu Ile Lys Gly Tyr Gly Thr Glu 145 150 155 160 Lys Glu Lys Asp Gly Leu Lys Ser Gin Met Gly Ile Lys Lys Gly Asp 165 170 175 Thr Phe Asp Glu Gin Lys Leu Glu His Ala Lys Thr Ala Leu Lys Thr 180 185 190 Ala Leu Glu Gly Gin Gly Tyr Tyr Gly Ser Val Val Glu Val Arg Thr 195 200 205 Glu Lys Val Ser Glu Gly Ala Leu Leu Ile Val Phe Asp Val Asn Arg 210 215 220 Gly Asp Ser Ile Tyr Ile Lys Gin Ser Ile Tyr Glu Gly Ser Asp Lys 225 230 235 240 Leu Lys Arg Arg Val Ile Glu Ser Leu Ser Ala Asn Lys Gin Arg Asp 245 250 255 Phe Met Gly Trp Met Trp Gly Leu Asn Asp Gly Lys Leu Arg Leu Asp 260 265 270 Gin Leu Glu Tyr Asp Ser Leu Arg Ile Gin Asp Val Tyr Met Arg Arg 275 280 285 Gly Tyr Leu Asp Ala His Ile Ser Ser Pro Phe 290 295 INFORMATION FOR SEQ ID NO:589: SEQUENCE CHARACTERISTICS: LENGTH: 288 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...288 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:589: Glu Phe Gly Val Lys Arg Ile Leu Phe Phe Leu Ala Ala Thr Thr Phe 1 5 10 Leu Leu Arg Ala Glu Thr Ala Ser Ala Thr Ile Asn Thr Thr Val Asp 25 Pro Asn Val Met Phe Ser Glu Ser Ser Thr Gly Asn Val Lys Lys Asp 40 Arg Lys Arg Val Leu Lys Ser Met Val Asp Leu Glu Lys Glu Arg Val 55 Lys Asn Phe Asn Gin Tyr Ser Glu Thr Lys Met Ser Lys Gly Asp Leu WO 97/37044 PCT/US97/05223 524 70 75 Ser Ala Phe Gly Ala Phe Phe Lys Gly Ser Leu Glu Asp Cys Val Glu 90 Gin Lys Ile Cys Tyr Tyr Glu His Arg Asn Gly Lys Val Ser Phe Val 100 105 110 Val Asn Asp Arg Glu Lys Phe Tyr Lys His Val Leu Lys Asp Leu Gly 115 120 125 Thr Glu Leu Ser Leu Pro Leu Phe Asn Trp Leu Tyr Lys Gly Ser Asp 130 135 140 Phe Gly Ala Leu His Glu Gin Phe Gly Asp Met Tyr Asp Gly Tyr Ile 145 150 155 160 Lys Tyr Leu Ile Ser Met Val Arg Val Ser Gin Lys Glu Lys Ala Arg 165 170 175 Lys Val Asp Ala Ile Val Leu Lys Lys Met Glu Glu Gin Ala Glu Lys 180 185 190 Asp Thr Lys Ala Ala Phe Gin Lys Arg Ser Ser Gly Glu Leu Glu Ser 195 200 205 His Thr Asp Ser Pro Glu Phe Ile Ser Ser Ser Lys Thr Gin Asn Ser 210 215 220 Ser Asn Pro Asp Leu Asp Pro Met Thr Asn Ala Asn Thr Leu Lys Glu 225 230 235 240 Thr Ala Ser Lys Glu Pro Glu Thr Ser Ser Lys Lys Glu Lys Lys Pro 245 250 255 Lys Lys Lys Arg Arg Leu Ser Lys Lys Glu Lys Gin Gin Gin Ala Leu 260 265 270 Gin Gin Glu Phe Glu Lys Gin Ile Ser Asp Ser Ser Lys Ser Glu Lys 275 280 285 INFORMATION FOR SEQ ID NO:590: SEQUENCE CHARACTERISTICS: LENGTH: 407 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...407 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:590: Ser Arg Thr Leu Cys Ala Lys Ile Val Leu Gin Lys Glu Arg Lys Lys 1 5 10 Met Glu Ile Gin Gin Thr His Arg Lys Ile Asn Arg Pro Leu Val Ser 25 Leu Val Leu Ala Gly Ala Leu Ile Ser Ala Ile Pro Gin Glu Ser His 40 Ala Ala Phe Phe Thr Thr Val Ile Ile Pro Ala Ile Val Gly Gly Ile 55 Ala Thr Gly Thr Ala Val Gly Thr Val Ser Gly Leu Leu Ser Trp Gly WO 97/37044 PCT/US97/05223 525 70 75 Leu Lys Gin Ala Glu Glu Ala Asn Lys Thr Pro Asp Lys Pro Asp Lys 90 Val Trp Arg Ile Gin Ala Gly Lys Gly Phe Asn Glu Phe Pro Asn Lys 100 105 110 Glu Tyr Asp Leu Tyr Lys Ser Leu Leu Ser Ser Lys Ile Asp Gly Gly 115 120 125 Trp Asp Trp Gly Asn Ala Ala Arg His Tyr Trp Val Lys Gly Gly Gin 130 135 140 Trp Asn Lys Leu Glu Val Asp Met Lys Asp Ala Val Gly Thr Tyr Lys 145 150 155 160 Leu Ser Gly Leu Arg Asn Phe Thr Gly Gly Asp Leu Asp Val Asn Met 165 170 175 Gin Lys Ala Thr Leu Arg Leu Gly Gin Phe Asn Gly Asn Ser Phe Thr 180 185 190 Ser Tyr Lys Asp Ser Ala Asp Arg Thr Thr Arg Val Asn Phe Asn Ala 195 200 205 Lys Asn Ile Ser Ile Asp Asn Phe Val Glu Ile Asn Asn Arg Val Gly 210 215 220 Ser Gly Ala Gly Arg Lys Ala Ser Ser Thr Val Leu Thr Leu Gin Ala 225 230 235 240 Ser Glu Gly Ile Thr Ser Ser Lys Asn Ala Glu Ile Ser Leu Tyr Asp 245 250 255 Gly Ala Thr Leu Asn Leu Ala Ser Asn Ser Val Lys Leu Asn Gly Asn 260 265 270 Val Trp Met Gly Arg Leu Gin Tyr Val Gly Ala Tyr Leu Ala Pro Ser 275 280 285 Tyr Ser Thr Ile Asn Thr Ser Lys Val Gin Gly Glu Val Asp Phe Asn 290 295 300 His Leu Thr Val Gly Asp Gin Asn Ala Ala Gin Ala Gly Ile Ile Ala 305 310 315 320 Ser Asn Lys Thr His Ile Gly Thr Leu Asp Leu Trp Gin Ser Ala Gly 325 330 335 Leu Asn Ile Ile Ala Pro Pro Glu Gly Gly Tyr Lys Asp Lys Pro Asn 340 345 350 Ser Thr Thr Ser Gin Ser Gly Thr Lys Asn Asp Lys Lys Glu Ile Ser 355 360 365 Gin Asn Asn Asn Ser Asn Thr Glu Val Ile Asn Pro Pro Asn Asn Thr 370 375 380 Gin Lys Thr Glu Thr Glu Pro Thr Lys Ser Leu Met Gly Leu Leu Leu 385 390 395 400 Lys Ala Lys Thr Arg Leu Ser 405 INFORMATION FOR SEQ ID NO:591: SEQUENCE CHARACTERISTICS: LENGTH: 163 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 526 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...163 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:591: Gin Ile Phe Cys Ser Thr Gin Ser Ser Lys Leu Pro Leu Lys Lys Ala 1 5 10 Pro Lys Ala Asp Lys Ser Pro Leu Leu Ile Leu Val Ser Glu Tyr Trp 25 Leu Lys Phe Phe Thr Arg Ser Phe Ser Lys Ser Thr Met Leu Phe Lys 40 Thr Leu Leu Arg Ser Phe Phe Thr Phe Pro Val Glu Leu Ser Glu Asn 55 Ile Thr Leu Gly Ser Thr Val Val Leu Ile Val Ala Glu Ala Val Ser 70 75 Ala Leu Asn Lys Lys Val Val Ala Ala Lys Lys Asn Lys Ile Arg Phe 90 Thr Pro Asn Ser Tyr Ile His Asn Arg Asn Lys Asn Arg Arg Tyr Ser 100 105 110 Ser Leu Ser Pro Leu Leu Lys Ser Ser Ser Ile Cys Lys Asn Pro Pro 115 120 125 Arg Ile Gin Ala Ile Leu Ile Ile Leu Lys Tyr Arg Leu Ser Lys Gly 130 135 140 Phe Asn Pro Val Val Val Ile Leu Arg Ile Gin Lys Met Arg Thr Leu 145 150 155 160 Gin Lys Leu INFORMATION FOR SEQ ID NO:592: SEQUENCE CHARACTERISTICS: LENGTH: 201 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...201 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:592: Gly Leu Ile Thr Met Asp Lys Asn Asn Asn Asn Asn Leu Arg Leu Ile 1 5 10 Leu Ala Ile Ala Leu Ser Phe Leu Leu Ile Ala Leu Asn Ser Tyr Phe 25 Phe Gin Glu Pro Asn Lys Thr Thr Thr Glu Thr Thr Lys Gin Glu Thr 40 Thr Asn Asn His Thr Ala Thr Ser Pro Thr Ala Ser Asn Thr Ile Thr WO 97/37044 PCT[US97/05223 527 55 Gin Asp Phe Ser Val Thr Gin Thr Ile Pro Gin Glu Ser Leu Leu Ser 70 75 Thr Ile Ser Phe Glu His Ala Lys Ile Glu Ile Asp Ser Leu Gly Arg 90 Ile Lys Gin Val Tyr Leu Lys Asp Lys Lys Tyr Leu Thr Pro Lys Glu 100 105 110 Lys Gly Phe Leu Glu His Val Ser His Leu Phe Ser Ser Lys Glu Asn 115 120 125 Ser Gin Pro Ser Leu Lys Glu Leu Pro Leu Leu Ala Ala Asp Lys Leu 130 135 140 Lys Pro Leu Glu Val Arg Phe Leu Asp Pro Thr Leu Asn Asn Lys Ala 145 150 155 160 Phe Asn Thr Pro Tyr Ser Ala Ser Lys Thr Thr Leu Gly Pro Asn Glu 165 170 175 Gin Leu Val Leu Thr Gin Asp Leu Gly Thr Leu Ser Ile Ile Lys Thr 180 185 190 Leu Thr Phe Tyr Asp Asp Leu His Tyr 195 200 INFORMATION FOR SEQ ID NO:593: SEQUENCE CHARACTERISTICS: LENGTH: 68 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...68 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:593: Gly Val Ser Ile Ile Met Val Leu Lys Thr Lys Leu Lys Ile Ile Ser 1 5 10 Ser Val Ile Leu Ser Thr Leu Leu Trp Val Gly Cys Ser Ser Glu Met 25 Ala Thr Tyr Gin Asn Val Asn Asp Ala Thr Lys Asn Thr Thr Ala Ser 40 Ile Asn Ser Thr Asp Leu Leu Leu Thr Ala Asn Ala Met Leu Asp Ser 55 Met Leu Gly Thr INFORMATION FOR SEQ ID NO:594: SEQUENCE CHARACTERISTICS: LENGTH: 176 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 528 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...176 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:594: Ala Met Lys Asn Gin Val Lys Lys Ile Leu Gly Met 1 Ala Lys Val Val Tyr Asn Thr Lys Ala 145 Lys Met Ser Val Phe Ser Leu Val Ile 130 Arg Gin Val Asn Gly Leu Thr Lys Asp 115 Ser Tyr Ile Ile Lys Asp Gly Asn Ser 100 Ala Gin Val Val 5 Val Ala Leu Arg Gin Thr Ser Leu Gly Asp Gly Tyr Glu Ala 70 Ala Leu Gly Val Lys 150 Lys Cys Lys Lys 55 Glu Thr Gin Lys Asp 135 Asp Val Ser Glu 40 Val Asp Ala Lys Arg 120 Lys Arg Arg His 25 Ala Ala Leu Lys Asp 105 Ser Glu Val Glu 10 Ala Thr Lys Ile Ala 90 Leu Ile Leu Phe Glu 170 Pro Lys Tyr Thr 75 Arg Glu Ser Ile Val 155 Leu Lys Gly Glu Asn Ala Asn Gly Thr 140 Leu Gly Ser Ser Ala Lys Asn Asn Glu Thr 125 Ser Val Met Val Val Gly Ile Pro Asp Tyr Ser Asp Val Leu Ala Lys Thr 110 Asp Thr Lys Met Gly Leu Val Lys 175 Ala Ser Trp Gly Asp Ala Arg Glu Leu Asp 160 Lys INFORMATION FOR SEQ ID NO:595: SEQUENCE CHARACTERISTICS: LENGTH: 100 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...100 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:595: WO 97/37044 PCT/US97/05223 529 Pro Lys Pro Tyr Ala Val Ser Ile Leu Cys Ile Cys Val Met Phe Ile 1 5 10 Leu Phe Lys Phe Gly Arg Val Leu Gly Lys Ala Tyr Ser Leu Tyr Leu 25 Tyr Ile Tyr Glu Ser Leu Ile Cys Gin Ala Phe Gly Leu Ser Leu Ser 40 Cys Asn Asn Ser Met Leu Phe Ser Thr Phe Leu Ile Asn Leu Pro Leu 55 Pro His Asn Glu Ser Leu Cys Cys Cys Arg Asp Ile Leu Ala Tyr Ser 70 75 Asn Ser Ser Ser Leu Lys Thr Tyr Ser Leu Glu Ser Asn Phe Ser Phe 90 Asn Ser Leu Phe 100 INFORMATION FOR SEQ ID NO:596: SEQUENCE CHARACTERISTICS: LENGTH: 100 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...100 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:596: Thr Ala Arg Met Asn Arg Met Lys Pro Ile Phe Leu Leu Ile Phe Leu 1 5 10 Leu Leu Ala Ser Leu Ile Ala Arg Glu Lys Asp Ala Ser Ser Asn Leu 25 Phe Asp Leu Ile Asp Lys Gly Ile Asn Arg Glu Gin Glu Leu Lys Glu 40 Gin Glu Gin Lys Thr Arg Leu Lys Leu Ala Gin Ser Pro Leu Val Ala 55 Leu Glu Ile Val Pro Gin Glu Thr Pro Tyr Leu Glu Trp Gin Gly Ala 70 75 Arg Glu Ser Tyr Tyr Leu Lys Val Glu Arg Cys Ser Gly Glu Arg Gly 90 Tyr Phe Lys Asp 100 INFORMATION FOR SEQ ID NO:597: SEQUENCE CHARACTERISTICS: LENGTH: 231 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 530 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...231 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:597: Val Ala Arg Ala Phe Ile Ala Leu Tyr Ala Ile 1 Ser Ala Ser Ser Ser Ser Met Arg Lys 145 Arg Ser Leu Asp Asn 225 Lys Pro Gly Gin Val Leu Leu Val 130 Thr Val Phe Glu Ala 210 Pro Val Lys Lys Asn Leu Leu Tyr 115 His Arg Met Ser Asn 195 Asn His Glu Pro Glu Thr Glu Phe 100 Ile Ile Phe Lys Ser 180 Arg Asp Lys 5 Ala Glu Glu Glu Gin Glu Glu Asn Lys Val 165 Tyr Met Leu Gin Leu Ala Glu Thr 70 Ile Asn Arg Val Ser 150 Leu Gly Lys Ser Gin 230 Thr Gin 40 Met Ala Gin Thr Ala 120 Gly Tyr Gin Thr Asn 200 Ile Glu 25 Pro Ala Thr Gly Ser 105 Lys Phe Glu Tyr Asn 185 Arg His 10 Phe Val Ser Ile Ser 90 Asp Ile Thr Leu Gly 170 Pro Val Ser Ile Val Glu Ala 75 Val Ala Ile Asp Ala 155 Val Ile Glu Ile Ser Lys Val Ser Arg Leu Ile Gin Asn 140 Ala Asp Ala Ile Leu 220 Ala Ile Ile Ser Lys Lys Asn Lys 125 Thr Asn Pro Pro Phe 205 Asp Val Phe Pro Lys Gly Leu Gin 110 Leu Pro Arg Asn Asn 190 Phe Glu Asn Asn Pro Pro Glu Pro Asp Pro Leu Ala Gin 175 Asp Ser Glu Lys Tyr Asp Ala Gly Ser Met Lys Asn Tyr 160 Leu Ser Thr Phe INFORMATION FOR SEQ ID NO:598: SEQUENCE CHARACTERISTICS: LENGTH: 98 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 531 (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...98 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:598: Gin Arg Leu Tyr Leu Gly Lys Leu Asn Ala Lys Val Asn His Thr Ile 1 5 10 Phe Gin Phe Leu Val Asn Val Gly Ile Arg Thr Asn Ile Phe Glu His 25 His Gly Ile Glu Phe Gly Ile Lys Ile Pro Thr Leu Pro Asn Tyr Phe 40 Phe Lys Gly Ser Thr Thr Ile Arg Ala Lys Lys Gin Gly Pro Leu Glu 55 Asn Gly Asn Pro Thr Thr Ile Thr Gly Ala Glu Thr Asn Phe Ser Leu 70 75 Thr Gin Thr Leu Arg Arg Gin Tyr Ser Met Tyr Leu Arg Tyr Val Tyr 90 Thr Phe INFORMATION FOR SEQ ID NO:599: SEQUENCE CHARACTERISTICS: LENGTH: 357 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...357 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:599: Phe His Phe Tyr Cys Leu Tyr Phe Leu Asn Gly Gly Tyr Asn Arg Leu 1 5 10 His Lys Ser Tyr Glu Arg Ile Val Met Leu Ile Ala Arg Phe Lys Lys 25 Ala Leu Ile Ser Tyr Ser Leu Gly Val Leu Leu Val Ser Ser Leu Leu 40 Gly Val Ala Asn Ala Ser Asn Gin Glu Ile Gin Val Lys Asp Tyr Phe 55 Gly Glu Gin Thr Ile Lys Leu Pro Val Ser Lys Ile Ile Tyr Leu Gly 70 75 Ser Phe Ala Glu Val Pro Ala Met Phe Asn Thr Trp Asp Arg Val Val 90 Gly Ile Ser Asp Tyr Ala Phe Lys Ser Asp Ile Val Lys Ala Thr Leu 100 105 110 Lys Asp Leu Glu Arg Ile Lys Pro Met Ser Ser Asp His Val Ala Ala WO 97/37044 PCT/US97/05223 532 115 120 125 Leu Asn Val Glu Leu Leu Lys Lys Leu Ser Pro Asn Leu Val Val Thr 130 135 140 Phe Val Gly Asn Pro Lys Ala Val Glu His Ala Lys Lys Phe Gly Ile 145 150 155 160 Ser Phe Leu Ser Phe Gin Glu Lys Thr Ile Val Glu Val Met Glu Asp 165 170 175 Ile Asp Ala Gin Ala Lys Ala Leu Glu Val Asp Ala Ser Lys Lys Leu 180 185 190 Ala Lys Met Gin Glu Thr Leu Asp Phe Ile Lys Glu Arg Leu Lys Asn 195 200 205 Val Lys Lys Lys Lys Gly Val Glu Leu Phe His Lys Ala Asn Lys Ile 210 215 220 Ser Gly His Gin Ala Leu Asp Ser Asp Ile Leu Glu Lys Gly Gly Ile 225 230 235 240 Asp Asn Phe Gly Leu Lys Tyr Val Lys Phe Gly Arg Ala Asp Ile Ser 245 250 255 Val Glu Lys Ile Val Lys Glu Asn Pro Glu Ile Ile Phe Ile Trp Trp 260 265 270 Val Ser Pro Leu Thr Pro Glu Asp Val Leu Asn Asn Pro Lys Phe Ser 275 280 285 Thr Ile Lys Ala Ile Lys Asn Lys Gin Val Tyr Lys Leu Pro Thr Met 290 295 300 Asp Ile Gly Gly Pro Arg Ala Pro Leu Ile Ser Leu Phe Ile Ala Leu 305 310 315 320 Lys Ala His Pro Glu Ala Phe Lys Gly Val Asp Ile Asn Ala Ile Ile 325 330 335 Lys Asp Tyr Tyr Lys Val Val Phe Asp Leu Asn Asp Ala Glu Val Glu 340 345 350 Pro Phe Leu Trp His 355 INFORMATION FOR SEQ ID NO:600: SEQUENCE CHARACTERISTICS: LENGTH: 61 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...61 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:600: Asp His His Lys Glu Phe Arg Met Lys Lys Gin Ile Leu Thr Gly Val 1 5 10 Leu Leu Ser Val Leu Ala Val Ser Ser Ala Tyr Ala His Lys Asp Lys 25 Lys Asp Ala Lys Lys Pro Glu Leu Ser Ser Gin Leu Val Ala His Lys WO 97/37044 PCTfUS9/05223 533 40 Asp Lys Lys Asp Ala Lys Lys Pro Lys Asn Ser Val Ala 55 INFORMATION FOR SEQ ID NO:601: SEQUENCE CHARACTERISTICS: LENGTH: 200 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...200 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:601: Arg Ile Ile Lys Gly Asn Ile Met Ser Ser Gly Leu Ile Tyr Ile Ser 1 5 10 Leu Glu Val Leu Val Ala Cys Leu Ile Thr Ala Leu Ile Met Tyr Tyr 25 Val Met Lys Lys Ile Tyr Tyr Ala Arg Gly Gin Ala Ile Leu Lys Gly 40 Ala Ser Ala Lys Ala Lys Leu Met Glu Phe Gin Ala Lys Ser Phe Val 55 Glu Ala Glu Glu Met Arg Met Lys Ser Gin Glu Cys Lys Leu Gin Gin 70 75 Gin Tyr Glu Asn Lys Asn Leu Gin Leu Gln Thr His Phe Asp Lys Lys 90 Glu Ala His Leu Lys His Leu Glu Ala Gin His Lys Glu Phe Val Arg 100 105 110 Asp Glu Lys Arg Tyr Leu Glu Lys Glu Lys Lys Glu Leu Glu Lys Glu 115 120 125 Arg Gin Ile Leu Glu Gln Glu Arg Glu Asn Phe Lys Lys Gln Arg Ala 130 135 140 Ile Cys Lys Glu Ala Gin Ala Lys Ala Leu Asp Ala Met Leu Asn Tyr 145 150 155 160 Met Ala Tyr Thr Lys Asp Glu Ile Lys Ser Met Ile Leu Glu Gin Leu 165 170 175 Glu Gin Glu Leu Glu Ala Gin Lys Ser Ala Leu Ile Arg Arg Tyr Glu 180 185 190 Glu Glu Ala Phe Ile Met Cys Leu 195 200 INFORMATION FOR SEQ ID NO:602: SEQUENCE CHARACTERISTICS: LENGTH: 167 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 534 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...167 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:602: Gly Arg Ala Phe Met Met Arg Glu Ile Leu Thr Asn Arg Phe Phe Pro 1 5 10 Ser Leu Phe Lys Lys Arg Leu Asp Phe Ser Asn Arg Val Val Leu Gly 25 Leu Gly Ser Asn Leu Lys Asn Pro Leu Lys Ile Leu Lys Ser Cys Phe 40 Leu Tyr Phe Lys Asn His Ser Lys Ile Gly Lys Ile Phe Ser Ser Pro 55 Ile Tyr Ile Asn Pro Pro Phe Gly Tyr Thr Asn Gin Pro Asn Phe Tyr 70 75 Asn Ala Thr Ile Ile Leu Lys Thr Ser Leu Gly Leu Arg His Phe Phe 90 Ala Leu Val Phe Tyr Ile Glu Arg Arg Phe Gly Arg Ala Arg Lys Arg 100 105 110 Asp Phe Lys Asp Ala Pro Arg Thr Leu Asp Ile Asp Ile Ile Ala Phe 115 120 125 Asn Gin Val Ile Leu Arg Gln Asn Asp Leu Thr Leu Pro His Pro Lys 130 135 140 Trp Ser Glu Arg Asp Ser Val Leu Val Pro Leu Thr Leu Gin Gin Ile 145 150 155 160 Leu Phe Lys Arg Glu Glu Trp 165 INFORMATION FOR SEQ ID NO:603: SEQUENCE CHARACTERISTICS: LENGTH: 171 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...171 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:603: WO 97/37044 PCT/US97/05223 535 Ile Ile Glu Ile Leu Val Ile Gin Gly Pro Asn Leu Asn Met Leu Gly 1 5 10 His Arg Asp Pro Arg Leu Tyr Gly Met Val Thr Leu Asp Gin Ile His 25 Glu Ile Met Gin Thr Phe Val Lys Gin Gly Asn Leu Asp Val Glu Leu 40 Glu Phe Phe Gin Thr Asn Phe Glu Gly Glu Ile Ile Asp Lys Ile Gin 55 Glu Ser Val Gly Ser Asp Tyr Glu Gly Ile Ile Ile Asn Pro Gly Ala 70 75 Phe Ser His Thr Ser Ile Ala Ile Ala Asp Ala Ile Met Leu Ala Gly 90 Lys Pro Val Ile Glu Val His Leu Thr Asn Ile Gin Ala Arg Glu Glu 100 105 110 Phe Arg Lys Asn Ser Tyr Thr Gly Ala Ala Cys Gly Gly Val Ile Met 115 120 125 Gly Phe Gly Pro Leu Gly Tyr Asn Met Ala Leu Met Ala Met Val Asn 130 135 140 Ile Leu Ala Glu Met Lys Ala Phe Gin Glu Ala Gin Lys Asn Asn Pro 145 150 155 160 Asn Asn Pro Asn Asn Pro Ile Asn Asn Gin Lys 165 170 INFORMATION FOR SEQ ID NO:604: SEQUENCE CHARACTERISTICS: LENGTH: 66 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...66 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:604: Pro Arg Ile Ile Ala Phe Phe Trp Ile Trp Gly Ser Asn Thr Asn Ala 1 5 10 Ile Ala Leu Ile Gly Leu Ala Arg Leu Phe Leu Asn Pro Lys Asp Phe 25 Val Phe Lys Arg Glu Gin Ser Phe Lys Asp Lys Glu Arg Gin Lys Ile 40 Tyr Asp Ile Val Lys Glu Ala Gin Glu Lys Ala Ile Gin Ala Leu Glu 55 Arg Gly INFORMATION FOR SEQ ID NO:605: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 536 LENGTH: 70 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...70 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:605: Arg Leu Trp Leu Trp Ala Val Phe Thr His Ser Thr Gly His Gly Ile 1 5 10 Asp Leu Asp Ile His Glu Leu Pro Tyr Ile Ser Ser Arg Ser Glu Thr 25 Ile Leu Glu Glu Gly Met Val Phe Ser Val Glu Pro Gly Ile Tyr Ile 40 Pro Gly Phe Phe Gly Val Arg Ile Glu Asp Leu Val Val Ile Lys Asn 55 Ser Arg Ala Glu Leu Leu INFORMATION FOR SEQ ID NO:606: SEQUENCE CHARACTERISTICS: LENGTH: 99 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...99 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:606: Tyr Gin Tyr Ile Gin Gin Gly Gly Gly Phe Gly Val Asn Val Gly Arg 1 5 10 Thr Leu Gly Asn Arg Thr His Val Ser Leu Gly Tyr Asn Leu Asn Val 25 Thr Lys Leu Leu Gly Phe Ser Ser Pro Leu Tyr Asn Arg Tyr Tyr Ser 40 Ser Val Asn Glu Val Ala Ser Pro Arg Gin Cys Ser Thr Pro Ala Ser 55 Val Ile Ile Asn Arg Leu Ser Gly Gly Arg Thr Pro Leu Val Pro Glu WO 97/37044 PCT/US97/05223 537 70 75 Ser Cys Ser Ser Pro Gly Ala Ile Thr Ile Phe Thr Arg Asn Lys Arg 90 Tyr Leu Gly INFORMATION FOR SEQ ID NO:607: SEQUENCE CHARACTERISTICS: LENGTH: 92 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...92 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:607: Met Ala Cys Glu Phe Leu Lys Lys Pro Lys Tyr Tyr Lys Phe Ile Glu 1 5 10 Gly Ala Asn Tyr Leu Ser Leu Gly Leu Ser Met Val Val Ala Ile Leu 25 Met Gly Val Ala Ile Gly Tyr Gly Leu Lys Lys Leu Thr His Ile Ser 40 Trp Leu Phe Trp Leu Gly Val Ile Trp Gly Val Leu Ala Ser Phe Leu 55 Asn Val Tyr Lys Ala Tyr Lys Asn Met Gin Lys Asp Tyr Glu Glu Leu 70 75 Ala Lys Asp Pro Lys Tyr Thr Gin Asn Lys Thr Lys INFORMATION FOR SEQ ID NO:608: SEQUENCE
CHARACTERISTICS:
LENGTH: 89 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...89 WO 97/37044 PCT/US97/05223 538 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:608: Arg Asn Leu Met Lys Thr Thr Glu Asn Thr Asp Glu Thr His Leu Arg 1 5 10 Glu Thr Lys Asn Lys Leu Gly Arg Lys Pro Lys Ala Asp Ala Asn Lys 25 Lys Thr Arg Ala Val Ser Leu Tyr Phe Ser Asp Glu Gin Tyr Gin Lys 40 Leu Glu Lys Met Ala Asn Glu Glu Glu Glu Ser Val Gly Ser Tyr Ile 55 Lys Arg Tyr Ile Leu Lys Ala Leu Arg Lys Ile Glu Gin Gly Gly Gly 70 75 Phe Ile Ala Phe Asn Leu Phe Leu Ile INFORMATION FOR SEQ ID NO:609: SEQUENCE CHARACTERISTICS: LENGTH: 103 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...103 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:609: Ile Leu Lys Gly Gly Phe Leu Gly Phe Phe Ile Val Ala Leu Ser Ser 1 5 10 Tyr Tyr Gly Val Lys Lys Arg Leu Asp Leu Arg Lys Gin Asp Ser Lys 25 Glu Lys Glu Glu Lys Gin Lys Phe Gin Lys Phe Ala Leu Gly Leu Glu 40 Met Ser Phe Asn Val Trp Arg Leu Gly Gly Tyr Gly Val Leu Leu Gly 55 Ile Leu Gly Val Leu Leu Phe Leu His Leu Phe Asn Gly Leu Pro Phe 70 75 Leu Ile Gly Val Phe Val Ser Ser Leu Ser Ser Ala Leu Leu Arg Phe 90 Leu Asn Asn Asn Gly Lys Phe 100 INFORMATION FOR SEQ ID NO:610: SEQUENCE
CHARACTERISTICS:
LENGTH: 257 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 539 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: H-elicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .257 (xi) SEQUENCE DESCRIPTION: SEQ ID PCTIUS97/05223 Ala Leu Thr Pro Asn Pro Met Lys Leu Pro Val Val 1 Ser Arg Leu His Leu Phe Leu Val1 Glu 145 Giu Ala His Asn Asp 225 Arg Phe Leu Gin Leu Gly Phe Asn Pro Lys Ile Glu Ile Leu Leu Ser 115 Glu Ser 130 Leu His Ser Lys Lys Ser Ser Ile 195 Asn Glu 210 Lys Asn Leu Ser Giy Gly Asp Phe Arg Thr 100 Vai Asn Phe Arg Val 180 Ala Ile Leu Asp 5 Glu Gly Cys Asn Giu Ile Lys Thr 70 Leu Val Gly Gly Leu Giu Gly Ser Thr Leu 150 Ile Asn 165 His Phe Giu Ile Phe Leu Lys Thr 230 Arg Leu 245 Lys Leu Leu 55 Ser Asn Giu His Ile 135 Ser Leu Lys Gin Met 215 Leu His Ang Ser 40 Thr Trp Leu Pro Phe 120 Phe Val Lys Phe Sen 200 Pro Aia Ile Ile 25 Cys Gly Asp Ala Ser i05 Tyr Phe Lys Ala Val 185 Leu Leu Pro Arg 10 Giy Lys Cys Tyr Pro 90 Leu His Glu Leu Leu 170 Leu Leu Gly Leu Leu 250 Lys Pro Gly Phe Asp Ser Tyr Asn 75 Asn Tyr Tyr Phe Lys Lys Phe Ser 140 Ser Phe 155 Gin Asn Giu Ser Lys Gin Thr Thr 220 Ala Ile 235 Trp Asp Giu Ser Asn Leu Glu Lys Asn Ile 125 Pro Ser Ile Gin Leu 205 Asn Glu Asn Ser Leu Cys Tyr Pro Asp Asn 110 Pro Ile Leu Leu Asn 190 Ser Asn His Lys Phe Phe Lys Ala Lys Phe Pro Leu Leu Glu Asn 175 Ala Leu Giu Gly Lys 255 Phe Leu Thr Val1 Pro Asp Ile Phe Lys Gin 160 Asn Ala Lys Leu Phe 240 Gly INFORMATION FOR SEQ ID NO:6ii: SEQUENCE CHARACTERISTICS: LENGTH: 386 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PTU9/52 PCTIUS97/05223 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .386 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:611: Phe Lys Asp Gin Gly Met Asn Leu Asn Phe Met Pro 1 Tyr Phe His Leu Tyr Leu Gin Ile Lys 145 Ile Lys Thr Gin Lys 225 Gin Gin Pro Arg Lys 305 Gin Phe Asn Cys Ala Gin Ala Pro Glu Lys 130 Met Ala Phe Lys Sen 210 Ile Lys Al a Tyr Phe 290 Ala His Phe His Val1 Val Ile Gly Lys Arg 115 Leu Thr Gin Asn Phe 195 Tyr Ile As n His Gly 275 Leu Leu Asn Trp Val1 His Ile Phe Leu Lys 100 As n Gly Pro Phe Asp 180 Ala Leu Ser Leu Pro 260 Lys Asn Tyr Leu Vali 340 5 Ser Giu Gin Ser Lys Tyr Leu His Leu Gly 165 Asn His Phe Ala Sen 245 Phe Phe Lys Ala Leu 325 Phe Ile Val Val1 Gin 70 Asp Ala Lys Leu Asn 150 Met His Gin Asn Phe 230 Val1 Lys Phe Giu Lys 310 Ser Val1 Asp Pro Arg 55 Ile Lys Pro Ile Lys 135 Ala Pro Lys Lys Ser 215 Ser His Ile Asp Ala 295 Asn Giy Glu Phe Leu 40 Lys Leu As n Leu Leu 120 Gly Gin Asn Glu Leu 200 Leu Val1 Ser Leu Ala 280 Val1 Leu His Asri His 25 Tyr Ser Gly Ala Leu 105 Ser Asn Lys Tyr Gly 185 As n Leu Lys Asn Glu 265 Leu Pro Ser Ala Ile Phe Giu Gly Val1 Leu 90 Giu Leu Ang Thr Phe 170 Leu Ala Ser Giu Ala 250 Gly Giu Thr Leu Lys 330 Thr Asn Phe Leu Lys 75 Thr Lys Asn Phe Giu 155 Gly Lys Phe Lys Ser 235 Leu Asp Leu Gly Giu 315 Thr Ser Ser Ser Ser Ile Thr Asn Tyr Phe 140 Gin Sen Ile Leu Arg 220 Leu Lys Val Glu Leu 300 Ile Leu Gin Leu Ser Asn Thr Ala Gin Thr His 125 Met Val Gin Leu Ile 205 Leu Giu Ala Met Lys 285 Leu Giu Gly Tyr Leu Ala Thr Leu Giu Phe Sen 110 His Arg Leu Ang Gin 190 Ser Giu Phe Leu Arg 270 Giu Asp Lys Ser Ile 350 His Arg Gly Giu Leu Ile Asn Asn Phe Giu Phe 175 Asn Ser Ile Phe Lys 255 His Ser Gly Gly Arg 335 Lys Ala Asp Glu Met Gly Ser Phe Lys Lys Gin 160 Gly Giu Tyr Sen Lys 240 Asn Tyr Glu Lys Phe 320 Arg Glu Lys Ala Gin Phe Giu Leu 0Th Phe Tyr Leu Pro Lys Gly Ser Tyr Ala 355 360 365 WO 97/37044 PCT/US97/05223 541 Ser Ala Leu Leu Lys Glu Ile Lys His Glu Lys Gly Glu Asn Asn Asp 370 375 380 Glu Phe 385 INFORMATION FOR SEQ ID NO:612: SEQUENCE CHARACTERISTICS: LENGTH: 63 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...63 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:612: Lys Ile Lys Gly Lys Glu Met Lys Phe Leu Asn Gly Leu Ala Gly Asn 1 5 10 Leu Leu Ile Val Val Ile Leu Leu Cys Val Val Val Phe Phe Ala Leu 25 Lys Ala Ile His Ile Gin Lys Glu Gin Ala Thr Asn Tyr Tyr Arg Tyr 40 Lys Asp Ile Asn Ala Leu Glu Ala Lys Asn Thr Gin Asn His Ala 55 INFORMATION FOR SEQ ID NO:613: SEQUENCE CHARACTERISTICS: LENGTH: 100 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...100 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:613: Met Leu Glu Ile Glu Leu Lys Lys Lys Phe Thr Lys Asp Leu Lys Lys 1 5 10 His Ile Leu Asn Gin Lys Ile Glu Leu Glu Val Phe Asp Leu Val Val WO 97/37044 PCT/US97/05223 542 25 Glu Asn Leu Arg Asn Gin Ile Pro Leu Asp Lys Arg Phe Lys Asp His 40 Ala Leu Ser Gly Thr Tyr Lys Gly Cys Arg Glu Arg His Ile Lys Pro 55 Asp Val Leu Leu Val Tyr Arg Val Lys Gly Asn Val Leu Thr Leu Val 70 75 Arg Leu Gly Ser His Ser Glu Leu Phe Cys Lys Pro Pro Thr Pro Leu 90 Ile Thr Leu Lys 100 INFORMATION FOR SEQ ID NO:614: SEQUENCE CHARACTERISTICS: LENGTH: 646 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...646 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:614: Ser Asn Tyr Asn Asn Leu Asn Thr Leu Val Ser Leu Ser Ser Asp Pro 1 5 10 Ser Ala Val Asn Asp Ala Arg Asp Asn Leu Gly Ser Ser Ala Arg Asn 25 Leu Leu Asp Val Lys Ala Asn Ser Pro Ala Tyr Gin Ala Val Leu Leu 40 Ala Leu Asn Ala Ala Val Gly Leu Trp Gin Val Thr Ser Tyr Ala Phe 55 Thr Ala Cys Gly Pro Gly Ser Asn Glu Asn Ala Asn Gly Gly Ile Gin 70 75 Thr Phe Asn Asn Val Pro Gly Gin Asn Thr Thr Thr Ile Thr Cys Asn 90 Ser Tyr Tyr Glu Pro Gly His Gly Gly Pro Ile Ser Thr Glu Asn Tyr 100 105 110 Ala Ile Ile Asn Lys Ala Tyr Gin Ile Ile Gin Lys Ala Leu Thr Ala 115 120 125 Asn Gly Glu Gly Ile Pro Val Leu Ser Asn Thr Thr Thr Lys Leu Asp 130 135 140 Phe Thr Ile Asn Gly Asp Lys Arg Thr Gly Gly Glu Pro Asn Lys Lys 145 150 155 160 Leu Val Tyr Pro Trp Ser His Gly Lys Ala Ile Ser Thr Ser Trp Asn 165 170 175 Ala Thr Ile Thr Val Pro Thr Thr Glu Asn Ile Asn Thr Thr Asn Ser 180 185 190 Ala Gin Glu Leu Leu Lys Gin Ala Ser Ile Ile Ile Thr Thr Leu Asn WO 97/37044 PCT/US97/05223 543 Ser Ala 210 Ile Ser 225 Ala Ile Lys Ile Lys Pro Lys Asn 290 Val Lys 305 Gly Val Gly Gin Gin Thr Glu Gin 370 Phe Lys 385 Thr Ala Ser Lys Tyr Leu Leu Gly 450 Asn Asn 465 Phe Phe Asp Tyr Asp Val Asn Asp 530 Gly Leu 545 Glu Tyr Asn Val Leu Ala Ile Glu 610 Phe Met 625 Asn Tyr 195 Cys Gly Gin Val Phe 275 Ala Asn Cys Thr Ile 355 Gin Ser Leu Lys Asn 435 Arg Gly Gly Asn Trp 515 Lys Phe Val Ala Arg 595 Leu Gly Val Pro Asn Phe Gln Asn Asn Gly Ser 260 Asn Gin Phe Tyr Thr 340 Thr Ile Arg Ser Asn 420 Gin Asn Ala Gin His 500 Thr Ala Gly Asn Asn 580 Pro Gly Ala Phe Gly Met 245 Glu Pro Ala Glu Glu 325 Ser Asn Gin Tyr Asn 405 Asn Asn Pro Met Lys 485 Ala Tyr Thr Gly Leu 565 Phe Lys Leu Glu Ala 645 Thr 230 Ile Asn Tyr Gin Lys 310 Val Asn Leu Gin Ser 390 Ile Pro Ser Phe Asn 470 Arg Phe Gly Asn Ile 550 Ala Gin Lys Lys Leu 630 Tyr 215 Met Ala Thr Thr Ala 295 Ile Gin Thr Lys Ala 375 Glu Pro Tyr Tyr Arg 455 Gly Lys Ile Phe Phe 535 Ala Thr Phe Lys Ile 615 Lys Cys Asn Gin Asp 280 Glu Pro Gly Trp Asn 360 Glu Leu Asn Ser Asn 440 Lys Ile Trp Lys Gly 520 Leu Leu Met Leu Asp 600 Pro Tyr Gly Gly Ala Asn 265 Ala Ile Thr Gly Gly 345 Ser Asn Gly Ala Pro 425 Gin Val Gly Gly Ser 505 Ala Gly Ala Asn Phe 585 Ser Thr Arg Gly Met Gin 250 Gin Ser Leu Ala Glu 330 Ala Ile Ile Asn Gin 410 Gin Ile Gly Ile Ala 490 Ser Asp Lys Gly Asn 570 Asn Asp Ile Arg Ser Phe 235 Glu Asn Phe Asn Phe 315 Arg Gly Ala Ala Thr 395 Ser Gly Gin Ile Gin 475 Arg Phe Ala Asn Thr 555 Val Met His Asn Leu 635 Gly 220 Lys Ala Ser Ala Gin 300 Val Arg Cys His Asp 380 Tyr Leu Ile Thr Val 460 Val Tyr Phe Leu Asn 540 Ser Tyr Gly Ala Thr 620 Tyr 205 Tyr Asn Val Leu Glu 285 Ala Asn Gly Ala Phe 365 Thr Asn Gin Asp Ile 445 Ser Gly Tyr Asn Tyr 525 Lys Trp Asn Val Ala 605 Asn Ser Trp Glu Ala Asp 270 Ser Glu Asp Thr Tyr 350 Gly Leu Ser Asn Thr 430 Asn Ser Tyr Gly Ser 510 Asn Leu Leu Ala Arg 590 Gin Tyr Val Ala Gly Ile Ser 240 Gin Ala 255 Ala Gly Met Leu Gin Val Ser Leu 320 Asn Pro 335 Val Gly Thr Gin Val Asn Ile Thr 400 Ala Val 415 Asn Tyr Gin Glu Gin Thr Lys Gin 480 Phe Phe 495 Ala Ser Phe Ile Ser Val Asn Ser 560 Lys Met 575 Met Asn His Gly Tyr Ser Tyr Leu 640 WO 97/37044 PCT/US97/05223 INFORMATION FOR SEQ ID NO:615: SEQUENCE CHARACTERISTICS: LENGTH: 93 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...93 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:615: Asp Leu Lys Met Leu Thr Ile Glu Thr Ser Lys Lys Phe Asp Lys Asp 1 5 10 Leu Lys Ile Leu Val"Lys Asn Gly Phe Asp Leu Lys Leu Leu Tyr Lys 25 Val Val Gly Asn Leu Ala Thr Glu Gin Pro Leu Glu Pro Lys Tyr Lys 40 Asp His Pro Leu Lys Gly Ala Leu Lys Asp Phe Arg Glu Cys His Leu 55 Lys Pro Asp Leu Leu Leu Val Tyr Gin Ile Lys Lys Gin Glu Asn Thr 70 75 Leu Phe Leu Val Arg Leu Gly Ser His Ser Glu Leu Phe INFORMATION FOR SEQ ID NO:616: SEQUENCE CHARACTERISTICS: LENGTH: 81 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...81 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:616: Arg Ala Trp Arg Met Lys Ser Met Arg Phe Ser Tyr Ile Glu Pro Arg 1 5 10 Ala Lys Tyr Leu Ile Ser Lys Leu Ser Lys Ile Trp Val Phe Tyr Ile WO 97/37044 PCT/US97/05223 545 25 Phe Leu Ser Phe Val Leu Ile Gly Gly Leu Val Trp Phe Met His Asn 40 Ala Ile Lys Arg Ala Gin Asp Asn Ala Ser Ser Leu Thr Ile Gin Glu 55 Glu Leu Tyr Arg Cys Tyr Ile Thr Arg Leu Ser Val Lys Met Ile Ile 70 75 Leu INFORMATION FOR SEQ ID NO:617: SEQUENCE CHARACTERISTICS: LENGTH: 149 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...149 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:617: Val Lys Phe Pro Tyr Leu Leu Asn Ile Arg Ile Met Leu Tyr Leu Arg 1 5 10 Lys Glu Asn Gly Val Arg Thr Leu Ile Ser Leu Gly Ile Leu Leu Ser 25 Val Leu Asn Gly Asp Asp Leu Arg Leu Tyr Ser Lys Pro Leu Val Tyr 40 Ser Ala Gly Ser Gly Ile Ile Gly Ile Asp Ile Asp Lys Arg Thr Phe 55 Tyr Lys Arg Ala Phe Ala Phe Thr Met Lys Ser Leu Phe Gly Glu Asn 70 75 Leu Leu Leu Phe Val Lys Leu Lys His Ser Ala Leu Met Ser Lys His 90 Met Lys Gly Pro Leu Glu Asn Arg His His His Ser Phe Ala Lys Asn 100 105 110 Tyr Glu Lys Ala Val Asn Gly Cys Gin Lys Tyr Val His Ile Lys Leu 115 120 125 Pro Glu Gly Pro Pro Ser Asn Phe Gin Ser Gly Ser Tyr Met Ala Thr 130 135 140 Met Val Met Arg Phe 145 INFORMATION FOR SEQ ID NO:618: SEQUENCE CHARACTERISTICS: LENGTH: 159 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 546 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...159 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:618: Val Ile Thr Asn Gin Gin Asn Glu Val Lys Thr Phe Thr Pro Ile Glu 1 5 10 Thr Lys Lys Ile Thr Ser Lys Glu Gin Ala Phe Leu Thr Leu Ser Ala 25 Leu Met Asp Ala Val Glu Asn Gly Thr Gly Ser Leu Ala Arg Ile Lys 40 Gly Leu Glu Ile Ala Gly Lys Thr Gly Thr Ser Asn Asn Asn Ile Asp 55 Ala Trp Phe Ile Gly Phe Thr Pro Thr Leu Gin Ser Val Ile Trp Phe 70 75 Gly Arg Asp Asp Asn Thr Pro Ile Gly Lys Gly Ala Thr Gly Gly Val 90 Val Ser Ala Pro Val Tyr Ser Tyr Phe Met Arg Asn Ile Leu Ala Ile 100 105 110 Glu Pro Ser Leu Lys Arg Lys Phe Asp Val Pro Lys Gly Leu Arg Lys 115 120 125 Glu Ile Val Asp Lys Ile Pro Tyr Tyr Ser Ser Pro Asn Ser Ile Thr 130 135 140 Pro Thr Pro Lys Lys Thr Asp Asp Ser Glu Glu Arg Leu Leu Phe 145 150 155 INFORMATION FOR SEQ ID NO:619: SEQUENCE CHARACTERISTICS: LENGTH: 80 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...80 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:619: Leu Leu Leu Asn Leu Lys Ile Tyr Val Thr Leu Cys Lys Ile Gin Gly 1 5 10 WO 97/37044 PCT/US97/05223 547 Asp Ser Val Leu Glu Lys Ser Phe Leu Lys Ser Lys Gin Leu Val Leu 25 Cys Gly Leu Gly Val Phe Met Leu Gin Ala Leu His Phe Ala Gin Thr 40 Leu His Lys Glu Ile Leu Phe Tyr Lys Met Cys Leu Ile Gly Cys Cys 55 Lys Ile Ala Ala Ser Ile Ser Arg Lys Gly Trp Ile Ala Arg Thr Leu 70 75 INFORMATION FOR SEQ ID NO:620: SEQUENCE CHARACTERISTICS: LENGTH: 344 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...344 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:620: Val Lys Cys Gly Lys Lys Gly Ile Leu Val Ser Thr Arg Lys Ala Met 1 5 10 Phe Leu Cys Lys Ile Pro Leu Ser Leu Thr Leu Met Glu Ala Arg Ser 25 Val Leu Thr Gin Glu Ile Arg Ser Phe Leu Pro Glu Thr Thr Thr Ser 40 Leu Ser Leu Thr Ile Leu Glu Arg Ser Ile Cys Cys Leu Ile Lys Phe 55 Leu Thr Leu Thr Ser Pro Cys Leu Thr Gin Gin Arg Pro Lys Ile Asn 70 75 Ala Thr Asn Asn Asn Val Ser Val Ser Gin Gly Asn Leu Phe Ile Asn 90 Ala Ser Cys Val Gin Gin Ser Asp Pro Thr Thr Ala Ser Ala Thr Asn 100 105 110 Pro Cys Thr Thr Ala Gin Asn Asn Ala Ser Ser Ser Asn Ala Ser Asn 115 120 125 Asn Ala Pro Ile Ala Leu Asn Asn Asn Asp Glu Ser Leu Val Val Thr 130 135 140 Ala Asn Gly Phe Asn Phe Ser Gly Asn Ile Tyr Ala Asn Gly Val Val 145 150 155 160 Asp Phe Ser Lys Ile Lys Gly Ser Ala Asn Val Lys Asn Leu Tyr Leu 165 170 175 Tyr Asn Asn Ala Gin Phe Gin Ala Asn Asn Leu Thr Ile Ser Asn Gin 180 185 190 Ala Val Leu Glu Lys Asn Ala Ser Phe Val Thr Asn Asn Leu Asn Ile 195 200 205 Gin Gly Ala Phe Asn Asn Asn Ala Thr Gin Lys Ile Glu Val Leu Gin 210 215 220 WO 97/37044 PCT/US97/05223 548 Asn Leu Val Ile Ala Ser Asn Ala Ser Leu Ser Thr Gly Ile Tyr Gly 225 230 235 240 Leu Glu Val Gly Gly Ala Leu Asn Asn Leu Gly Ala Ile His Phe Asn 245 250 255 Leu Glu Asn Ser Gin Thr Pro Val Asn Pro Leu Ile Gin Val Gly Gly 260 265 270 Ile Ile Asn Leu Asn Thr Thr Gin Thr Pro Phe Met Asn Val Ser Val 275 280 285 Ala Asn Gly Gly Thr Tyr Thr Leu Leu Lys Ser Ser Arg Tyr Ile Asp 290 295 300 Tyr Asn Ile Asn Pro Asn Ser Leu Gin Ser Tyr Leu Lys Leu Tyr Thr 305 310 315 320 Leu Ile Asn Ile Asn Gly Asn His Ile Glu Glu Lys Asn Gly Val Leu 325 330 335 Thr Tyr Leu Gly Gin Arg Val Leu 340 INFORMATION FOR SEQ ID NO:621: SEQUENCE CHARACTERISTICS: LENGTH: 266 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...266 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:621: Leu Leu Ala Arg Thr Ile Leu Asp Phe Arg Gly Ser Leu Ser Asn Leu 1 5 10 Asn Asn Thr Tyr Asn Ser Ile Thr Thr Thr Ala Ser Asn Thr Pro Asn 25 Ser Pro Phe Leu Lys Asn Leu Ile Ser Gin Ser Thr Asn Pro Asn Asn 40 Pro Gly Gly Leu Gin Ala Val Tyr Gin Val Asn Gin Ser Ala Tyr Ser 55 Gin Leu Leu Ser Ala Thr Gin Glu Leu Gly His Asn Pro Phe Arg Arg 70 75 Val Gly Leu Ile Ser Ser Gin Thr Asn Asn Gly Ala Met Asn Gly Ile 90 Gly Val Gin Val Gly Tyr Lys Gin Phe Phe Gly Glu Lys Arg Arg Trp 100 105 110 Gly Leu Arg Tyr Tyr Gly Phe Phe Asp Tyr Asn His Ala Tyr Ile Lys 115 120 125 Ser Ser Phe Phe Asn Ser Ala Ser Asp Val Phe Thr Tyr Gly Val Gly 130 135 140 Thr Asp Val Leu Tyr Asn Phe Ile Asn Asp Lys Thr Thr Lys Asn Ser 145 150 155 160 WO 97/37044 PCTIUS97/05223 549 Lys Ile Ser Phe Gly Val Phe Gly Gly Ile Ala Leu Ala Gly Thr Ser 165 170 175 Trp Leu Asn Ser Gin Tyr Val Asn Leu Ala Thr Phe Asn Asn Phe Tyr 180 185 190 Ser Ala Lys Met Asn Val Ala Asn Phe Gin Phe Leu Phe Asn Leu Gly 195 200 205 Leu Arg Met Asn Leu Ala Lys Asn Lys Lys Lys Asp Ser Asp His Ala 210 215 220 Ala Gin His Gly Val Glu Leu Gly Val Lys Ile Pro Thr Ile Asn Thr 225 230 235 240 Asn Tyr Tyr Ser Phe Leu Gly Thr Lys Leu Glu Tyr Arg Arg Leu Tyr 245 250 255 Ser Val Tyr Leu Asn Tyr Val Phe Ala Tyr 260 265 INFORMATION FOR SEQ ID NO:622: SEQUENCE CHARACTERISTICS: LENGTH: 95 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...95 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:622: Arg Asn Val Glu Ala Arg Tyr Tyr Tyr Gly Asp Thr Ser Tyr Phe Tyr 1 5 10 Leu His Ala Gly Val Leu Gin Glu Phe Ala His Phe Gly Ser Asn Asp 25 Val Ala Ser Leu Asn Thr Phe Lys Ile Asn Ala Ala Arg Ser Pro Leu 40 Ser Thr Tyr Ala Arg Ala Met Met Gly Gly Glu Leu Gin Leu Ala Lys 55 Glu Val Phe Leu Asn Leu Gly Val Val Tyr Leu His Asn Leu Ile Ser 70 75 Asn Ala Ser His Phe Ala Ser Asn Leu Gly Met Arg Tyr Ser Phe 90 INFORMATION FOR SEQ ID NO:623: SEQUENCE CHARACTERISTICS: LENGTH: 155 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 550 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...155 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:623: Arg Leu Pro Lys Lys Glu Asp Tyr Ala Lys Ala Met Val Phe Ser Phe 1 5 10 Lys Met Glu Ala Ile Lys Glu Ser Gin Ile Val Leu Leu Ile Thr Ser 25 Ala Phe Glu Gly Gin Phe Glu Lys Thr His Lys Glu Glu Lys Glu Glu 40 Thr Thr Lys Ser Ala Thr Glu Glu Thr Lys Thr His Asp Ala Ser Leu 55 Glu Asn Ile Glu Ile Arg Asn Ile Ser Met Leu Leu Asp Val Lys Leu 70 75 Asn Val Lys Val Arg Ile Gly Gin Lys Lys Met Ile Leu Lys Asp Val 90 Val Ser Met Asp Ile Gly Ser Val Val Glu Leu Asp Gin Leu Val Asn 100 105 110 Asp Pro Leu Glu Ile Leu Val Asp Asp Lys Val Ile Ala Lys Gly Glu 115 120 125 Val Val Ile Val Asp Gly Asn Phe Gly Ile Gin Ile Thr Asp Ile Gly 130 135 140 Thr Lys Lys Glu Arg Leu Glu Gin Leu Lys Asn 145 150 155 INFORMATION FOR SEQ ID NO:624: SEQUENCE CHARACTERISTICS: LENGTH: 211 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...211 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:624: Lys Pro Thr Asn Val Trp Ala Asn Ala Ile Gly Gly Ala Ser Leu Asn 1 5 10 Ser Gly Ser Asn Ala Ser Leu Tyr Gly Thr Ser Ala Gly Val Asp Ala 25 Phe Leu Asn Gly Asn Val Glu Ala Ile Val Gly Gly Phe Gly Ser Tyr WO 97/37044 PCT/US97/05223 551 40 Gly Tyr Ser Ser Phe Ser Asn Gin Ala Asn Ser Leu Asn Ser Gly Ala 55 Asn Asn Ala Asn Phe Gly Val Tyr Ser Arg Phe Phe Ala Asn His Pro 70 75 Glu Phe Asp Phe Glu Ala Gin Gly Ala Leu Gly Ser Asp Gin Ser Ser 90 Leu Asn Phe Lys Ser Thr Leu Leu Gin Asp Leu Asn Gin Ser Tyr Asn 100 105 110 Tyr Leu Ala Tyr Ser Ala Thr Ala Arg Ala Ser Tyr Gly Tyr Asp Phe 115 120 125 Ala Phe Phe Arg Asn Ala Leu Val Leu Lys Pro Ser Val Gly Val Ser 130 135 140 Tyr Asn His Leu Gly Ser Thr Asn Phe Lys Ser Asn Ser Gin Ser Gin 145 150 155 160 Val Ala Leu Lys Asn Gly Ala Ser Ser Gin His Leu Phe Asn Ala Asn 165 170 175 Ala Thr Trp Lys Arg Val Ile Ile Met Gly Thr Leu His Thr Phe Ile 180 185 190 Cys Met Arg Glu Phe Tyr Lys Ser Ser Leu Thr Leu Asp Arg Met Met 195 200 205 Trp Arg Leu 210 INFORMATION FOR SEQ ID NO:625: SEQUENCE CHARACTERISTICS: LENGTH: 343 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...343 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:625: His Ala Arg Glu Phe Cys Gin Val Val Met Phe His Lys Ala Leu Ile 1 5 10 Thr Phe Ile Val Leu Trp Phe Phe Leu Asn Gly Leu Gly Ala Tyr Asp 25 Phe Lys His Cys Gin Ala Phe Phe Lys Lys Ala Ser Leu Gin Lys Gly 40 Gly Val Ala Leu Lys Glu Leu Pro Lys Gly Val Tyr Leu Tyr Tyr Ser 55 Lys Thr Tyr Pro Lys His Ala Lys Val Ile Lys Ser Asp Pro Phe Ile 70 75 Gly Leu Tyr Leu Leu Gin Ser Ala Pro Ser Glu Tyr Val Tyr Thr Leu 90 Arg Asp Leu Asp Lys Asp Ala Leu Ile Arg Pro Met Ala Ser Ile Gly WO 97/37044 WO 9737044PCT1US97/05223 100 105 Ala Asn Gin Ala Thr Glii Ala Arg Leu 110 Lys Gly Tyr Asp Ser 145 Phe Tyr Val Asn Giu 225 Val1 Asn Tyr His Trp 305 Ala Glu Arg 130 Asn Ile Tyr Val1 Asp 210 Phe Lys Lys Gly Leu 290 Val1 Leu Phe 115 Tyr Ile Glu Gly Ala 195 Giu Glu Ile Arg Ile 275 Pro Asn Ser Tyr Ala Cys Thr Asp 180 Gin Ile Trp Lys Tyr 260 Ala Lys His Thr Ile 340 Gin Tyr Lys 165 Ile Phe Leu Val Arg 245 Gly Leu Gly Arg Pro 325 Lys Ile Gin i50 Phe Gly Asp Ala Val1 230 Asn Gly Asp Leu Ser 310 Lys Val1 Ser 135 Met Ile Val Pro Ile 215 Ser His Phe Giu Asp 295 Val1 Ile Arg 120 Gin Lys Leu Gly Lys Arg Arg Leu 185 Phe Phe 200 Asn Asp Asn Leu Gin Ile Leu Leu 265 Arg Phe 280 Phe Leu Ser Phe Leu Thr Leu Phe 170 Clu Pro Tyr Ser Lys 250 Lys Ile Lys Asn Val Gly Gin Gin Gly 155 Leu Giu Lys Lys Tyr 235 Giu Asp Ile Leu Pro 315 Lys 140 Val1 Asn Arg Asn Ile 220 Gin Val1 Thr Thr Gly 300 Lys Asn Gly Gin His Pro 205 His Ser Thr Phe Lys 285 Asp Ala Gly Gly Gin Lys 190 Phe Ser Leu Leu Leu 270 Ile Arg Leu Val Ile Asn Gly 160 Glu Pro 175 Arg Leu Leu Lys Leu Ala Ala Lys 240 Lys Val 255 Glu Arg Gly Ala Ile Leu Arg Glu 0Th Leu Leu Val Trp Arg Gin Gly Phe INFORMATION FOR SEQ ID NO:626: SEQUENCE CHARACTERISTICS: LENGTH: 165 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAI4E/KEY: misc-feature LOCATION .165 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:626: Ile Lys Ile Asn Met Val Glu Cys Gin Asn Leu Leu Ser Sen Cys Gly 10 Lys Asn Leu Asp Asn Lys Giu Leu Gly Met Arg Arg Ser Leu Ala Phe 25 Cys Leu Leu Ala Leu Leu Gly Leu Gin Val Leu Gly Ala Ang Asp Phe WO 97/37044 PCT/US97/05223 553 40 Ser Gin Leu Lys Asn Glu Glu Leu Leu Lys Leu Ala Gly Thr Leu Pro 55 Ser Asn Glu Ala Ile Asp Tyr Arg Met Glu Val Ser Lys Arg Leu lie 70 75 Ala Leu Ser Ala Glu Asp Ala Lys Asn Phe Arg Ala Asn Phe Ser Arg 90 Ile Ala Arg Lys Asn Leu Ser Lys Met Ser Glu Glu Asp Phe Lys Lys 100 105 110 Met Arg Glu Glu Val Arg Lys Glu Leu Glu Glu Lys Thr Lys Gly Leu 115 120 125 Ser Ala Glu Glu Ile Lys Ala Lys Gly Leu Asn Val Ser Val Cys Ser 130 135 140 Gly Asp Thr Arg Lys Val Trp Cys Arg Ala Val Lys Lys Lys Asp Glu 145 150 155 160 His Cys Ser Pro Lys 165 INFORMATION FOR SEQ ID NO:627: SEQUENCE CHARACTERISTICS: LENGTH: 157 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...157 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:627: Phe Lys Arg Lys Asn Met Lys Lys Ala Leu Lys Ile Leu Ser Val Gly 1 5 10 Ala Leu Leu Phe Val Ala Leu Asn Ala Lys Asp Phe Ser Lys Thr Ser 25 Asp Glu Asp Leu Ala Lys Met Ala Gly Val Val Ala Pro Gin Asp Ile 40 Val Asp Tyr Thr Lys Glu Leu Lys Lys Arg Met Glu Lys Met Pro Glu 55 Asp Lys Arg Lys Ala Phe His Lys Gin Leu His Glu Tyr Ala Thr Lys 70 75 Asn Thr Asp Lys Met Thr Val Ala Asp Phe Glu Ala Arg Gin Lys Ala 90 Ile Lys Glu Ala Leu Lys Lys Gly Asn Met Glu Asp Met Asp Asp Asp 100 105 110 Phe Gly Leu Arg Ser Cys Lys His Gly Lys Lys His Lys His Asp Lys 115 120 125 His Gly Lys Lys His Gly Lys Lys His Asp Lys Asp His Asp Asp Lys 130 135 140 Asp His Asp His His Asp Glu Asp His Ser Asp Lys His WO 97/37044 WO 9737044PCT/US97/05223 554 145 150 155 INFORMATION FOR SEQ ID NO:628: SEQUENCE CHARACTERISTICS: LENGTH: 677 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .677 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:628: Ile Gin Arg Tyr Trp Asn Gly Ala Leu Met Lys Asn 1 Pro Phe Ala Leu Lys Ile Leu Ile Val1 145 Asn Pro Ser Asn Sen 225 Gly Sen Leu Phe Tyr Asn Val1 Asp Thn Lys 130 Ala Ile Phe Pro Gly 210 Asn Ile Ser Thr Cys Ala Gly Giu Ile Sen 115 Lys Lys Ala Ser Sen 195 Tyr Gly Asp Ser Lys Phe Leu Ala Ala Glu 100 Sen Sen Ile Gin Asn 180 Asn Gly Ala Thn Sen 5 Ala Ile Ser Ile His Ile Ala Thr Leu Thr 165 Gly Asn Ala Asn Asp 245 Gly Leu Leu Ile Giu 70 Giy Thn Lys Ile Giy 150 Lys Ser Ala Asn Gly 230 Gly Gly Met Gly Ile 55 Ser Phe Ser Thr Leu 135 Val Ala Asp Ile Gly 215 Sen Val1 Ser Lys Ala 40 Thr Arg Tyr Leu Sen 120 Val1 Ser Ala Sen Asn 200 Asn His Leu Val Thr 25 Phe Thr Val1 Phe Leu 105 Leu Leu Lys Asn Ser 185 Gly Asp Sen Gly Gly 10 Tyr Leu Lys Val Arg 90 Ang Lys Lys Giu Asp 170 Phe Lys Gly Asn Val1 250 Gly Pro Leu Giu Leu 75 Asn Asp Ile Gly Giu 155 Pro Tyr Asp Val Asn 235 Asp Tyr Tyr Gly Ile Gly Asn Asn Pro Giu 140 Tyr Met Asp Gly Asn 220 Asn Gly Gin Asn Leu Asp Lys Ala Gin Pro 125 Asn Gin Tyr Asn Ala 205 Gly Ala Val His His Leu Ala Arg Thr Ser 110 Asn Ala Lys Ala Asn 190 Asn Ile Ile Asn Lys Phe Sen Asn Val1 Asn Phe Ala Giu Leu Asn 175 Pro Gly Ser Gly Gly 255 Phe Asn Leu Pro Leu Phe Sen Pro Lys Glu Giu 160 Thr Asn Sen Gly Ser 240 Ser Thr Giu Asn Asn 260 265 270 Asn His Gly Sen Thr Asn Asn Asn Thr Gly Gly Tyr Asp Asn Phe Asn WO 97/37044 PCT/US97/05223 555 275 280 285 Asn Gly Ser Ser Ser Gly Gly Ser Leu Gly Asn Gly Gly Leu Phe Pro 290 295 300 Ile Pro Phe Gly Asn Gly Asp Thr Asn Asn Ser Asn Asn Ser Thr Asn 305 310 315 320 Thr Thr Ser Pro Thr Asn Gly Ser Ser Ser Asn Asn Ala Thr Asn Pro 325 330 335 Ser Ser Gin Glu Asn Asn Tyr Ser Ser Gin Tyr Cys Lys Val Pro Glu 340 345 350 Leu Ser Pro Asn Asn Thr Met Lys Leu Asp Val Ile Ala Lys Asp Gly 355 360 365 Ser Cys Ile Ser Met Asn Ala Leu Arg Asp Asp Thr Lys Cys Ala Tyr 370 375 380 Arg Tyr Asp Phe Glu Ala Gly Lys Ala Ile Lys Gin Thr Gin Tyr Tyr 385 390 395 400 Tyr Val Asp Arg Glu Asn Lys Thr Gin Asn Ile Gly Gly Cys Val Asp 405 410 415 Leu Gin Gly Ala Gin Tyr Ala Met Gin Leu Tyr Lys Asp Asp Ser Lys 420 425 430 Cys Ala Leu Gin Thr Thr Ser Asp Lys Gly Tyr Gly Met Gly Lys Thr 435 440 445 Gin Thr Phe Gin Thr Glu Ile Val Phe Arg Gly Met Asp Asn Leu Ile 450 455 460 His Val Ala Val Pro Cys Ser Asp Tyr Ala Arg Val Gin Asp Arg Ile 465 470 475 480 Val Arg Tyr Glu Lys Asn Asp Lys Thr Gin Thr Leu Thr Pro Ile Val 485 490 495 Asp Gin Tyr Tyr Asn Asp Pro Asn Asn Pro Asn Lys Gin Glu Ile Leu 500 505 510 Asn Arg Gly Ile Ala Thr Gin Leu Ser Ser Gin Tyr Gin Glu Phe Ala 515 520 525 Cys Gly Gin Trp Glu Tyr Asn Asp Ala Lys Leu Glu Ala Lys Arg Pro 530 535 540 Thr Met Leu Lys Ser Tyr Asn Lys Leu Asn Gly Glu Trp Val Glu Val 545 550 555 560 Thr Pro Cys Asn Phe Glu Ala Gly Ile Lys Ser Gly Ala Val Val Ser 565 570 575 Pro Tyr Val Met Gly Val Pro Ser Ser Lys Val Leu Ser Asp Ile Thr 580 585 590 Thr Ser His Tyr Phe Arg Ile Glu Arg Lys Asn Tyr Gly Glu Arg Glu 595 600 605 Gin Cys Gin Lys Leu Tyr Gly Val Asn Arg Cys Gin Pro Gin Tyr Ser 610 615 620 Ile Leu lie Leu Val Ser Pro Ile Gly Ala Pro Leu Thr Lys Pro Leu 625 630 635 640 Pro Pro Lys Pro Leu Asn Leu Ile Tyr Ala Gin Pro Lys Ile Met Lys 645 650 655 Asn Thr Pro Gin Pro Ile Ile Leu Ser Pro Leu Lys Pro Pro Ser Thr 660 665 670 Gly Leu Lys Ala Phe 675 INFORMATION FOR SEQ ID NO:629: SEQUENCE CHARACTERISTICS: LENGTH: 113 amino acids TYPE: amino acid WO 97/37044 PCTIUS97/05223 556 TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...113 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:629: Met Ala Lys Met Asn Ala Pro Asp Gly Val Ala Val Trp Val Asn Glu 1 5 10 Asp Arg Cys Lys Gly Cys Asp Ile Cys Val Ser Val Cys Pro Ala Gly 25 Val Leu Gly Met Gly Ile Glu Lys Glu Arg Val Leu Gly Lys Val Ala 40 Lys Val Ala Tyr Pro Glu Ser Cys Ile Gly Cys Val Gin Cys Glu Leu 55 His Cys Pro Asp Phe Ala Ile Tyr Val Ala Asp Arg Lys Asp Phe Lys 70 75 Phe Ala Lys Val Ser Lys Glu Ala Gln Glu Arg Ser Glu Lys Val Lys 90 Ala Asn Lys Tyr Met Leu Leu Glu Glu Thr Ile Leu Glu Gly Arg Gly 100 105 110 Lys INFORMATION FOR SEQ ID NO:630: SEQUENCE CHARACTERISTICS: LENGTH: 329 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...329 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:630: Met Thr Thr Lys Arg Val Asn Thr Ala Thr Asn Lys Ile Met Thr Leu 1 5 10 Asn Thr Phe Leu Asp Thr Cys Phe Leu Phe Phe Ile Ser Ile Leu Phe 25 Tyr Leu Ser Ile Pro Ile Tyr Pro Asn Lys Val Val Val Val Pro Gin WO 97/37044 PCT/US97/05223 557 40 Gly Ser Leu Lys Lys Val Phe Phe Ser Leu Lys Glu Gin Gly Val Asp 55 Ile Asn Ala Leu Asp Leu Leu Leu Leu Arg Leu Met Gly Met Pro Lys 70 75 Lys Gly Tyr Ile Asp Met Gly Asp Gly Ala Leu Arg Lys Gly Asp Phe 90 Leu Val Arg Leu Ile Lys Ala Lys Thr Ala Gln Lys Ser Val Thr Leu 100 105 110 Ile Pro Gly Glu Thr Arg Tyr Phe Phe Thr Gin Ile Leu Ser Glu Thr 115 120 125 Tyr Gin Leu Glu Thr Ser Asp Leu Asn Glu Ala Tyr Glu Ser Ile Ala 130 135 140 Pro Arg Leu Asn Gly Ala Val Ile Glu Asp Gly Val Ile Trp Pro Asp 145 150 155 160 Thr Tyr His Leu Pro Leu Gly Glu Asp Ala Phe Lys Ile Met Gin Thr 165 170 175 Leu Ile Gly Gin Ser Met Lys Lys His Glu Ala Leu Ser Lys Gin Trp 180 185 190 Leu Gly Tyr Tyr His Lys Glu Glu Trp Phe Glu Lys Ile Ile Leu Ala 195 200 205 Ser Ile Val Gin Lys Glu Ala Ala Asn Val Glu Glu Met Pro Leu Ile 210 215 220 Ala Ser Val Ile Phe Asn Arg Leu Lys Lys Gly Met Pro Leu Gin Met 225 230 235 240 Asp Gly Ala Leu Asn Tyr Gin Glu Phe Ser His Ala Lys Val Thr Lys 245 250 255 Glu Arg Ile Lys Thr Asp Asn Thr Pro Tyr Asn Thr Tyr Lys Phe Lys 260 265 270 Gly Leu Pro Lys Asn Pro Val Gly Ser Val Ser Leu Glu Ala Val Arg 275 280 285 Ala Val Val Phe Pro Lys Lys Thr Asp Phe Leu Tyr Phe Val Lys Met 290 295 300 Pro Asp Lys Lys His Ala Phe Ser Ala Thr Tyr Lys Glu His Leu Lys 305 310 315 320 Asn Ile Asn Ile Ser Asn Asn His Phe 325 INFORMATION FOR SEQ ID NO:631: SEQUENCE CHARACTERISTICS: LENGTH: 148 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...148 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:631: WO 97/37044 PCT/US97/05223 558 Lys Phe Met Arg Leu Leu Phe Leu Leu Leu Ser Ala Thr Leu Met Leu 1 5 10 Leu Ala Glu Glu Lys Ile Pro Leu Ser Asp Asp Ala Pro Ile Lys Leu 25 Val His Trp Gin Asn Ala Leu Lys Glu Val Gln Pro Asp Ser Asn Ala 40 Pro Ala Thr Pro Pro Ile Lys Ala Val Gin Thr Thr Leu Thr Phe Glu 55 Thr Pro Phe Asn Lys Thr Pro Lys Ile Met Glu Val Glu Gly Gin Lys 70 75 Val Ile Val Leu Lys Asn Ala Gln Leu Asp Ser Lys Lys Thr Met Asp 90 Phe Lys Glu Ala Ser Leu Asn Ala Leu Glu Met Phe Ser Tyr Gin Asn 100 105 110 Asp Ile Tyr Leu Leu Ser Lys Lys Ala Lys Ala Gly Leu Glu Ile Gin 115 120 125 Ala Ser Ser Ser Lys Asp Lys Lys Gin Leu Ala Phe Phe Phe Tyr Pro 130 135 140 Lys Val Phe Ile 145 INFORMATION FOR SEQ ID NO:632: SEQUENCE CHARACTERISTICS: LENGTH: 261 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...261 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:632: Glu Leu Ile Leu Lys Lys Lys Lys Glu Arg Asn Leu Met Lys Lys Gly 1 5 10 Ser Leu Ala Ile Val Leu Gly Ser Leu Leu Ala Ser Gly Thr Phe Tyr 25 Thr Ala Leu Ala Asp Gly Met Pro Met Lys Gin Gin His Asn Asn Met 40 Gly Glu Ser Val Glu Leu His Phe His Tyr Pro Ile Lys Gly Lys Gin 55 Glu Pro Lys Asn Asn His Leu Val Val Leu Ile Asp Pro Lys Ile Glu 70 75 Ala Asn Lys Val Ile Pro Glu Asn Tyr Gin Lys Glu Phe Glu Lys Ser 90 Leu Phe Leu Gin Leu Ser Asn Phe Leu Glu Arg Lys Gly Tyr Ser Val 100 105 110 Ser Gin Phe Lys Asp Val Ser Glu Ile Pro Gin Asp Ile Lys Glu Lys WO 97/37044 PCT/US97/05223 559 115 120 125 Ala Leu Leu Val Leu Arg Met Asp Gly Asn Val Ala Ile Leu Glu Asp 130 135 140 Ile Val Glu Glu Ser Asp Ala Leu Ser Glu Glu Lys Val Ile Asp Met 145 150 155 160 Ser Ser Gly Tyr Leu Asn Leu Asn Phe Val Glu Pro Lys Ser Glu Aso 165 170 175 Ile Ile His Ser Phe Gly Ile Asp Val Ser Lys Ile Lys Ala Val Ile 180 185 190 Glu Arg Val Glu Leu Arg Arg Thr Asn Ser Gly Gly Phe Val Pro Lys 195 200 205 Thr Phe Val His Arg Ile Lys Glu Thr Asp His Asp Arg Ala Ile Lys 210 215 220 Lys Ile Met Asn Gin Ala Tyr His Lys Val Met Ala His Ile Thr Lys 225 230 235 240 Glu Leu Ser Lys Lys His Met Glu Arg Tyr Glu Lys Val Ser Ser Glu 245 250 255 Met Lys Lys Arg Lys 260 INFORMATION FOR SEQ ID NO:633: SEQUENCE CHARACTERISTICS: LENGTH: 268 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...268 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:633: Asp Gin Lys Phe Lys Gly Ile Ile Leu Ala Met Lys Ile Ser Val Ser 1 5 10 Lys Asn Asp Leu Glu Asn Thr Leu Arg Tyr Leu Gin Ala Phe Leu Asp 25 Lys Lys Asp Ala Ser Ser Ile Ala Ser His Ile His Leu Glu Val Ile 40 Lys Glu Lys Leu Phe Leu Lys Ala Ser Asp Ser Asp Ile Gly Leu Lys 55 Ser Tyr Ile Ser Thr Gin Ser Thr Asp Lys Glu Gly Val Gly Thr Ile 70 75 Asn Gly Lys Lys Phe Leu Asp Ile Ile Ser Cys Leu Lys Asp Ser Asn 90 Ile Val Leu Glu Thr Lys Asp Asp Ser Leu Val Ile Lys Gin Asn Lys 100 105 110 Ser Ser Phe Lys Leu Pro Met Phe Asp Ala Asp Glu Phe Pro Glu Phe 115 120 125 Pro Val Ile Asp Pro Lys Val Ser Leu Glu Ile Asn Ala Pro Phe Leu WO 97/37044 PCT/US97/05223 560 130 135 140 Val Asp Ala Phe Lys Lys Ile Ala Pro Val Ile Glu Gin Thr Ser His 145 150 155 160 Lys Arg Glu Leu Ala Gly Val Leu Met Gin Phe Asn Gin Lys His Gin 165 170 175 Thr Leu Ser Val Val Gly Thr Asp Thr Lys Arg Leu Ser Tyr Thr Gin 180 185 190 Leu Glu Lys Ile Ser Ile His Ser Thr Glu Glu Asp Ile Ser Cys Ile 195 200 205 Leu Pro Lys Arg Ala Leu Leu Glu Ile Leu Lys Leu Phe Tyr Glu Asn 210 215 220 Phe Ser Phe Lys Ser Asp Gly Met Leu Ala Val Val Glu Asn Glu Thr 225 230 235 240 His Ala Phe Phe Thr Lys Leu Ile Asp Gly Asn Tyr Pro Asp Tyr Gin 245 250 255 Lys Ile Leu Pro Lys Glu Tyr Thr Ser Leu Ser Leu 260 265 INFORMATION FOR SEQ ID NO:634: SEQUENCE CHARACTERISTICS: LENGTH: 416 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...416 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:634: Asn Pro Leu Ile Gin Lys Asn Lys Ser Leu Ser Ile Phe Leu Ile Ser 1 5 10 Asn Ser Val Val Phe Leu Gly Lys Ile Ile Leu His Lys Val Phe Ile 25 Met Glu Ala Leu Glu Cys Leu Lys Arg Ile Glu Lys Glu Ser Ile Gin 40 Thr Ile Tyr Ile Asp Pro Pro Tyr Asn Thr Lys Ser Ser Asn Phe Glu 55 Tyr Glu Asp Ala His Ala Asp Tyr Glu Lys Trp Ile Glu Glu His Leu 70 75 Ile Leu Ala Lys Ala Val Leu Lys Gin Ser Gly Cys Ile Phe Ile Ser 90 Met Asp Asp Asn Lys Met Ala Glu Val Lys Ile Ile Ala Asn Glu Ile 100 105 110 Phe Gly Thr Arg Asn Phe Leu Gly Thr Phe Ile Thr Lys Gin Ala Thr 115 120 125 Arg Ser Asn Ala Lys His Ile Asn Ile Thr His Glu Tyr Val Leu Ser 130 135 140 Tyr Ala Lys Asn Lys Ala Phe Ala Pro Gly Phe Lys Ile Leu Arg Thr WO 97/37044 PTU9/52 PCT/IJS97/05223 145 Leu Lys Lys Lys Asp 225 Asn Lys Arg Ser Leu 305 Lys Phe Lys Lys Tyr 385 Ile Leu Asn Glu Asn 210 Leu Leu Leu Pro Val1 290 Gly Tyr Phe Asp Ile 370 Lys Lys Pro Val1 Gin 195 Tyr Ser Phe Lys Tyr 275 Leu Leu Leu Ala Tyr 355 Lys Asn Arg Ile Tyr Ala Lys Ala Leu Phe 180 Ile Asn Thr Leu Glu 260 Glu Asp Lys Leu Gly 340 Tyr Asn Thr Ser 165 Lys Lys Leu Pro Glu 245 Leu Lys Phe Gly Leu 325 Ser Leu Asn Ile Giu 405 Gin Giu Val1 Ser 230 Pro Tyr Tyr Tyr Leu 310 Cys Gly Asn Pro Ser 390 Tyr Lys Leu Asp 215 Asn Leu Tyr Tyr Ser 295 Phe Ser Thr Trp Gin 375 Asp Glu Gly Ser 200 Giu Pro Lys Gin Leu 280 Arg Lys Thr Thr Ser 360 Ala Ile Ile Gin 185 Gin Lys Arg Ser Asn 265 Lys Gin Thr Pro Ala 345 Phe Val1 Met Leu Lys 170 Ala Lys Gly Ser Arg 250 Arg Giu Gly Pro Lys 330 Gin Tyr Ser Leu Lys 410 155 Asp Gin Glu Giu Val1 235 Gly Leu Ser Thr Lys 315 Asp Ala Leu Ile Leu 395 Thr Leu Ala His Ile 220 Ala Trp Ile Gin Lys 300 Pro Ser Vali Cys Leu 380 Arg Lys Met Gin Phe 205 Tyr Ile Ser Phe Asp 285 Asp Val1 Ile Ile Gin 365 Lys Leu Ser Arg Thr 175 Leu Ile 190 Asn Phe Phe Ala Gin Glu Ser Asp 255 Lys Asn 270 Asn Cys Leu Glu Aia Leu Ile Leu 335 Giu Val 350 Lys Glu Asn Lys Giu Lys Ile Leu 415 160 Ile Leu Leu Lys Ile 240 Glu Asn Leu Lys Ile 320 Asp Asn Giu Gly Ile 400 Phe INFORMATION FOR SEQ ID NO:635: SEQUENCE CHARACTERISTICS: LENGTH: 76 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iiiJ) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misojfeature LOCATION .76 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:635: Asn Phe Ser Giy Giu Ser Lys Lys Arg Leu Lys His Pro Ala Pro Phe 10 Pro Arg Glu Leu Pro Arg Arg Cys Ile Gin Leu Phe Ser Phe Leu Giu WO 97/37044 PCT/US97/05223 562 25 Asp Thr Ile Phe Asp Pro Phe Ser Gly Ser Gly Thr Thr Ile Leu Glu 40 Ala Asn Ala Leu Gly Arg Phe Ser Val Gly Leu Glu Ile Glu Lys Glu 55 Tyr Cys Glu Leu Phe Lys Lys Arg Ile Leu Glu Ser 70 INFORMATION FOR SEQ ID NO:636: SEQUENCE CHARACTERISTICS: LENGTH: 217 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...217 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:636: Arg Arg Cys Cys Arg Arg Glu Asp Thr Lys Arg Tyr Val lie Glu Val 1 5 10 Val Ile Ser Leu Ser Thr Met Thr Ser Leu Leu Ser Ala Glu Thr Pro 25 Lys Gin Glu Lys Ala Ile Lys Thr Ser Pro Thr Lys Lys Gly Glu Arg 40 Asn Ala Ala Phe Ile Gly Ile Asp Tyr Gin Leu Gly Met Leu Ser Thr 55 Thr Ala Gin Asn Cys Ser His Gly Asn Cys Asn Gly Asn Gin Ser Gly 70 75 Ala Tyr Gly Ser Asn Thr Pro Asn Met Pro Thr Ala Ser Asn Pro Thr 90 Gly Gly Leu Thr His Gly Ala Leu Gly Thr Arg Gly Tyr Lys Gly Leu 100 105 110 Ser Asn Gin Gin Tyr Ala Ile Asn Gly Phe Gly Phe Val Val Gly Tyr 115 120 125 Lys His Phe Phe Lys Lys Ala Pro Gin Phe Gly Met Arg Tyr Tyr Gly 130 135 140 Phe Phe Asp Phe Ala Ser Ser Tyr Tyr Lys Tyr Tyr Thr Tyr Asn Asp 145 150 155 160 Tyr Gly Met Arg Asp Ala Arg Lys Gly Ser Gin Ser Phe Met Phe Gly 165 170 175 Tyr Gly Ala Gly Thr Asp Val Leu Phe Asn Pro Ala Ile Phe Asn Arg 180 185 190 Glu Lys Leu Ala Phe Gly Val Phe Leu Gly Arg Cys Asp Trp Trp His 195 200 205 Leu Leu Gly Ser Asn Lys Leu Leu Phe 210 215 WO 97/37044 PCT/US97/05223 563 INFORMATION FOR SEQ ID NO:637: SEQUENCE CHARACTERISTICS: LENGTH: 107 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...107 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:637: Gly Gly Val Cys Gly Val Leu Ser Leu Gin Ala His Leu Ser Phe Glu 1 5 10 Asn Cys Ser Lys Pro Leu Asp Cys Ser Leu Phe Ala Thr Thr Cys Thr 25 Pro Gin Asn Pro Ile Gly Ser Cys Met Val Ser Ser Gin Gly Gly Val 40 Cys Gly Val Leu Ser Leu Gin Ala His Leu Ser Phe Glu Asn Cys Ser 55 Lys Pro Leu Asp Cys Ser Leu Phe Ala Thr Thr Cys Thr Pro Gin Asn 70 75 Pro Ile Gly Ser Tyr Met Val Ser Ser Gin Gly Gly Val Arg Gly Val 90 Leu Ser Leu Gin Ala Arg Leu Ser Phe Lys Asn 100 105 INFORMATION FOR SEQ ID NO:638: SEQUENCE CHARACTERISTICS: LENGTH: 384 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...384 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:638: Asn Tyr Ser Asn Lys Asp Asn Pro Met Arg Phe Phe Cys Phe Phe Leu 1 5 10 WO 97/37044 WO 9737044PCT1US97/05223 Phe Phe Leu Thr Gin Thr Asn Ala Leu Tyr Lys Leu Asn Asn Gin Asp Thr Ser Gin Arg Thr Asp Ala Leu 100 Ala Leu Giu Lys 115 Lys Glu Ser Gin 130 Gly Val Lys Asn 145 Lys Asn Asn Ala Leu Ser Arg Leu 180 Val Leu Glu Asn 195 Glu Tyr Asn Ala 210 Arg Leu Lys Lys 225 Gilu Asp Ala Leu Leu Gly Ser Ser 260 Asn Gin Leu Gin 275 Giu Leu Lys Asn 290 Ala Asn Tyr Glu 305 Giu Leu Ser Giu Ala Gin Val Leu 340 Asn Val Ser Asp 355 Arg Leu Asp Pro 370
INFORMATION
The Lys Asn Gin Arg Leu His Ala Leu Leu 165 Asp Giu Leu Ile Lys 245 Asp Gin Ala His Gin 325 Ala Gly His Ser Leu Giu Leu 70 Phe Lys Leu Leu Giu 150 Phe Leu Ile Met Lys 230 Thr Cys Arg Lys Thr 310 Met Leu Leu Gly Asn Ser Ser 55 Lys Phe Gin Lys Phe 135 Glu Leu Met Gin Asn 215 Asn Phe Cys Tyr Asn 295 Leu Ala Leu Ser Phe 375 Ala Gin 25 Arg Ser 40 Leu Arg Giu Ile Asn Ala Ser Ala 105 Giu Ser 120 Leu Gin Ala Ser Leu Lys Ser Ala 185 Asp Gin 200 His Asp Lys Leu Leu Pro Asp Lys 265 Gin Asn 280 Asn Lys Lys Thr Phe Leu Ala Gin 345 Gly Gly 360 Pro Ser Ile met Asn Giu Ile Tyr Lys Lys 75 Ser Gin 90 Leu Giu Met Glu Giu His Asn Ala 155 Giu Pro 170 Leu Asn Gin Ser Phe Gin Gin Ser 235 Leu Giu 250 Glu Asn Ala Leu Giu Lys Leu Asn 315 Asn Glu 330 Gin Gin Lys Ala Phe Lys Met Gin Gin Ala Ile Leu His Cys 140 Leu Lys Ala His Ala 220 Gin Lys Leu Ile His 300 Ile Thr Thr Leu Asn 380 Thr Phe Leu Ser Asn Val Asn Ser Arg Leu Glu Lys 110 Glu Arg 125 Pro Tyr Giu Val Leu Phe Leu Cys 190 Asn Lys 205 Tyr Lys Ile Gin Arg Pro Lys Ser 270 Giu Arg 285 Ala Leu Glu Phe Met Ala Lys Lys 350 Ile Lys 365 Phe Lys Asp Asp Leu Thr Met Leu Leu Leu Gin His 175 Asp Thr Ala Ala Glu 255 Cys Asp Ile Leu Leu 335 Pro Asn Gin Ser Met Ser Leu Asp Gin Ile Ser Giu 160 Leu Gin Leu Met Lys 240 Thr Ala Lys Leu Ser 320 Asn Phe Ile Glu FOR SEQ ID NO:639: SEQUENCE CHARACTERISTICS: LENGTH: 298 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 WO 9737044PCTIUS97/05223 565 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .298 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:639: Leu Val. Phe Lys Lys Pro Phe Phe Lys Asn Arg Leu 1 Asn His Lys Ala Leu Gin Val Thr Pro 145 Lys Ser Ile Asn Leu 225 Val1 Tyr Ala Ile Lys Tyr Gin s0 Met Arg Asn Leu His 130 Asn Tyr Lys Ser Ser 210 Ala Met Gly Thr Gin 290 Leu Cys Gly Ala Lys Pro Lys 115 Ile Phe Gly Lys Leu 195 Arg Arg Leu Leu Gin 275 Asp Lys Arg Ala Ile Ile Ser 100 Asp Ser Phe Lys Gly 180 Asp Trp Arg Ala Ala 260 Phe His 5 Phe Lys Lys Ile Val1 Pro Ile Gly Tyr Pro 165 Arg Val1 Ser Val1 Lys 245 Val1 Asn Glu Val1 Ile Lys Val1 70 Phe Cys Asn Ile Leu 150 Ile Ser Arg Pro Thr 230 Glu Val Pro Cys Lys Thr Met 55 Val Glu Ile Gly Giu 135 Ser Pro Gin Lys Leu 215 Giu Val1 Gli Leu Ala 295 Ile Arg 40 Lys Ser Lys Glu Pro 120 Asn Trp Asp Asn Gin 200 Ser Ser Pro Gin Thr 280 Ile Thr 25 Gly Lys Leu Cys Val 105 Leu Pro Gin Tyr His 185 Leu Giy Glu Asn Ser 265 Leu Leu 10 Ile Phe Ala Asn Leu 90 Lys Gin Leu Ala Ala 170 Phe Asp Gly Leu Ala 250 Asp Asn Arg Met Val1 Gly Al a 75 Pro Pro Tyr Leu Arg 155 Ile His Asn Leu.
Ala 235 His Asn Arg Ile Lys Phe Lys Asn Asp Leu Leu 140 Asp Ser Ile Asn Asn 220 Gin Lys Ser Ala Leu Asn Lys Asp Ile Pro Leu Phe Asp Pro Tyr Glu Ala Gly 1710 Leu Met 125 Asp Pro Phe Met Leu Thr His Ile 190 Leu Lys 205 Gly His Lys Ser Arg Met Phe Val 270 Ser Ala 285 Val1 Phe Thr Leu Asn Lys Tyr Pro Ser Ser Ile 17S Ser Asn Lys Pro Gly 255 Leu Glu Thr Asn Lys Ala Val Asn Val Thr Thr Lys 160 Asn Cys Ile Tyr Phe 240 Asp Leu Glu INFORMATION FOR SEQ ID NO:640: SEQUENCE CHARACTERISTICS: LENGTH: 179 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 566 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...179 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:640: Leu Arg Arg Leu Ala Leu Asn Thr Met Asn Ser Val Leu Glu Cys Lys 1 5 10 Glu Leu Ala Leu Tyr Gly Gly Ser Phe Asp Pro Leu His Lys Ala His 25 Leu Ala Ile Ile Glu Gin Thr Leu Glu Leu Leu Pro Phe Val Gin Leu 40 Ile Val Leu Pro Ala Tyr Gin Asn Pro Phe Lys Lys Pro Cys Phe Leu 55 Asp Ala Lys Thr Arg Phe Lys Glu Leu Glu Arg Ala Leu Lys Gly Met 70 75 Pro Arg Val Leu Leu Ser Asp Phe Glu Ile Lys Gin Glu Arg Ala Val 90 Pro Thr Ile Glu Ser Val Leu His Phe Gin Lys Leu Tyr Arg Pro Lys 100 105 110 Thr Leu Tyr Leu Val Ile Gly Ala Asp Cys Leu Arg His Leu Ser Ser 115 120 125 Trp Thr Asn Ala Lys Glu Leu Leu Lys Arg Val Glu Leu Val Val Phe 130 135 140 Glu Arg Ile Gly Tyr Glu Lys Ile Gin Phe Lys Gly Arg Tyr His Pro 145 150 155 160 Leu Lys Gly Ile Asp Ala Pro Ile Ser Ser Ser Ala Ile Arg Ala Ser 165 170 175 Leu Gly Val INFORMATION FOR SEQ ID NO:641: SEQUENCE CHARACTERISTICS: LENGTH: 269 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...269 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:641: WO 97/37044 PCT/US97/05223 567 Ser Ser Ile Phe Asp Tyr Asn Tyr Lys Gly Leu His Phe Ser Asn Lys 1 5 10 Arg Tyr Ile Met Leu Gly Ser Val Lys Lys Thr Leu Phe Gly Val Leu 25 Cys Leu Gly Ala Leu Cys Leu Arg Gly Leu Met Ala Glu Pro Asp Ala 40 Lys Glu Leu Val Ser Leu Gly Ile Glu Ser Val Lys Lys Gin Asp Phe 55 Ala Gin Ala Lys Ala His Phe Glu Lys Ala Cys Glu Leu Lys Glu Gly 70 75 Phe Gly Cys Val Phe Leu Gly Ala Phe Tyr Glu Glu Gly Lys Gly Val 90 Gly Lys Asp Leu Lys Lys Ala Ile Gin Phe Tyr Thr Lys Gly Cys Glu 100 105 110 Leu Asn Asp Gly Tyr Gly Cys Arg Leu Leu Gly Asn Leu Tyr Tyr Asn 115 120 125 Gly Gin Gly Val Ser Lys Asp Ala Lys Lys Ala Ser Gin Tyr Tyr Ser 130 135 140 Lys Ser Cys Glu Leu Asn His Ala Glu Gly Cys Thr Val Leu Gly Ser 145 150 155 160 Leu His His Tyr Gly Val Gly Thr Pro Lys Asp Leu Arg Lys Ala Leu 165 170 175 Asp Leu Tyr Glu Lys Ala Cys Asp Leu Lys Asp Ser Pro Gly Cys Ile 180 185 190 Asn Ala Gly Tyr Met Tyr Gly Val Ala Lys Asn Phe Lys Glu Ala Ile 195 200 205 Val Arg Tyr Ser Lys Ala Cys Glu Leu Lys Asp Gly Arg Gly Cys Tyr 210 215 220 Asn Leu Gly Val Met Gin Tyr Asn Ala Gin Gly Thr Ala Lys Asp Glu 225 230 235 240 Lys Gin Ala Val Glu Asn Phe Lys Lys Gly Cys Lys Ser Ser Val Lys 245 250 255 Glu Ala Cys Asp Ala Leu Lys Glu Leu Lys Ile Glu Leu 260 265 INFORMATION FOR SEQ ID NO:642: SEQUENCE CHARACTERISTICS: LENGTH: 86 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...86 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:642: His Phe Asn Ile Phe Thr Glu Asp Asn Arg Glu Ile Ser Gly Ser Thr 1 5 10 WO 97/37044 PCT/US97/05223 568 Asp Lys Leu Ser Tyr Asn Ala Leu Asn Gly Glu Tyr 25 Asn Ala Val Val Arg Glu Val Gly Lys Ser Asn Val 40 Glu Ile Ile Leu Asn Lys Thr Lys Gly Tyr Ala Asp 55 Ala Lys Arg Pro Leu Asn Leu Cys Leu Ile Trp Lys 70 Lys Ile Val Arg Leu Asn INFORMATION FOR SEQ ID NO:643: SEQUENCE CHARACTERISTICS: LENGTH: 92 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...92 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:643: Pro Lys Ser Pro Lys Lys Ile Val Lys Arg Tyr Glu 1 5 10 Lys Met Pro Lys Glu Asn Thr Thr His Glu Asp Ala 25 Phe Ala Pro Ala Ile Phe Lys Ala Ile Met Leu Asn 40 Ala Val Val Ala Ala Pro Ile Ile Lys Met Leu Lys 55 Ile Trp Trp Lys Lys Met Gly Lys Lys His Ala Ile 70 75 Arg Ile Lys Ile Leu Ala Ser Leu Ala Pro Ser Phe INFORMATION FOR SEQ ID N0:644: SEQUENCE CHARACTERISTICS: LENGTH: 185 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori Lys Leu Leu Ile Thr Gly Val Leu Gly Thr Leu Met Lys Ser Ala Trp Leu Ala Lys Cys Lys Leu Ile Leu Leu Ser Lys Glu Asp Phe Lys Met WO 97/37044 PCT/US97/05223 569 (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...185 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:644: Asn Lys Asp His Leu Met Leu Asn Lys Phe Lys Lys Ile Val Gly Val 1 5 10 Gly Val Leu Val Gly Cys Leu Gly Val Leu Gin Ala Lys Asn Ser Leu 25 Phe Val Leu Pro Tyr Glu Gin Arg Asp Ala Leu Asn Ser Leu Ile Ser 40 Gly Ile Ser Ser Ala Arg Glu Asn Val Lys Ile Ala Ile Tyr Ser Phe 55 Thr His Arg Asp Ile Ala Arg Ala Ile Lys Ser Val Ala Ser Arg Gly 70 75 Ile Lys Val Gin Ile Ile Tyr Asp Tyr Glu Ser Asn His Asn Asn Lys 90 Gin Ser Thr Ile Gly Tyr Leu Asp Lys Tyr Pro Asn Thr Lys Val Cys 100 105 110 Leu Leu Lys Gly Leu Lys Ala Lys Asn Gly Asn Tyr Tyr Gly Ile Met 115 120 125 His Gin Lys Val Ala Ile Ile Asp Asp Lys Ile Val Phe Leu Gly Ser 130 135 140 Ala Asn Trp Ser Lys Asn Ala Phe Glu Asn Asn Tyr Glu Val Phe Leu 145 150 155 160 Lys Arg Asp Asp Thr Glu Thr Ile Phe Lys Ala Lys Ser Tyr Tyr Gin 165 170 175 Lys Met Leu Glu Gly Cys Val Gly Phe 180 185 INFORMATION FOR SEQ ID NO:645: SEQUENCE CHARACTERISTICS: LENGTH: 215 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...215 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:645: Gly Arg Thr Met Lys Tyr Leu Trp Leu Phe Leu Ile Tyr Ala Ile Gly 1 5 10 Leu Phe Ala Thr Asp Lys Thr Leu Asp Ile Ile Lys Thr Ile Gin Lys 25 Leu Pro Lys Ile Glu Val Arg Tyr Ser Ile Asp Asn Asp Ala Asn Tyr WO 97/37044 PCT/US97/05223 570 40 Ala Leu Lys Leu His Glu Val Leu Ala Asn Asp Leu Lys Thr Ser Gin 55 His Phe Asp Val Ser Gin Asn Lys Glu Gin Gly Ala Ile Asn Tyr Ala 70 75 Glu Leu Lys Asp Lys Lys Val His Leu Val Ala Leu Val Ser Val Ala 90 Val Glu Asn Gly Asn Lys Ile Ser Arg Leu Lys Leu Tyr Asp Val Asp 100 105 110 Thr Gly Thr Leu Lys Lys Thr Phe Asp Tyr Pro Ile Val Ser Leu Asp 115 120 125 Leu Tyr Pro Phe Ala Ala His Asn Met Ala Ile Val Val Asn Asp Tyr 130 135 140 Leu Lys Ala Pro Ser Ile Ala Trp Met Lys Arg Leu Ile Val Phe Ser 145 150 155 160 Lys Tyr Ile Gly Pro Gly Ile Thr Asn Ile Ala Leu Ala Asp Tyr Thr 165 170 175 Met Arg Tyr Gin Lys Glu Ile Ile Lys Asn Asn Arg Leu Asn Ile Phe 180 185 190 Pro Lys Trp Ala Asn Ala Glu Gin Thr Glu Phe Tyr Tyr Thr Gin Met 195 200 205 Ala Glu Lys Arg Pro Trp Phe 210 215 INFORMATION FOR SEQ ID NO:646: SEQUENCE CHARACTERISTICS: LENGTH: 176 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...176 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:646: Ala Ala Met Met Phe Asn Asn Asp Ile Asp Ser Ala Thr Gly Phe Tyr 1 5 10 Lys Pro Leu Ile Lys Ile Asn Ser Ala Gin Asp Leu Ile Lys Asn Thr 25 Glu His Val Leu Leu Lys Ala Lys Ile Ile Gly Tyr Gly Asn Val Ser 40 Thr Gly Thr Asn Gly Ile Ser Asn Val Asn Leu Glu Glu Gin Phe Lys 55 Glu Arg Leu Ala Leu Tyr Asn Asn Asn Asn Arg Met Asp Thr Cys Val 70 75 Val Arg Asn Thr Asp Asp Ile Lys Ala Cys Gly Met Ala Ile Gly Asn 90 Gin Ser Met Val Asn Asn Pro Asp Asn Tyr Lys Tyr Leu Ile Gly Lys WO 97/37044 PCT/US97/05223 100 105 Ala Trp Arg Asn Ile Gly lie Ser Lys Thr Ala A: 115 120 Ser Val Tyr Tyr Leu Gly Asn Ser Thr Pro Thr G 130 135 1.
Thr Thr Asn Leu Pro Thr Asn Thr Thr Asn Asn A 145 150 155 Tyr Ala Leu Val Lys Asn Ala Pro Phe Ala His A: 165 170 INFORMATION FOR SEQ ID NO:647: SEQUENCE CHARACTERISTICS: LENGTH: 188 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...188 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:647: 110 sn Gly Ser Lys Ile 125 lu Asn Gly Gly Asn la His Ser Ala Asn 160 sn Thr Thr Pro Asn 175 Ile Phe Arg Ala Phe Arg Arg Gly Phe Phe Arg 1 Arg Asn Asn Glu Thr Ser Asn Leu Lys 145 Ile Tyr Phe Arg Phe Ser His Leu Ala Gin 130 Asp Lys Glu Thr Thr Ala Val Ile Met Ile 115 Leu Gly Ser Ile Ala Lys Ile Phe His Arg 100 Lys Lys Glu Asp Lys 180 5 His Ser Gly Glu Pro Asp Asp Leu Phe Lys 165 Glu His Asp Thr Lys 70 Phe Ser Leu Asn Lys 150 Pro Arg Asp Arg Asp 55 Ala Ile Val Ile Thr 135 Ala Leu Glu Phe Glu 40 Phe Phe Tyr Ser Phe 120 Pro Leu Arg Asn Met 25 Leu Ile Lys Ser Leu 105 His Leu Asp Ala His 185 10 Leu Leu Val Asn Lys 90 Glu Lys Lys Gin Ser 170 Ala Glu Glu Gly Leu 75 Arg Asp Asn Ala Phe 155 Phe Val Lys Arg Ile His Glu Lys Ser Lys Leu 140 Phe Leu Phe Thr Met Ile Pro Ser Asp Lys Ala 125 Val Asn Glu Phe Asn Ala Gly Leu Thr Lys 110 Phe Glu Pro Ile Ala Arg Ser Glu Pro Pro Arg Arg Ala Ile Lys 175 Tyr Arg Lys Ser Leu Ser Leu Gin Gin Lys 160 Glu INFORMATION FOR SEQ ID NO:648: WO 97/37044 PCT/US97/05223 572 SEQUENCE CHARACTERISTICS: LENGTH: 79 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...79 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:648: Lys Ala Leu Pro Gin Gly Arg Asn Arg Met Arg Ser Asp Val Glu Val 1 5 10 Leu Ser Pro Leu His Lys Ile Asp Glu Lys Tyr Leu Phe His Leu Lys 25 Ile Ala Gly Glu Leu Ala Ser Met Gly Lys Ile Leu Ser Val Tyr Leu 40 Ala His Lys His Ser Ala Tyr Phe Ile Leu Asn Ala Leu Ser Tyr Gly 55 Phe Ser His Gin Asp Arg Ala Ile Ile Cys Leu Leu Gly Ala Ile 70 INFORMATION FOR SEQ ID NO:649: SEQUENCE CHARACTERISTICS: LENGTH: 235 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...235 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:649: Lys Gly Thr Lys Met Lys Lys Val Tyr Phe Lys Thr Phe Gly Cys Arg 1 5 10 Thr Asn Leu Phe Asp Thr Gin Val Met Gly Glu Asn Leu Lys Asp Phe 25 Ser Ala Thr Leu Glu Glu Gln Glu Ala Asp Ile Ile Ile Ile Asn Ser 40 Cys Thr Val Thr Asn Gly Thr Asp Ser Ala Val Arg Ser Tyr Ala Arg 55 WO 97/37044 PCT/US97/05223 573 Lys Met Ala Arg Leu Asp Lys Glu Val Leu Phe Thr Gly Cys Gly Val 70 75 Lys Thr Gin Gly Lys Glu Leu Phe Glu Lys Gly Leu Leu Lys Gly Val 90 Phe Gly His Asp Asn Lys Glu Lys Ile Asn Ala Leu Leu Gin Glu Lys 100 105 110 Lys Arg Phe Phe Ile Asp Asp Asn Leu Glu Asn Lys His Leu Asp Thr 115 120 125 Thr Met Val Ser Glu Phe Val Gly Lys Thr Arg Ala Phe Ile Lys Ile 130 135 140 Gin Glu Gly Cys Asp Phe Asp Cys Asn Tyr Cys Ile Ile Pro Ser Val 145 150 155 160 Arg Gly Arg Ala Arg Ser Phe Glu Lys Arg Lys Ile Leu Glu Gin Val 165 170 175 Gly Leu Leu Cys Ser Gin Gly Val Gin Glu Val Val Leu Thr Gly Thr 180 185 190 Asn Val Gly Ser Tyr Gly Lys Asp Arg Gly Ser Asn Ile Ala Arg Leu 195 200 205 Ile Lys Lys Leu Ser Gin Ile Thr Gly Leu Lys Arg Ile Arg Ile Gly 210 215 220 Ser Leu Asp Leu Ile Lys Ser Thr Met Asn Phe 225 230 235 INFORMATION FOR SEQ ID NO:650: SEQUENCE CHARACTERISTICS: LENGTH: 101 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...101 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:650: Val Thr Ala Leu Ala Thr Arg Ile Gly Arg Ser Phe Ala Tyr Trp Ala 1 5 10 Gin Phe Ser His Lys Lys Ile Leu Lys Asp Tyr Val Ile Ala His Met 25 Ser Ala Met Met Pro Ser Leu Leu Thr Leu Gin Trp Leu Ser Phe Ile 40 Leu Ser Leu Ala Glu Asn Leu Cys Leu Thr Asp Ser His His Leu Lys 55 Tyr Thr Leu Glu Lys Asn Lys Leu Val Ile His Ser Asn Asp Ala Leu 70 75 Tyr Leu Ala Lys Glu Met Leu Pro Lys Leu Ile Lys Pro Ile Pro Trp 90 Thr Ile Glu Phe Ala 100 WO 97/37044 PTU9/52 PCTIUS97/05223 574 INFORMATION FOR SEQ ID NO:651: SEQUENCE CHARACTERISTICS: LENGTH: 409 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 409 (xi) SEQUENCE DESCRIPTION: SEQ ID Ser Asp Ala Phe Leu Ile Ile Gin Asp Phe Lys Giu 1 His Val1 Asp Phe Leu Phe Val1 Val Leu 145 Ala S er Lys Ser Glu 225 Giu Phe Lys Lys Glu Ile Gly Ala Gly Ile 130 Asn Pro Arg Arg Pro 210 Val Asp Phe Ile Arg Lys Ser Lys Tyr Leu 115 Gin Ala Phe Lys Leu 195 Lys Asn His Leu Ile Asp Leu Ang Gly Ang 100 Tyr Pro Val1 Gly Ala 180 Lys Phe Phe Gin Lys 260 5 Gin Ser Ile Leu Giu His Arg Arg Tyr Lys 165 Met Ser Tyr Asp Lys 245 Lys Thr Glu Leu Pro 70 Ile Ile Asn Asp His 150 Ser Met Tyr His Phe 230 Lys His His Lys Ile 55 Asn Met Gly Asp Ile 135 Gin Ile Arg Lys Lys 215 Tyr Met Arg Phe Thr 40 Asp Ile Glu Ser Val1 120 Leu Val Tyr Asp Leu 200 Phe Gly Gly Arg Lys 25 Leu Giu Pro Ile Ile 105 Leu Leu Lys Leu Val1 185 Tyr Leu Lys Leu Ala 265 10 Arg Giu Phe Ser Asp 90 Ang Leu Val1 Sen Tyr 170 Tyn Ile Ser Ser Ile 250 Leu Met Asn Giu Thr 75 Val1 Gin Leu Ala Asn 155 Ile Gin Gin Leu Phe 235 Val1 His Arg Asn Val1 Pro Pro Lys Ser Gly 140 Val1 Asp Ala Val1 Giu 220 Ile Val Lys Gin Val1 Giu Leu Arg Phe Giu Thr 125 Asn Gly Met Leu Leu 205 Thr Gin Gly Thr Arg Val1 Glu Ala Glu Gly Tyr 110 Lys Pro Gin Arg Phe 190 His 0Th Lys Arg Ala 270 Ile Leu Asn Asn Phe Sen Ang Sen 0Th Phe Leu 175 Leu Pro Ser Leu Glu 255 Thr Ile Ser Lys Lys Gly Ile Ile Leu Ile Pro 160 Gin His Thr Ile His 240 Leu Pro Val Tyr Lys 275 ValTyrLysThr Asn Thr Ser Gly Leu Ser Lys Thr Thr Gin Ser Ile 275280 285 WO 97/37044 PCT/US97/05223 Val Ile 305 Phe Asn Ile His Phe 385 Pro Val 290 Phe Asp Leu Lys Phe 370 Leu Gin Leu Asp Pro Ala Asn 355 Met Ser Ile Asn Val Asn Asn 340 Pro Pro Thr Phe Glu Ser Lys 325 Thr Ile Phe Lys Ile Ser Met 310 His Phe Met Glu Val 390 Pro Leu 295 Gin Tyr Asn Tyr Glu 375 Glu Val Ser Met Lys Arg Leu 360 Cys Lys Ala lie Asp Asn Lys 345 Asn Ile Leu Glu 575 Asn Glu Leu Gly 315 Glu Ile 330 Ile Glu Ser Leu Thr Gin Ala Phe 395 Asp 300 Leu Val Ile Arg Thr 380 Leu Met Leu Asn Phe Asn 365 Arg Asn Ser Leu His Gin 350 Pro Tyr Asp Ser Tyr Tyr 335 Thr Ile Leu Asp Val Asp 320 Glu Asp Leu Trp His 400 405 INFORMATION FOR SEQ ID NO:652: SEQUENCE CHARACTERISTICS: LENGTH: 64 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...64 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:652: Pro Asn Glu Ala Ser Ala Ala Val Asn Val Gly Tyr Lys Ile Ser Lys 1 5 10 Ser Leu Thr Ala Ser Val Lys Leu Glu Tyr Leu Gly Val Met Thr His 25 Ser Gly Phe Thr Val Gly Ser Tyr Arg Pro Thr Pro Gly Ser Lys Ala 40 Leu Tyr Ser Asp Arg Ser His Leu Met Thr Thr Leu Ser Ala Lys Val 55 INFORMATION FOR SEQ ID NO:653: SEQUENCE CHARACTERISTICS: LENGTH: 85 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 576 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...85 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:653: Ser Ile Val Tyr Gly Lys Asn Lys Leu Phe Ser Asp Glu Phe Tyr Glu 1 5 10 Lys Ile Glu Asp Ile Leu Thr Asn Asn Asn Pro Arg Tyr Lys Gin Val 25 Cys Ile Ile Phe Asp Ala Asp Ile Lys Lys Glu Asn Gin Glu Ser Asp 40 Ala Gly Phe Asp Asn Lys Leu Lys His Ile Arg Glu Lys Phe Lys Glu 55 Lys Gly Thr Asp Phe Pro Lys Glu Gin Ile Phe Tyr Ser Leu Thr Ile 70 75 Lys Met Met Ala Ile INFORMATION FOR SEQ ID NO:654: SEQUENCE CHARACTERISTICS: LENGTH: 146 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...146 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:654: Val Gin Ala Phe Asp Tyr Lys Ile Glu Val Leu Ala Glu Ser Phe Ser 1 5 10 Lys Val Gly Phe Asn Lys Lys Lys Ile Asp Ile Ala Arg Gly Ile Tyr 25 Pro Thr Glu Thr Phe Val Thr Ala Val Gly Gin Gly Asn Ile Tyr Ala 40 Asp Phe Leu Ser Lys Ser Leu Lys Asp Gin Gly His Val Leu Glu Gly 55 Lys Val Gly Gly Thr Ile Gly Gly Ile Ala Tyr Asp Ser Thr Lys Phe 70 75 Asn Gin Gly Gly Ser Val Ile Tyr Asn Tyr Ile Gly Tyr Trp Asp Gly 90 Tyr Leu Gly Gly Lys Arg Ala Leu Leu Asp Gly Thr Ser Ile His Glu 100 105 110 Cys Ala Leu Gly Ser Asp Gly Lys Val Ile Asp Ser Ile Ala Cys Gly WO 97/37044 PCT/US97/05223 577 115 120 125 Asn Ala Arg Ala Asn Lys Ile Arg Arg Asn Tyr Leu Met Asn Thr Pro 130 135 140 Phe Ser 145 INFORMATION FOR SEQ ID NO:655: SEQUENCE CHARACTERISTICS: LENGTH: 247 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...247 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:655: Phe Arg Tyr Lys Asp Ile Phe Ala Ala Lys Gly Gly Arg Tyr Gin Ser 1 5 10 Asn Ala Pro Tyr Met Ser Ser Tyr Thr Gin Gly Phe Glu Ile Ser Ala 25 Lys Ile Lys Asp Lys Asn Glu Gly Ser His Lys Leu Trp Trp Phe Ser 40 Ser Trp Gly Arg Ala Phe Ala Tyr Gly Glu Trp Ile Tyr Asp Phe Tyr 55 Ser Pro Arg Thr Val Ile Lys Asn Gly Arg Thr Leu Asn Tyr Gly Ile 70 75 His Leu Val Asp Tyr Thr Tyr Glu Arg Lys Gly Val Ser Val Ser Pro 90 Phe Phe Gin Phe Ser Pro Gly Thr Tyr Tyr Ser Pro Gly Val Ala Val 100 105 110 Gly Tyr Asp Ser Asn Pro Asn Phe Asn Gly Val Gly Phe Arg Ser Glu 115 120 125 Thr Lys Ala Tyr Ile Leu Leu Pro Val His Ala Pro Leu Lys Arg Asp 130 135 140 Thr Tyr Arg Tyr Ala Val Lys Ala Gly Thr Ala Gly Gin Ser Leu Leu 145 150 155 160 Ile Arg Gin Arg Phe Asp Tyr Asn Glu Phe Asn Phe Gly Gly Ala Phe 165 170 175 Tyr Lys Val Trp Lys Asn Ala Asn Ala Tyr Ile Gly Thr Thr Gly Asn 180 185 190 Pro Leu Gly Ile Asp Phe Trp Thr Asn Ser Val Tyr Asp Ile Gly Gin 195 200 205 Ala Leu Ser His Val Val Thr Ala Asp Ala Val Ser Gly Trp Val Phe 210 215 220 Gly Gly Gly Val His Lys Lys Trp Leu Trp Gly Thr Leu Trp Arg Trp 225 230 235 240 Thr Ser Gly Ala Leu Ala Lys WO 97/37044 PCT/US97/05223 578 245 INFORMATION FOR SEQ ID NO:656: SEQUENCE
CHARACTERISTICS:
LENGTH: 130 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...130 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:656: Phe Ser Gin Arg Thr Asn Leu Leu Phe Pro Asn Asn Gin 1 5 10 Asp Leu Glu Thr Leu Leu Leu Glu Ile Ala Lys His Asp 25 Lys Cys Phe Glu Gly Tyr Leu Glu Cys Ile Lys Ser Lys 40 Lys Pro Ile Lys Asn Ile Arg Lys Asn Met Leu Tyr Ala 55 Ala Leu Gly Leu Glu Asn Leu Thr Lys Thr Asn Ile Asp 70 75 Ser Lys Gly Lys Ile Lys Ser Arg Tyr Glu Glu Asn Tyr 90 Thr Glu Glu Val Ile Asp Phe Ser Ser Asn Ser Leu Ile 100 105 Asn Phe Leu Gly Gin Phe Ala Glu Asn Lys Gin Lys Thr 115 120 125 Ile Phe 130 INFORMATION FOR SEQ ID NO:657: SEQUENCE
CHARACTERISTICS:
LENGTH: 71 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature Asp Asp Glu Phe Glu His Tyr Leu Val Phe Lys Lys Pro Leu 110 Asn Pro Gly Leu Tyr Glu Asp Leu Lys Lys WO 97/37044 PCT/US97/05223 579 LOCATION 1...71 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:657: Ile Tyr Phe Ser Ala Lys Arg Phe Asn Thr Lys Asn Thr His Gly Thr 1 5 10 Gly Cys Thr Leu Ser Ser Leu Ile Val Gly Leu Leu Ala Gin Gly Leu 25 Asp Leu Lys Asn Ala Ile Thr Lys Ala Lys Glu Leu Leu Thr Ile Ile 40 Ile Gin Asn Pro Leu Asn Ile Gly His Gly His Gly Pro Leu Asn Leu 55 Trp Ser Ile Lys Lys His Val INFORMATION FOR SEQ ID NO:658: SEQUENCE CHARACTERISTICS: LENGTH: 233 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...233 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:658: Arg Pro Ser Phe Gin Lys Arg Cys Ala His Phe Asn Ala Thr Leu Pro 1 5 10 Ile Leu Leu Lys Val Leu Glu Lys Gin Asp Lys Asp Leu Phe Leu Leu 25 Gin Val Gly Asn Arg Ile Ile Pro Thr Lys Ser Glu Gin Glu Leu Lys 40 Ile Asn Gln Pro Tyr Phe Ala Thr Met Gin Arg Asn Gin Leu Gly Asp 55 Ile Val Leu Lys Asn Leu Val Pro Ala Pro Lys Ile Leu Asp Ala Leu 70 75 Asp Asp Leu Pro Val Ile Glu Met Lys Lys Leu Lys Glu Ile Leu Ser 90 Ala Lys Asp Asn Thr Pro Leu Lys Glu Tyr Lys Glu Leu Leu Ser Glu 100 105 110 Lys Leu Ile His Ala Lys Ser Ser Gin Glu Phe Leu Asn Thr Ala Asn 115 120 125 Met Leu Leu Ser Leu Gin Ser Gin Val Leu Ser Phe Val Val Glu Asn 130 135 140 Glu Arg Lys Lys Ala Phe Leu Gin Met Lys Ala Lys Lys Gin Ser Val 145 150 155 160 Asp Phe Tyr Ala Leu Tyr Pro Asn Leu Gly Glu Ile Gly Gly Val Ile 165 170 175 WO 97/37044 PCTIUS97/05223 580 Tyr Leu Lys Glu Lys Glu Lys Gin Leu Phe Leu Lys Thr Thr Leu Gin 180 185 190 Arg Thr Lys Glu Val Leu Lys Glu Ala Gin Asn Thr Leu Leu Gly Phe 195 200 205 Ser Phe Val Glu Ile Val Cys Glu Lys Thr Pro Met Leu Phe Ala Phe 210 215 220 Glu Asp Arg Leu Leu Asp Thr Leu Gly 225 230 INFORMATION FOR SEQ ID NO:659: SEQUENCE CHARACTERISTICS: LENGTH: 154 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...154 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:659: Leu Leu Glu Met Ile Phe Arg Ser Arg Arg Ser Lys Trp Gly Ala Leu 1 5 10 Cys Asn Ala Gin Ile Ile Glu Cys Val Ala Asn Ala Leu Glu Thr Cys 25 Asp Phe Gly Leu Cys Val Leu Asp Pro Val Met Val Ala Lys Asn Gly 40 Ala Leu Leu Leu Glu Glu Glu Ala Ile Leu Ser Leu Lys Lys Arg Leu 55 Leu Pro Lys Thr Asn Leu Leu Thr Pro Asn Leu Pro Glu Val Tyr Ala 70 75 Leu Thr Gly Val Gin Ala Arg Asp Asp Lys Ser Ala Ser Lys Ala Met 90 Gly Val Leu Arg Asp Leu Gly Val Lys Asn Ala Val Ile Lys Gly Gly 100 105 110 His Thr Glu His Phe Gin Gly Glu Phe Ser Asn Asp Trp Val Phe Leu 115 120 125 Glu Asp Ala Glu Phe Thr Leu Ala Pro Ser Asp Ser Thr Pro Lys Thr 130 135 140 Arg Met Ala Arg Val Val Leu Cys Leu Ala 145 150 INFORMATION FOR SEQ ID NO:660: SEQUENCE CHARACTERISTICS: LENGTH: 323 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCTIUS97/05223 581 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...323 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:660: Trp Leu Gin Ser Arg Cys Phe Arg Asp Ser Ala Phe Cys Leu Ser Ser 1 5 10 Glu Lys Leu Phe Gly Lys Asn Leu Asn Lys Val Val Ile Leu Ala Leu 25 Arg Asp Ile Ile His Glu Tyr Gly His Thr Leu Gly Tyr Thr His Asn 40 Gly Asn Met Thr Tyr Gin Arg Val Arg Leu Cys Gin Glu Gly Asn Gly 55 Pro Glu Ala Arg Cys Glu Gly Gly His Glu Val Glu Lys Asn Gly Lys 70 75 Glu Glu Leu Glu Phe Ser Asn Gly His Glu Val Arg Asp His Asp Gly 90 Tyr Thr Tyr Asp Val Cys Ser Arg Phe Gly Gly Lys Asn Gin Pro Ala 100 105 110 Phe Pro Ser Asn Tyr Pro Asn Ser Ile Tyr Thr Asn Cys Ala Gin Val 115 120 125 Pro Ala Gly Leu Ile Gly Val Thr Thr Ala Val Trp Gln Gin Leu Ile 130 135 140 Asn Gin Asn Ala Leu Pro Ile Asn Phe Ala Asn Leu Asn Ser Gin Thr 145 150 155 160 Ser His Leu Asn Ala Gly Leu Asn Ala Gin Asn Phe Ala Thr Ser Met 165 170 175 Val Ser Ala Ile Ala Gin Asn Phe Ser Thr Thr Ser Thr Thr Thr Tyr 180 185 190 Arg Ser Ser Ser Lys Asn Phe Arg Ser Pro Ile Leu Gly Val Asn Val 195 200 205 Lys Ile Gly Tyr Gin His Tyr Phe Asn Asp Tyr Ile Gly Leu Ala Tyr 210 215 220 Tyr Gly Ile Ile Gin Tyr Asn Tyr Ala Gin Ala Asn Asp Glu Lys Ile 225 230 235 240 Gin Gin Leu Ser Tyr Gly Gly Gly Met Asp Val Leu Phe Asp Phe Ile 245 250 255 Thr Thr Tyr Thr Asn Lys Lys Gin Asp His Pro Thr Lys Lys Val Phe 260 265 270 Ala Ser Ser Phe Gly Val Phe Gly Gly Leu Arg Gly Leu Tyr Asn Ser 275 280 285 Tyr Tyr Val Phe Asn Gin Val Lys Gly Ser Gly Asn Leu Asp Ile Val 290 295 300 Thr Gly Phe Asn Tyr Arg Tyr Lys His Ser Lys Tyr Ser Ile Ala Leu 305 310 315 320 Ala Phe Leu INFORMATION FOR SEQ ID NO:661: WO 97/37044 PCT/US97/05223 582 SEQUENCE CHARACTERISTICS: LENGTH: 142 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...142 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:661: Tyr Asp Phe Leu Gly Val Ser Leu His Ala Leu Ser Pro Leu Glu Glu 1 5 10 Gin Glu Phe Leu Ile Ser Tyr Arg Leu Lys Ile Val Asp Ser Arg Val 25 Met Gly Glu Glu Tyr Ser Val Ser Lys Pro Ile Val Ser Arg Ile Lys 40 Thr Ala Pro Tyr Val Leu Asp Tyr His Cys Ser Ile Ile Thr Arg Asn 55 Leu Pro Asp Leu Lys Asn Pro Leu Leu Gln Ile Lys Leu Glu Arg Phe 70 75 Leu Leu Glu Ile Ala Leu Lys Lys Glu Lys Glu Arg Val Ile Asp Cys 90 Leu Leu Lys Ser Gin Val Ala Ile Thr His Tyr Asp His Ser Tyr Lys 100 105 110 Asn Gly Thr Thr Thr Thr Ser Ile Leu Asn Leu Lys Ala Leu Ser Val 115 120 125 Lys Ala Ser Leu Asp Gly Arg Tyr Ala Val Phe Arg Tyr Phe 130 135 140 INFORMATION FOR SEQ ID NO:662: SEQUENCE CHARACTERISTICS: LENGTH: 344 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...344 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:662: WO 97/37044 PCT/US97/05223 Glu Trp Leu Met Leu Lys Lys Ile Phe Leu Thr Ala Asp Phe Gly Lys Val Lys 145 Glu Tyr Lys Pro Ala 225 Asn Thr Gin Leu Phe 305 Thr Asp Ile Thr Ser Lys Val Gly Ser Lys 130 Glu Ile Gly Leu Ser 210 Asn Glu Ser Leu Thr 290 Gly Asn Ser Ile Asp Gin Glu Glu Ile Gly 115 Asn Ala Leu Val Thr 195 Phe Asp Leu Thr Asp 275 Ile His Ala Tyr Val Lys Ile Phe Ser Asn 100 Arg Met Ile Glu Lys 180 Leu Tyr Ile Lys Gin 260 Gin Asp Gin Ser Arg 340 Gly Asp Leu Arg Leu Leu Tyr Val Ile Arg 165 Thr Lys Asp Leu Ser 245 Asn Leu Leu Lys Asn 325 Asn Leu Ile Asp Phe 70 Leu Asp Thr Leu Ser 150 Tyr Ala Glu Pro Arg 230 Ala Ile Asp Asp Ile 310 Asp Glu Leu Ala Arg 55 Tyr Ala Ala Glu Thr 135 Ile Leu Ser Ile Thr 215 Arg Leu Ala Gly Tyr 295 Leu Lys His Ala Val 25 Lys Ile 40 Lys Gly Ala Arg Val Glu Val Met 105 Gly Gly 120 Arg Glu Arg Ile Asn Gin Leu Gly 185 Thr Met 200 Lys Asn Leu Tyr Asn Glu Pro Tyr 265 Leu Lys 280 Gin Arg Glu Lys Asp Glu Arg Tyr Leu Lys Arg Phe Asp 90 Arg Ser Lys Glu Thr 170 Tyr Leu Leu Ser Val 250 Val Thr Leu Ile Gly Val Asp Leu Glu 75 Thr Ala Thr Thr Lys 155 Phe Phe Val Glu Leu 235 Pro Val Gin Ala Ala 315 Phe Ala Tyr Ile Glu Leu Met Leu Leu 140 Val Phe Lys Ala Phe 220 Gly Ile Asp Gly Leu 300 Lys Ile Val Gin Val Arg Pro Ala Asn Ile Pro Phe Phe Ile Lys 110 Thr Gin 125 Thr Arg Leu Ser Gly His Lys Pro 190 Leu Pro 205 Ser Leu Trp Ile Val Tyr Glu Val 270 Tyr Thr 285 Glu Ser Glu Lys Leu Phe Trp Val Ser Val Ile Tyr Pro Arg Glu His Asn Ala Gin Leu Lys Leu Lys Glu 160 Gly Tyr 175 Leu Asp Arg Ala Ser Arg Ser Ser 240 Asn Gin 255 Leu Lys Ile Lys Leu Arg Pro Lys 320 Asp Asn Leu Asn Ala Gin His 330 335 INFORMATION FOR SEQ ID NO:663: SEQUENCE CHARACTERISTICS: LENGTH: 117 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 584 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...117 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:663: Thr Pro Ser Met Ile Val Thr Glu Thr Ser Thr Gly Lys Ile Leu Ala 1 5 10 Leu Val Gly Gly Ile Asp Tyr Lys Lys Ser Ala Phe Asn Arg Ala Thr 25 Gin Ala Lys Arg Gin Phe Gly Ser Ala Ile Lys Pro Phe Val Tyr Gin 40 Ile Ala Phe Asp Asn Gly Tyr Ser Thr Thr Ser Lys Ile Pro Asp Thr 55 Ala Arg Asn Phe Glu Asn Gly Asn Tyr Ser Lys Asn Ser Val Gin Asn 70 75 His Ala Trp His Pro Ser Asn Tyr Thr Arg Lys Phe Leu Gly Leu Val 90 Thr Leu Gin Glu Ala Leu Ser His Ser Leu Asn Leu Ala Thr Ile Asn 100 105 110 Leu Ala Iie Ala Trp 115 INFORMATION FOR SEQ ID NO:664: SEQUENCE CHARACTERISTICS: LENGTH: 208 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...208 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:664: Pro Arg Asp His Thr Tyr Ser Leu Glu Asp Ser Thr Pro His Gly Ser 1 5 10 Leu Leu Gly Arg Asn Gly Val Thr Leu Asn Ile Arg Gin Val Phe Trp 25 Trp Asp Asn Phe Asn Trp Ser Ile Gly Phe Tyr Asn Thr Phe Gly Asn 40 Ser Asp Ala Phe Leu Gly Ser His Thr Met Pro Arg Gly Asn Asn Thr 55 Ser Tyr Ile Ser Asn Glu Ile Ser Val Thr Thr Arg His Ala Gly Met 70 75 Ile Gly Tyr Asp Phe Trp Asp Asn Thr Ala Tyr Asp Gly Leu Ala Asp WO 97/37044 PCT/US97/05223 585 90 Ala Ile Thr Asn Ala Asn Thr Phe Thr Phe Tyr Thr Ser Val Gly Gly 100 105 110 Ile His Lys Arg Phe Ala Trp His Val Phe Gly Arg Val Ser His Ala 115 120 125 Asn Lys Asn Ala Leu Gly Gin Val Gly Arg Ala Asn Glu Tyr Ser Leu 130 135 140 Gln Phe Asn Ala Ser Tyr Ala Ser Thr Glu Ser Val Leu Leu Asn Phe 145 150 155 160 Arg Ile Thr Tyr Tyr Gly Ala Arg Ile Asn Lys Gly Tyr Gin Ala Gly 165 170 175 Tyr Phe Gly Ala Pro Lys Phe Asn Asn Pro Asp Gly Asp Phe Ser Ala 180 185 190 Asn Tyr Gin Asp Arg Ser Tyr Met Met Thr Asn Leu Thr Leu Lys Phe 195 200 205 INFORMATION FOR SEQ ID NO:665: SEQUENCE CHARACTERISTICS: LENGTH: 132 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...132 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:665: Pro Arg Ile His Asp Met Gly Asp Leu Asp Ser Leu Asn Ile Ser Pro 1 5 10 Asp Pro Asn Thr Pro Thr Leu Leu Ile Leu Ser Ala Leu Asp Asn Ser 25 Leu Lys Asp Tyr Ala Pro Thr Phe Asn Val Leu Lys Lys Thr Phe Lys 40 Asp Arg Leu Arg Val Leu Ile Leu Leu Asn Gin Pro Tyr Ser Ser Asp 55 Ala Ile Lys Gly Phe Ile Ala Pro Ser Gln Thr Asp Leu Met Ile Leu 70 75 Asn Pro Lys Asp Thr Ala Leu Phe Asp His Leu Asn His Asp Ala Leu 90 Asn His Ser Phe Asn Met Leu Leu Tyr Asp Lys His Gin Leu Ile Lys 100 105 110 Met Tyr Gin Gly Ile Val Pro Ala Glu Met Leu Gin Phe Asp Ile Ser 115 120 125 Asn Leu Lys Asp 130 INFORMATION FOR SEQ ID NO:666: WO 97/37044 PCT/US97/05223 586 SEQUENCE CHARACTERISTICS: LENGTH: 122 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...122 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:666: Arg Arg Ser Leu Tyr Tyr Val Phe Ile Lys Ser Tyr Ala Ile Leu Ala 1 5 10 Glu Ala Thr Ala Arg Phe Ala Gly Asp Tyr Ala Thr Glu Asn Leu Thr 25 Ser Arg Ile Ala Leu Pro Cys Ser Asp Tyr Val Gly Arg Val Ile Gly 40 Lys Asp Gly Lys Asn Ile Glu Ala Phe Lys Lys Ile Ser Gly Val Asp 55 Ile Glu Phe Ser Glu Asp Ser Ser Glu Leu Cys Leu Ser Ser Phe Asn 70 75 Ile Tyr Arg Arg Glu Val Ala Ser Glu Thr Ile Lys Ile Leu Ile Glu 90 Asp Gly Arg Ile Gin Pro Asn Arg Ile Glu Glu Val Tyr Pro Lys Ile 100 105 110 His Pro Gin His Gly Lys Arg Ile Ala Phe 115 120 INFORMATION FOR SEQ ID NO:667: SEQUENCE CHARACTERISTICS: LENGTH: 250 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...250 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:667: Lys Thr Ala Val Ser Ser Leu Thr Gly Leu Lys Arg Phe Ile Leu Lys 1 5 10 WO 97/37044 PCT/US97/05223 587 Phe Thr Arg Asn Met Glu Lys Glu Leu Leu Ser Glu Gly Glu Ser Val 25 Val Leu Glu Leu Glu Leu Gly Thr Met Glu Asp Glu Leu Lys Ile Leu 40 Ile Gly Lys Met Arg Tyr Arg Phe Ser Phe Gly Gin Asn Ala Leu Gin 55 His Ser Lys Glu Ile Ala Leu Leu Ser Gly Leu Ile Ala Glu Gin Leu 70 75 Gly Gly Asp Lys Lys Leu Ala Arg Arg Ala Gly Ile Leu His Asp Ile 90 Gly Lys Ala Leu Thr Gin Glu Leu Gly Arg Asp His Val Asn Leu Gly 100 105 110 Val Glu Val Cys Lys Arg His Lys Glu Asp Pro Val Val Ile Asn Ala 115 120 125 Ile Tyr Ala His His Gly His Glu Glu Ile Met Ser Val Giu Cys Ala 130 135 140 Ser Val Cys Ala Ala Asp Ala Leu Ser Ala Gly Arg Pro Gly Ala Arg 145 150 155 160 Arg Lys Ser Asp Glu Glu Tyr Ala Lys Arg Met Gin Ala Leu Glu Glu 165 170 175 Ile Ala Leu Glu Phe Asp Gly Val Glu Lys Ala Tyr Ala Met Glu Ser 180 185 190 Gly Arg Glu Leu Arg Val Ile Val Lys Ser Asn Gin Val Arg Asp Asn 195 200 205 Gin Val Pro Ile Ile Ala Arg Lys Ile Ala Lys Arg Ile Glu Glu Ser 210 215 220 Thr Gin Tyr Val Gly Glu Val Gly Val Gin Val Val Arg Glu Asn Arg 225 230 235 240 Phe Lys Thr Thr Ala Thr Leu Ser Asn Asp 245 250 INFORMATION FOR SEQ ID NO:668: SEQUENCE CHARACTERISTICS: LENGTH: 804 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...804 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:668: Ser Leu Trp Leu Lys Ser Lys Phe Phe Leu Leu Met Gly Val Gly Leu 1 5 10 Leu Ser His Ser Leu Asn Ala Leu Ser Leu Thr Leu Thr Gin Gly Lys 25 Glu Glu Gly Glu Asp Phe Ser Val Leu Thr Leu Arg Asn Asp Lys Ala 40 WO 97/37044 PCTT[US97/05223 588 Phe Ser Ser Ile Lys Lys Gln 145 Asn Thr Thr Asp Lys 225 Ile Ile Glu Lys Ser 305 Glu Ser Ala Ile Glu 385 Glu Leu Arg Gir G1 465 His Ser C Leu Ile Thr Vai Ala 130 Ile Ala Pro Lys Ser 210 Asn Ile Gly Ala Gin 290 Arg Gly Asn Glu Asp 37.0 Leu Ser Lys Ile Asp 450 Lys 3 Tyr :ys 3er ?ro [yr 4et 1.15 lie lie Gln Ile Gly 195 ln Tyr Ala Thr Leu 275 Ala Tyr Ser Ala Ala 355 Lys Ala Ala Ala Lys 435 His Ala Asj c ;er Tyr Ala Asn Glu Lys 55 Ile Lys Ser 100 Arg Pro Gly Lys Ile 180 Tyr Ala Pro Leu Gin 260 Tyr Val Ala Asp Lys 340 Glu Va1 Leu Ile Lys 420 Asp Ala Leu Lys lie Glu Met Arg Leu Tyr Gly 165 Gin Asp Tyr Gin Gly 245 Trp Tyr Arg Pro Leu 325 Asp Ile Val Glu Glu 405 Gl.
PhE Gi\ Ph~ lie His 1 70 Gly I Arg C Leu Phe Asp 150 Leu Glu I Leu Phe Thr 230 Lys Ile Val Tyr Leu 310 Ser Lys Asn Gin Leu 390 Ile Gin Lys i Leu Ser 470 e Ile la The ln 'hr lal 135 31n %sn Leu Asn Asp 215 lie Leu Lys Ala Tyr 295 Ala Asn Glu Tyr Ser 375 Leu Ala Ala Asn Glu 455 Met Glr Lys Thr Gin Leu 120 Glu Lys Phe Asp Ala 200 Ala Phe Gly Asn Lys 280 Lys Gin Ala Ser Gin 360 Asn Lys His Leu Ala 440 Arg Glu 1 Asn ,rg Pro Mln 105 Phe Asn Ile Pro Val 185 Tyr Leu Lys Ile Tyr 265 Ala Arg Met Asn Ala 345 Asn Prc Leu Leu Tyr 425 His Al G1 Phf Pro I Pro I Leu C 90 Phe I Ser I Asp Pro I Ile 170 Asn Leu Arg Lys Lys 250 Pro Leu Ile Arg Met 330 Ser Phe Asp Leu Leu 410 Asp Leu Ser Asn Pro )ro le 15 ;iu 2 le I ?he la The 155 lie %sn Mlu Thr Asp 235 Lys Thr Asp Leu Leu 315 Leu Glu Asn Tyr Lys 395 Leu Leu Tyr Val Thr 475 Asn 3er lu %sn Leu Asp Lys 140 Leu Ile Lys Ala Ile 220 Leu Ser Asp Glu Leu 300 Ala Phe Ile Asn Leu 380 Lys Asn Glj Asr Va] 46( Glir Se Gly I Cys Ala I His I Arg 125 Ala I Ser Lys 2 Pro Lys 205 Ser Tyr Leu Pro Asn 285 Glu Ile Lys Ala Ala 365 Ser Asn Gin Ala 1 Leu 445 Arg I Glu Asn le ial The ile tsp .ys lu ksp Leu 190 Lys Arg Leu Leu Asn 270 Asn Tyr Glu Glu Leu 350 Lys Met Gir Asp Let 43C Glr Ali Ly Gi Glu 2 ile Phe Lys Tyr Met Lys Ala 175 Leu Gin Ala Leu Ile 255 Ile Asn Lys Ala Ala 335 Asn Tyr His Met Asp 415 1 Tyr Tyr Arg s Ile Ala kla Asp Asn Pro Lys Trp Asp 160 Gin Thr Met Phe Glu 240 Asp Pro Tyr Asn Ala 320 Phe Trp Leu Ser Asn 400 Asp Ala Leu Asp Ala 480 Gln 485 490 495 Lys Ala Leu Giu Leu Lys Ala Gin Leu Leu Phe Asp Asn Lys His Tyr WO 97/37044 PCT/US97/05223 Ala Ile Arg 545 Phe Ser Ala Asn Asp 625 Ile Leu Lys Pro Ala 705 Asp Asp Leu Ser Thr 785 Asn Glu Gin 530 Cys Asn Leu Lys Tyr 610 Ala Ala Ala Arg Lys 690 Tyr Ala Lys Tyr Lys 770 Asn Lys 500 Val Leu 515 Lys Thr Lys Glu Pro Gin Lys Glu 580 Thr Pro 595 Tyr Arg Leu Ile Phe Val Leu Asn 660 Met Ala 675 Ser Val Lys Asp Tyr Arg Leu Leu 740 Leu Gin 755 Ala Ser Ala Trp Glu Ser Gly Leu Ala Glu 565 Lys Ser Leu Leu Leu 645 Leu Leu Lys Tyr Thr 725 Asn Ser Leu Gin Met Asn Leu 550 Glu Ala Glu Gly Ala 630 Phe Tyr Val Ile Ser 710 Thr Arg Ser Glu Asn 790 Gin Ile 535 Lys Ile Gin Lys Asp 615 Gln Ser Ala Tyr Tyr 695 Tyr Lys Arg Leu Lys 775 Lys 520 Leu Tyr Gln Ile Leu 600 Phe Ser Asp Phe Phe 680 Ala Thr Asp Leu Leu 760 Cys 505 Asn Ala Leu Ala Ile 585 Thr Lys Leu Tyr Leu 665 Lys Thr Pro Tyr Ser 745 Asp Val Leu Lys Ser Phe 570 Ala Trp Asn Asn Met 650 Glu Leu Ser Phe Ser 730 Leu Leu Gin Pro Thr Gin 555 Asp Leu Leu Ser Lys 635 Gin Lys Leu Leu Gly 715 Lys Glu Thr Leu Lys Pro 540 Ile Cys Asn Tyr Thr 620 Lys Asn His Glu Leu 700 Glu Ala Asp Asn Lys 780 Asp 525 Leu Thr Leu Ala Arg 605 Leu Glu Asn Phe Asn 685 Lys Phe Leu His Gin 765 Gin 510 Ser Glu Ala Tyr Leu 590 Leu Ala Phe Glu Lys 670 Glu Leu Ala Glu Gin 750 Lys Lys Pro Leu Asp His Phe Ala 560 Phe Ala 575 Lys Ala Gly Arg Ser Lys Tyr Asp 640 Lys Gly 655 Asp Asp Lys Asp Gin Asp Leu Ile 720 Thr Leu 735 Lys Ala Ala Lys Asp Gin Leu Cys Glu Gin Gly Leu Asn Leu Phe Lys 795 800 INFORMATION FOR SEQ ID NO:669: SEQUENCE CHARACTERISTICS: LENGTH: 382 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...382 WO 97/37044 590 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:669: PCT/US97/05223 Lys His Pro Asn Arg Phe Phe Phe Lys Arg Ile 1 Lys Pro Lys Thr Asp Leu Ser Val1 Gly 145 Ile Pro Pro Gin Ser 225 Arg Giu Lys Ala Leu 305 Leu Asn Lys Val1 Pro Ser Leu Pro Ala Phe Ala Glu 130 Lys Leu Cys Tyr Thr 210 Thr Gin Tyr Gin Tyr 290 Asn Asn His Gin Val 370 Asn Ser Gly Ser Asn Val1 Asp 115 Lys Trp Lys Asp Thr 195 Phe Giu Cys Glu Asp 275 Ser Giu Asp Val1 Pro 355 Cys Thr Ser Ser Lys Pro Ala 100 Ala Gin Val Arg Tyr 180 Lys Glu Lys Tyr Ile 260 Asp Ser Lys Ile Arg 340 Arg Val1 5 Phe Gin Lys Vali Pro Pro Ser Ala Tyr Val 165 Ser Ile Ala Cys Leu 245 Thr Gin Thr Phe Ile 325 Phe Ala Lys Leu Gin Asn Ser 70 Leu Pro Giu Ile Asp 150 Asp Thr Ser Lys Lys 230 Ile Thr Val Arg Met 310 Lys Lys Lys Lys Asn Ser Ser 55 Pro Lys Thr Asn Arg 135 Asp Lys Ala Val1 Asn 215 Arg Giu Gin Giu Lys 295 Giu Giu Giu Ser Gly 375 Ala Pro 40 Lys Thr His Giu Asn 120 Asp Glu Asp Glu His 200 Asn Ala Glu Leu Pro 280 Ser Phe Ser Arg Thr 360 Asn Pro 25 Gin Asn Asn Ser Lys 105 Giu Pro Asn Lys Asn 185 Lys Phe Arg Pro Val1 265 Thr Giu Val Ser Val1 345 Pro Tyr 10 Met As n Ser Giu Ser 90 Thr Ser Asn Leu Giu 170 Lys Thr Ala Ala Leu 250 Lys Phe Ile Giu Giu 330 Cys Leu Gin Phe Leu Val 75 Gin Leu Asn Ile Gin 155 Ile Ser Glu Ile Arg 235 Lys Ala Tyr Thr Val 315 Tyr Met Ser Giu Lys Ser Leu Lys Asp Pro Giu Lys 140 Ala Thr Gly Pro Leu 220 Lys Gin Ile Giu Arg 300 Tyr Lys Ala Ile Arg Pro Tyr Gin Thr Gin Asn Asn 125 Glu Tyr Thr Lys Leu 205 Gin Asp Ala Tyr Thr 285 Asn Giu Glu Leu Giu 365 Thr Gin Pro Pro Pro Giu Asn 110 Arg Phe Arg Asp Ile 190 Giu Ala Gly Trp Glu 270 Ser Giu Gly Trp Lys 350 Asn Ala Asn Giu Leu Thr Asn Thr Asp Ala Pro Ile 175 Ile Asp Arg Thr Giu 255 Arg Giu Leu His Val1 335 Ile Ser Leu Lys Ser Val Asn Asn Ser Asn Cys Ser 160 Thr Thr Pro Ser Thr 240 Ser Pro Leu Asn Tyr 320 Lys Lys Arg Leu Phe Asn Giu Val INFORMATION FOR SEQ ID NO:670: SEQUENCE CHARACTERISTICS: LENGTH: 105 amino acids TYPE: amino acid WO 97/37044 PCT/US97/05223 591 TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...105 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:670: Thr Lys Lys Leu Asn Asn Thr Leu Phe Asn Lys Gly Leu Ile Ile Phe 1 5 10 Lys Met Phe Lys Lys Ile Ile Phe Leu Cys Val Phe Leu Ile Gly Gly 25 Phe Val Ile Pro Pro Leu Glu Ala Met Pro Ile Leu Arg Asn Lys Thr 40 Pro Lys Lys Asn Tyr Gin Glu Ala His Glu Lys Leu Tyr Arg Ser Ile 55 Ile Asn Arg Gin Lys Leu Thr Arg Lys Lys Ser Gly Trp Tyr Phe Leu 70 75 Gly Gly Val Gly Ala Val Glu Ala Ile Lys Asp Tyr Gin Gly Lys Glu 90 Met Lys Asp Trp Met Pro Arg Ser Ile 100 105 INFORMATION FOR SEQ ID NO:671: SEQUENCE CHARACTERISTICS: LENGTH: 69 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...69 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:671: Ser Lys Thr Ser Tyr Cys Asn Ser Tyr Lys Arg Lys Leu Leu Pro Tyr 1 5 10 His Phe Leu Leu Phe Pro Leu Trp Ala Cys Phe His Asn Val Ser Arg 25 Pro Phe Ser Leu Ser His Leu Phe Leu Val Lys Arg Pro Tyr Ser Arg 40 Leu Tyr Leu His Asn Gin Ser Asn His Lys Met Arg His His Ser Ile WO 97/37044 PCT/US97/05223 592 55 Leu Phe Val Ser Phe INFORMATION FOR SEQ ID N0:672: SEQUENCE CHARACTERISTICS: LENGTH: 458 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...458 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:672: Gin Asp Lys Gly Leu Leu Leu Ser Val Ala Leu Pro Asn Ser Asn Asn 1 5 10 Ala Ser Gin Asn Asn Ile Leu Ser Leu Ser Val Leu His Asn Gin Ile 25 Lys Met Ser Tyr Gly Asn Lys Val Met Asp Phe Thr Pro Pro Thr Leu 40 Gin Asp Tyr Ile Val Gly Ile Gin Gly Gin Ser Ala Leu Asn Gin Ile 55 Glu Ala Val Gly Gly Asn Asn Ala Ile Lys Trp Leu Ser Thr Leu Met 70 75 Met Glu Thr Lys Glu Asn Pro Leu Phe Ala Pro Ile Tyr Leu Glu Asn 90 His Ser Leu Asn Glu Ile Leu Gly Val Thr Lys Asp Leu Gin Asn Thr 100 105 110 Ala Ser Leu Ile Ser Asn Pro Asn Phe Arg Asn Asn Ala Thr Ser Leu 115 120 125 Leu Glu Met Ala Ser Tyr Thr Gin Gin Thr Ser Arg Leu Thr Lys Leu 130 135 140 Ser Asp Phe Arg Ala Arg Glu Gly Glu Ser Asn Phe Ser Glu Arg Leu 145 150 155 160 Leu Glu Leu Lys Asn Lys Arg Phe Ser Asp Pro Asn Pro Ser Glu Val 165 170 175 Phe Val Lys Tyr Ser Gin Leu Ser Lys His Pro Asn Asn Leu Trp Ile 180 185 190 Gin Gly Val Gly Gly Ala Ser Phe Ile Ser Gly Gly Asn Gly Thr Leu 195 200 205 Tyr Gly Leu Asn Val Gly Tyr Asp Arg Leu Val Lys Ser Val Ile Leu 210 215 220 Gly Gly Tyr Val Ala Tyr Gly Tyr Ser Gly Phe Asn Gly Asn Ile Met 225 230 235 240 His Ser Leu Ala Asn Asn Val Asp Val Gly Met Tyr Ala Arg Ala Phe 245 250 255 Leu Lys Arg Asn Glu Phe Thr Leu Ser Ala Asn Glu Thr Tyr Gly Gly WO 97/37044 PCT/US97/05223 593 260 265 270 Asn Ala Ser His Ile Asn Ser Ser Asn Ser Leu Leu Ser Val Leu Asn 275 280 285 Gin Arg Tyr Asn Tyr Asn Thr Trp Thr Thr Ser Val Asn Gly Asn Tyr 290 295 300 Gly Tyr Asp Phe Met Phe Lys Gin Lys Ser Val Val Leu Lys Pro Gin 305 310 315 320 Val Gly Leu Ser Tyr His Phe Ile Gly Leu Ser Gly Met Lys Gly Lys 325 330 335 Met Gin Asn Pro Ala Tyr Gin Gin Phe Val Met His Ser Asn Pro Ser 340 345 350 Asn Glu Ser Val Leu Thr Leu Asn Met Gly Leu Glu Ser Arg Lys Tyr 355 360 365 Phe Gly Lys Asn Ser Tyr Tyr Phe Val Thr Ala Arg Leu Gly Arg Asp 370 375 380 Leu Leu Ile Lys Ala Lys Gly Asp Asn Val Val Arg Phe Val Gly Glu 385 390 395 400 Asn Thr Leu Leu Tyr Arg Lys Gly Glu Ile Phe Asn Thr Phe Ala Ser 405 410 415 Val Ile Thr Gly Gly Glu Met His Leu Trp Arg Leu Met Tyr Val Asn 420 425 430 Ala Gly Val Gly Leu Lys Met Gly Leu Gin Tyr Gin Asp Leu Asn Ile 435 440 445 Thr Gly Asn Val Gly Met Arg Val Ala Phe 450 455 INFORMATION FOR SEQ ID NO:673: SEQUENCE CHARACTERISTICS: LENGTH: 186 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...186 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:673: Pro Phe Glu Ala Leu Asn Phe Tyr Ser Lys Lys Ala Leu Asn Glu Ile 1 5 10 Phe Ala Asn Ala Arg Lys Ile Cys Gly Asn Lys Pro Leu Gly Ala Asn 25 Ile Leu Tyr Ala Ile Asn Asp Tyr Gly Arg Val Leu Arg Asp Ser Cys 40 Glu Ala Gly Ala Asn Ile Ile Ile Thr Gly Ala Gly Leu Pro Thr Asn 55 Met Pro Glu Phe Ala Lys Asp Phe Ser Asp Val Ala Leu Ile Pro Ile 70 75 Ile Ser Ser Ala Lys Ala Leu Lys Ile Leu Cys Lys Arg Trp Ser Asp WO 97/37044 PCTIS97/05223 594 90 Arg Tyr Lys Arg Ile Pro Asp Ala Phe Ile Val Glu Gly Pro Leu Ser 100 105 110 Gly Gly His Gin Gly Phe Lys Tyr Glu Asp Cys Phe Lys Glu Glu Phe 115 120 125 Gin Leu Glu Asn Leu Val Pro Lys Val Val Glu Ala Ser Lys Glu Trp 130 135 140 Gly Asn Ile Pro Ile Ile Ala Ala Gly Gly Ile Trp Asp Arg Lys Asp 145 150 155 160 Ile Asp Thr Met Leu Ser Leu Gly Ala Ser Gly Val Gin Met Ala Ile 165 170 175 Ser Phe Phe Arg His Glu Arg Met Arg Arg 180 185 INFORMATION FOR SEQ ID N0:674: SEQUENCE CHARACTERISTICS: LENGTH: 88 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...88 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:674: Ala Ile Arg Leu Arg Leu Ser Ile Arg Gly Leu Ile Lys Arg Ile Glu 1 5 10 Glu Gly Asn Ala Pro Lys Ile Ala Cys Val Ser Asn Cys Val Thr Pro 25 Cys Asn Arg Gly Glu Glu Thr Lys Lys Val Gly Tyr Cys Ile Ala Asp 40 Gly Leu Gly Arg Ser Tyr Leu Gly Asn Arg Glu Glu Gly Leu Tyr Phe 55 Thr Gly Ala Asn Gly Tyr Arg Val Asp Lys Ile Ile Ser Val His Glu 70 75 Leu Ile Lys Glu Leu Thr Glu Gly INFORMATION FOR SEQ ID NO:675: SEQUENCE CHARACTERISTICS: LENGTH: 284 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 WO 9737044PCTIUS97/05223 595 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .284 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:675: Asn Phe Asn Gin Arg Ala Phe Leu Lys Arg Ala Leu 1 Gly Lys Arg Gly Asn Asn Val Phe Ile 145 Tyr Ala Leu Leu Phe 225 Asn Leu Asn Leu Gly Tyr Ser Lys Ser Gly Tyr 130 Ile Asn Gly Gin Val 210 Gly Asn Giu Tyr Tyr Tyr Asp Gin Gin Val1 100 His Leu Sen Asp Tyr 180 Thr Leu Lys Ser Gly 260 Val1 5 Aia Thr Arg His Ile Ser Glu Ser Thr Lys 165 Asp Phe Giy Ile Asn 245 Ile Tyr Leu Phe Ser Asn 70 Ile Asn Giu Ala Tyr 150 Phe Arg Gly Val Pro 230 Asn Asn Phe As n Phe Ala Ser Phe Gly Thr Sen 135 Gly Phe Leu Gly Arg 215 Thr Ser Sen Asn Ala Asn Phe Ser Ser Leu Lys 120 Leu Thr Ala Sen Lys 200 Leu Tyr Giu Leu Tyr 280 Giu 25 Lys Tyr Asn Asp Gly 105 Trp Tyr Tyr Gly Asp 185 Val1 Gly Tyr Asn Leu 265 Thr Phe Val1 Gly Asn 75 Leu Gin Gly Sen Asp 155 Asn Leu Leu Glu Asn 235 Leu Gin Ser Trp Asp Ser Gly Ser Pro Gly Arg 125 Glu Leu Gly Tyr Gly 205 Asn Tyr Val1 Asp Leu Val Pro Tyr Gin Val Tyr 110 Trp Ser Leu Ile Gin 190 Phe Gin Tyr Leu Phe 270 Ile Leu Ile Gin Arg Phe Lys Gly Gin Asn Ala 175 Thr Gin Phe Ser Arg 255 Arg Leu Thr Lys Leu Phe Lys Trp Leu Ser Ala 160 Phe Leu Phe Gly Met 240 Phe Arg INFORMATION FOR SEQ ID NO:676: SEQUENCE CHARACTERISTICS: LENGTH: 66 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 596 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...66 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:676: Lys Met His Thr Ser Phe Phe Gin Ile Pro Leu Asn Phe Gly Val Arg 1 5 10 Val Asn Val Asp Arg His Asn Gly Phe Glu Met Gly Leu Lys Ile Pro 25 Leu Ala Val Asn Ser Phe Tyr Glu Thr His Gly Lys Gly Leu Asn Ala 40 Ser Leu Phe Phe Lys Arg Leu Val Met Phe Asn Val Ser Tyr Val Tyr 55 Ser Phe INFORMATION FOR SEQ ID NO:677: SEQUENCE CHARACTERISTICS: LENGTH: 138 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...138 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:677: Ile Lys Arg Ile Ile Lys Ser Asn Ala Ser Leu Asn Gin Leu Asn Thr 1 5 10 Thr Arg Tyr Asn Thr Pro Ser His Leu Phe Phe Lys Lys Gly Val Gly 25 Met Ala Thr Ile Gin Pro Phe Asn His Ser Thr Ile Gin Pro Phe Asn 40 His Ser Thr Ile Gin Pro Phe Asn His Ser Ile Ile Gin Ser Phe Asn 55 His Ser Thr Ile Gin Ala Thr Leu Pro Tyr Phe Tyr Asn Tyr Leu Ser 70 75 Phe Tyr Lys Asn Leu Phe Lys Asn Pro Leu Phe Phe Ile Ile Pro Pro 90 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro 100 105 110 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro 115 120 125 WO 97/37044 PCT/US97/05223 597 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile 130 135 INFORMATION FOR SEQ ID NO:678: SEQUENCE CHARACTERISTICS: LENGTH: 193 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...193 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:678: Arg Tyr His His Phe Ser Gly Leu Ser Ser Cys Gly Phe Ser Ala Gin 1 Cys Pro Ser Thr Leu Gin Ser Pro Gly 145 Ala Thr Arg Gly Trp Phe Thr Ser Thr Ser Thr 130 Gly Glu Gin Gly Asn Asn Thr Ser Leu 100 Asn Glu Ser Leu Pro 180 5 His Leu Thr Cys Glu Asn Met Tyr Ile Leu 165 His Ala Pro Asn 55 Gly Gin Asn Val Tyr 135 Ile Gin Asn Ser Glu Tyr Ala Val Gin Asn 120 Pro Gin Ala Gly 10 Ser Leu Tyr Asn Asn 90 Gly Asn Gly Lys Thr 170 Gly Leu Glu Asn Val 75 Thr Gly Gin Asn Ile 155 Ile Gly Trp Asn Thr Gly Ala Met Thr Gly 140 Ser Ile Val His Gly Gly Pro Tyr Pro Phe 125 Asn Ser Asn Met Gin Gly Ser Asn Gin Ala 110 Thr Tyr Val Val Gly 190 Leu Val Gly Gly Thr Leu Lys Tyr Asn Leu 175 Val Trp Arg Thr Ile Ile Asn Asn Ser Asp 160 Thr Trp INFORMATION FOR SEQ ID NO:679: SEQUENCE CHARACTERISTICS: LENGTH: 418 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 WO 9737044PCTIUS97/05223 598 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .418 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:679: Met Val Vai Val Gly Ser Trp Gly Phe Gly Gly Lys 1 Met Asn As n Asp Giu Gin Gly Giu Val1 145 Al a Ser Gin Ser Pro 225 Met Lys Al a Tyr Thr 305 Gly Asp Ala Thr Thr Ile Gly Val1 Thr 130 Asn Leu As n Asn Lys 210 Phe Asn Arg Tyr Gly 290 Asn Phe Ile Gin Gin Gin Leu Pro Ile 115 Leu Gin Ser Leu Pro 195 Tyr Arg Gly Asn Ile 275 Val1 Phe Ala Phe Ala Ile Phe Ser Ile 100 Asn Asn 5 Gly Val Thr Ala Leu Gin Asp Ser Asp Arg Thr Leu 165 Pro Asn 180 Asn Sen Asn Gin Arg Ile Ile Gly 245 Trp Gly 260 Lys Ser Giy Met Leu Gly Leu Ala 325 Asp Leu Gin Gin 70 Ala Gin Asn Leu Ala i50 Gly Ala Pro Leu Gly 230 Val1 Leu Asn Asp Lys 310 Gly Ser 0Th Pro Glu Gin Asp Thr 0Th 135 Leu Asn Lys Glu Gin 215 Val1 Gin Arg Phe Ala 295 Asn Thr Phe Lys 40 Asp Met Gin Leu Tyr 120 Gin Ser Asp Ser Gly 200 Thr Ile Ala Tyr Phe 280 Leu Asn Ser Asn 25 Thr Asn Leu Val1 Giu 105 Gly His Gin Ser Leu 185 Leu Val1 Asn Gly Tyr 265 Asn Tyr Lys Trp Asn 345 Ala Gin Phe Asn Ala Glu Ser Thr Thr Lys 170 Gin Leu Ala Tyr Tyr 250 Gly Ser Asn Leu Leu 330 Ile Gin Asn Arg 75 Asp Cys Gly Ala Ile 155 Ala Asn Thr Gin Gin 235 Lys Phe Ala Phe Ser 315 Asn Thr Glu Asn Tyr Asn Phe Ala Ala 125 Tyr Asn Asn Thr Ser 205 Leu Asn Phe Asp Asp 285 Asn Gly Gin Gly Met Ala Thr Ala His Gly 110 Phe Gly Phe Ser His 190 Leu Gly Asn Phe Tyr 270 Val Asp Leu Gin Val1 Lys Glu Lys Ala Ile Ala Lys Gin Glu 160 Ile Thr Thr Asn Ala 240 Lys His Thr Asn Gly 320 Asn Leu Thr Met Met Asn Gly Ile Tyr 340 Ala Asn Val Ser Ala Ser Asn 350 WO 97/37044 PCT/US97/05223 599 Phe Gin Phe Leu Phe Asp Leu Gly Leu Arg Met Asn Leu Ala Arg Pro 355 360 365 Lys Lys Lys Asp Ser Asp His Ala Ala Gin His Gly Met Glu Leu Gly 370 375 380 Val Lys Ile Pro Thr Ile Asn Thr Asp Tyr Tyr Ser Phe Met Gly Ala 385 390 395 400 Glu Leu Lys Tyr Arg Arg Leu Tyr Ser Val Tyr Leu Asn Tyr Val Phe 405 410 415 Ala Tyr INFORMATION FOR SEQ ID NO:680: SEQUENCE CHARACTERISTICS: LENGTH: 163 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...163 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:680: Ala Leu Ser Arg Cys Leu Ser Asp Arg Phe Asn Gin Tyr Pro Ala Leu 1 5 10 Gin Lys Thr Glu His His Phe Val Asp Phe Leu Asn Gin Asp Lys His 25 Tyr Ala Ile Ile Gin Arg Ala Asp Lys Ser Ile Ser Ser Asn Glu Ala 40 Leu Ala Arg Ser Leu Ile Gly Ala Tyr Val Leu Asn Arg Glu Ser Ile 55 Asn Arg Ile Asp Asp Lys Ser Arg Tyr Glu Leu Val Arg Leu Gin Ser 70 75 Ser Ser Lys Val Trp Gin Arg Phe Glu Asp Leu Ile Lys Ala Gin Asn 90 Ser Ile Tyr Val Gin Ser His Leu Glu Arg Glu Val His Ile Val Asn 100 105 110 Ile Ala Ile Tyr Gin Gin Asp Asn Asn Pro Ile Ala Ser Val Ser Ile 115 120 125 Ala Ala Lys Leu Leu Asn Glu Asn Lys Leu Val Tyr Glu Lys Arg Tyr 130 135 140 Lys Ile Val Leu Ser Tyr Leu Phe Asp Thr Pro Asp Phe Asp Tyr Ala 145 150 155 160 Ser Met Pro INFORMATION FOR SEQ ID NO:681: SEQUENCE CHARACTERISTICS: WO 97/37044 600 LENGTH: 194 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...194 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:681: PCTIUS97/05223 Ser 1 Asn Ile Phe Asn Thr Leu Asn Asn Arg 145 Ser Ala Ser Phe Lys Lys Asn Glu Arg Leu Leu Leu Leu Val Ser Arg Phe Leu Ala Lys Thr Ile Lys Asp Asp Ser 130 Ile Val Leu Arg Ile Asn Pro Leu Val Glu Phe 115 His Gin Cys Gly Asp Gly Gly Ser Lys Asn 100 Phe Pro Ser Lys Lys 180 5 Pro Cys Tyr Lys Gly Ser Asn Glu Phe Gin 165 Ala Phe Ile Glu His 70 Thr Phe Arg Leu Ser 150 Ile Ile Asn Tyr Glu 55 Gin Phe Tyr Ser His 135 Asp Gly Leu Leu Gly 40 Ser Ile Val Lys His 120 Arg Asp Asn Lys Gly 25 Val Lys Trp Phe Lys 105 Leu Ser Tyr Ala Ser 185 10 Val Cys Ala Gin Ile 90 Leu Val Ile Ile Val 170 Ala Leu Ser Arg Ser 75 Leu Leu Thr Thr Phe 155 Pro Arg Leu Tyr Val Asn Glu Asn Pro Pro 140 Tyr Pro Asn Ser Lys Leu Gin Asn Leu Ser 125 Arg Gly Leu Asp Arg Val Asn Glu Asp Ile 110 Asn Glu Asn Leu Thr 190 Phe Ser Ala Ser Leu Ile Gly Ala Lys Ala 175 Asn Gin Lys Leu Val His Asp Thr Ala Thr 160 Leu Pro INFORMATION FOR SEQ ID NO:682: SEQUENCE CHARACTERISTICS: LENGTH: 165 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 601 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...165 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:682: Met Gin Asp Asn Leu Val Ser Val Ile Glu Lys Gin Thr Asn Lys Lys 1 5 10 Val Arg Met Leu Glu Ile Lys Pro Leu Lys Ser Ser Gin Asp Leu Lys 25 Met Val Val Ile Glu Asp Pro Asp Thr Lys Tyr Asn Ile Pro Leu Val 40 Val Ser Lys Asp Gly Asn Ser Val Ile Gly Leu Ser Asn Ile Phe Phe 55 Ser Asn Lys Ser Asp Asp Val Lys Leu Val Ala Glu Thr Asn Gin Lys 70 75 Ile Gin Ala Leu Asn Ala Thr Gin Gin Asn Ser Ala Lys Leu Asn Ala 90 Ile Phe Asn Glu Ile Pro Ala Asp Tyr Ala Ile Glu Leu Pro Ser Thr 100 105 110 Asn Ala Glu Asn Lys Asp Lys Ile Leu Tyr Ile Val Ser Asp Pro Met 115 120 125 Cys Pro His Cys Gin Lys Glu Leu Thr Lys Leu Arg Asp His Leu Lys 130 135 140 Glu Asn Thr Val Arg Met Val Val Val Gly Trp Leu Glu Ala Ile Arg 145 150 155 160 Leu Lys Lys Arg Leu 165 INFORMATION FOR SEQ ID NO:683: SEQUENCE CHARACTERISTICS: LENGTH: 115 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...115 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:683: Thr Gly Ala Ile Met Leu Ser Ser Asn Asp Leu Phe Met Val Val Leu 1 5 10 Gly Ala Ile Leu Leu Val Leu Val Cys Leu Val Gly Tyr Leu Tyr Leu 25 Lys Glu Lys Glu Phe Tyr His Lys Met Arg Arg Leu Glu Lys Thr Leu 40 Asp Glu Ser Tyr Gin Glu Asn Tyr Leu Tyr Ser Lys Arg Leu Arg Glu WO 97/37044 PCTJUS97/05223 602 S560 Leu Giu Gly Arg Leu Giu Gly Leu Ser Leu Giu Lys Ser Ala Lys Glu 70 75 Asp Ser Ser Leu Lys Thr Thr Leu Ser His Leu Tyr Asn Gin Leu Gin 90 Giu Ile Gin Lys Ser Met Asp Lys Giu Arg Asp Tyr Leu Giu Giu Lys 100 105 110 Ile Ile Thr 115 INFORMATION FOR SEQ ID NO:684: SEQUENCE
CHARACTERISTICS:
LENGTH: 324 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. .324 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:684: Glu Arg Lys Lys Met Asn Thr Asn Leu Lys Lys Aia 1 Thn Asp Lys Sen Giu Thr Ser Leu Leu 145 Lys Asp Asn Leu Glu Met Ala Val1 Lys Leu Thr 130 Ile Ile Ile Phe Gly Asn Glu Pro Lys Asn Lys 115 Tyr Gly Sen Lys Gly Tyr Lys Gin Leu Giu Giy 100 Gin Ser Tyn Sen Ile 180 Ile 5 Leu Gin Lys Gin Gin Phe Sen Lys Gin His 165 Tyr Asp Gin Giu Ile Ser 70 Lys Phe Leu Thr His 150 Leu Lys Val1 Ala Asp Ala 55 Ala Lys Leu Ser Sen 135 Tyr Tyn Ser Lys Asn Ile 40 Pro Giu Ser Gly Leu 120 Lys Phe Gly Lys Tyr Giu 25 Sen Gin Leu Gin Sen 105 Ile Asn Gly Gly Val1 185 Leu 10 Gly Gin Asn Leu Lys 90 Glu Ser Ala Ala Thr 170 Sen Phe Giu Thr Gin 75 His Ile Asn Asn Sen 155 Pro Thr Asp Leu Gin Leu Sen Leu Ala Ala Phe 140 Lys Sen Asn Leu Ala Lys Lys Gin Val Ile Thr 125 Asn Arg Asp Tyr Phe Ile Leu Lys Val1 Ala Thr 110 Asp Met His Val Thr 190 Leu Met Glu Gin Lys Met Gly Asp Gly Gly Ser 175 Pro Val Thr Pro His Gin Ang Gin Thr Leu Leu Ile 160 Thr Leu Ile 195 Phe Phe Leu Asn 200 205 Lys Lys Asn Arg His Thr Leu Gly Leu Sen Val Gly Phe Gly Trp Ang WO 97/37044 PCT/US97/05223 Met 225 Glu Pro Asn Lys Asn 305 Ala 210 Gin Phe Thr Tyr Ile 290 Phe Tyr Tyr Ile Ile Arg 275 Val Leu Leu Tyr Phe Thr Asn 245 Gly Leu 260 Phe Gly Val Gin Thr Lys Phe Ala 230 Lys His Gly Asn Ala 310 215 Lys Pro Tyr Ala Gly 295 Thr Ile Lys Tyr Ile 280 Asn Asn Asp Asp Leu 265 Asn Phe Ser Pro Ile 235 Ile Phe 250 Asn His Tyr Gin Lys Gly Ser Tyr 315 220 Lys Asn His Ser Glu 300 Ile Thr His Gin Val 285 Val Ala Ser Gly Phe 270 Val Leu Phe His Ser 240 Phe Tyr 255 Glu Val Pro Ser Phe Ser Asn Tyr 320 INFORMATION FOR SEQ ID NO:685: SEQUENCE CHARACTERISTICS: LENGTH: 137 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...137 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:685: Ala Asn 1 Val Leu Leu Ile Phe Lys Lys Thr Thr Phe Ile Pro Asn Ala Thr Ile 130 Glu Lys Met Glu Leu Ile Lys Lys Leu Lys Ile Ala Lys Leu Lys Leu 115 Asn Lys Asp Trp Phe Val Met 100 Arg Pro 5 Asp Asn Val Asp Arg Pro Ser Asn Leu Glu Glu Gly 70 Asp Ile Leu Lys Gin Asp Ile 55 Glu Phe Phe Val Ile 135 Gin Leu 40 Val Met Phe Cys Tyr 120 Pro His 25 Phe Lys Ile Asn Gly 105 Leu Phe 10 Ser Lys Met Gly Gly 90 Asp Ser Asn Glu Met Tyr 75 Ile Val Val Glu Lys Glu Leu Gin Phe Phe Glu Thr Glu Phe Lys Lys Cys Leu Glu 125 Glu Phe Glu Leu Glu Ser Glu 110 Leu Ser Glu Lys Met Ile Met Thr Lys Leu Leu Lys Val Asp Phe Glu Glu INFORMATION FOR SEQ ID NO:686: WO 97/37044 WO 9737044PCTIUS97/05223 604 SEQUENCE CHARACTERISTICS: LENGTH: 227 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .227 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:686: Asn Lys Giu Asn Asn Ile Thr Ile Cys Val Ala Asn 1 Ser Asp Giu Leu Ser Glu Thr Ile Ile 145 Ile Giu Gly Val1 His 22S Gly Asn Thr Phe Lys Thr Pro Giu 130 Asn Glu Ser Leu Asn 210 Ala Lys Lys Phe Asn Tyr Gin Sen 115 Gin Arg Phe Ser Gly 195 Phe Phe Ser Giu Ala Arg Giu Lys 100 Gin Leu Met Ile Leu 180 Val1 Tyr 5 Thr Val1 Thr Ser As n Ala Leu Gin Pro Lys 165 Ser Ile Asn Leu Val Ile Ser 70 le Met Asp Glu Thr 150 Giu Giu Giu Glu Cys Leu Val Leu 40 Arg Ala 55 Gly Phe Leu Ile Leu Leu Thr Giu 120 Leu Asn 135 Ile Pro Asn Asn Arg Ile Tyr Ser 200 Leu Lys 215 Asn 25 Asp Giu Ser Asp Ser 105 Val Giu Thr Pro Val 185 Asp 10 Leu Aia Thr Asp Lys Glu Asp Thr 75 Thr Lys 90 Asn Ile Leu Ala Asn Leu Leu Lys i55 Ser Asp 170 Tyr Lys Lys Lys Vali Ser Arg Leu Gly Val Asn Arg 140 Giu Arg Arg Ala Glu Gin Gin Pro Lys Giu Leu Met 125 Ala Arg Ile Ser Ile Lys Leu Lys Thr Gin Tyr Val 110 Leu Leu Gin Thr Val1 190 Asn Gly Leu Ser Phe Met Ser Pro Giu Ile Ala Leu 175 Ser Glu Gly Lys Met Ser Val1 Lys Thr Arg Val1 Leu 160 Leu Glu Trp Sen His Leu Glu Lys Giu Lys Ile 220 INFORMATION FOR SEQ ID NO:687: SEQUENCE CHARACTERISTICS: LENGTH: 105 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 605 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...105 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:687: Arg Ser Thr Thr Leu Ile Tyr Phe Asn Phe Lys Glu Pro Met Met Ser 1 5 10 Asp Glu Ile Thr Gin Glu Asn Glu Leu Glu Ile Asn Ser Asn Asn Gin 25 Asn Gin Glu Pro Lys Glu Val Glu Lys Met Pro Leu Asn Asn Ile Gin 40 Lys Ala Lys Lys Leu Lys Asn His Ala Asn Leu Ile Val Arg Arg Thr 55 Asp Glu Leu Asp Lys Val Ile Asn Lys Arg Glu Ser Leu Gin Arg Glu 70 75 Phe Lys Arg Arg Ile Lys His Leu Asp Asn Lys Ile Glu Thr Leu Ser 90 Asn Asn Ile Glu Glu Leu Lys Arg Lys 100 105 INFORMATION FOR SEQ ID NO:688: SEQUENCE CHARACTERISTICS: LENGTH: 173 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...173 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:688: Ser Met Lys Lys Tyr Gin Asn Lys Arg Lys Leu Gly Lys Arg Ile Asn 1 5 10 Cys Phe Ile Ser Asn Glu Cys Lys Asp Ile Leu Glu Asn Tyr Leu Lys 25 Val Lys Lys Ile Ser Val Thr Glu Cys Ile Glu Ser Leu Leu Leu Ala 40 Leu Asp Pro Ser Lys Glu Thr Leu Arg Ala Arg Asp Phe Tyr Glu Ser 55 Leu Phe Lys Val Ile Ser Leu Asn Gin His Val Tyr Lys Glu Leu Met 70 75 WO 97/37044 PCT/US97/05223 Ala Asn Gly Asn Asn Ile Asn Gin Ile Ala Lys Asn 90 Ile Lys Tyr Asn Lys Pro Phe Tyr Leu Gin Asn Leu 100 105 Glu Ile Phe Lys Val Met Asp Thr Leu Asn Lys Asn 115 120 Lys Lys Ile Ser Leu Glu Leu Leu Leu Gin Ile Tyr 130 135 140 Ser Gin Thr Glu Phe Asn Ala Ile Ser Lys Ile Leu 145 150 155 Glu Lys Gin Ser Thr Thr Glu Pro Lys Lys Asp Asn 165 170 INFORMATION FOR SEQ ID NO:689: SEQUENCE CHARACTERISTICS: LENGTH: 109 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...109 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:689: Lys Ser Pro Tyr Arg His Ser Ile Ser Ser Gin Val 1 5 10 Asp Glu Glu Ile Ile Glu Val Glu Lys Gly Glu Asn 25 Ser Tyr Phe Leu Gly Gly Pro Thr Cys Leu Ala Gly 40 Ser Phe Ser Phe Glu Thr Pro Leu Lys Arg Gly Asp 55 Gln Asp Met Leu His Tyr Thr Ile Val Lys Asn Asn 70 75 Val Pro Leu Pro Ser Leu Ala Lys Ile Asp Ser Gin 90 Leu Lys Ser Phe Ser Tyr Glu Asp Tyr Lys Asn Arg 100 105 INFORMATION FOR SEQ ID NO:690: SEQUENCE CHARACTERISTICS: LENGTH: 133 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein Thr Asn Ile Ala Phe Leu 110 Val Glu Glu 125 Lys Glu Gin Asn Ala Leu Ala Ala Glu Cys SLys Asn 160 Asn Phe Gly Phe Gly Ile Ser Gin Asp Lys Ser Gly Asn Val Gly Phe Ile Phe Phe Glu Ala Met Val Asn Lys WO 97/37044 PCT/US97/05223 607 (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...133 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:690: Ala Leu Glu Lys Asn Gly Thr Ala Thr Ala Asn Ser Thr Ser Ser Thr 1 5 10 Ser Gly Ala Thr Gly Ser Asp Gly Gln Thr Tyr Ser Gln Gin Ala Ile 25 Gln Tyr Leu Gin Gly Gln Gln Asn Ile Leu Asn Asn Ala Ala Asn Leu 40 Leu Lys Gln Asp Glu Leu Leu Leu Glu Ala Phe Asn Ser Ala Val Ala 55 Ala Asn Ile Gly Asn Lys Glu Phe Asn Ser Ala Ala Phe Thr Gly Leu 70 75 Val Gin Gly Ile Ile Asp Gin Ser Gin Leu Val Tyr Asn Glu Leu Thr 90 Lys Asn Thr Ile Ser Gly Ser Ala Val Asn Asn Ala Gly Ile Asn Ser 100 105 110 Asn Gin Ala Asn Ala Cys Lys Gly Val Leu Val Ser Ser Leu Thr Leu 115 120 125 Phe Thr Thr Cys Lys 130 INFORMATION FOR SEQ ID NO:691: SEQUENCE CHARACTERISTICS: LENGTH: 84 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...84 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:691: Leu Ile Lys Glu Ala Met Ile Pro Val Phe Gly Ser Glu Thr Gly Ile 1 5 10 Tyr Asn His Lys Glu Gin Asn Phe Lys Gly Lys Gly Arg Phe Ile Leu 25 Thr Ser Lys Asp Ser Lys Val Glu Gly Leu Asp Ile Ser Tyr Ser His 40 Ala Leu Ala Ile Ile Glu Ala Gin Ser Ile Gin Ala Asn Leu Phe Leu WO 97/37044 PCT/US97/05223 608 55 Asp Glu Ile Lys Gin Ser Gin Lys Glu Lys Lys Lys Phe Pro Thr Phe 70 75 Lys Gly Gly Phe INFORMATION FOR SEQ ID NO:692: SEQUENCE CHARACTERISTICS: LENGTH: 161 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...161 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:692: Pro Phe Leu Lys Lys Asn Lys Phe Leu Gin Ile Cys Gin Tyr Phe Ser 1 5 10 Ala His Phe Lys Gin Val Leu Lys Asn Glu Lys Pro Leu Val Tyr Tyr 25 Gly Val Leu Lys Ala Lys Ala Pro Asn Trp Ala Leu Trp Val Tyr Glu 40 Lys Pro Leu Lys Lys Glu Ile Tyr Met Asn Asp Lys Glu Val Val Val 55 Tyr Glu Pro Asn Leu Phe Gin Ala Thr Ile Thr Pro Leu Lys Asp Lys 70 75 Thr Asp Phe Phe Thr Ile Leu Lys Gin Leu Lys Lys Gin Thr Asp Gly 90 Ser Phe Lys Thr Thr Ile Asn Lys Thr Thr Tyr Arg Leu Val Phe Lys 100 105 110 Asp Gly Lys Pro Phe Ser Leu Glu Phe Lys Asp Asp Met Asn Asn Leu 115 120 125 Val Thr Ile Thr Phe Ser Gin Ala Glu Ile Asn Pro Lys Ile Pro Asn 130 135 140 Glu Ile Phe Val Phe Asn Pro Lys Asp Glu Asn Ile Asp Ile Val Arg 145 150 155 160 Gin INFORMATION FOR SEQ ID NO:693: SEQUENCE CHARACTERISTICS: LENGTH: 71 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 609 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...71 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:693: Pro Ser Phe Leu Lys Glu Lys Phe Asp Phe Phe Lys Gly Lys Asn Phe 1 5 10 Lys Ile Val Tyr Cys Ile Gly Glu Asp Leu Thr Thr Arg Glu Lys Gly 25 Phe Arg Ala Val Lys Glu Phe Leu Ser Glu Gln Leu Glu Asn Ile Asp 40 Leu Asn Tyr Ser Asn Leu Ile Val Ala Tyr Glu Pro Ile Trp Ala Ile 55 Gly Thr Lys Lys Ala Arg Phe INFORMATION FOR SEQ ID NO:694: SEQUENCE CHARACTERISTICS: LENGTH: 155 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...155 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:694: Arg Asp Val Arg Phe His Val Arg Asp Leu Val Arg Gin Leu Lys Gly 1 5 10 Lys His Leu Ile Glu Val Ser Asp Val Ile Asn Asp Thr Thr Gin Pro 25 Asn Leu Asp Met Asn Leu Leu Thr Thr Glu Ile Ala Arg Gin Leu Arg 40 Leu Arg Ser Asn Gly Arg Phe Asn Ile Thr Arg Ala Ser Gly Gly Ser 55 Gly Ile Glu Ala Asp Ser Arg Met Val Lys Gln Arg Glu Lys Glu Arg 70 75 Glu Ser Glu Glu Tyr Asn Gin Asp Thr Thr Val Glu Lys Gly Thr Leu 90 Lys Ala Ala Asp Leu Ser Leu Ser Gly Lys Val Ser Ser Ile Ala Ala 100 105 110 WO 97/37044 PCT/US97/05223 Ser Ile Ser Ser Ser Arg Gin Arg Leu Asp Tyr Asp Phe Thr Leu Ser 115 120 125 Leu Thr Asn Arg Lys Thr Gly Glu Glu Val Trp Ser Asp Val Lys Pro 130 135 140 Ile Val Lys Asn Ala Ser Asn Lys Arg Met Phe 145 150 155 INFORMATION FOR SEQ ID NO:695: SEQUENCE CHARACTERISTICS: LENGTH: 219 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...219 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:695: Ile Lys Ser Val Leu Gly Leu Ile Leu Ile Gin Gin Asn Phe Lys Thr 1 5 10 Leu Lys Glu Leu Tyr Met Leu Val Thr Lys Leu Ala Pro Asp Phe Lys 25 Ala Pro Ala Val Leu Gly Asn Asn Glu Val Asp Glu His Phe Glu Leu 40 Ser Lys Asn Leu Gly Lys Asn Gly Val Ile Leu Phe Phe Trp Pro Lys 55 Asp Phe Thr Phe Val Cys Pro Thr Glu Ile Ile Ala Phe Asp Lys Arg 70 75 Val Lys Asp Phe His Glu Lys Gly Phe Asn Val Ile Gly Val Ser Ile 90 Asp Ser Glu Gin Val His Phe Ala Trp Lys Asn Thr Pro Val Glu Lys 100 105 110 Gly Gly Ile Gly Gin Val Ser Phe Pro Met Val Ala Asp Ile Thr Lys 115 120 125 Ser Ile Ser Arg Asp Tyr Asp Val Leu Phe Glu Glu Ala Ile Ala Leu 130 135 140 Arg Gly Ala Phe Leu Ile Asp Lys Asn Met Lys Val Arg His Ala Val 145 150 155 160 Ile Asn Asp Leu Pro Leu Gly Arg Asn Ala Asp Glu Met Leu Arg Met 165 170 175 Val Asp Ala Leu Leu His Phe Glu Glu His Gly Glu Val Cys Pro Ala 180 185 190 Gly Trp Arg Lys Gly Asp Lys Gly Met Lys Ala Thr His Gin Gly Val 195 200 205 Ala Glu Tyr Leu Lys Glu Asn Ser Ile Lys Leu 210 215 INFORMATION FOR SEQ ID NO:696: WO 97/37044 PCT/US97/05223 611 SEQUENCE CHARACTERISTICS: LENGTH: 113 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...113 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:696: Gly Val Ile Met Glu His His Lys Ala His Thr Thr Ile Gin Ala Leu 1 5 10 Gln Ala Lys Arg Lys Arg Leu Leu Thr Glu Leu Ala Glu Leu Glu Ala 25 Glu Ile Lys Val Ser Ser Glu Arg Arg Ser Ser Phe Asn Val Ser Leu 40 Ser Pro Ser Leu Leu Ala Glu Ile Glu Glu Ile Glu Tyr Glu Glu Lys 55 Met Ser Lys Glu Arg Arg Ile His His Asn Leu Leu Leu Ser Pro Ser 70 75 Phe Met Ala Lys Val Asp Glu Tyr Met Lys Glu Lys Gly Phe Pro Asn 90 Arg Ser Leu Leu Phe Glu Lys Ala Leu Glu Phe Tyr Met Leu Lys His 100 105 110 Pro INFORMATION FOR SEQ ID NO:697: SEQUENCE CHARACTERISTICS: LENGTH: 187 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...187 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:697: Gly Phe Val Cys Gly Ser Val Leu Phe Gin Leu Gly Val Phe Met Gly WO 97/37044 PCT/US97/05223 612 1 5 10 Leu Lys Asn Lys Ile Lys Gly Phe Val Lys Glu Arg Met Pro Phe Val 25 Met Arg Cys Val Arg Gly Leu Lys Gly Ala Lys Asp Thr His Glu Asn 40 Ala His Asp Arg Asp Ala Tyr Cys Gly Ile Asn Arg Glu Ile Lys Glu 55 Met Leu Glu Ala Lys Lys Leu His Phe Leu Gin Glu Lys Ala Leu Phe 70 75 Asn His Asp His Gin Glu Ser Val Phe Leu Ala Ile Ala Ser Leu Asn 90 Asn Glu Ser Phe Ile Glu Tyr Asn Lys Ser Ile Tyr Lys Asn Ser Ser 100 105 110 Leu Asn Tyr Asn Tyr Gly Gly His Leu Glu Asp Arg Val Ile His Pro 115 120 125 Thr Leu Thr Leu Pro Asn Pro Thr His Ser Gly Tyr Phe Asp Tyr Asp 130 135 140 Lys Lys Ser Gin Asn Pro Lys Ser Pro Leu Asn Pro Trp Ala Phe Ile 145 150 155 160 Arg Val Lys Asn Glu Ile Val Thr Leu Glu Glu Ser Leu Phe Ser Met 165 170 175 Leu Pro Ala Val Gin Arg Gly Gly His Trp Phe 180 185 INFORMATION FOR SEQ ID NO:698: SEQUENCE CHARACTERISTICS: LENGTH: 230 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...230 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:698: Lys Arg Val Cys Phe Leu Cys Phe Leu Pro Phe Lys Gly Gly Val Ile 1 5 10 Gly Phe Asn Asp Cys Asp Asp Gly Ser Lys Glu Val Val Leu Glu Phe 25 Cys Lys Lys Phe Pro Ser Phe Ile Pro Ile Ser Tyr Pro Tyr Glu Val 40 Ile Leu Lys Asp Cys Pro Ser Leu Trp His Gin Leu Tyr His Tyr Cys 55 Asn Tyr Thr Leu Ser Phe Ile Pro Lys Asn Glu Trp Val Val Lys Ile 70 75 Asp Cys Asp His Ile Tyr Asp Ala Lys Lys Leu Tyr Lys Ser Phe Tyr 90 Ile Pro Lys Asn Ile Lys Glu Val Val Met Tyr Ser Arg Ile Asn Phe WO 97/37044 PCTIUS97/05223 613 100 105 110 Val Val Arg Asp Phe Glu Val Phe Val Arg Asn Asp Gly Asp Phe Gly 115 120 125 Phe Leu Asp Ala Trp Gly Asp His Trp Leu Leu Tyr Asn Asp Cys Glu 130 135 140 Pro Phe Glu Ile Trp Arg Tyr Asn Asp Glu Ser Tyr Glu Val Leu Lys 145 150 155 160 Leu Lys Asp Lys His His Ile Lys Asp Lys Glu Met Val Gin Trp His 165 170 175 Phe Pro Leu Ala Lys Lys Arg Arg Asn Ala Ile Val Tyr Asp Asp Leu 180 185 190 Ile Pro Leu Glu Glu Phe Lys Lys Arg His Ala Asp Leu Ile Gly Thr 195 200 205 Arg Ile Glu Glu Ser Met Leu Asp Glu Lys Arg Ile Leu Glu Val Tyr 210 215 220 Gin Lys Phe Arg Leu Pro 225 230 INFORMATION FOR SEQ ID NO:699: SEQUENCE CHARACTERISTICS: LENGTH: 86 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...86 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:699: Leu Ser Leu Asn Lys Lys Met Met Phe Glu Leu Thr Lys Lys Thr Lys 1 5 10 Phe Asp Gly Glu Met Ile Gly Tyr Thr Glu Glu Leu Leu Thr Phe Leu 25 Val Arg Asp Phe Phe Asn Gly Ile Phe Lys Ser Lys Val Ile Pro Lys 40 Met Pro Ile Phe Cys Gly Asp Val Lys Cys Glu Asp Phe Asn Ala Leu 55 Arg Ser Leu Val Tyr Leu Ser Val Leu Glu Leu Glu Glu Thr Ile Asn 70 75 Pro Asn Lys Ile Pro Phe INFORMATION FOR SEQ ID NO:700: SEQUENCE CHARACTERISTICS: LENGTH: 164 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 614 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...164 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:700: Phe Ser Arg Lys Thr Asp Lys Asn Ala Gin Lys Asp Glu Gin Lys Asn 1 5 10 Glu Gin Glu Glu Gin Arg Arg Leu Arg Glu Gin Gin Arg Leu Lys Gin 25 Asn Gin Glu Asn Gin Glu Met Leu Lys Gly Leu Gin Gin Asn Leu Asn 40 Gin Phe Thr Gin Lys Leu Glu Ser Val Lys Asn Lys Thr Leu Asp Leu 55 Gin Ile Pro Lys Gin Asp Gly Val Asp Glu Lys Ala Tyr Gin Glu Trp 70 75 Tyr Ala Gin Ile Tyr Gin Ile Leu Tyr Lys Gly Trp Arg Gly Val Phe 90 Tyr His Lys Ala Ser Val Ser Val Leu Ile Met Ile Thr Lys Asp Gly 100 105 110 Glu Phe Asp Tyr Thr Ile Leu Ser Tyr Ser Asp Phe Lys Asp Tyr Asn 115 120 125 Lys Ser Val Met Thr Leu Leu Asp Asp Leu Lys Lys Val Asp Phe Pro 130 135 140 Pro Tyr Pro .Gly Gly Asn Met Ile Ser Ile Lys Val Asn Phe Thr Thr 145 150 155 160 Lys Glu Glu Gin INFORMATION FOR SEQ ID NO:701: SEQUENCE CHARACTERISTICS: LENGTH: 68 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...68 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:701: WO 97/37044 PCT/US97/05223 615 Arg Ala Arg Leu Leu Arg Arg Lys Asn His Tyr Leu Glu Asn Lys Phe 1 5 10 Lys Asp Met Gly His Tyr Ala Ala Ser Asp Glu Val Asn Glu Lys Gin 25 Val Leu Lys Met Tyr Gin Glu Gly Tyr Ser Val Asp Ser Ile Ser Lys 40 Glu Phe Lys Val Ser Lys Gly Glu Val Glu Phe Ile Leu Asn Met Ala 55 Gly Leu Lys Trp INFORMATION FOR SEQ ID NO:702: SEQUENCE CHARACTERISTICS: LENGTH: 97 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...97 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:702: Thr Lys Leu Asp Thr Ile Ser Leu Lys Arg Ile Gin Met Glu Lys Leu 1 5 10 Pro Lys Lys Arg Val Ser Lys Thr Lys Ser Gin Lys Leu Ile His Ser 25 Leu Thr Thr Gin Lys Asn Arg Ala Phe Leu Lys Lys Ile Ser Ala Asn 40 Glu Met Leu Leu Glu Leu Glu Lys Gly Ala Phe Lys Lys Asn Glu Ala 55 Tyr Phe Ile Ser Asp Glu Glu Asp Lys Asn Tyr Val Leu Val Pro Asp 70 75 Asn Val Ile Ser Leu Leu Ala Glu Asn Ala Arg Lys Ala Phe Glu Ala 90 Arg INFORMATION FOR SEQ ID NO:703: SEQUENCE CHARACTERISTICS: LENGTH: 272 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97137044 WO 9737044PCT1US97105223 616 (vi) ORIGINAL SOURCE: ORGAN~ISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1..-272 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:703: Arg Ser Ile Lys Met Lys Gin Ser Leu Arg Glu Gin 1 Ile Leu Gly Phe Asp Asn Asp Thr Ser 145 Tyr Leu Ile Asn Met 225 Ala Val1 Leu Phe Giu Leu Asp Ala Phe Leu 130 Phe Phe Giu Giu Lys 210 Asp Lys Thr Giu Giu Ile Giu Val1 Asn Arg 115 Sen Lys Met Lys Asn 195 Ile Ser Lys Asn Asn Leu Thr Asp Lys Leu 100 Phe Arg Ala Leu Pro 180 Phe Tyr Ile Lys Lys 260 5 Asp Arg Giu His Thr His Phe Leu Asn Giu 165 Ile Phe Pro Ile Giu 245 Met Val Giu Gin Val 70 Giu Phe Lys Ile Asn 150 Thr Asn Vali Thr Pro 230 Leu Leu Leu Giu Asn 55 Asn His Val1 Giu Pro 135 Sen Ile Asn Gin Asn 215 Asn Phe Ala Thr Leu 40 Leu Val Leu Lys Ile 120 Leu Phe Ang Asn Ala 200 His Trp Giu Gin Ile 25 Asp Thr Phe Tyr Sen 105 Asn Gin Val1 Gin Ile 185 Asn Ala Ile Lys Glii 265 10 Leu Phe Ala Tyr Ser 90 Phe Asp Ser Ser Ser 170 Ser Phe Tyr Gin Tyr 250 Asn Asp Ile Leu Giu 75 Gly Leu Gly Gly Leu 155 Tyr Giu Leu Asp Ile 235 Phe Gin Ser Giu Tyr Asn Leu Ser Gin Lys 140 Val Arg Asp Giu Phe 220 Asp Gin Asn Lys Phe Giu Asp Val1 Ile Asn Asp 125 Asn Tyr Ile Met Tyr 205 Thr Met Asn Lys Leu Ser Giu Phe Leu Asp Gin 110 Pro Asp Val Leu Gin 190 Tyr His Ser Ile Asn 270 Leu Asn Met Ser Asn Ser Asp Gin Ala Tyr Arg 175 Ser Val1 Leu Val Asp 255 Ser Lys Tyr Giu Asn Ile Leu Leu Lys Ser Ala 160 Leu Asp Gin Ile Giu 240 Giu Asp INFORMATION FOR SEQ ID NO:704: SEQUENCE CHARACTERISTICS: LENGTH: 229 amino acids TYPE: amino acid TOPOLOGY: iinear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCTJUS97/05223 617 (ix) FEATURE: NAME/KEY: misc-feature LOCATION .229 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:704: Val1 1 Gly Asn Gin Arg Val Leu Asp Asn Gly 145 Pro Leu Asp Gly Lys Oly Glu Lys Asn Ala Trp Tyr Leu OWy Ile Ser Th-r Gin Tyr Gly Trp Phe Asn Met Lys 130 Asn Tyr Phe Phe ksn Ala Pro Leu Phe Gly Giu Phe 115 Glu Ser Lys Asn Gly 195 Leu *Ser Lys Gdy Gly Al a Pro 100 Thr Asp Trp His Leu 180 Vai Ser 5 Glr Phe Leu Ala Asn Cys Tyr Al a Gly Thr 165
G
1 y Lays Phe Ser Val Lys Pro Val Gly 40 *Thr Val Gly 55 Arg Tyr Tyr 70 Ala Leu Thr Ala Thr Lys Gly Val Gly 120 Ser Phe Gly 135 Asn Thr Thr 150 Ser Tyr Ser Ile Arg Thr Ile Pro Thr 200 Thr Tyr Ara Asr 25 Lys Tyr Gly Ser Val1 105 Ile Phe Gly Leu His 185 Ile 10 Pro *Thr *Lys Phe Asp 90 Gly Asp Phe Ala Asp 170 Ile Asn Prc Asp Glrj Met 75 Asn Thr Thr Phe Phe 155 Pro 3 iy Val1 Lys Tyr Phe Asp Gly Met Leu Gly 140 Leu Ala Gin Tyr Ser 220 Ser Leu Phe Tyr Gly Tyr 125 Ala Giu Ile His Tyr *Ser Ala Gly Gly Val1 Asn 110 Asn Gin Thr Phe Gin 190 Phe Gln Glu Val1 Gl u His Cys Leu Val1 Ile Lys Gin 175 Glu Asn 1Val Phe Met Lys Ala Lys Ser Ile Ala Ser 160 Phe Phe His Arg Gin Tyr 215 Ueu Tyr Val Gly Arg Tyr Asn Phe INFORMATION FOR SEQ ID NO:705: SEQUENCE
CHARACTERISTICS:
LENGTH: 65 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION (xi) SEQUENCE DESCRIPTION: SEQ ID NO:705: WO 97/37044 PCT/US97/05223 Ser 1 Gly Arg Trp Ile Ala Arg Asn Ile Val Val Glu Ser 5 Ser Cys Ser Leu Gin Ser Val Ala Glu Val Ser Thr Val Val Ile Leu Lys Ile Asp Ile Asn Gin 10 Tyr Pro Thr Pro Lys Ser Val Ser Leu Val 25 Tyr Glu Ile Leu Cys Glu Asn Gin Pro Leu 40 Asn Leu Gly Gin Thr Leu Pro Phe Ser Leu 55 INFORMATION FOR SEQ ID NO:706: SEQUENCE CHARACTERISTICS: LENGTH: 365 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...365 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:706: Arg 1 Ala Ser Asn Asn Gin Asn Val Asn Thr 145 Phe Ile Phe Lys His Phe Arg Gly Gly Gly Phe Pro His Cys Leu Asn Gin Phe Gin Ala Leu 130 Ile Ser Gly Gly Leu Thr Asn Ile Ala Ile 115 Asn Glu Glu Thr Ala Gly Leu Asn Gin Asn 100 Asn Asn Gin Pro Val 5 Lys Ser Leu Pro Asn Gin Ala Leu Tyr Lys 165 Thr Pro Thr Met Asn 70 Leu Gin Leu His Asn 150 Asn Asn Thr Ser Asn 55 Gly Thr Val Asp Asn 135 Asn Leu Asp Gly Ser 40 Thr Cys Pro Gin Asn 120 Ala Ala Leu Gin 10 Gin Val 25 Asn Ser Leu Gin Ala Asn Leu Ala 90 Ala Ile 105 Asn Ala Leu Asn Leu Lys Lys Asn 170 Gly Gin Val Asn Gly Gin 75 Ala Ala Ile Phe Gin 155 Thr Asn Phe Asn Glu Ile Thr Gin Asn Gin 140 Ile Ser Ile His Asn Tyr Pro Gin Gin Leu Ser Gin Cys Pro Thr Lys Leu 110 Asn Thr 125 Ala Tyr Ser Trp Asn Asn Ser Ala Leu Thr Gin Thr Leu Ser Gin Thr Gin Ile Tyr 175 Tyr Asn Asn Tyr Asn Glu Thr Ser Tyr Ser Ser 160 Gin Asp 180 185 190 Cys Thr Ser Ala Thr Gly Ser Leu Ser Ser Asp Ala Ser Ser Gly Ile WO 97/37044 PCT/US97/05223 619 195 200 205 Ser Cys Ser Ala Thr Ser Ser Thr Ser Asn Thr Asn Ser Phe Asp Asn 210 215 220 Ser Leu Val Ala Thr Ser Lys Val Gin Thr Ile Asn Gly Lys Glu Gin 225 230 235 240 Ile Gly Val Asn Ser Phe Asn Leu Val Ser Gin Val Trp Ser Val Tyr 245 250 255 Asn Ser Leu Lys Thr Ser Glu Glu Asn Leu Gin Lys Asn Ala Lys Ile 260 265 270 Leu Cys Asn Asn Gly Ser Gin Ser Gly Thr Ser Pro Cys Asn Ser Ser 275. 280 285 Ser Gly Gly Leu Ser Ile Ser Gly Asn Ala Gin Leu Gin Asn Ile Leu 290 295 300 Ser Pro Thr Asn Gly Thr Thr Thr Asn Thr Gin Ala Lys Ser Asn Ala 305 310 315 320 Ser Lys Leu Lys Ala Met Val Met Val Asn Asn Glu Glu Glu Ala Lys 325 330 335 Thr Thr Asn Phe Asn Gin Ser Ser Gly Pro Thr Thr Gin Ser Ser Asn 340 345 350 Ser Thr Val Met Gly Ala Leu Asn Thr Val Leu Gin Asn 355 360 365 INFORMATION FOR SEQ ID NO:707: SEQUENCE CHARACTERISTICS: LENGTH: 135 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...135 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:707: Asn Pro Phe Phe Thr Ile Thr Ala Lys Leu Leu Ser Asn Thr Gin Ser 1 5 10 Ala Phe Asp Gin Gly Ile Ala Leu Ser Ser Asn Ile Ile Ser Ala Val 25 Asn Ser Leu Asn Pro Ser Asn Asn Ser Gin Glu Val Lys Ala Gin Leu 40 Gin Asn Thr Ala Gin Ser Met Thr Glu Leu Leu Gin Gin Ile Glu His 55 Ser Ile Thr Lys Thr Thr Ser Thr Thr Tyr Ala Gin Ser Leu Leu Ser 70 75 Asn Leu Thr Asp Ala Val Asn Ala Ser Ser Asn Asn Thr Thr Tyr Val 90 Ser Ala Leu Val Asn Ala Leu Asn Thr Leu Gly Val Gly Val Phe Pro 100 105 110 Thr Thr Thr Ser Thr His Val Val Leu Asn Pro Pro Asp Lys Ser Tyr WO 97/37044 PCT/US97/05223 620 115 120 125 Ser Ile Gin Leu Ile Pro Phe 130 135 INFORMATION FOR SEQ ID NO:708: SEQUENCE CHARACTERISTICS: LENGTH: 91 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...91 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:708: Gly Arg Gly Met Lys Leu Lys Lys Arg Lys Val Ala Ala Thr Leu Leu 1 5 10 Lys Arg Leu Thr Leu Pro Leu Leu Phe Thr Thr Gly Ser Leu Gly Ala 25 Val Thr Tyr Glu Val His Gly Asp Phe Ile Asn Phe Ser Lys Val Gly 40 Phe Asn Arg Ser Pro Ile Asn Pro Val Lys Gly Ile Tyr Pro Thr Glu 55 Thr Phe Val Asn Leu Thr Gly Lys Leu Glu Gly Ser Val His Leu Gly 70 75 Arg Gly Trp Thr Val Asn Val Gly Gly Val Leu INFORMATION FOR SEQ ID NO:709: SEQUENCE CHARACTERISTICS: LENGTH: 721 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...721 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:709: WO 97/37044 PCTIUS97/05223 Met 1 Phe Leu Gly Val Gly Gin Va1 Thr Asn 145 Phe Phe Gly Leu Leu 225 Ala Glu Lys Asp Ser 305 Lys Leu Leu Gin Leu 385 Ile Tyr Ala Glu Glu Ala Thr Met Phe Glu lie Glu Leu 130 Gin Leu Thr Val Gly 210 Leu Phe Phe Ile Leu 290 Thr Ser lu ksp ly 370 lu lie 31n Phe ryr Gin Tyr Asn Va1 Ala Tyr Pro Va1 115 Ser Leu Ala Asp Lys 195 Ser Ser Leu Asp Lys 275 Glu Pro Arg Lys Lys 355 Tyr Phe Gly I Val Leu I 435 Leu Pro Leu Asp Lys Leu Lys Ile 100 Gly Pro Leu Lys Tyr 180 Gly Leu Pro Ser Phe 260 Asp Asn Ile Met Leu 340 Asp Phe Leu His Pro 420 Lys Lys Val 5 Phe Lys Lys Glu Gin Ala Gly Tyr Ser Asp 165 Gin Ile Glu Lys Lys 245 Leu Glu Ser Leu Ile 325 Glu Lys Leu Gin Asp 405 Leu Asn Glu Ile Arg Gly Phe Ser 70 Asn Leu Phe Lys Asp 150 Cys Gly Gly Lys Met 230 Glu Ser Leu Pro Asp 310 Val Asn Lys Pro Asn 390 Leu Glu Pro Asp I Lys Ser Phe Tyr 55 Gin Arg Glu Glu Thr 135 Lys Val Ile Ser Ile 215 Tyr Leu Cys Lys Phe 295 Asn Leu Pro Ile Leu 375 Ala Lys Asn lu Leu Glu Tyr Pro 40 Lys Thr Lys Trp Ala 120 Arg Ile Glu Va1 Lys 200 Tyr Gin Ala Ala Glu 280 Ile Thr Glu Asn Leu 360 Glu Phe Pro Ile Lys 440 Ile 1 Gly Tyr 25 Thr Asp Lys Asp Leu 105 Asp Ile Ala Lys Gly 185 Asn Glu Ala Thr Phe 265 Tyr Va1 Pro Ser Ala 345 Ala Glu Ser Leu Arg 425 Val Pro Thr 10 Met Gly Arg Thr Ala 90 Gin Asp Tyr Leu Tyr 170 Asp Ala Asn Leu Leu 250 Pro Gly Glu Ala Ala 330 Arg Leu Ala Leu 410 lie Gly His Leu Ser Leu Lys Lys 75 Pro Lys Val Ser Phe 155 Gly Ser Lys Leu Ile 235 Glu Ser Phe Asn Leu 315 Glu Va1 Ala Leu Met 395 Ser Gin Phe Glu 1 Ala Ala Leu Asn Arg Lys Met Ile Lys 140 Asp Ile Ser Glu Asp 220 Gin Arg Glu Ile Val 300 Asp Pro Phe Phe Phe 380 Leu Phe Asp Asp Lys Leu Lys Thr Met Ala Glu Gly Ala 125 Asp Gly Leu Asp Leu 205 Leu Asp Gly Asn Ser 285 Pro Asn Leu Met Leu 365 Ser Gln Leu rhr lu 445 ile Ile Asn Gly Pro Glu Met Phe 110 Ser Lys Lys Pro Asn 190 Leu Ala Lys Cys Pro 270 Thr Ile Ala Ser Arg 350 Leu Pro His Lys Gin 430 Val Lys As Lys Leu Phe Lys Leu Thr Leu Asp Thr Ser 175 Tyr Gin Lys Gly Ile 255 Leu Leu Leu Pro Met 335 Leu 3ml Phe Ala Ala 415 Ile Leu ksp Thr Pro Val Ile Leu Leu Cys Ala Phe Glu 160 Gin Lys Arg Asn Ser 240 Lys Leu Arg Asn Lys 320 Phe Val Asp Ser Cys 400 Lys Leu Lys Phe WO 97/37044 PCT/US97/05223 Lys 465 Asn Glu Val Glu Ala 545 Ile Ser Gin Gin Ser 625 Val Asn Gly Leu Asn 705 Ile 450 Thr Ala Asp Leu Asn 530 Ile Lys Asn Thr Lys 610 Ile Pro Leu Asn Lys 690 Leu Lys Leu Leu Met 515 Ile Thr Ala His Asn 595 Gin Pro His Asn Arg 675 Ser Gly Ser Lys Leu 500 Gly Val Ser Met Thr 580 Arg Ile Lys Leu Lys 660 Leu Asn Pro Lys Ala 470 Arg Leu 485 Thr Leu Met Glu Ile Ala Thr Gly 550 Leu Pro 565 Leu Ser Glu Phe Leu Ser Asp Gin 630 Gly Lys 645 Glu Ile Asp Ser Gin Thr Gly Val 710 455 Glu Lys Cys Glu Ala Arg Phe Gin 520 Gly Val 535 His Val Ile Leu Thr Gin Ala Lys 600 Asn Ala 615 Leu Lys Thr Pro Asn Ala Ala Leu 680 Glu Ile 695 Ser Lys Ser Tyr Asp 505 Gly Thr Thr Gin Leu 585 Asp Ser Tyr Thr Val 665 Ser Val Leu Glu Phe 490 Ile Phe Asn Asp Gin 570 Gin Ile Ser Leu Asn 650 Gin Val Thr Pro 460 Leu Leu 475 Glu Lys Glu Thr Lys Ile Lys Pro 540 Tyr Ala 555 Ala Leu Ala Arg Tyr Ala Ile Phe 620 Glu Asn 635 Pro Tyr Asp Asn Ala Lys Thr Tyr 700 Cys Arg 715 Ser Gly Pro Asp 525 Asn Val Thr Ala Leu 605 Asn Ala Arg Val Asp 685 Asn Cys Met Glu Gly Leu 495 Phe Val 510 Ala Ala Gly Ala Phe Asn Leu Ser 575 Met Gly 590 Ala Gin Leu Phe Tyr Leu Gin Asn 655 Ala Asn 670 Val Tyr Asp Ala Leu Glu Leu 480 Glu Lys Leu Gly Asn 560 Gin Ser Asn Asn Lys 640 Val Tyr Asn Lys Lys 720 INFORMATION FOR SEQ ID NO:710: SEQUENCE CHARACTERISTICS: LENGTH: 76 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...76 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:710: Thr Ser Leu Ser Leu Phe Gly Glu Asp Leu Ala Lys Glu Lys Arg Ser WO 97/37044 WO 9737044PCT/US97/05223 1 Ile Lys Ile Gly (2) 5 10 Ala Lys Ser Ile Asn Phe Gly Leu Val Tyr G 25 Leu Ser Glu Thr Leu Ser Ile Pro Leu Ser G 40 Glu Ala Tyr Phe Lys Arg Phe Pro Ser Ile L 55 6 Met Arg Glu Glu Ile Leu Lys Thr Ser Lys A 70 INFORMATION FOR SEQ ID NO:711: SEQUENCE CHARACTERISTICS: LENGTH: 325 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .325 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:711: ly Met Gly Ser Lys iu Ala Lys Ser Tyr ys Asp Tyr Leu Asn 0 la Lys Giu Lys Asp Asn Pro Leu Gin Asn Phe Val Phe 1 Leu Ala Ala Ser Pro Met Gly Pro Asn 145 Asn Asn Asp Ile Glu Val Thr Leu Gin Gly Pro 130 Asn Pro Gin Ala Tyr Asp Gin Tyr Ala Met Tyr 115 Arg Leu Glu Sen Leu Sen Asp Ang Ile Tyn Leu 100 Gin Gly Asn Asn Thn 180 Asn 5 Sen Gly Val1 Arg Tyr Cys Asn Asn Lys Leu 165 Val1 Asn Leu Phe Asp Gin 70 Leu Pro Gly Val1 Leu 150 Pro Ile Asp Leu Phe Asn 55 Asn Giu Asp Gin Asn 135 Thr Asn Ala Ile Pro Met 40 Ser Ala Ala Pro Asn 120 Ala Gli Sen Leu Thn 200 Leu 25 Gly Gly Ile Met Sen 105 Asn Thr Leu Lys Pro 185 Asn 10 Phe Vai Leu Ala Gly 90 Lys Asn Phe Ile Val1 170 Glu Ala Phe Ser Asn Leu Gin Ang Gly Asp Gly 155 Phe Gly Leu Leu Tyr Ala Giu Gin Cys Asp Met 140 Giu Asn Leu Thn Asn Asn Gin Ser Ser Thr Leu Thr 125 Gin Thr Val1 Ala Thr 205 Lys Pro Thr Gin Ala Arg Leu 110 Gly Ser Leu Lys Asn 190 Leu Lys Leu Ser Asp Ala Val Tyr Asn Leu Ile Phe 175 Thr Trp Asn Gin Thr Leu Thr Asn Lys Ser Phe Ser Thr Pro Ser Asn Thn Ser WO 97/37044 PCT/US97/05223 624 210 215 220 Val Asn Phe Ser Pro Gin Val Leu Gin His Leu Leu Gin Asp Gly Leu 225 230 235 240 Ala Thr Ala Asn Asn Asn Gin Thr Ile Cys Ser Thr Gin Asn Gin Cys 245 250 255 Thr Ala Thr Asn Glu Ala Lys Ser Ile Ala Gin Asn Ala Gin Asn Ile 260 265 270 Phe Gin Ala Leu Met Gin Ala Gly Ile Leu Gly Gly Leu Ala Asn Glu 275 280 285 Lys Gin Phe Gly Phe Thr Tyr Asn Lys Ala Pro Asn Gly Gly Asp Ser 290 295 300 Gin Gin Gly Tyr Ser Lys Leu Cys Gly Pro Gly Ile Thr Pro Lys Arg 305 310 315 320 Gin His His Ala Ser 325 INFORMATION FOR SEQ ID NO:712: SEQUENCE CHARACTERISTICS: LENGTH: 280 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...280 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:712: Tyr Asn Gin Met Gin Asp Val Ile Asn Tyr Gly Glu Ser Leu Leu Ser 1 5 10 Asn Thr Val Ala Tyr Gly Asp Phe Ile Thr Asn Trp Val Ala Pro Tyr 25 Leu Asp Leu Asn Asn Lys Gly Leu Asn Phe Leu Pro Asn Tyr Gly Gly 40 Gin Leu Asn Gly Ala Asn Asn Gin Thr Pro Gin Leu Thr Pro Gin Gin 55 Ala Gin Gin Glu Gin Lys Val Ile Met Asn Gin Leu Glu Gin Ala Thr 70 75 Asn Ala Pro Thr Pro Ala Gin Ile Asn Arg Ile Leu Ala Asn Pro Tyr 90 Ser Pro Thr Ala Lys Thr Leu Met Ala Tyr Gly Leu Tyr Arg Ser Lys 100 105 110 Ala Val Ile Gly Gly Val Ile Asp Glu Met Gin Thr Lys Val Asn Gin 115 120 125 Val Tyr Gin Met Gly Phe Ala Arg Asn Phe Leu Glu His Thr Leu Tyr 130 135 140 Ile Leu Ile Thr Leu Asn Gly Phe Gly Val Lys Met Gly Tyr Lys Gin 145 150 155 160 Phe Phe Gly Lys Lys Arg Met Phe Gly Leu Arg Tyr Tyr Gly Phe Tyr WO 97/37044 PCT/US97/05223 625 165 170 175 Asp Phe Gly Tyr Ala Gin Phe Gly Thr Glu Ser Ser Leu Val Lys Ala 180 185 190 Thr Leu Ser Ser Tyr Gly Ala Gly Thr Asp Phe Leu Tyr Asn Val Phe 195 200 205 Thr Arg Lys Arg Gly Thr Glu Ala Ile Asp Ile Gly Phe Phe Ala Gly 210 215 220 Ile Gin Leu Ala Gly Gin Thr Trp Lys Thr Asn Phe Leu Asp Gin Val 225 230 235 240 Asp Gly Asn His Leu Lys Pro Lys Asp Thr Ser Phe Gin Phe Leu Phe 245 250 255 Asp Leu Gly Ile Met Asp Gin Ile Phe Pro Lys Ser Leu Ile Ser Lys 260 265 270 Lys Asn Leu Val Phe Leu Lys Gly 275 280 INFORMATION FOR SEQ ID NO:713: SEQUENCE CHARACTERISTICS: LENGTH: 135 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...135 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:713: Gin Pro His Lys Gin Arg Asn Ile Ala His Leu Gin His Ala Gin Ala 1 5 10 Val Ile Thr Ser Val Leu Ala Phe Trp Ser Leu Tyr Ala Gly Asn Ala 25 Leu Ser Phe His Val Thr Gly Leu Asn Asp Gly Ser Asn Ser Pro Leu 40 Gly Arg Ile His Arg Asp Gly Asn Cys Thr Gly Leu Gin Gin Cys Phe 55 Met Ser Lys Glu Thr Tyr Asp Lys Met Lys Thr Leu Ala Glu Asn Leu 70 75 Gin Lys Ala Gin Gly Asn Leu Cys Ala Leu Ser Glu Cys Ser Ser Asn 90 Gin Asp Ser Gly Phe Pro Val Leu Asp Ser Ala Gly Lys Gin Val Thr 100 105 110 Ile Thr Ile Thr Thr Gin Thr Asn Gly Ala Asn Lys Ser Glu Thr Thr 115 120 125 Thr Thr Thr Thr Thr Thr Asn 130 135 INFORMATION FOR SEQ ID NO:714: WO 97/37044 PCT/US97/05223 626 SEQUENCE CHARACTERISTICS: LENGTH: 124 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...124 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:714: Gly Leu Ser Arg Gin Gly Asn Glu Arg Leu Asp Ala Thr Leu Asn Leu 1 5 10 Lys Thr Gly Val Gin Ser Phe Phe Lys Lys Tyr Ile Gly Ile Arg Gly 25 Val Phe Ala Trp Asp Leu Gly Ser Gly Lys Val Asn Tyr Gin Ser Tyr 40 Lys Asp Pro Thr Asn Ser Phe Phe Thr Met Leu Ala Val Gly Leu Asp 55 Val Ile Met Glu Phe Pro Leu Gly Ser Tyr Lys His Tyr Leu Gly Ala 70 75 Phe Gly Gly Ala Arg Gly Ala Leu Val Val Tyr Thr Asp Lys Gin Asn 90 Phe Lys Phe Phe Lys His Ser Val Val Ser Gly Gly Leu Ala Ile Ser 100 105 110 Gly Gly Trp Leu Ile Leu Leu Arg Thr Pro His Phe 115 120 INFORMATION FOR SEQ ID NO:715: SEQUENCE CHARACTERISTICS: LENGTH: 103 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...103 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:715: Pro Val Gly Leu Tyr Pro Leu Glu Ser Pro Leu Ile Tyr Glu Glu Asn 1 5 10 WO 97/37044 PCTIUS97/05223 627 His Leu Leu Pro Met Gly Phe Ile His Leu Ala Phe Arg Gly Gly Gly 25 Ser Leu Ser Asp Lys Asn Gin Leu Gly Leu Ala Lys Leu Phe Ala Gin 40 Val Leu Asn Glu Gly Thr Lys Glu Leu Gly Ala Val Gly Phe Ala Gin 55 Leu Leu Glu Gin Lys Ala Ile Ser Leu Asn Val Asp Thr Ser Ala Glu 70 75 Asp Leu Gin Ile Thr Leu Glu Phe Leu Lys Glu Tyr Glu Asp Glu Ala 90 Ile Met Arg Leu Lys Glu Leu 100 INFORMATION FOR SEQ ID NO:716: SEQUENCE CHARACTERISTICS: LENGTH: 85 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...85 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:716: Tyr Ser Val Tyr Ile Arg Ser Asn Phe Ser Lys Val Ala His Phe Ala 1 5 10 Ser Gly Tyr Leu Gin Thr Lys Leu Ser Thr Gin Ala Lys Ser Val Ala 25 Leu Ala Lys Lys Thr Thr Lys Glu Phe Ile Glu Lys Gly Met Thr Gin 40 Gin Glu Leu Asp Asp Ala Lys Lys Phe Leu Leu Gly Ser Glu Pro Leu 55 Arg Asn Glu Thr Ile Ser Ser Arg Leu Asn Thr Thr Tyr Asn Tyr Phe 70 75 Ile Trp Val Cys Leu INFORMATION FOR SEQ ID NO:717: SEQUENCE CHARACTERISTICS: LENGTH: 343 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
WO 97/37044 WO 9737044PCTIUS97/05223 628 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .343 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:717: Arg Arg Ala Trp Ile Leu Leu Gly Leu Lys Asn 1 Val1 Val1 Leu Giu Lys Al a Gin Lys Arg 145 Ile Leu Giu Phe Val 225 Giu Giu Ala Ile His 305 Arg Giu Leu Lys Tyr Ile Arg Val Thr Lys Ile Ala Met Ser Ile Lys 115 Tyr Cys 130 Phe Lys Tyr Asp Leu Leu Ile Phe 195 Ile Giu 210 Asp Leu Ala Ile Asp Ala Asp Val 275 Asp Phe 290 Lys Ala Leu Asp Asn Glu Gly Ser Lys Val1 Arg Lys 100 Gin Val1 Ile Phe Giu 180 Leu Ser Phe Giy Tyr 260 Leu Val1 Leu Phe Lys 340 5 Leu Phe Asn Giu Thr Ala Giu Tyr Gin Val 165 Arg Ala Lys Leu Leu 245 Ser Gin Giu Met Ser 325 Hi q Lys Asn His Thr 70 Tyr Lys Asn Val1 Asp IS0 Arg Ser Lys Giu Ser 230 Asp Leu Giu Lys His 310 Leu Ala Lys Vali Arg 55 Lys Ser Giu Gin Giu 135 Val Asp Arg Ser Giu 215 Giu Ser Ile Giy Vali 295 Leu Val1 Phe Ala Asp 40 Ile Asn Gin Vai Asp 120 Ser Asp Asn Phe Val1 200 Asp Ile Ser Giu Leu 280 Val Gin Giu Phe 25 His Lys Gly Lys Leu 105 Tyr Lys Phe Leu Tyr 185 Phe Ser Gin Lys Gly 265 Arg Val Giu Arg 10 Lys Asn Giu Giu Tyr 90 Cys Gin Asp Leu Glu i7 0 Phe Leu Met Giu Asp 250 Met Gly Leu Thr Leu 330 Giu Leu Lys Val1 75 Pro Giu Ala Phe Phe 155 Asn Leu Giu Gly Asp 235 Asn Thr Val1 Asp Leu 315 Asn Leu Arg Leu Phe Pro Tyr Lys Cys Leu 140 Ser Lys Ile Giu Met 220 Ile Ser Asn Tyr Ser 300 Met Val Lys Phe Ser Phe Ile Thr Gin Giu 125 Lys Pro Pro Ala Gin 205 Asp Asp Giu Ile His 285 Cys Ile Leu Gly Cys Ala Lys Gin Tyr Ala 110 Vai Asp Phe Leu Asp 190 Pro Asn Ser Lys Pro 270 Ser Gin Giu Ala Val1 Ser Gin Thr Ala Phe Phe Asn Phe Ser Leu 175 Lys Giu Giu Leu Ile 255 Leu Arg Ile Val Arg 335 Trp Gin Val1 Phe Leu Ser Giu Gin Lys Leu 160 Tyr Lys Giu Ala Glu 240 Thr Ile Giu His Asp 320 Met INFORMATION FOR SEQ ID NO:7i8: SEQUENCE CHARACTERISTICS: WO 97I37044 629 LENGTH: 420 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .420 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:718: Met Ile Phe Gly Asp Phe Lys Tyr Gin Lys Ser Val PCT/US97/05223 Lys Lys Leu Thr 1 Ala Asn Leu Phe Giu Tyr Thr Lys Phe 145 Pro Ile Ile Ser Leu 225 Sen Pro Gly Ser Thr Arg Al a Glu His Phe Tyr Pro 130 Lys Glu Thr Asp Glu 210 Ala Leu Leu Ser Leu 290 Asn Gly Phe Gln Ala Lys Gln 115 Lys Ala Leu Lys Glu 195 Asn Leu Pro Lys Val1 275 Glu Leu Lys Leu Phe Phe Gln 100 Val1 Arg Phe Phe Pro 180 Lys Val1 Lys Ser Thr 260 Thr Lys 5 Asn Gly Asp Leu Tyr Phe Asn Vai Ile Phe 165 Met Asn Met As n Val1 245 Sen Gly Arg G1u Tyr Glu Glu 70 Pro Lys Leu Phe Glu 150 Glu Lys Arg Ile Ser 230 Tyr Leu Cys Pro Leu Phe Asn 55 Arg Lys Ala Thr Lys 135 Asn Leu Gly Leu Val 215 Val1 Gin Phe Pro Arg 295 Lys Val 40 Phe Lys Ile Val1 Met 120 Giu Glu Glu Thr Phe 200 Asp Lys Met Glu Lys 280 Gly Asn 25 Gly Gin Lys His Lys 105 Asp Val Phe Phe Ile 185 Leu Leu Vali Ile Ile 265 Ile Val 10 Ala Tyr Ser Tyr Ser 90 Giu Leu Ile Gly Leu 170 Ala Gin Leu Asfl Ser 250 Phe Lys Tyr Leu Leu Gin Leu 75 Ser His Leu His Ser 155 Asp Arg Asn Ang Gln 235 Giu Lys Thr Cys Asp Leu Thr Val Leu Leu Leu Asn 140 Val Thr Ser Asp Asn.
220 Leu Ilie Ala Met Gly 300 Phe Tyr Pro Glu Asp Lys Asn 125 Gln Leu Ala Asn Asp 205 Asp Phe Glu Leu Gin 285 Ala Ile Glu Phe His Gin Asn 110 Thr Asn Ser Ile Asn 190 Lys Leu Glu Aia Phe 270 Ile Ile Ser Ala Leu Leu Lys Gly Lys Thr Phe Lys 175 Pro Asn Ser Ile Gln 255 Pro Ile Gly Gin Arg Tyr Lys Thr Asp Ala Pro Ser 160 Ile Leu Arg Arg Ile 240 Leu Gys Glu Met Val Gly Gly Lys Lys Ala Leu Phe Ser Vai Pro Ile Arg Thr Leu Glu 310 315 320 WO 97/37044 PCTIUS97/05223 630 Lys Arg Ala His Glu Asp Phe Leu His Leu Gly Val Gly Ser Gly Val 325 330 335 Thr Tyr Lys Ser Lys Ala Ser Lys Glu Tyr Glu Glu Ser Phe Leu Lys 340 345 350 Ser Phe Phe Val Met Pro Lys Ile Glu Phe Glu Ile Val Glu Thr Met 355 360 365 Arg Val Ile Lys Arg Asp Gin Lys Leu Glu Ile Asn Asn Lys Asn Ala 370 375 380 His Lys Glu Arg Leu Met His Ser Ala Gin Tyr Phe Asn Phe Asn Thr 385 390 395 400 Asp Glu Asn Leu Leu Asp Phe Glu Leu Glu Gly Glu Gly Val Leu Arg 405 410 415 Val Leu Ile Gin 420 INFORMATION FOR SEQ ID NO:719: SEQUENCE CHARACTERISTICS: LENGTH: 285 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...285 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:719: Lys Gin Ile Arg Lys Glu Arg Leu Asn Lys Leu Leu Lys Lys Gly Phe 1 5 10 Leu Ala Phe Phe Leu Ser Val Tyr Leu Arg Ala Asp Asp Leu Val Thr 25 Tyr Thr Ile Ala Lys Glu Glu Asp Leu Gly Tyr Gin Arg Phe Leu Ala 40 Lys Lys Cys Leu Arg Gly Lys Thr His Pro Pro Cys Phe Thr Ala Ser 55 Lys Lys Pro Lys Arg Lys Pro Phe Asn Ile Asp Lys Ser Ser His Tyr 70 75 Tyr Gly Thr Ser Val Val Gin Met Ser Trp Leu Gin Ser Arg Glu Lys 90 Phe Glu Asn His Ser Lys Tyr Arg Asn Ile Pro Phe Ala Glu Val Ser 100 105 110 Leu Ile Tyr Gly Tyr Lys Gin Phe Phe Pro Lys Lys Glu His Tyr Gly 115 120 125 Phe Arg Phe Tyr Val Ser Leu Asp Tyr Ala Tyr Gly Phe Phe Leu Lys 130 135 140 Asn Lys Gly Ala Leu Gly Asp Ser Leu Arg Ala Ser Ser Gin Ile Pro 145 150 155 160 Lys Ser Tyr Arg Glu Lys Leu Gin Arg Lys Glu Thr Phe Ile Asn Ala 165 170 175 WO 97/37044 PCT/US97/05223 631 Ile Phe Tyr Gly Val Gly Ala Asp Phe Leu Tyr Lys Arg Ala Phe Gly 180 185 190 Thr Leu Ile Leu Gly Val Asn Phe Val Gly Glu Thr Trp Phe Tyr Glu 195 200 205 Thr Lys Ile Phe Lys Gin Trp Ala Lys Asp Ser Leu Asn Thr Tyr Arg 210 215 220 Pro Asn Met Phe Gin Val Met Leu Asn Val Gly Tyr Arg Tyr Arg Phe 225 230 235 240 Ser Arg Tyr Lys Asn Trp Ala Ile Glu Phe Gly Ala Arg Ile Pro Phe 245 250 255 Leu Ile Asn Asp Tyr Phe Lys Thr Pro Leu Tyr Thr Leu His Phe Lys 260 265 270 Arg Asn Ile Ser Val Tyr Leu Thr Ser Thr Tyr Asp Phe 275 280 285 INFORMATION FOR SEQ ID NO:720: SEQUENCE CHARACTERISTICS: LENGTH: 231 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...231 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:720: Asn Thr Leu Phe Tyr Phe Phe Leu Leu Gin Ala Val Ile Val Leu Asp 1 5 10 Ser Phe Glu Ile Leu Lys Ala Leu Lys Ser Leu Asp Leu Leu Lys Asn 25 Ala Pro Ala Trp Trp Trp Pro Asn Ala Leu Lys Phe Glu Ala Leu Leu 40 Gly Ala Val Leu Thr Gin Asn Thr Lys Phe Glu Ala Val Leu Lys Ser 55 Leu Glu Asn Leu Lys Asn Ala Phe Ile Leu Glu Asn Asp Asp Glu Ile 70 75 Asn Leu Lys Lys Ile Ala Tyr Ile Glu Phe Ser Lys Leu Ala Glu Cys 90 Val Arg Pro Ser Gly Phe Tyr Asn Gin Lys Ala Lys Arg Leu Ile Asp 100 105 110 Leu Ser Lys Asn Ile Leu Lys Asp Phe Gin Ser Phe Glu Asn Phe Lys 115 120 125 Gin Glu Val Thr Lys Glu Trp Leu Leu Asp Gin Lys Gly Ile Gly Lys 130 135 140 Glu Ser Ala Asp Ala Ile Leu Cys Tyr Val Cys Ala Lys Glu Val Met 145 150 155 160 Val Val Asp Lys Tyr Ser Tyr Leu Phe Leu Lys Lys Leu Gly Ile Glu 165 170 175 WO 97137044 WO 9737O~PCT1US97/05223 632 Ile Giu Ala Gin 225 (2) Glu Asp Tyr Asp Giu Leu Gin His Phe Phe G 180 185 Asn Leu Asn Ser Ala Leu Ala Leu Tyr Glu A 195 200 C-in Leu Tyr Ala Arg Phe His Gly Lys Ile V 210 215 2 Lys Leu Giu Leu Lys Leu 230 INFORMATION FOR SEQ ID NO:721: SEQUENCE CHARACTERISTICS: LENGTH: 231 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .231 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:721: iu Lys Gly Val Gin 190 sn Thr Ile Ser Leu 205 al Giu Phe Ser Lys Ile Ser Phe Asn Ala Asp Tyr Phe Arg Ile Trp Ala 1 Thr Arg Arg Thr Leu Gin Ile Ala Tyr 145 Thr Val1 Leu Gly Giy Pro Pro Arg Lys Phe Ser Ala 130 Giu Gin Ser Gin Ser 210 Gin Ile Ile Val1 Gly Ile Ser 115 Giy Ser His Gin Ile 195 Ser Tyr Asn Arg Thr Thr Phe 100 Tyr Giy Val Giu Ile 180 Asn Pro 5 Ser Gly Giy Ser Ser Asp Phe Tyr Leu Gly 165 Phe Asn Ala Val1 Tyr Leu His 70 Tyr Ala Tyr Tyr Asn 150 Leu Trp Ile Gly Tyr Ser Gin 55 Gly Asn Arg Sen Giy 135 Ser Leu Giu Phe Leu 215 Thr Gin 40 Phe Pro Lys Tyr Arg 120 Met Gly Pro Asn Asn 200 Gin Ser 25 Gly His Leu His Asn 105 Ala Gin Tyr Trp Gly 185 Met Pro 10 Gly Val Ala Thr Phe 90 Trp Tyr Tyr Gin Tyr 170 Arg Lys Ala Pro Glu Ala Asp Pro Arg Ser Tyr Cys 155 Trp His Tyr Pro Met Leu Phe Leu Phe Lys Gly Ser 140 Giu Val1 Arg Tyr Gly 220 Arg Lys Glu Asn Asn Val1 Thr Ile 125 Gly Ala Trp Val1 Phe 205 Arg Asp Giy Leu Tyr Giy Sen Thr 110 S er Gly Trp Asn Thr 190 Thr Ser Phe Asn Tyr Ile Asp Pro Ile Asn Asn Cys Ile 175 Giy Gly Val1 WO 97/37044 PCT/US97/05223 633 Ala Tyr Leu Asn Tyr Thr Phe 225 230 INFORMATION FOR SEQ ID NO:722: SEQUENCE CHARACTERISTICS: LENGTH: 268 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...268 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:722: Ser Gly Val Val Lys Met Lys Asp Lys Lys Ile Asp Glu Glu Thr Asn 1 Ala Gly Val Lys Met Val Ile Asn Thr 145 Val Met Ala Leu Leu 225 Leu Val Cys Arg Glu Gly Glu Ser Leu 130 Arg Leu Phe Lys Ile 210 Glu Cys Leu Glu Gin Val Asp Leu Ala Asn Met Gly Ile Phe 100 Thr Ser 115 Gly Val Ser Ser Asn Glu Glu Tyr 180 Lys Leu 195 Leu Phe Ser Ala Thr Ile 5 Gly Gly Lys Asn Glu Asn Gly Gin Leu Val 165 Leu Leu Asn Arg Arg 245 Glu Cys Ala Leu 70 Pro Thr Val Leu Met 150 Arg Leu Lys Pro Met 230 Glu Lys Ser Ser 55 Pro Leu Gly Ala Ala 135 Pro Lys Ile Leu His 215 Phe Ser Tyr Phe 40 Glu Ile Asn Met Asp 120 Ile Leu Trp Lys Leu 200 Glu Ala Lys Thr 25 Cys Ile Glu Asn Gin 105 Lys Ser Asn Pro Asp 185 Asn Gly Asn Ala 10 Val Phe Ile Lys Leu 90 Ile Ile Leu Lys Leu 170 Leu Gly Ser Phe Leu 250 Cys Thr Gin Ala 75 Asp Ser Pro His Lys 155 Glu Asn Ile Lys Leu 235 Asp Val Gin Gin Leu Glu Pro Ile Ala 140 Tyr Gin Asp Lys Phe 220 Asn Ile Ser Lys Ala Asn Val Lys Leu 125 Val Asn Arg Ser Ser 205 Glu Ala Glu Cys Gly Leu Ile Cys Arg 110 Ala Asp Ile Lys Leu 190 Lys Arg Lys Ala Gin Ile Gly Phe Leu Ile Val Phe Lys Ala Val Thr Gly Lys Asp Lys Glu Cys 160 Arg Val 175 Asp Cys Val Asn Pro Ser Gly Leu 240 Ala Cys 255 Gly Gin Leu Arg Glu Lys Lys Leu Gin Gin Lys Ile 260 265 WO 97/37044 WO 9737044PCT/US97/05223 634 INFORMATION FOR SEQ ID NO:723: SEQUENCE CHARACTERISTICS: LENGTH: 236 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .236 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:723: Arg Leu Ser Giu Pro Ile Asp Arg Phe Thr Arg Ile 1 Lys Vai Ile Gin Val Asp Leu Cys Lys 145 Tyr Arg Cys Gin Trp 225 Asn Gly Giy Asn Leu Giu Asp Gin 130 Arg Gily Phe Ile Ile 210 Asn Asp Gly Gin Arg Gin Aia Cys 115 Asn Leu Asp Lys Giu Aia Giu Phe Val Ile Gin Asp Phe 100 Met Phe Asn Lys Giy 180 Leu Ser Arg 5 Giu Gly Thr Ile Leu Leu Asp Ala Pro Phe 165 Asp Giy Giu Leu Lys Gly Ile Giy 70 Tyr Asn Asp Tyr Lys 150 Giy Phe Ser Val1 Arg 230 Ile Phe Ile 55 Ser Lys Ser Leu Giy 135 His Arg Lys Phe Vai 215 Arg Arg Ala 40 Asp Giu Gly Phe Pro 120 Lys Ile Lys Val1 Asn 200 Gin Arg Gin 25 Leu Lys Arg Ile Asn 105 Ile Phe Gin Phe Val1 185 Ala Asp Ile 10 Gin Asp Asp Ile Gin 90 Phe Lys Ile Val Arg 170 Phe Val Ile Giy Arg Ser Met Gly 75 Aia Ang Thr Ser Gly i55 Asp Ser Thr Ile Arg 235 Val Leu Phe Giu Leu Asp Ser Ser 140 Ser Phe Pro Ala Lys 220 Phe Arg Leu Tyr Asp Sen Asn Tyr Leu 125 Met Val1 Leu Giu Ser 205 Asn Trp Ile Arg Val1 Lys Cys Asp 110 Ala Gly Trp Lys Val 190 Phe Glu Leu Cys Val Thr Val Arg Tyr Ile Ser Giu Lys 175 Pro Gly Arg Phe Gly Gly Asn Leu Ile Ile Lys Ala Ser 160 Ang His Leu Gin INFORMATION FOR SEQ ID NO:724: SEQUENCE CHARACTERISTICS: LENGTH: 333 amino acids TYPE: amino acid WO 97137044 PTU9/52 PCTIUS97/05223 635 TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Heiicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .333 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:724: Ala Lys Ala Ang Val Gin Phe Asn Arg Ala Asn Glu 1 Ala Asp Val Ala Phe Giu Phe Asp Phe 145 Glu Lys Ser Sen Gly 225 Phe Met Ser Giu Thr 305 Lys Ser Ser Tyr Tyr Ala Phe Gly 130 Met Lys Val1 Giu Ai'a 210 Phe Ala Arg Ile Sen 290 Ala Giu Ser Glu Gin Ala Tyr Lys 115 Lys Ile Gly Thr Phe 195 Thn Val Asp Ile Asp 275 Pro Pro 5 Phe Tyr Lys Lys Ile Leu Gly Leu Leu Asn Gly Ile 100 Asn Pro Giu Pro Asp Ser Giu Ala 165 Pro Phe 180 Arg Lys Phe Lys Gly Giy Thr Ang 245 Lys Asn 260 Ala Phe Lys Ile Gin Val Tyr His Asn Leu 70 Gly Ser Asn Gin Val1 150 Phe Asp Gin Val1 Leu 230 As n Lys Sen Ile Phe 310 Giu As n Asn 55 Asn Asp Gin Arg Lys 135 Tyr Tyr Thr Leu Gly 215 Val1 Thr Ala Phe Asp 295 Cys Giu Ile 40 Thr Pro Lys Sen Ser 120 Sen Asn Gin Leu Pro 200 Met Thr Sen Phe Ser 280 Ala Ser Val1 25 Asn Tyr Ala Asn Pro 105 His Giu Val Asn Al a 185 Tyr Gin Ser Ile Giu 265 Leu Arg Asn Gin Met Ser Val1 Lys 90 Phe Phe Phe Sen Phe 170 Ser Ile Ala Leu Phe 250 Ser Lys Giu Arg Lys Giu Asn Sen 75 Gly Val1 Thr Lys Val 155 Thr Ile Ile Val Tyr 235 Ala Tyr Pro Leu His 315 Ala Arg Leu Tyr Leu Ala Trp Ile 140 Ala Leu Asp Thr Al a 220 Ser His Giu Cys Leu 300 Asn Ang Ile Ser Asp Leu Gly Gin Ile 125 Asp Leu Lys Ala Arg 205 Asn Gly Lys Val1 Lys 285 Ser Ile Gin Lys Arg Lys Ser Tyr Asp 110 Ile Val Pro Asp Val1 190 Ala Tyr Val1 Ile Arg 270 Arg Gly Leu Arg Glu Ala Tyr Gly Leu Leu Ile Pro Lys Gly 175 Val Ile Tyr S er Tyr 255 Aia Sen Phe Tyr Arg Ile Giu Glu Leu Asn Val1 Giu Ile Leu 160 Giu Ala Leu Leu Thr 240 Leu Asp Leu Val Val 320 Arg Ser Phe Lys Asn Gly Phe Val Leu Ser His Leu Lys WO 97/37044 PCT/US97/05223 INFORMATION FOR SEQ ID NO:725: SEQUENCE CHARACTERISTICS: LENGTH: 91 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...91 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:725: Arg Asn Lys Asp Lys Arg Glu Tyr Phe Glu Asn Ala Thr Ala Gln Ile 1 5 10 Gly Ala Lys Leu Leu Val Cys Asp Ile Arg Glu Gin Phe Phe Asn Asp 25 Val Leu Phe Lys Pro Lys Tyr Gly Tyr Gly Lys Tyr Phe Asn Pro Cys 40 Ile Asp Cys His Ala Asn Met Phe Arg Asn Ala Phe Tyr Lys Met Leu 55 Glu Leu Asp Ala Asp Phe Val Leu Ser Gly Glu Val Leu Gly Asn Ala 70 75 Leu Asn Pro Lys Gly Lys Lys Arg Ser Ile Arg INFORMATION FOR SEQ ID NO:726: SEQUENCE CHARACTERISTICS: LENGTH: 302 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...302 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:726: Gly Glu Val Met Ala Asp Ser Leu Ala Gly Ile Asp Gln Val Thr Ser 1 5 10 Leu His Lys Asn Asn Glu Leu Gin Leu Leu Cys Phe Arg Leu Gly Lys WO 97/37044 PCT/US97/05223 637 Asn Lys Asp Leu Tyr Ala Val Asn Val Lys Val Met Arg Ile Asp Leu 145 Phe Asp Leu Pro Lys 225 Asn Glu Thr Gly Tyr Glu Lys Pro Cys Arg 130 Arg Asp Val Ser Ser 210 His Pro Met Pro Ser 290 His Gly Lys Tyr Glu 115 Ile Gly Gly Phe Lys 195 Val Ile Thr Pro Leu 275 Ser Gly Leu Trp Arg 100 Phe Ser Ser Arg Pro 180 lie Leu Asp Thr Glu 260 Thr Tyr Asn Ile Phe Ile Ser Ser Ala Leu 165 Trp His Lys Phe Asp 245 Ala Ser Glu Leu lle 70 Tyr Gin Arg Lys Gly 150 Val Ile Ser Thr Ile 230 Val Ser Thr Lys Thr 55 Ile Tyr Lys Trp Lys 135 Asn Gin Glu Asn Met 215 Asn Ser Gly Ile Asn 295 40 Ile Arg Asp Glu Thr 120 Trp Asn Val Asp Gin 200 Gin Gly Asn Phe Pro 280 Gly Ile Glu Ser Lys 105 Ile Thr Lys Val Glu 185 Cys Met Lys Ile Glu 265 Ile Lys Phe Ser Leu Gin Ser Gly Glu Leu Asp 170 Lys Val Ile Thr Asp 250 Val Val Asn Lys His Thr 75 Asn Glu Val Met Val 155 Ile His Leu Leu Leu 235 Leu Ile Val Leu Ile Glu Ile Lys Asp Arg Glu 140 Ser Glu Asn Leu Asp 220 Leu Ile Asn Asn Asn 300 Arg Asn Pro Ser Asp Ile 125 Gin Arg Lys Asp Ala 205 Lys Glu Ile Gin Ser 285 Gly Glu Asn Leu Lys Ile 110 Tyr Ser Thr Met Val 190 Asp Leu His Thr Val 270 Ser Gin Val Leu Asp Leu Met Ala Gly Tyr 160 Ile Thr Ser Val Phe 240 Leu Asn Ser INFORMATION FOR SEQ ID NO:727: SEQUENCE CHARACTERISTICS: LENGTH: 126 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...126 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:727: Ala Val Ala Phe Cys Leu Pro Tyr Ser Ser Pro Lys Thr Ser Leu Ala WO 97/37044 PCT/US97/05223 638 1 5 10 Thr Glu Leu His Tyr Ser Ala Asn Glu Arg Ile Glu Ala Phe Ser Ser 25 Asn Asp Glu Lys Thr Glu Ser Phe Glu Leu Asn Glu Gin Ser Phe Glu 40 Ala Ile Lys Glu Asn Ala Ala Lys Tyr Ser Tyr Leu Lys Val Tyr Leu 55 Lys Asn Val Ala Leu Lys Asp Ser Ala Pro Leu Val Phe Val Asp Met 70 75 Pro Gly Phe Asp Ser Pro Ile Ser Ser His Thr His Ala Ile Leu Glu 90 Tyr Leu Glu Arg Gly Val His Phe Val Ile Leu Thr Ser Val Glu Glu 100 105 110 Gly Asn Leu Thr Lys Arg Met Val Arg Glu Leu Lys Thr Phe 115 120 125 INFORMATION FOR SEQ ID NO:728: SEQUENCE CHARACTERISTICS: LENGTH: 325 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...325 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:728: Arg His Pro Ile Glu Asn Ser Ile Cys Asp Leu Ser Glu Leu Asn Asn 1 5 10 Leu Lys Pro Leu Gly Ala Glu Met Phe Val Gly Leu Asp Gly Ile Asp 25 Ala Met Ile Glu Glu Cys Val Ser Asn Leu Val Ile Asn Ala Ile Val 40 Gly Val Pro Gly Leu Lys Ala Ser Phe Lys Ser Leu Gin Arg Asn Lys 55 Lys Leu Ala Leu Ala Asn Lys Glu Ser Leu Val Ser Ala Gly His Leu 70 75 Leu Asp Ile Ser Gin Ile Thr Pro Val Asp Ser Glu His Phe Gly Leu 90 Trp Ala Leu Leu Gin Asn Lys Thr Leu Lys Pro Lys Ser Leu Ile Ile 100 105 110 Ser Ala Ser Gly Gly Ala Phe Arg Asp Thr Pro Leu Asp Leu Ile Ala 115 120 125 Ile Gin Asn Ala Gin Asn Ala Leu Lys His Pro Asn Trp Ser Met Gly 130 135 140 Asp Lys Ile Thr Ile Asp Ser Ala Ser Met Val Asn Lys Leu Phe Glu 145 150 155 160 Ile Leu Glu Thr Tyr Trp Leu Phe Gly Ala Ser Leu Lys Ile Asp Ala WO 97/37044 PCT/US97/05223 639 165 170 175 Leu Ile Glu Arg Asn Ser Ile Val His Ala Leu Val Glu Phe Glu Asp 180 185 190 Asn Ser Val Ile Ala His Leu Ala Ser Ala Asp Met Gin Leu Pro Ile 195 200 205 Ser Tyr Ala Ile Asn Pro Lys Leu Ala Ser Leu Ser Ala Ser Ile Lys 210 215 220 Pro Leu Asp Leu Tyr Ala Leu Ser Ala Ile Lys Phe Glu Pro Ile Ser 225 230 235 240 Val Glu Arg Tyr Thr Leu Trp Arg Tyr Lys Asp Leu Leu Leu Glu Asn 245 250 255 Pro Lys Leu Gly Val Val Leu Asn Ala Ser Asn Glu Val Ala Met Lys 260 265 270 Lys Phe Leu Asn Gin Glu Ile Ala Phe Gly Gly Phe Ile Gin Ile Ile 275 280 285 Ser Gin Ala Leu Glu Leu Tyr Ala Lys Lys Ser Phe Lys Leu Ser Ser 290 295 300 Leu Asp Glu Val Leu Ala Leu Asp Lys Glu Val Arg Glu Arg Phe Gly 305 310 315 320 Ser Val Ala Arg Val 325 INFORMATION FOR SEQ ID NO:729: SEQUENCE
CHARACTERISTICS:
LENGTH: 132 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...132 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:729: Glu Thr Leu Ala Arg Val Leu Lys Val Ser Cys Leu Ala Met Gly Leu 1 5 10 Ala Gin Met Cys Cys Leu Thr Gin Leu Phe Ser Ile Val Arg Asn Leu 25 His Leu Gly Phe Phe Leu Gly Val Ala Ile Gly Gly Thr Ser Trp Gly 40 Pro Thr Asn Tyr Tyr Phe Lys Asp Leu Ala Glu Glu Tyr Arg Gly Ser 55 Phe His Pro Ser Asn Phe Gin Val Leu Val Asn Gly Gly Ile Arg Leu 70 75 Gly Thr Lys His Gin Gly Phe Glu Ile Gly Leu Lys Ile Gin Thr Ile 90 Arg Asn Asn Tyr Tyr Thr Ala Ser Ala Asp Asn Val Pro Glu Gly Thr 100 105 110 Thr Tyr Arg Phe Thr Phe His Arg Pro Tyr Ala Phe Tyr Trp Arg Tyr WO 97/37044 PCT/US97/05223 640 115 120 125 Ile Val Ser Phe 130 INFORMATION FOR SEQ ID NO:730: SEQUENCE CHARACTERISTICS: LENGTH: 60 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:730: Val Leu Lys Asn Glu Ile Ser Tyr Asn Lys Pro Ile Leu Asn Thr Asn 1 5 10 Asp Tyr Leu Asn Lys Asp Asn Thr Met Lys Asp Ser Phe Leu Phe Thr 25 Ser Glu Ser Val Thr Glu Gly His Pro Asp Lys Met Ala Asp Gin Ile 40 Ser Asp Ala Val Leu Asp Tyr Met Ile Glu Arg Asp 55 INFORMATION FOR SEQ ID NO:731: SEQUENCE CHARACTERISTICS: LENGTH: 111 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...111 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:731: Lys Thr Lys Val Ala Cys Lys Thr Leu Val Ser Asn Gly Leu Cys Met 1 5 10 Ile Thr Gly Glu Leu Glu Thr Ser Val Tyr Ala Pro Met Leu Glu Ile 25 WO 97/37044 PCT/US97/05223 641 Ala Arg His Met Val Arg Lys Ile Gly Tyr Thr Asn Ala Leu Tyr 40 Phe Asp Tyr Arg Ser Ala Ala Val Leu Asn Gly Ile Gly Glu Gln 55 Pro Asp Ile Asn Gln Gly Val Asp Arg Glu Asp Gly Glu Ile Gly 70 Gly Asp Gin Gly Leu Met Phe Gly Tyr Ala Cys Lys Glu Thr Glu 90 Leu Met Pro Leu Pro Ile His Leu Ala His Gin Leu Ala Phe Ala 100 105 110 INFORMATION FOR SEQ ID NO:732: SEQUENCE CHARACTERISTICS: LENGTH: 169 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...169 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:732: Ser Leu Ser Thr Leu Ala Val Leu Leu Lys Ala Leu Asn Lys Arg Val 1 Phe Leu Glu Lys Val Val Phe Asp Asp 145 Ala Gin Ile Leu Ile Phe His Cys Ser 130 Met Lys Thr Ala Ile Asp Leu Ala Ala 115 Ile Thr Ser Ile Ser Pro Ile Ser Tyr 100 Ile Leu Phe Arg 5 Leu Gly Glu Leu Glu Val Pro Lys Ile Leu 165 Asn Cys Val Ile 70 His Lys Ser Glu Ala 150 Asp Ala Leu Asp 55 Ala Tyr Ile Phe Val 135 Gin Pro Ala Ser 40 Ile Lys Asn Ser Lys 120 Glu Asp Ala Lys 25 Glu Phe Lys Ala Glu 105 Gly Asp Ser His 10 Asp Arg Thr Gin Arg 90 Gly Lys Leu Ser Lys Tyr Gly Asn 75 Ile Cys Leu Ala Ser 155 Lys Lys Val Gin Ile Asn Gin Leu 140 Phe Glu Asp Gly Phe Thr Gin Ser 125 Lys Leu Gly Glu Asp Ser Gly Lys 110 Arg Gly Ser Ala Ile Tyr Glu Ser Cys Glu Tyr Asp Ile Lys Asp Gin Ser Ser Leu Lys Arg 160 INFORMATION FOR SEQ ID NO:733: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 642 LENGTH: 158 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...158 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:733: Asn Leu Arg Thr Ile Met His Tyr Ser Tyr Glu Ala Phe Leu Lys Asp 1 5 10 Ser Leu Glu Leu Ala Lys Gin Val Glu Arg Leu Cys Gly Ile Pro Glu 25 Ala Leu Val Cys Val Met Arg Gly Gly Met Thr Leu Val His Phe Leu 40 Ser Leu His Trp Asn Leu Arg Glu Val Tyr Gly Ile Asn Ala Ile Ser 55 Tyr Asp Thr Thr Lys Arg Gin Asn Ala Leu Lys Ile Glu Asn Thr Pro 70 75 Thr Ile Lys Asp His Leu Lys Thr Ile Leu Val Val Asp Glu Ile Val 90 Asp Ser Gly Asn Ser Leu Glu Ala Val Leu Lys Val Leu Gin Asp Lys 100 105 110 His Pro Asp Lys Lys Phe Tyr Ser Ala Ser Leu Phe Gin Lys Thr Ser 115 120 125 Ala Lys Tyr Lys Ala Asp Ala Phe Leu Lys Asp Ala Pro Glu Trp Ile 130 135 140 Asp Phe Phe Trp Glu Val Asp Leu Lys Asn Leu Lys Ser His 145 150 155 INFORMATION FOR SEQ ID NO:734: SEQUENCE CHARACTERISTICS: LENGTH: 227 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...227 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:734: WO 97137044 WO 9737044PCTIUJS97/05223 Ser Leu Glu Ile His Ile Phe Asn Glu Lys 145 Lys Lys Ie Ile Ser Ser Tyr Asp Ser His Arg Giu Ile Lys 130 Ile Pro Ala Asn Val 210 Pro Ser Leu Ser Asp Leu Ser Giu Phe 115 Val1 Ala Ile Arg Asp 195 Pro Phe Leu Tyr Pro Ser Lys Thr Leu 100 Ala Pro Leu.
Lys Asp 180 Ser Ser Giu Pro Ile Met Leu Ile Ser Phe Lys Lys Ala 165 Leu Glu Ala Arg Ser Phe Leu Leu 70 Ile Ala Ser Lys His 150 Leu Arg Leu Phe Leu Ser Gin Lys 55 Asn Val1 Phe Ala Ile 135 Gin Val1 Trp Thr Lys 215 Pro Thr Asn 40 Lys Ala Gly Leu Giu 120 Ile Asn Glu Ala Thr 200 Asp Ala Thr 25 Tyr Met Met His Asp 105 Glu Asn His His Pro 185 Pro Asn Ser 10 Leu Phe Arg Lys Pro 90 Glu Asn Ala Ser Lys 170 Giu Leu Ile Leu Glu Tyr Arg Gin 75 Giu Phe Thr Arg Phe 155 Giu Val1 Lys Leu Lys Leu Met Asn Val1 Glu Arg His Ile 140 Lys Gly Asp Pro Leu 220 Asp Ile Pro Ser Lys Asn Phe Ala 125 Lys Ala Glu Gly Gly 205 Ala Ala Gly Ile Ser Giu Giu Asp 110 Tyr Ala Leu Tyr Giu 190 His Lys Arg Ala Gln Gin Ser Ser Arg Ser Leu Leu Phe 175 Ile Tyr Val1 Ile Ile His Ala Phe Glu Leu Leu Asn Asn 160 Tyr Leu Thr Leu INFORMATION FOR SEQ ID NO:735: SEQUENCE CHARACTERISTICS: LENGTH: 108 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .108 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:735: Ile Thr Asn Thr Thr Phe Gin Leu Gin Gly Val Leu Leu Glu Tyr Asp 10 Phe Giu Arg Pro Ala Ile Ser Tyr Thr Pro Lys Lys Ser Val Phe Asn 25 Glu Arg Leu Lys Asp Leu Arg Giu Asn Phe Sen Ala Sen Leu Tyr Ala WO 97/37044 PCT/US97/05223 644 40 Asp Leu Lys Asp Lys Ile His His Asn Ala Leu Ser Asn Asp Asp Leu 55 Glu Arg Met Ile Ala Phe Arg Glu Gin Glu Phe Glu Lys Ser Leu Glu 70 75 Asp Trp Met Gly Ala Tyr Ser Tyr Asp Ala Thr Leu Arg Cys Phe Ser 90 Ser Glu Arg Ile Tyr Leu Asp Arg Leu Gly Trp Glu 100 105 INFORMATION FOR SEQ ID NO:736: SEQUENCE CHARACTERISTICS: LENGTH: 364 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...364 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:736: Lys Lys Val Val Met Asn Gly Phe Cys Ala Arg Leu Arg Ala Ile Thr 1 5 10 Phe Asn Glu Arg Leu Lys Met Lys Ile Ala Val Leu Leu Ser Gly Gly 25 Val Asp Ser Ser Tyr Ser Ala Tyr Ser Leu Lys Glu Gin Gly His Glu 40 Leu Val Gly Ile Tyr Leu Lys Leu His Ala Ser Glu Lys Lys His Asp 55 Leu Tyr Ile Lys Asn Ala Gin Lys Ala Cys Glu Phe Leu Gly Ile Pro 70 75 Leu Glu Val Leu Asp Phe Gin Lys Asp Phe Lys Ser Ala Val Tyr Asp 90 Glu Phe Ile Asn Ala Tyr Glu Glu Gly Gin Thr Pro Asn Pro Cys Ala 100 105 110 Leu Cys Asn Pro Leu Met Lys Phe Gly Leu Ala Leu Asp His Ala Leu 115 120 125 Lys Leu Gly Cys Glu Lys Ile Ala Thr Gly His Tyr Ala Arg Val Lys 130 135 140 Glu Ile Asp Lys Val Ser Tyr Ile Gin Glu Ala Leu Asp Lys Thr Lys 145 150 155 160 Asp Gin Ser Tyr Phe Leu Tyr Ala Leu Glu His Glu Val Ile Ala Lys 165 170 175 Leu Val Phe Pro Leu Gly Asp Leu Leu Lys Lys Asp Ile Lys Pro Leu 180 185 190 Ala Leu Asn Ala Met Pro Phe Leu Gly Thr Leu Glu Thr Tyr Lys Glu 195 200 205 Ser Gin Glu Ile Cys Phe Val Glu Lys Ser Tyr Ile Asp Thr Leu Lys WO 97/37044 PCT/US97/05223 645 210 215 220 Lys His Val Glu Val Glu Lys Glu Gly Val Val Lys Asn Leu Gin Gly 225 230 235 240 Glu Val Ile Gly Thr His Lys Gly Tyr Met Gin Tyr Thr Ile Gly Lys 245 250 255 Arg Lys Gly Phe Ser Val Lys Gly Ala Leu Glu Pro His Phe Val Val 260 265 270 Gly Ile Asp Ala Lys Lys Asn Glu Leu Ile Val Gly Lys Lys Glu Asp 275 280 285 Leu Ala Thr His Ser Leu Lys Ala Lys Asn Lys Ser Leu Thr Lys Asp 290 295 300 Phe Lys Asp Gly Glu Tyr Phe Ile Lys Ala Arg Tyr Arg Ser Val Pro 305 310 315 320 Thr Lys Ala Phe Val Ser Leu Lys Asp Gly Met Ile Glu Val Glu Phe 325 330 335 Lys Glu Pro Phe Tyr Gly Val Ala Lys Gly Gin Ala Leu Val Val Tyr 340 345 350 Lys Asp Asp Ile Leu Leu Gly Gly Gly Val Ile Val 355 360 INFORMATION FOR SEQ ID NO:737: SEQUENCE CHARACTERISTICS: LENGTH: 133 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...133 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:737: Leu Phe Lys Thr Asn Gly Ile Ser Val Gly Glu Tyr Thr His Phe Ser 1 5 10 Glu Asp Ile Gly Ser Gin Ser Arg Ile Asn Thr Val Arg Leu Glu Thr 25 Gly Thr Arg Ser Ile Phe Ser Gly Gly Val Lys Phe Lys Ser Gly Glu 40 Lys Leu Val Ile Asn Asp Phe Tyr Tyr Ser Pro Trp Asn Tyr Phe Asp 55 Ala Arg Asn Val Lys Asn Val Glu Ile Thr Arg Lys Phe Ala Ser Ser 70 75 Thr Pro Glu Asn Pro Trp Gly Thr Ser Lys Leu Met Phe Asn Asn Leu 90 Thr Leu Gly Gin Asn Ala Val Met Asp Tyr Ser Gin Phe Ser Asn Leu 100 105 110 Thr Ile Gin Gly Ile Leu Ser Thr Ile Lys Arg Tyr His Leu Thr Lys 115 120 125 Gly Phe Phe Thr Pro WO 97/37044 PCT/US97/05223 646 130 INFORMATION FOR SEQ ID NO:738: SEQUENCE CHARACTERISTICS: LENGTH: 82 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...82 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:738: Met Lys Lys Thr Arg Ile Arg Lys Lys Ser Ala Leu Met Asn Gly Cys 1 5 10 Trp Ala Glu Phe Asp Ser Val Phe Ser Ala Ile Val Pro Leu Glu Asp 25 Leu Asn Lys Thr Ala Cys Ala His His Ala Leu Lys Ala Leu Gin Ala 40 Thr Leu Lys Thr Thr Ile Trp Ala Leu Met Arg Gin Ser Trp Asn Arg 55 Ser Gin Lys Asp Ser Ser Leu Gly Gly Ile Cys Gly Ile Leu Thr Arg 70 75 Met Phe INFORMATION FOR SEQ ID NO:739: SEQUENCE CHARACTERISTICS: LENGTH: 105 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...105 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:739: Trp Thr Leu Ala Lys Ile Arg Cys Val Lys Asn Phe Lys Glu Ala Ile 1 5 10 WO 97/37044 WO 9737044PCTIUS97/05223 647 Giu Gly Phe Thr Giu Lys Ile Lys Giu Ser Pro Asn Asp Ser Asn 25 Ile Asn Glu Ala Phe Asp Asn Leu Glu Thr Glu Leu Giu Arg Ala 40 Giu Asn Leu Ser Gin Lys Ile Asp Pro Val Leu Glu Arg Asn Glu 55 Tyr Thr Gin Lys Ala Leu Giu Tyr Arg Glu Phe Leu Giu Ser Arg 70 75 Glu Ser Phe Ile Val Asp Giu Lys Asn Pro Tyr Pro Giu Giu Vai 90 Phe Asn Giu Trp Leu Leu Gly Gly Ile 100 105 INFORMATION FOR SEQ ID NO:740: SEQUENCE CHARACTERISTICS: LENGTH: 284 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .284 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:740: Aia Thr Asn Lys Ser Ile Ile Leu Lys Arg Glu Giu Phe Ala Sen 160 Gly Asp Arg Pro Leu Ala Leu Ile Met Trp Tnp Arg Asn Ile 1 Sen Thr Giu Asn Glu Glu Ang Lys Leu 145 Val1 Leu Ile Gin Glu Glu Aia Thr Giu Thr 130 Asp His Glu Ang Trp Leu Ala Giu Leu 100 Phe Pro His Ser Gly 180 5 Val Ile Lys Leu Ile Leu Val1 Thr Asn Lys 165 Phe Val Phe Ile Lys 70 Ser Lys Lys His Thr 150 Gly Phe Asn Ser Gly Lys Val Ser Giu Ser 135 Glu Leu Pro Lys Leu Ala Phe Glu Tyr Leu 120 Leu Asn Giu His Pro 25 Leu Phe Thr Lys Giu 105 Leu Leu Ala Phe Arg 185 10 Pro Asp Lys Ala Phe 90 Lys Ser Asp Gin Lys 170 Gly Arg Giu Asp Met 75 Cys Giu Leu Phe Lys 155 His Phe Asp Leu Gly Leu Gly Ang Asn Lys 125 Asn Ser Phe Gin Asn Gly Leu Asn Arg Phe Tyr 110 Giu Giu Cys Val1 Glu 190 Arg Lys Asn Pro Leu Leu Giu His Ser Met Ile 175 Ser WO 97/37044 PCT/US97/05223 648 Leu Glu Glu Glu Arg Arg Leu Ala Tyr Val Ala I: 195 200 Glu Glu Leu Gin Leu Ser Tyr Val Lys Glu Arg S( 210 215 2: Lys Ile Ser Cys Ser Pro Ser Val Phe Leu Glu G: 225 230 235 Gin Gin Asp Lys Pro Pro Lys Gin Asn His Gin L' 245 250 Lys Val Gly Asp Leu Ile Lys His Lys Ile Phe G: 260 265 Phe Arg Arg Arg Lys Arg Leu Ala Ala Val Cys A.
275 280 INFORMATION FOR SEQ ID NO:741: SEQUENCE CHARACTERISTICS: LENGTH: 242 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...242 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:741: Thr Arg Ala 205 Tyr Phe Gly Ala Gin Leu Asp Thr Pro 255 Thr Gly Glu 270 Lys Arg Leu 240 Ile Gly Asp Gly Ser Lys Lys Met Lys Ser Asn Lys Lys Ser Asn His Leu Arg 1 Ala Val Arg Ile Ser Cys Ala Gly Arg 145 Lys Gin Ile Phe Ala Gly Leu Trp Leu lie 130 Pro Phe Leu Tyr Asn Cys Ser Leu Ile Thr 115 Val Leu Leu Lys Arg Tyr Ser Phe Asp Ala 100 Asp Ser Val Pro Ile 180 5 Ala Phe Cys Asp Ile Lys Thr Leu Ile Phe 165 Gin Leu Asn Phe Thr 70 Ile Lys Gly Leu Phe 150 Lys Pro Val Arg Phe 55 Gly Tyr Glu Met Lys 135 Pro Gin Met Ile Lys 40 Pro Ala Leu Leu Ile 120 Ala Glu Gly Val Ala 25 Asn Pro Lys Glu Gly 105 Leu Cys Gly Ala Leu 185 10 Ile Asn Thr Leu Ala 90 Glu Ile Lys Thr Lys 170 Ile Gly Asn Gly Ile 75 Tyr Ile Asp Glu Arg 155 Ile Asn Leu Ala Val Val His Pro Arg Lys 140 Gly Ile Ser Ala Arg Asn Leu Pro Phe Glu 125 Leu Lys Ala Ile Val Ser Leu Asn Ser Tyr 110 Asp Asp Gly Glu Lys 190 Ile Ser Glu His Asn Gly Lys Gin Gly Lys 175 Ile Ile Arg Lys Gin Ile His Lys Asn Glu 160 Phe Phe WO 97/37044 PCT/US97/05223 Asn Leu Leu 225 Asn Ser Lys Pro Leu 195 Glu Ser Tyr Thr 210 Gin Glu Arg Met Ala Glu Pro Gin 230 649 Ala Tyr Lys Ala Arg Thr Arg Leu Val Met 200 205 Asp Phe Ser Ser Pro Thr Trp Tyr Glu Glu 215 220 Lys Glu Tyr Leu Lys His Tyr His Glu Leu 235 240 INFORMATION FOR SEQ ID NO:742: SEQUENCE CHARACTERISTICS: LENGTH: 277 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...277 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:742: Val Met Lys Thr Ser Asn Thr Lys Thr Pro Lys Pro 1 Gly Lys Ala Pro Phe Val Arg Asn Val 145 Tyr Ser Met Met Pro Leu Asn Gly Gly Al-a Gin Ile 130 Leu Glu Phe Arg Pro 210 Cys Gin Phe Leu Tyr Ala Thr 115 Lys Lys Thr Gly Glu 195 Gly Val Pro Asp Glu Lys Lys 100 Asp Lys Ala Ala Tyr 180 Phe Gly 5 Ile Leu Asn Lys Ile Val Leu Gly Leu Leu 165 Gly Ala Ala Glu Ala Ala Gly 70 Leu Ala Ile Gin Lys 150 Lys Asn Pro Asn Ser Asn Asn 55 Leu Thr Asp Val Phe 135 Thr Asn Leu Val Gly 215 Leu Asn 40 Arg Glu Asp Ile Glu 120 Met Arg Gly Val Ile 200 Lys Glu 25 Glu Thr Met Val Leu 105 Val Asn Asp Val Val 185 Phe Ser 10 Asn Arg Ser Leu His 90 Gin Ser Pro Ser Trp 170 Asp Asp Ser Leu Leu Leu Gin 75 Glu Ile Gin Lys Ser 155 Leu Met Ala Gly Arg Asp Glu Thr Ser Pro Thr Asp 140 Ile Cys Arg Thr Asp 220 Val Ser Phe Ser Ile Tyr Ala Asn 125 Met Gln Glu Ser His 205 Ser Leu Ile Tyr Tyr Lys Gin Phe 110 Ala Gin Ser Arg Leu 190 Ser Ser Ile Ala Phe Lys Asp Ala Leu Ile Tyr Pro Gly 175 Lys Val Phe Ala Ile Lys Glu Glu Ser Cys Val Ser Thr 160 Ser Ile Gin Pro WO 97/37044 WO 9737044PCT/US97/05223 650 Pro Ile Leu Pro Arg Ala Ala Ala Ala Val Gly Ile 225 230 235 Ala Glu Thr His Ile Asp Pro Lys Asn Ala Leu Ser 245 250 Melt Leu Lys Pro Asp Glu Levi Glu His Levi Val Thr 260 265 Ile Gln Asn Levi Phe 275 INFORMATION FOR SEQ ID NO:743: SEQUENCE CHARACTERISTICS: LENGTH: 313 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAMdE/KEY: miscfeatvire LOCATION .313 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:743: Gin Arg Asp His Gly Val Levi Tyr Levi Leu Asn Asn Asp Gly Leu Phe 240 Asp Gly Ala Asn 255 Asn Met Leu Lys 270 1 Phe Phe Givi Glu Leu Gly Lys Leu Gin 145 S er Lys Asp Pro Levi Arg Gin Gin Lys His Ala Tyr 130 Tyr Leu Lys Gly Lys 210 Ile Val1 Asp Asp Glu Gly Cys 115 Tyr Tyr Gly Ala Cys 195 Asp Arg Levi Pro Phe Asn Val1 100 Asp Ser Ser Gly Val 180 Thr Leu 5 Arg Cys Lys Thr Ser Glu Levi Gly Lys Ile 165 Giu Ile Lys Tyr Levi Givi Gin 70 Gly Lys Asn Gin Ala 150 Tyr Tyr Levi Lys Ile Gly Leu 55 Ala Cys Asn Tyr Gly 135 Cys His Phe Gly Al a 215 Met Ala 40 Val Lys Phe Levi Ser 120 Val Asp Asp Thr Ser 200 Levi Leu 25 Leu Asn Lys Asn Lys 105 Asn Ser Levi Gly Lys 185 Leu Ala 10 Givi Cys Levi Tyr Leu 90 Lys Gly Gin Lys Lys 170 Ala Tyr Ser Asn Leu Gly Phe 75 Gly Ala Cys Asn Tyr 155 Val1 Cys Asp Tyr Val Gly Thr Givi Val1 Ala His Thr 140 Ala Phe Asp Ala Asp 220 Thr Lys Gly Lys Lys Leu Ser Leu 125 Asn Glu Thr Leu Gly 205 Lys Ile Lys Levi Ser Ala Tyr Phe 110 Levi Lys Gly Arg Asn 190 Arg Ala Ile Sen Ile Tyr Cys Tyr Tyr Gly Ala Cys Asp 175 Asp Gly Cys Lys Levi Ala Lys Asp Gin Sen Asn Trp Ala 160 Phe Gly Thr Asp WO 97/37044 PCTIUS97/05223 651 Leu Lys Asp Ser Pro Gly Cys Phe Asn Ala Gly Asn Met Tyr His 225 230 235 Gly Glu Gly Ala Ala Lys Asn Phe Lys Glu Ala Leu Ala Arg Tyr 245 250 255 Lys Ala Cys Glu Leu Glu Asn Gly Gly Gly Cys Phe Asn Leu Gly 260 265 270 Met Gln Tyr Asn Gly Glu Gly Ala Thr Arg Asn Glu Lys Gln Ala 275 280 285 Glu Asn Phe Lys Lys Gly Cys Lys Leu Gly Ala Lys Gly Ala Cys 290 295 300 Ile Leu Lys Gin Leu Lys Ile Lys Val 305 310 INFORMATION FOR SEQ ID NO:744: SEQUENCE CHARACTERISTICS: LENGTH: 529 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...529 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:744: His 240 Ser Ala Ile Asp Met Ser Ser Gly Leu Ile Tyr Ile Ser Leu Glu Val 1 Leu Ala Met Lys Gin Glu Lys Arg Lys 145 Ile Ile Arg Glu Ser Leu Ala Glu Glu 130 Ala Lys Thr Gly Phe Gin Gln Gln Lys 115 Asn Leu Ser Ala Gln Gln Glu Thr His 100 Lys Phe Asp Met 5 Leu Ala Ala Cys His Lys Glu Lys Ala Ile 165 Ile Ile Lys Lys 70 Phe Glu Leu Lys Met 150 Leu Met Leu Ser 55 Leu Asp Phe Glu Gin 135 Leu Glu Tyr Lys 40 Phe Gin Lys Val Lys 120 Arg Asn Gin Tyr 25 Gly Val Gin Lys Arg 105 Glu Ala Tyr Leu 10 Val Ala Glu Gin Glu 90 Asp Arg Ile Met Glu 170 Met Ser Ala Tyr 75 Ala Glu Gin Cys Ala 155 Glu Lys Ala Glu Glu His Lys Ile Lys 140 Tyr Glu Leu Lys Lys Glu Asn Leu Arg Leu 125 Glu Thr Leu Val Ile Ala Met Lys Lys Tyr 110 Glu Ala Lys Glu Ala Tyr Lys Arg Asn His Leu Gin Gin Asp Ala 175 Cys Tyr Leu Met Leu Leu Glu Glu Ala Glu 160 Gin Lys Ser Ala Leu Ile Arg Arg Tyr Glu Lys Glu Ala Lys Glu Glu Gly 185 190 WO 97/37044 PCT/US97/05223 652 Lys Lys Lys Ser Tyr Ala Ile Leu Ala Glu Ala Thr Ala Arg Phe Ala 195 200 205 Gly Asp Tyr Ala Thr Glu Asn Leu Thr Ser Arg Ile Ala Leu Pro Cys 210 215 220 Ser Asp Tyr Val Gly Arg Val Ile Gly Lys Asp Gly Lys Asn Ile Glu 225 230 235 240 Ala Phe Lys Lys Ile Ser Gly Val Asp Ile Glu Phe Ser Glu Asp Ser 245 250 255 Ser Glu Leu Cys Leu Ser Ser Phe Asn Ile Tyr Arg Arg Glu Val Ala 260 265 270 Ser Glu Thr Ile Lys Ile Leu Ile Glu Asp Gly Arg Ile Gin Pro Asn 275 280 285 Arg Ile Glu Glu Val Tyr His Arg Val Ala Arg Asn Met Glu Lys Glu 290 295 300 Leu Leu Ser Glu Gly Glu Ser Val Val Leu Glu Leu Glu Leu Gly Thr 305 310 315 320 Met Glu Asp Glu Leu Lys Ile Leu Ile Gly Lys Met Arg Tyr Arg Phe 325 330 335 Ser Phe Gly Gin Asn Ala Leu Gin His Ser Lys Glu Val Ala Leu Leu 340 345 350 Ala Gly Leu Ile Ala Glu Gin Leu Gly Gly Asp Lys Lys Leu Ala Arg 355 360 365 Arg Ala Gly Ile Leu His Asp Ile Gly Lys Ala Leu Thr Gln Glu Leu 370 375 380 Gly Arg Asp His Val Asn Leu Gly Val Glu Val Cys Lys Arg His Lys 385 390 395 400 Glu Asp Pro Val Val Ile Asn Ala Ile Tyr Ala His His Gly His Glu 405 410 415 Glu Ile Met Ser Val Glu Cys Ala Ser Val Cys Ala Ala Asp Ala Leu 420 425 430 Ser Ala Gly Arg Pro Gly Ala Arg Arg Lys Ser Asp Glu Glu Tyr Ala 435 440 445 Lys Arg Met Gln Ala Leu Glu Glu Ile Ala Leu Glu Phe Asp Gly Val 450 455 460 Glu Lys Ala Tyr Ala Met Glu Ser Gly Arg Glu Leu Arg Val Ile Val 465 470 475 480 Lys Ser Asn Gln Val Arg Asp Asn Gin Val Pro Ile Ile Ala Arg Lys 485 490 495 Ile Ala Lys Arg Ile Glu Glu Ser Thr Gin Tyr Val Gly Glu Val Gly 500 505 510 Val Gin Val Val Arg Glu Asn Arg Phe Lys Thr Thr Ala Thr Leu Lys 515 520 525 Gin INFORMATION FOR SEQ ID NO:745: SEQUENCE CHARACTERISTICS: LENGTH: 249 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: WO 97/37044 WO 9737044PCTIUS97/05223 653 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .249 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:745: Met Lys.Lys Gly Ser Leu Ala Ile Val Leu Gly Ser 1 Gly His Lys Pro Phe Gly Ile Ile Val1 145 Lys Lys Phe Arg His 225 Val1 Thr As n Gly Lys Glu Tyr Lys Leu 130 Ile Ser Ala Val1 Ala 210 Ile Ser Phe Asn Lys Ile Lys Ser Glu 115 Glu Asp Giu Val1 Pro 195 Ile Thr Ser Tyr Met Gin Giu Ser Val1 100 Lys Asp Met Asp Ile 180 Lys Lys Lys Giu 5 Thr Gly Giu Ala Leu Ser Al a Ile Ser Ile 165 Glu Thr Lys Glu Met 24S Ala Glu Pro Asn 70 Phe Gin Leu Val Ser 150 Ile Arg Phe Ile Leu 230 Lys Leu Ser Lys 55 Lys Leu Phe Leu Giu 135 Gly His Val1 Val1 Met 215 Ser Lys Ala Val1 40 Asn Val1 Gin Lys Val1 120 Giu Tyr Ser Glu His 200 Asn Lys Arg Asp 25 Giu Asn Ile Leu Asp 105 Leu Sen Leu Phe Leu 185 Ang Gin Lys Lys 10 Gly Leu His Pro Ser 90 Val1 Ang Asp Asn Gly 170 Arg Ile Ala Met His Leu Giu 75 Asn Ser Met Ala Leu 155 Ile Arg Lys Tyr Pro Phe Val1 Asn Phe Giu Asp Leu 140 Asn Asp Thr Giu His 220 Leu Met His Val1 Tyr Leu Ile Gly 125 Ser Phe Val Asn Thr 205 Lys Leu Lys Tyr Leu Gin Glu Pro 110 Asn Giu Val Ser Ser 190 Asp Val Ala Gin Pro Ile Lys Arg Gin Val Glu Glu Lys 175 Gly His Met Sen Gin Ile Asp Glu Lys Asp Ala Lys Pro 160 Ile Gly Asp Ala His Met Glu Arg Tyr Giu Lys INFORMATION FOR SEQ ID NO:746: SEQUENCE CHARACTERISTICS: LENGTH: 119 amino *acids TYPE: amino acid TOPOLOGY: linear (ii4) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature WO 97/37044 PCT/US97/05223 654 LOCATION 1...119 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:746: Met Gin Asn Thr Phe Asn Tyr Thr Asn Asn Ala Leu Lys Asn Asn Ala 1 5 10 Lys Leu Thr Pro Thr Glu Met Gin Ala Glu Gin Tyr Tyr Leu Gin Ser 25 Thr Leu Gin Asn Ile Glu Lys Ile Val Met Leu Ser Gly Gly Val Ala 40 Ser Asn Pro Lys Leu Val Gin Ala Leu Glu Lys Met Gin Glu Pro Ile 55 Thr Asn Pro Leu Glu Leu Val Glu Asn Leu Lys Asn Leu Glu Leu Gin 70 75 Phe Ser Gin Ser Gin Asn Ser Met Leu Ser Ser Leu Ser Ser Gin Ile 90 Ala Gin Ile Ser Asn Ser Leu Asn Ala Leu Asp Pro Ser Ser Tyr Ser 100 105 110 Lys Asn Val Ser Ser Met Tyr 115 INFORMATION FOR SEQ ID NO:747: SEQUENCE CHARACTERISTICS: LENGTH: 314 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...314 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:747: Met Asp Leu Asp Lys Leu Lys Asp Tyr Arg Ala Leu Arg Asn Ala Ile 1 5 10 Leu Arg Leu Leu Pro Tyr Leu Asp Ser Gly Ile Thr Glu Leu Ile Met 25 Asn Lys Glu Lys Glu Ile Trp Leu Tyr Lys Leu Asn Gly Val Arg Glu 40 Lys Val Phe Asp Glu Asn Leu Asp Lys Ala Phe Ile Leu Gly Phe Gly 55 Glu Gin Leu Ala Ser Phe Arg Asp Leu Phe Phe Asn Ala Asn Tyr Pro 70 75 Thr Leu Asn Thr Ser Ile Pro Thr Ser Arg Tyr Arg Val Ser Met Asn 90 His Phe Ala Ile Ser Ala Asp Asn Glu Leu Ser Leu Asn Ile Arg Val 100 105 110 Pro Ser Asp Lys Lys Phe Asp Leu Lys Ala Phe Lys Leu Ser Ser Ile 115 120 125 WO 97/37044 PCT/US97/05223 655 Cys Gin Tyr Asp Tyr Glu Tyr Leu Lys Asn Leu Met Ile Asp Gly Lys 130 135 140 Asn Leu Leu Ile Ser Gly Gly Thr Gly Ser Gly Lys Thr Ser Phe Leu 145 150 155 160 Asn Ala Leu Ile Glu Phe Ile Pro Lys His Thr Arg Ile Val Ser Val 165 170 175 Glu Asp Ser Glu Glu Leu Asp Leu Arg Ala Phe Glu Asn His Lys Ser 180 185 190 Leu Leu Val Asp Lys Thr Glu Ser Ser Lys Phe Thr Tyr Glu Asn Ala 195 200 205 Leu Asn Met Ala Met Arg Met Ser Pro Asp Arg Leu Met Val Gly Glu 210 215 220 Ile Asp Thr Arg Asn Ser Met Leu Phe Leu Arg Phe Gly Asn Thr Gly 225 230 235 240 His Lys Gly Met Val Ser Thr Leu His Ala Asp Ser Val His Gly Val 245 250 255 Ile Glu Ala Ile Ala Leu Asn Leu Gin Met Asn Lys Ser Gly Leu Asp 260 265 270 Val Asn Val Ala Lys Lys Phe Phe Lys Ser Ser Val Asp Val Val Val 275 280 285 Gin Ile Val Leu Asp Lys Ala Thr Asn Thr Arg Tyr Ile Gin Glu Ile 290 295 300 Leu Pro Ala Lys Asp Leu Arg Asp Ser Leu 305 310 INFORMATION FOR SEQ ID NO:748: SEQUENCE CHARACTERISTICS: LENGTH: 144 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...144 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:748: Met Asn Ile Ser Val Asn Pro Tyr Leu Met Ala Val Val Phe Val Val 1 5 10 Phe Val Leu Leu Leu Trp Ala Met Asn Val Trp Val Tyr Arg Pro Leu 25 Leu Ala Phe Met Asp Asn Arg Gin Ala Glu Ile Lys Asp Ser Leu Ala 40 Lys Ile Lys Thr Asp Asn Thr Gin Ser Val Glu Ile Gly His Gin Ile 55 Glu Thr Leu Leu Lys Glu Ala Ala Glu Lys Arg Arg Glu Met Leu Ala 70 75 Glu Ala Ile Gin Lys Ala Thr Glu Ser Tyr Asp Ala Val Ile Lys Gin 90 WO 97/37044 PCT/US97/05223 656 Lys Glu Asn Glu Leu Asn Gin Glu Phe Glu Ala Phe Ala Lys Gin Leu 100 105 110 Gin Asn Glu Lys Gin Ile Leu Lys Glu Gin Leu Gin Ala Gin Met Thr 115 120 125 Val Phe Glu Asp Glu Leu Asn Lys Arg Val Ala Met Gly Leu Gly Ser 130 135 140 INFORMATION FOR SEQ ID NO:749: SEQUENCE CHARACTERISTICS: LENGTH: 91 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...91 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:749: Met Ala Ser Gly Leu Phe Glu Asn Asp Glu Ile Lys Asn Asn Lys Ala 1 5 10 Arg Asp Phe Phe Tyr Ser His Ser Ser Leu Ile Val Phe Phe Leu Leu 25 Leu Leu Gly Phe Gly Tyr Tyr Leu Gly Lys Leu Leu Phe Gly Gly Ser 40 Ser Leu Glu Val Tyr Leu Asp Leu Arg Asp Lys His Glu Arg Leu Gin 55 Gin Glu Ile Thr Glu Leu Gin Ser Lys Asn Val Arg Leu Gin Lys Arg 70 75 Leu Phe Glu Leu Arg Glu Leu Arg Pro Arg Asp INFORMATION FOR SEQ ID NO:750: SEQUENCE CHARACTERISTICS: LENGTH: 299 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...299 WO 97/37044 PCT/US97/05223 657 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:750: Met Lys Lys Asn Ile Leu Asn Leu Ala Leu Val Gly 1 Ser Asn Asp Asn Leu Ala Val Glu Asn 145 His Ile Ala Gly Lys 225 Lys Pro Leu Leu Phe Thr Gly Pro Ile Glu Lys Val 130 Ala Ile Asp Asn Asp 210 Ala Thr Val Gin Arg 290 Leu Lys Arg Asn Glu Lys Lys 115 Lys Asn Leu Lys Arg 195 Leu Ala Glu Thr Glu 275 Lys Met Glu Pro Phe Gin Leu 100 Gin Lys Lys Val Gin 180 Asp Gly Phe Phe Tyr 260 Lys His 5 Ala Thr Ile Asp Ala Asn Ala Ile Asp Lys 165 Pro Thr Lys Ala Gly 245 Thr Leu Ala Lys Thr Thr Phe 70 Ile Gin Leu Gln Gin 150 Thr Lys Ile Phe Leu 230 Tyr Tyr Phe Lys Ala Ala 40 Ser Lys Thr Pro Glu 120 Pro Phe Asp Lys Pro 200 Lys Pro Ile Gin Glu 280 Val His 25 Ser Asp Leu Ala Glu 105 Phe Glu Val Glu Lys 185 Asn Asn Gly Ile Ala 265 Arg Ile 10 Asn Ala Phe Lys Leu 90 Phe Trp Lys Lys Ala 170 Glu Ser Gin Asp Tyr 250 Lys Met Asn Ala Gly Asp Glu 75 Val Lys Ala Glu Gin 155 Lys Ala Lys Met Tyr 235 Leu Pro Asn Lys Asn Val Met Lys Glu Ala Lys Met 140 Glu Arg Lys Asn Ala 220 Thr Ile Thr Gin Ala Asn Leu Ile Glu Asn Met Lys 125 Gin Ala Ile Phe Ala 205 Pro Lys Ser Ile Arg 285 Leu Ser Ala Lys Lys Glu Met 110 Gin Asp His Ile Ile 190 Gin Asp Thr Lys Lys 270 Ile Ser Thr Thr Gin Glu Ala Glu Ala Phe Ala Ser 175 Glu Asn Phe Pro Asp 255 Gly Glu Ala His Val Arg Ala Lys Ala Glu Tyr Arg 160 Glu Leu Gly Ser Val 240 Ser Met Glu INFORMATION FOR SEQ ID NO:751: SEQUENCE CHARACTERISTICS: LENGTH: 305 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 WO 9737044PCTIUS97/05223 658 NAME/KEY: misc-feature LOCATION .305 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:751: Met Ile Lys Ser Trp Thr Lys Lys Trp Phe Leu Ile 1 Ala Lys Ala Ser Ser Leu Gin Leu Gly 145 Tyr Met Ala Ser Glu 225 Ser Pro Leu Ser Gin 305 Ser Met Phe Leu Lys Ala Lys Lys 130 Thr Ala Tyr Asn Leu 210 Gin Gly Lys Gly Asp 290 Cys Ala Tyr Gly Al a Cys Asp 115 Gly Giy Cys Lys Phe 195 Gly Ala Cys Asp Phe 275 Asn Phe Asn Lys Ser Vai Ala 100 Phe Gly Val1 Ser Ser 180 Lys Tyr Leu His Leu 260 Ser Leu 5 Gly Gin Arg Met Phe Ser Pro Val Lys Leu 165 Ala Arg Leu Asn Asn 245 Giu Gly Gin His Ala Ser Tyr 70 Tyr Leu Lys Ser Gin 150 Asn Lys Gly Tyr Leu 230 Val Lys Ser Asp Leu Leu Cys 55 Giu Tyr Gly Ala Cys 135 Asn Tyr Gly Cys Giu 215 Tyr Ala Ala Cys Asp 295 Val Lys 40 Asn Tyr Arg Ser Ile 120 Gly Tyr Gly Val1 His 200 Ala Lys Val Thr Lys 280 Ala Ala 25 Arg Leu Gly Arg Met 105 Tyr Ser Ala Ile Glu 185 Leu Gly Lys Met Ser 265 Val1 Gin 10 Thr Gly Arg Asp Gly 90 Tyr Tyr Leu Lys Ser 170 Lys Lys Met Gly Tyr 250 Tyr Leu Asn Thr Asp Met Gly 75 Cys Giu Tyr Gly Ala 155 Cys Asp Asp Asp Cys 235 Tyr Tyr Giu Asp Gly Tyr Gly Val1 Asn Asp Arg Phe 140 Leu Asn Leu Gly Val 220 Ser Thr Lys Val1 Thr 300 Leu Glu His Val Asp Leu Gly Arg 125 Met Ser Phe Lys Ala 205 Lys Leu Gly Lys Ile 285 Gln Phe Lys Arg Gly Gin Arg Asp 110 Gly Tyr Phe Val1 Lys 190 Ser Gin Lys Lys Gly 270 Gly Asp Leu Tyr Ala Cys As n Asn Gly Cys Phe Ser Gly 175 Ala Cys Asn Giu Gly 255 Cys Lys Ser Met Phe Val1 Thr Ile His Val His Asn Lys 160 Tyr Leu Val Giu Gly 240 Ala Ala Glu Val1 INFORMATION FOR SEQ ID NO:752: SEQUENCE CHARACTERISTICS: LENGTH: 502 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 WO 9737044PCTIUS97/05223 659 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .502 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:752: Leu Lys Ile Phe Leu Val Ile Leu Ser Val Phe Phe 1 Phe Asp Glu Ala Asp Ala Leu Leu Ile 145 Arg Ala Asp Val1 Arg 225 Asn Giu Phe Ala Ile 305 Ala Asni Ser Gly Pro Asn Leu Met Lys Leu Asn 130 Arg Ile Val1 Leu Ala 210 Ser Val1 Asp Gin Asp 290 Ala Ser Giln Ser Leu Tyr Pro Leu Gin Giu Asp 115 Phe Asn Lys Ile Asp 195 Ser Ile Lys Ala Lys 275 Leu Phe Ser Ile Thr 355 Al a Thr Lys His Thr Leu 100 Asp His Lys Lys Ile 180 Thr Lys Pro Glu Asn 260 Tyr Pro Giu Tyr Ser 340 Asp 5 Tyr Thr His Arg Tyr Leu Asn Lys Gly Arg 165 Gly Asn Ala Val Ile 245 Giu Gin Ala Lys Phe 325 Lys Ala Lys Thr Ser Val1 70 Ile Asn Gly Asn Leu 150 Met Gly Phe Lys Ser 230 Ala Phe Tyr Lys Al a 310 Ile Gly Ile Thr Ile Ala 55 Gly Tyr Ala Leu Ile 135 Arg His Arg Leu Glu 215 Leu Lys Glu Pro Ile 295 Leu Pro Ile Val1 Pro Gly 40 Ala Leu Lys.
Ala Asp 120 Giu Tyr Asn Asn Asp 200 Sen Leu Leu Lys Ile 280 Asp Lys Gly Giu Val1 360 Ile 25 Ser Ile Ile Asn As n 105 Ser Val Phe Lys Ile 185 Leu Phe Arg His Lys 265 Tyr Thr Asn Lys Leu 345 Tyr 10 Ser Leu Leu Arg Asp 90 Arg Asp Lys Giu Leu 170 Gly Asp Giu Thr Giu 250 Val1 Tyr Pro Ala Lys 330 Asn Gly Asn Tyr Leu Met 75 Leu Gly Phe Ile Met 155 Phe Asp Ala Asn His 235 Lys Asn Gly Leu Lys 315 Ile Ile Ala Pro Ala Giu Sen Ser Val1 Ser Phe 140 Leu Ile Asn Leu Tyr 220 Lys Ile Asp Asn Tyr 300 Asp Met Leu Trp Phe Pro Lys Asp Gin Ser Lys Asp 125 Asn Ala Val1 Tyr Phe 205 Trp Arg Pro Phe Ala 285 Ser Ser Lys Thr Giu 365 Asn Ile Asn Gili Lys Gin Val 110 Ile Pro 'Asp Asp Phe 190 Phe Arg Leu Ile Ile 270 Ile Pro Val Ile Asn 350 Arg Gly Ser Leu Phe Sen Val1 Ang Met Tyr Tyr Asn 175 Asp Gly Phe Lys Sen 255 Giu Phe Ile Phe Phe 335 Ser Tyr Cys Tyr Lys Asp Ile Ile Ile Leu Tyr Giu 160 Phe Asn Gly His Asn 240 Ala Arg Leu Lys Ile 320 Lys Leu Arg Asn Lys Leu Val Ang Met Gly Ala Asn Val Tyr Giu Ile Arg Asn Asp 370 375 380 WO 97/37044 PCTIUS97/05223 660 Phe Phe Asn Arg Gin Ile Lys Gly Arg Phe Ser Thr Lys His Ser Leu 385 390 395 400 His Gly Lys Thr Ile Val Phe Asp Asp Ala Leu Thr Leu Leu Gly Ser 405 410 415 Phe Asn Ile Asp Pro Arg Ser Ala Tyr Ile Asn Thr Glu Ser Ala Val 420 425 430 Leu Phe Asp Asn Pro Ser Phe Ala Lys Arg Val Arg Leu Ser Leu Lys 435 440 445 Asp His Ala Gin Gin Ser Trp His Leu Val Leu Tyr Arg His Arg Val 450 455 460 Ile Trp Glu Ala Thr Glu Glu Gly Ile Leu Ile His Glu Lys Asn Ser 465 470 475 480 Pro Asp Thr Ser Phe Phe Leu Arg Leu Ile Lys Glu Trp Ser Lys Val 485 490 495 Leu Pro Glu Arg Glu Leu 500 INFORMATION FOR SEQ ID NO:753: SEQUENCE CHARACTERISTICS: LENGTH: 169 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...169 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:753: Leu Lys Asn Leu Ser Thr Leu Leu Val Phe Leu Phe Phe Cys Leu Gly 1 5 10 Cys Val Ser Asn Phe Asn Glu Asp Thr Tyr Thr Leu Asp Leu Val Leu 25 Glu Lys Lys Ile Gin Ala Ser Arg Lys Gly Glu Ile Thr Gin Asp Asn 40 Val Pro Ile Ile Thr Ala Ile Ala Thr His Leu Asn Asp Val Asp Ser 55 Gly Thr Tyr Tyr Asp His Glu Tyr Phe Leu Val Glu Ile Phe Thr Gin 70 75 Asn Asn Asp Trp Ile Asp Asp Gly Tyr Ile Ser Tyr Glu Leu Phe Gly 90 Thr Lys Pro Thr Gly Ser Glu Pro Leu Trp Val Arg Glu Ile Thr Arg 100 105 110 Asp Glu Phe Asp Gly Ile Leu Glu Thr Thr Asn Arg Trp Ser Arg Ala 115 120 125 Phe Leu Ile Ala Phe Asp Lys Leu Asp Tyr Leu Ala Val Gin Glu Ala 130 135 140 Lys Leu Glu Leu Asp Ala Tyr Ser Leu Gly Lys Ile Val Phe Asn Phe 145 150 155 160 WO 97/37044 WO 9737044PCT/US97/05223 Ala Tyr Gin Val Pro Leu Pro Gin Phe 165 INFORMATION FOR SEQ ID NO:754: SEQUENCE CHARACTERISTICS: LENGTH: 511 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .511 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:754: Met Lys Lys Leu Leu Tyr Thr Met Leu Ala Leu Leu Leu Ile Gly Leu 1 Leu Ala Val Ala Thr Ser Ser Ser Phe 145 Asp Ser Ile Ile Sen 225 Ser Thr Sen Lys Asn Gin Phe Gly Asn 130 Lys Leu Leu Leu Phe 210 Ser Pro Ala Tyr Thn Asp Sen Lys Asn 115 Val Leu Leu Gin Thr 195 His Asp Leu Tyr Ile Phe Asp Val1 Asp 100 Ile Ala Sen Tyr Ala 180 Ala Leu Phe Val1 5 Leu Giu Lys Ser Asn Leu Lys Gin His Leu 165 Asp Asn Asn Lys Asn 245 Ile Lys Leu Thn 70 Leu Ile Gly Ser Leu 150 Ile Phe Asn Leu Gly 230 Phe Leu Lys Arg 55 Leu Asp Pro His His 135 Ser Asn Asn Ala Lys 215 Asn Thr Phe Ile 40 Phe Ile Tyr Tyr Arg 120 Thr Leu Arg Sen Leu 200 Asp Lys Ala Thn 25 Asn Asn Leu His Pro 105 Lys Ala Asn Pro Leu 185 Ile Thr Ala Leu 10 Glu Pro Ser Lys Ile 90 Leu Ala Tyn Ala Ala 170 Lys Asn Leu Ile Lys 250 Trp Asn Leu Gly 75 Asp Arg Leu Asn Lys 155 Tyr Pro Asn Vai Ser 235 Sen Gly Glu Asp Asp Ile Gly Val Ala 140 Asp Ala Leu Ala Phe 220 Asp Glu Asn Ang Phe Phe Lys Ala Val1 125 Leu Ala Asn Glu Leu 205 Asn Thn Tyr Lys Tyr Lys Ser Asn Ile 110 Gin Leu Asn Al a Gly 190 Ile Leu Thr Ser Ile Leu Ala Leu Leu Val1 Gly Asp Leu Lys 175 His Asn Sen Leu Phe 255 Ile Sen Gin Leu Arg Thr Val1 Asp Glu 160 Val Leu Gin His Thr 240 Pro Ala Leu Lys Leu Asn Ala Pro Tyr Thr Leu Glu Ile Pro His Leu Ala 260 265 270 WO 97/37044 PCT/US97/05223 662 Lys Lys Ser 305 Leu Phe Tyr Ala Arg 385 Gin Lys Gin Met Asn 465 Leu Asp Leu Gly 290 Asn Lys His Asp Arg 370 Phe Ile Thr Met Lys 450 Glu Lys Lys Gin 275 Asp Leu Ala Tyr Leu 355 Phe Asp Asn Gin Asp 435 Leu Lys Asn Leu Asn Ile Leu Arg Pro 340 Ile Leu Ile Gin Leu 420 Ile Gin Ala Asp Lys 500 Ile Thr Asn Glu Asp Phe 325 Lys Ser Lys Thr Gin 405 Lys Leu Gly Ile Thr 485 Glu Gin Gly 310 Ser Phe Lys Asn Lys 390 Arg Ile Met Ser Gin 470 Leu Lys Ser 295 Ala Asn Phe Gin Ala 375 Glu Leu His Asp Met 455 Gin Lys Leu His Pro Leu 280 Pro Lys Leu Leu Asp Phe Ile Ser Thr 330 Gin Ser Ile 345 Gly Val Leu 360 Phe Ser Asp Ile Tyr His Leu Ser Asp 410 Asn Gly Leu 425 Ala Glu Ile 440 His Gin Pro Asn Leu Gin Lys Gly Leu 490 Glu Lvs Glv Lys Leu Thr 315 Ser Ala Lys Phe Asp 395 Leu Leu Leu Lys Gin 475 Asp Gly Lys 300 Leu Lys Asp Ala Leu 380 Ala Ser Asp Lys Phe 460 Gly His Ser 285 Val Leu Ala Ala Asn 365 Tyr Asn Leu Leu Phe 445 Ser Leu Leu Leu Ser Asn Leu Asn 350 Leu Ser Leu Lys Asn 430 Ile Leu Lys Leu Thr Gly Lys Asp 335 Leu Lys Ile Val Ser 415 Thr Phe Ile Glu Lys 495 Leu His Asp 320 Leu Asp Asn Ser Ser 400 Pro Lys Lys Leu Ile 480 Asp 505 Leu Lys Gly Leu Phe 510 INFORMATION FOR SEQ ID NO:755: SEQUENCE CHARACTERISTICS: LENGTH: 160 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...160 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:755: Met Phe Gly Met Gly Phe Phe Glu Ile Leu Val Val Leu Ile Val Ala 1 5 10 Ile Ile Phe Leu Gly Pro Glu Lys Phe Pro Gin Ala Val Val Asp Ile 25 Val Lys Phe Phe Arg Ala Val Lys Lys Thr Leu Asn Asp Ala Lys Asp 40 WO 97/37044 PCT/US97/05223 663 Thr Leu Asp Lys Giu Ile Asn Ile Glu Glu Ile Lys Lys Glu Thr Leu 55 Glu Tyr Gin Lys Leu Phe Glu Asn Lys Val Glu Ser Leu Lys Gly Val 70 75 Lys Ile Glu Glu Leu Glu Asp Ala Lys Val Thr Ala Glu Asn Glu Ile 90 Lys Ser Ile Gin Asp Leu Met Gin Asp Tyr Lys Arg Ser Leu Glu Thr 100 105 110 Asn Thr Ile Pro Asn His Leu Asn Glu Glu Val Ser Asn Glu Glu Ala 115 120 125 Leu Asn Lys Glu Val Ser Ser Asp Glu Ser Pro Lys Glu Val Gin Leu 130 135 140 Thr Thr Asp Asn Asn Ala Lys Glu His Asp Lys Glu Lys Glu His Val 145 150 155 160 INFORMATION FOR SEQ ID NO:756: SEQUENCE CHARACTERISTICS: LENGTH: 80 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...80 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:756: Met Leu Arg Ile Leu Ile Pro Leu Leu Ile Ile Val Trp Val Leu Trp 1 5 10 Arg Leu Phe Leu Arg Gin Lys Pro His Lys Asp Asp His Arg Asp Asn 25 His Ser Tyr Thr Gin Gin Thr Pro Lys Glu Leu Glu Asp His Met Ile 40 Val Cys Ser Lys Cys Gin Thr Tyr Val Ser Ser Lys Asp Ala Ile Tyr 55 Ser Gly Ala Val Ala Tyr Cys Ser Glu Thr Cys Leu Lys Asp Lys Gly 70 75 INFORMATION FOR SEQ ID NO:757: SEQUENCE CHARACTERISTICS: LENGTH: 214 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/1JS97/05223 664 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...214 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:757: Leu Asp Gly Leu Lys Lys Glu Arg Gin Glv Phe 1 Tyr Leu Arg Ile Ala Glu Asn Leu Phe Met Arg Pro Leu Gin Gly Tyr Tyr Tyr Phe Phe Thr Leu Gin Ala 100 Val Ala Tyr Ala 115 Leu Gin Leu Ala 130 Leu Val Val Asn 145 Thr Tyr Phe Arg Val Leu Tyr His 180 Thr Pro Glu Arg 195 Ser His Ser Trp 210
INFORMATION
5 Leu Ser Leu Glu Ile Asn Lys Ala Thr Ala 165 Lys Arg His Ile Tyr Asn Val 70 Asp Met Asn Asn Trp 150 Ile Val Ser Phe Leu Met Thr 55 Asn Tyr Phe Pro Thr 135 Asp Gly Asp Leu Ser Ser 40 Asn Pro Gly Thr Ile 120 Trp Ser Lys Val Phe 200 Met 25 Ser Lys Lys Asn Tyr 105 Asn Ile Leu Phe Glu 185 Glu 10 Ser Ser Leu Asn Val 90 Gly Arg Leu Lys Gly 170 Ile Arg Phe Tyr Leu Asp 75 Leu Val Trp Asn Asp 155 Val Gly Ser Tyr Leu Gin Gin Trp Phe Gly Ala Asn 140 Phe Gin Met Phe Lys Asn Ile Gly Ala Asn Gly Phe 125 Lys Asn Phe Lys Leu 205 Gin Ile Gly Ala Tyr Asn Asp 110 Phe Val Phe Arg Ile 190 Phe Val Leu Thr Ser Ser Asp Phe Phe Lys His Thr 175 Phe Phe His Asn Val Ile Arg Ser Met Gly Asp Asn 160 Ile Leu Val FOR SEQ ID NO:758: SEQUENCE CHARACTERISTICS: LENGTH: 111 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...111 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:758: WO 97/37044 PCT/US97/05223 665 Met Lys Pro Thr Asn Glu Pro Lys Lys Pro Phe Phe Gin Ser Pro Ile 1 5 10 Val Leu Ala Val Leu Gly Gly Ile Leu Leu Ile Phe Phe Leu Arg Ser 25 Phe Asn Ser Asp Gly Ser Phe Ser Asp Asn Phe Leu Ala Ser Ser Thr 40 Lys Asn Val Ser Tyr His Glu Ile Ile Gin Leu Ile Ser Asn His Glu 55 Val Gly Asn Val Ser Ile Gly Gin Thr Leu Ile Lys Ala Ser His Lys 70 75 Glu Gly Asn Asn Arg Val Ile Tyr Ile Ala Lys Arg Val Leu Ile Tyr 90 Leu Ser Ala Phe Val Arg Arg Glu Lys Asn Gin Leu Phe Trp Phe 100 105 110 INFORMATION FOR SEQ ID NO:759: SEQUENCE CHARACTERISTICS: LENGTH: 207 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...207 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:759: Met Leu Arg Val Leu Ser Val Gly Val Val Phe Ile Leu Leu Gly Cys 1 5 10 Gin Phe Phe Asn Lys Thr Thr Leu His Leu Lys Tyr Lys Asp Tyr Pro 25 Lys Asn Ser Pro Leu Lys Thr Ala Ser Thr Leu Thr Pro Pro Lys Ile 40 Phe Phe Asn Ala His Phe Val Pro Pro Phe Tyr Gin Lys Glu Phe Lys 55 Lys Ala Leu Ala Gin Gin Ile Ala Tyr Phe Leu Lys Asp Lys Ser Ala 70 75 Leu Thr Phe Asn Ile Ser Gly Asn Val Phe Phe Ser Phe Glu Glu Ser 90 Pro Lys Asp Leu Lys Ala Ile Lys Glu Arg Leu Lys Lys Thr Ile Glu 100 105 110 Pro Asn Thr Asp Pro Lys Ala Val Met Arg Phe Leu Asn Leu Gin Ala 115 120 125 Ser Leu Ile Leu Glu Cys Val Pro Gin Thr Ala Cys Pro Phe Asp Thr 130 135 140 Leu Leu Ile Pro Thr Ala Leu Ser Val Pro Ile Asp Tyr Ala Asn Arg 145 150 155 160 Leu Gly Asp Asn Pro Ser Leu Phe Pro Gin Glu Asp Lys Ser Tyr His WO 97/37044 PCT/US97/05223 666 165 170 175 Asn Ala Leu Ile Lys Ala Leu Asn Lys Ala Tyr Tyr Ser Leu Met Glu 180 185 190 Gly Leu Glu Lys Arg Leu Asn Ala Ile Glu Asn Ala Ala Trp Leu 195 200 205 INFORMATION FOR SEQ ID NO:760: SEQUENCE CHARACTERISTICS: LENGTH: 175 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...175 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:760: Met Lys Asn Gin Val Lys Lys Ile Leu Gly Met Ser Val Val Ala Ala 1 5 10 Met Val Ile Val Gly Cys Ser His Ala Pro Lys Ser Gly Ile Ser Lys 25 Ser Asn Lys Ala Tyr Lys Glu Ala Thr Lys Gly Ala Pro Asp Trp Val 40 Val Gly Asp Leu Glu Lys Val Ala Lys Tyr Glu Lys Tyr Ser Gly Val 55 Phe Leu Gly Arg Ala Glu Asp Leu Ile Thr Asn Asn Asp Val Asp Tyr 70 75 Ser Thr Asn Gin Ala Thr Ala Lys Ala Arg Ala Asn Leu Ala Ala Asn 90 Leu Lys Ser Thr Leu Gin Lys Asp Leu Glu Asn Glu Lys Thr Arg Thr 100 105 110 Val Asp Ala Ser Gly Lys Arg Ser Ile Ser Gly Thr Asp Thr Glu Lys 115 120 125 Ile Ser Gin Leu Val Asp Lys Glu Leu Ile Ala Ser Lys Met Leu Ala 130 135 140 Arg Tyr Val Gly Lys Asp Arg Val Phe Val Leu Val Gly Leu Asp Lys 145 150 155 160 Gin Ile Val Asp Lys Val Arg Glu Glu Leu Gly Met Val Lys Lys 165 170 175 INFORMATION FOR SEQ ID NO:761: SEQUENCE CHARACTERISTICS: LENGTH: 88 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCTIUS97/05223 667 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...88 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:761: Met Lys Lys Ile Val Val Ser Leu Cys Val Ala Leu Gly Phe Leu Ser 1 5 10 Ala Asp Pro Ala Gin Ala Asn Lys Ala Ile Ser Asp Ala Asp Leu Ile 25 Glu Glu Ile Arg Asp Leu Lys Lys Ile Ile Ser Ala Gin Asn Thr Glu 40 Ile Asn Gin Leu Arg Lys Val Gin Glu Val Leu Ser Gly Gin Leu Gly 55 Asp Met Arg Lys Asp Ile Leu Ser Thr Arg Asp Tyr Cys Ile Ser Leu 70 75 Arg Pro Tyr Ile Tyr Asn Trp Arg INFORMATION FOR SEQ ID NO:762: SEQUENCE CHARACTERISTICS: LENGTH: 257 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...257 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:762: Leu Asn Ser Gly Ser Asn Ala Ser Leu Tyr Gly Thr Ser Ala Gly Val 1 5 10 Asp Ala Phe Leu Asn Gly Asn Val Glu Ala Ile Val Gly Gly Phe Gly 25 Ser Tyr Gly Tyr Ser Ser Phe Ser Asn Gin Ala Asn Ser Leu Asn Ser 40 Gly Ala Asn Asn Ala Asn Phe Gly Val Tyr Ser Arg Phe Phe Ala Asn 55 Gin His Glu Phe Asp Phe Glu Ala Gin Gly Ala Leu Gly Ser Asp Gin 70 75 Ser Ser Leu Asn Phe Lys Ser Thr Leu Leu Gin Asp Leu Asn Gin Ser 90 WO 97/37044 PCTIUS97/05223 668 Tyr Asn Tyr Leu Ala Tyr Ser Ala Thr Ala Arg Ala Ser Tyr Gly Tyr 100 105 110 Asp Phe Ala Phe Phe Arg Asn Ala Leu Val Leu Lys Pro Ser Val Gly 115 120 125 Val Ser Tyr Asn His Leu Gly Ser Thr Asn Phe Lys Ser Asn Ser Gln 130 135 140 Ser Gin Val Ala Leu Lys Asn Gly Ala Ser Ser Gin His Leu Phe Asn 145 150 155 160 Ala Asn Ala Asn Val Glu Ala Arg Tyr Tyr Tyr Gly Asp Thr Ser Tyr 165 170 175 Phe Tyr Leu His Ala Gly Val Leu Gin Glu Phe Ala His Phe Gly Ser 180 185 190 Asn Asp Val Ala Ser Leu Asn Thr Phe Lys Ile Asn Ala Ala Arg Ser 195 200 205 Pro Leu Ser Thr Tyr Ala Arg Ala Met Met Gly Gly Glu Leu Gln Leu 210 215 220 Ala Lys Glu Val Phe Leu Asn Leu Gly Val Val Tyr Leu His Asn Leu 225 230 235 240 Ile Ser Asn Ala Ser His Phe Ala Ser Asn Leu Gly Met Arg Tyr Ser 245 250 255 Phe INFORMATION FOR SEQ ID NO:763: SEQUENCE CHARACTERISTICS: LENGTH: 283 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...283 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:763: Met Phe Glu Asp Phe Tyr Arg Thr Thr Leu Ser Phe Leu Arg Ser Leu 1 5 10 Leu Leu Leu Leu Gly Leu Leu Leu Pro Phe Ser Leu Cys Ile Ala Asp 25 Glu Tyr Ile Ser Ile Ser Asp Asp Trp Asp Glu Arg Ala Arg Asn Gin 40 Trp Asp Glu Thr Ala Arg Asn His Lys Thr Tyr Tyr Phe Glu Asn Gly 55 Leu Asp His Phe Asn Gln Gly Gin Tyr Lys Gin Ala Phe Lys Asp Phe 70 75 Lys Leu Ala Gin Glu Tyr Ser Ile Gly Leu Gly Asn Val Tyr Leu Ala 90 Lys Met Tyr Leu Glu Gly Lys Gly Val Lys Val Asp Tyr Lys Lys Ala 100 105 110 WO 97/37044 PCT/US97/05223 669 Gin Phe Tyr Ala Gin Asn Ala Ile Lys Gly Tyr Gly Ser Gly Leu Leu 115 120 125 Gly Gly Ala Leu Ile Leu Gly Arg Met Gin Ala Glu Gly Leu Gly Met 130 135 140 Lys Lys Asp Leu Lys Gin Ala Leu Lys Thr Tyr Arg His Val Val Arg 145 150 155 160 Met Phe Ser Asn Lys Ser Ala Asn Phe Ala Asn Lys Phe Gly Ser Asn 165 170 175 Leu Ala Giu Phe Thr Ser Met Leu Ile Gly Ser Arg Phe Ile Asp Leu 180 185 190 Ser Gly Leu Ser Ala Asn Pro Ile Lys Phe Gly Asn Lys Phe Gly Ile 195 200 205 Leu Val Lys Lys Ala Leu Gin Ile Lys Asp Asn Thr Leu Ser Trp Glu 210 215 220 Asp Ile Ala Glu Ile Ser Ser Asn Ile Ile Leu Leu Lys Gin Gin Met 225 230 235 240 Gly Glu Ile Leu Tyr Arg Ile Gly Ile Ala Tyr Lys Glu Gly Leu Gly 245 250 255 Thr Arg Lys Lys Lys Asp Arg Ala Lys Lys Phe Leu Gin Lys Ser Ala 260 265 270 Glu Phe Gly Tyr Glu Lys Ala Met Glu Ala Leu 275 280 INFORMATION FOR SEQ ID NO:764: SEQUENCE CHARACTERISTICS: LENGTH: 786 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...786 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:764: Leu Cys Lys Ile Pro Leu Ser Leu Thr Leu Met Glu Ala Arg Ser Val 1 5 10 Leu Thr Gin Glu Ile Arg Ser Phe Leu Pro Glu Thr Thr Thr Ser Leu 25 Ser Leu Thr Ile Leu Glu Arg Ser Ile Cys Cys Leu Ile Lys Phe Leu 40 Thr Leu Thr Ser Pro Cys Leu Thr Gin Gin Arg Pro Lys Ile Asn Ala 55 Thr Asn Asn Asn Val Ser Val Ser Gin Gly Asn Leu Phe Ile Asn Ala 70 75 Ser Cys Val Gin Gin Ser Asp Pro Thr Thr Ala Ser Ala Thr Asn Pro 90 Cys Thr Thr Ala Gin Asn Asn Ala Ser Ser Ser Asn Ala Ser Asn Asn 100 105 110 WO 97/37044 PCT/US97/05223 670 Ala Pro Ile Ala Leu Asn Asn Asn Asp Glu Ser Leu Val Val Thr Ala 115 120 125 Asn Gly Phe Asn Phe Ser Gly Asn Ile Tyr Ala Asn Gly Val Val Asp 130 135 140 Phe Ser Lys Ile Lys Gly Ser Ala Asn Val Lys Asn Leu Tyr Leu Tyr 145 150 155 160 Asn Asn Ala G1n Phe Gin Ala Asn Asn Leu Thr Ile Ser Asn Gin Ala 165 170 175 Val Leu Giu Lys Asn Ala Ser Phe Val Thr Asn Asn Leu Asn Ile Gin 180 185 190 Gly Ala Phe Asn Asn Asn Ala Thr Gin Lys Ile Glu Val Leu Gin Asn 195 200 205 Leu Val Ile Ala Ser Asn Ala Ser Leu Ser Thr Gly Ile Tyr Gly Leu 210 215 220 Glu Val Gly Gly Ala Leu Asn Asn Leu Gly Ala Ile His Phe Asn Leu 225 230 235 240 Glu Asn Ser Gin Thr Pro Val Asn Pro Leu Ile Gin Val Gly Gly Ile 245 250 255 Ile Asn Leu Asn Thr Thr Gin Thr Pro Phe Met Asn Val Ser Val Ala 260 265 270 Asn Gly Gly Thr Tyr Thr Leu Leu Lys Ser Ser Arg Tyr Ile Asp Tyr 275 280 285 Asn Ile Asn Pro Asn Ser Leu Gin Ser Tyr Leu Lys Leu Tyr Thr Leu 290 295 300 Ile Asn Ile Asn Gly Asn His Ile Glu Glu Lys Asn Gly Val Leu Thr 305 310 315 320 Tyr Leu Gly Gin Arg Val Leu Leu Gin Asp Lys Gly Leu Leu Leu Ser 325 330 335 Val Ala Leu Pro Asn Ser Asn Asn Ala Ser Gin Asn Asn Ile Leu Ser 340 345 350 Leu Ser Val Leu His Asn Gin Ile Lys Met Ser Tyr Gly Asn Lys Val 355 360 365 Met Asp Phe Thr Pro Pro Thr Leu Gin Asp Tyr Ile Val Gly Ile Gin 370 375 380 Gly Gin Ser Ala Leu Asn Gin Ile Glu Ala Val Gly Gly Asn Asn Ala 385 390 395 400 Ile Lys Trp Leu Ser Thr Leu Met Met Glu Thr Lys Glu Asn Pro Leu 405 410 415 Phe Ala Pro Ile Tyr Leu Glu Asn His Ser Leu Asn Glu Ile Leu Gly 420 425 430 Val Thr Lys Asp Leu Gin Asn Thr Ala Ser Leu Ile Ser Asn Pro Asn 435 440 445 Phe Arg Asn Asn Ala Thr Ser Leu Leu Glu Met Ala Ser Tyr Thr Gin 450 455 460 Gin Thr Ser Arg Leu Thr Lys Leu Ser Asp Phe Arg Ala Arg Glu Gly 465 470 475 480 Glu Ser Asn Phe Ser Glu Arg Leu Leu Glu Leu Lys Asn Lys Arg Phe 485 490 495 Ser Asp Pro Asn Pro Ser Glu Val Phe Val Lys Tyr Ser Gin Leu Ser 500 505 510 Lys His Pro Asn Asn Leu Trp Ile Gin Gly Val Gly Gly Ala Ser Phe 515 520 525 Ile Ser Gly Gly Asn Gly Thr Leu Tyr Gly Leu Asn Val Gly Tyr Asp 530 535 540 Arg Leu Val Lys Ser Val Ile Leu Gly Gly Tyr Val Ala Tyr Gly Tyr 545 550 555 560 Ser Gly Phe Asn Gly Asn Ile Met His Ser Leu Ala Asn Asn Val Asp WO 97/37044 PCTIUS97/05223 Val Gly Met Tyr Ala Arg Ala Phe Leu Ser Ala Asn Ser 610 Thr Thr 625 Lys Ser Gly Leu Phe Val Met Gly 690 Val Thr 705 Asn Val Glu Ile Leu Trp Asn 595 Leu Ser Val Ser Met 675 Leu Ala Val Phe Arg 580 Glu Leu Val Val Gly 660 His Glu Arg Arg Asn 740 Leu Thr Ser Asn Leu 645 Met Ser Ser Leu Phe 725 Thr Met Tyr Val Gly 630 Lys Lys Asn Arg Gly 710 Val Phe Tyr Gly Gly 600 Leu Asn 615 Asn Tyr Pro Gin Gly Lys Pro Ser 680 Lys Tyr 695 Arg Asp Gly Glu Ala Ser Val Asn 760 Asn Ile 775 585 Asn Gin Gly Val Met 665 Asn Phe Leu Asn Val 745 Ala 570 Lys Ala Arg Tyr Gly 650 Gin Glu Gly Leu Thr 730 Ile Gly Arg Ser Tyr Asp 635 Leu Asn Ser Lys Ile 715 Leu Thr Val Asn His Asn 620 Phe Ser Pro Val Asn 700 Lys Leu Gly Gly Glu Ile 605 Tyr Met Tyr Ala Leu 685 Ser Ala Tyr Gly Leu 765 Phe 590 Asn Asn Phe His Tyr 670 Thr Tyr Lys Arg Glu 750 Lys 575 Thr Ser Thr Lys Phe 655 Gin Leu Tyr Gly Lys 735 Met Met Leu Ser Trp Gin 640 Ile Gin Asn Phe Asp 720 Gly His Gly 755 Leu Gin 770 Ala Phe 785
INF(
Tyr Gin Asp Leu Thr Gly Asn Val Gly Met Arg Val )RMATION FOR SEQ ID NO:765: SEQUENCE CHARACTERISTICS: LENGTH: 481 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...481 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:765: Phe Arg Lys Leu Ala Thr Ser Val Ser Leu Ile Ser Leu Leu Ile 5 10 Asn Ala Leu Tyr Ala Lys Glu Ile Ser Glu Ala Asp Lys Val Ile 25 Ala Thr Lys Glu Thr Lys Glu Thr Lys Lys Glu Ala Lys Arg Leu 40 Lys Glu Ala Lys Gin Arg Gin Gin Ile Pro Asp Asn Lys Lys Pro Met 1 Ser Lys Lys WO 97/37044 PCT/US97/05223 Gin Tyr Asn Asp Ala Gin 145 Ala Thr Asp Lys Glu 225 Asp Arg Ala Leu Lys 305 Lys Glu Ile Glu Lys 385 Lys Lys Asn Cys Leu 465 Lys Tyr Val Asp Thr Ser Ala Tyr Asn 115 Met Gly 130 Val Asn Lys Phe Phe Leu Lys Gly 195 Met Tyr 210 Val Cys Ser Val Leu Lys Thr Gin 275 Ile Ala 290 Leu Leu Leu Lys Leu Lys Pro Pro 355 Lys Glu 370 Arg Ser Thr Pro Ser Tyr Leu Tyr 435 Gly Tyr 450 Lys Tyr Ser Leu Leu 100 Asn Thr Gly Val Arg 180 Ala Phe Ser Thr Glu 260 Thr Asn Ala Asp Lys 340 Lys Asn Tyr Ile Tyr 420 Val Glu Asp Val Asp 70 Asn Val Lys Asn Tyr Leu Tyr Ala Tyr Tyr 150 Leu Lys 165 Val Pro Ser Ile Asn Tyr Pro Leu 230 Gin Lys 245 Thr Asn Ala Pro Ser Gin Glu Lys 310 Leu Glu 325 Lys Asn Thr Ser Tyr Asn Lys Gly 390 Asn Leu 405 Val Lys Lys Ile Ser Val Lys Gin 470 55 Asp Asn Lys Ser Asp 135 Asn Ile Lys Asp Ala 215 Arg Pro Asn Glu Leu 295 Glu Asn Thr Asp Gly 375 Thr Glu Ser Lys Gin 455 Thr Asp Thr lie 120 Leu Ile Asn Arg Glu 200 Asn Asp Asn Ala Asn 280 Ile Lys Gin Lys Ser 360 Leu Leu Asp Asn Asn 440 Lys Lys Lys Tyr 105 Glu lie Leu Asp Ser 185 Ser Asp Glu Ile Asn 265 Ser Ala Gin Lys Lys 345 Asp Leu Ile Leu Gly 425 Asp Leu Thr Ser 90 Leu Asn Ile Lys Lys 170 Asp Lys Val Met Ile 250 Glu Lys Asn Glu Lys 330 Pro Glu Val Ser Arg 410 Leu Pro Leu Gin 75 Phe Tyr Pro Ile Ala 155 Ile Pro Leu Ile Val 235 Ala Ala Glu Glu Thr 315 Leu Arg Thr Asp Glu 395 Ser Cys Tyr Ser Ala 475 Ala Gly Ala Ile Thr 140 Leu Pro Asn Phe Cys 220 Ala Pro Gin Lys Glu 300 Glu Lys Val Met Lys 380 Asn Leu Tyr Lys Pro 460 Leu Asp Met Ile 125 Gly Asn Tyr Ala Glu 205 Arg Met Tyr Pro Leu 285 Glu Leu Ala Val Arg 365 Glu Ser Glu Ala Glu 445 Leu Phe Trp Asp 110 Lys Ser Lys Ala His 190 Gin Pro Pro Ser Ser 270 Ile Arg Ala Leu Glu 350 Val Thr Tyr Glu Asn 430 Gly Arg Asp Phe Leu Thr Leu Arg Gin 175 Thr Gin Asn Thr Leu 255 Pro Glu Glu Lys Glu 335 Val Ile Thr Ser Glu 415 Gly Met Asp Ile Gly Leu Arg Glu Asn 160 Ala Leu Lys Asp Asn 240 Tyr Tyr Glu Lys Tyr 320 Ala Pro Lys Ile Lys 400 Ile Ile Leu Lys Leu 480 Lys Leu Gin Lys Leu Leu Lys Asp INFORMATION FOR SEQ ID NO:766: WO 97/37044 PCTfUS97/05223 673 SEQUENCE CHARACTERISTICS: LENGTH: 59 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...59 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:766: Leu Leu Ser Arg Ser Met Pro Lys Ile His Ala Val Phe Lys Ala Phe 1 5 10 Ile Pro Ile Pro Phe Ala Leu Phe Ala Ile His Phe Val Val Leu Gly 25 Ile Gly Ser Val Phe Asn Leu Asn Arg Ile Lys Asp Lys Lys Phe Ile 40 Leu Arg Ala Lys Ile Ser His Ile Ala Gin Ala INFORMATION FOR SEQ ID NO:767: SEQUENCE CHARACTERISTICS: LENGTH: 263 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...263 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:767: Met Arg Lys Thr Ile Ser Ala Leu Phe Leu Ser Ala Cys Ile Gly Leu 1 5 10 Ser Ser Val His Ala Ser Asn Ala Leu Ile Leu Gin Thr Asp Phe Ser 25 Leu Lys Asp Gly Ala Val Ser Ala Met Lys Gly Val Ala Phe Ser Val 40 Asp Ser Asn Leu Lys Ile Phe Asp Leu Thr His Glu Ile Pro Pro Tyr 55 Asn Ile Trp Glu Gly Ala Tyr Arg Leu Tyr Gin Thr Ala Ser Tyr Trp WO 97/37044 PCT/US97/05223 674 70 75 Pro Lys Gly Ser Val Phe Val Ser Val Val Asp Pro Gly Val Gly Thr 90 Asn Arg Lys Ser Val Val Leu Lys Thr Lys Asn Gly Gin Tyr Phe Val 100 105 110 Ser Pro Asp Asn Gly Thr Leu Thr Leu Val Ala Gin Thr Leu Gly Ile 115 120 125 Asp Ser Val Arg Glu Ile Asp Glu Lys Ala Asn Arg Leu Lys Gly Ser 130 135 140 Glu Lys Ser Tyr Thr Phe His Gly Arg Asp Val Tyr Ala Tyr Thr Gly 145 150 155 160 Ala Arg Leu Ala Ser Gly Ala Ile Thr Phe Glu Gin Val Gly Pro Glu 165 170 175 Leu Pro Ile Lys Val Val Glu Ile Pro Tyr Gin Lys Ala Lys Ala Thr 180 185 190 Lys Gly Gly Val Lys Gly Asn Ile Pro Ile Leu Asp Ile Gin Tyr Gly 195 200 205 Asn Val Trp Ser Asn Ile Ser Asp Lys Leu Leu Asn Gin Ala Gly Ile 210 215 220 Lys Arg Asn Asp Thr Val Cys Val Thr Ile Leu Lys Ile Pro Arg Asn 225 230 235 240 Asn Thr Lys Gly Lys Cys Arg Met Ser Arg Ala Leu Ala Met Cys Gin 245 250 255 Lys Ala Ser Arg Trp Ser Ile 260 INFORMATION FOR SEQ ID NO:768: SEQUENCE CHARACTERISTICS: LENGTH: 114 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...114 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:768: Met Asn Ile Lys Thr His Ser Ser Asn Glu Lys Glu Arg Phe Val Arg 1 5 10 Ile Glu Glu Asp Glu Lys Lys Glu Leu Phe Ala Glu Ala Thr Asn Glu 25 Asn Pro His Gly Leu Ser Leu Met Ala Leu Ile Gly Val Leu Val Phe 40 Gly Gly Ala Phe Leu Ala Leu Leu Val Pro Lys Ile Tyr Leu Ser Asn 55 Asn Ile Tyr Tyr Ile Ser Arg Lys Ile Asn Thr Leu Glu Asp Gin Lys 70 75 Arg Leu Leu Leu Glu Glu Gin Gin Ile Leu Lys Asn Glu Leu Glu Lys WO 97/37044 PCT/US97/05223 675 90 Glu Arg Phe Lys Tyr Tyr Ile Glu Asn Ser Glu Asn Ile Gly Asp Ile 100 105 110 Ala Phe INFORMATION FOR SEQ ID N0:769: SEQUENCE CHARACTERISTICS: LENGTH: 941 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...941 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:769: Leu Gin His Lys Thr Ile Met Asp Lys Ile Ile Ile 1 Glu Val Asp Ser Lys Ser Tyr Cys Ile 145 Ile Leu Asp Val Ser Asn Val Thr Tyr Ile Lys Leu Leu 130 Cys Lys Lys Glu Val 210 Ala Asn Phe Leu Ala Glu Asn Arg 115 Glu His Asp Gly Glu 195 Asp Val Leu Thr Tyr Arg Gly Pro 100 Leu Pro Leu Lys Tyr 180 Ile Arg Glu 5 Lys Gly Ala Gin Leu Arg Leu Ile Glu Lys 165 Val His Val Lys Asn Leu Glu Phe 70 Thr Ser Phe Ser Glu 150 Gly Arg Leu Val Ala Ile Ser Gly 55 Leu Pro Thr Ala Ser 135 Asn Ser Ala His Ile 215 Leu Phe Gly 40 Gin Asp Ala Val Arg 120 Met Ser Phe Phe Lys 200 Asn Lys Leu 25 Ser Arg Lys Ile Gly 105 Val Ser Lys Asn Val 185 Thr Ser Glu 10 Glu Gly Arg Val Ala 90 Thr Gly Ala Ile Asp 170 Asp Lys Glu Ser Ile Lys Tyr Gly 75 Ile Ile Glu Ser Ile 155 Lys Gly Lys Asn Tyr Pro Ser Leu Lys Asp Thr Gin Asp 140 Ile Leu Val His Ala 220 Gly Gin Lys Thr Glu Pro Gin Glu Phe 125 Ile Leu Glu Met Thr 205 Ser Glu Gly Asn Leu Ser Asn Lys Ile 110 Cys Ile Ala Ser Val 190 Ile Arg Leu Ala Gin Ala Leu Val Thr Tyr Pro Ser Pro Leu 175 Arg Glu Ile Glu Arg Phe Phe Ser Asp Thr Asp Thr Gin Ile 160 Arg Leu Ala Ala Val 240 225 230 235 Glu Ile Leu Gln Asp Asn Ala Pro Ser Ile Arg Lys His Tyr Ser Glu WO 97/37044 PCT/US97/05223 676 245 250 255 His Lys Ala Cys Phe Lys Cys Lys Met Ser Phe Glu Glu Leu Glu Pro 260 265 270 Leu Ser Phe Ser Phe Asn Ser Pro Lys Gly Ala Cys Glu Ser Cys Leu 275 280 285 Gly Leu Gly Thr Lys Phe Ser Leu Asp Ile Ser Lys Ile Leu Asp Pro 290 295 300 Asn Thr Pro Leu Asn Gln Gly Ala Ile Lys Val Ile Phe Gly Tyr Asn 305 310 315 320 Arg Ser Tyr Tyr Ala Gin Met Phe Glu Gly Phe Cys Thr Tyr Asn Gly 325 330 335 Ile Asp Ser Ala Leu Cys Phe Asn Glu Leu Asn Lys Glu Gin Gin Asp 340 345 350 Ala Leu Leu Tyr Gly Asn Gly Thr Glu Ile Ser Phe His Phe Lys Asn 355 360 365 Ser Pro Leu Lys Arg Pro Trp Lys Gly Ile Ile Gin Ile Ala Tyr Asp 370 375 380 Met Phe Lys Glu Gin Lys Asp Leu Ser Asp Tyr Met Ser Glu Lys Thr 385 390 395 400 Cys Ser Ser Cys Asn Gly His Arg Leu Lys Ala Ser Ser Leu Ser Val 405 410 415 Gin Val Ala Gly Leu Lys Met Ala Asp Phe Leu Thr Lys Pro Ile Glu 420 425 430 Glu Val Tyr His Phe Phe Asn Asp Pro Thr His Phe Asn Tyr Leu Asn 435 440 445 Glu Gin Glu Lys Lys Ile Ala Glu Pro Ile Leu Lys Glu Ile Leu Glu 450 455 460 Arg Val Phe Phe Leu Tyr Asp Val Gly Leu Gly Tyr Leu Thr Leu Gly 465 470 475 480 Arg Asp Ala Arg Thr Ile Ser Gly Gly Glu Ser Gin Arg Ile Arg Ile 485 490 495 Ala Ser Gin Ile Gly Ser Gly Leu Thr Gly Val Leu Tyr Val Leu Asp 500 505 510 Glu Pro Ser Ile Gly Leu His Glu Lys Asp Thr Leu Lys Leu Ile Asn 515 520 525 Thr Leu Arg Asn Leu Gin Lys Lys Gly Asn Thr Leu Ile Val Val Glu 530 535 540 His Asp Lys Glu Thr Ile Lys His Ala Asp Phe Val Val Asp Ile Gly 545 550 555 560 Pro Lys Ala Gly Arg His Gly Gly Glu Val Val Phe Ser Gly Ser Val 565 570 575 Lys Asp Leu Leu Gin Asn Asn His Ser Thr Ala Leu Tyr Leu Asn Gly 580 585 590 Thr Lys Lys Ile Glu Arg Pro Lys Phe Glu Pro Pro Lys Glu Lys His 595 600 605 Phe Leu Glu Ile Lys Asn Val Asn Ile Asn Asn Ile Lys Asn Leu Ser 610 615 620 Val Gin Ile Pro Leu Lys Gin Leu Val Cys Ile Thr Gly Val Ser Gly 625 630 635 640 Ser Gly Lys Ser Ser Leu Ile Leu Gin Thr Leu Leu Pro Thr Ala Gin 645 650 655 Thr Leu Leu Asn His Ala Lys Lys Asn Gln Ser Leu Asn Gly Val Glu 660 665 670 Ile Val Gly Leu Glu Tyr Leu Asp Lys Val Ile Tyr Leu Asp Gin Ala 675 680 685 Pro Ile Gly Lys Thr Pro Arg Ser Asn Pro Ala Thr Tyr Thr Gly Val 690 695 700 WO 97/37044 PCT/US97/05223 677 Met Asp Glu Ile Arg Ile Leu Phe Ala Glu Gin Lys Glu Ala Lys Ile 705 710 715 720 Leu Gly Tyr Ser Thr Ser Arg Phe Ser Phe Asn Val Lys Gly Gly Arg 725 730 735 Cys Glu Lys Cys Gin Gly Asp Gly Asp Ile Lys Ile Glu Met His Phe 740 745 750 Leu Pro Asp Val Leu Val Gin Cys Asp Ser Cys Lys Gly Ala Lys Tyr 755 760 765 Asn Pro Gin Thr Leu Glu Ile Lys Val Lys Gly Lys Ser lie Ala Asp 770 775 780 Val Leu Asn Met Ser Val Glu Glu Ala Tyr Glu Phe Phe Ala Lys Phe 785 790 795 800 Pro Lys Ile Ala Val Lys Leu Lys Thr Leu Ile Asp Val Gly Leu Gly 805 810 815 Tyr Ile Thr Leu Gly Gin Asn Ala Thr Thr Leu Ser Gly Gly Glu Ala 820 825 830 Gin Arg Ile Lys Leu Ala Lys Glu Leu Ser Lys Lys Asp Thr Gly Lys 835 840 845 Thr Leu Tyr Ile Leu Asp Glu Pro Thr Thr Gly Leu His Phe Glu Asp 850 855 860 Val Asn His Leu Leu Gin Val Leu His Ser Leu Val Ala Leu Gly Asn 865 870 875 880 Ser Met Leu Val Ile Glu His Asn Leu Asp Ile Ile Lys Asn Ala Asp 885 890 895 Tyr Ile Ile Asp Met Gly Pro Asp Gly Gly Asp Lys Gly Gly Lys Val 900 905 910 Ile Ala Ser Gly Thr Pro Leu Glu Val Ala Gin Asn Cys Glu Lys Thr 915 920 925 Gin Ser Tyr Thr Gly Lys Phe Leu Ala Leu Glu Leu Lys 930 935 940 INFORMATION FOR SEQ ID NO:770: SEQUENCE CHARACTERISTICS: LENGTH: 83 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...83 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:770: Met Asn Tyr Asp Val Leu Met Gly Phe Leu Ala Leu Ile Leu Leu Ile 1 5 10 Leu Trp Tyr Ala Tyr Gly Leu Arg Gin Tyr Leu Lys Leu Lys Asp Lys 25 Asn Lys Arg Leu Lys Glu Lys Leu Gin Arg Cys Asn Cys Asn Ile Lys 40 WO 97/37044 PCT/US97/05223 678 Ile Pro Ser Ile Leu Glu Met Ala His Lys Pro Ile Ile Met Asp Ile 55 Lys Gly Glu Leu Leu Pro His Leu Thr Glu Ser Tyr Arg Lys Ser Lys 70 75 Phe Lys Glu INFORMATION FOR SEQ ID NO:771: SEQUENCE CHARACTERISTICS: LENGTH: 279 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...279 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:771: Met Arg Val Ile Ile Lys Phe Met Leu Ile Ser Leu Lys Thr Phe Leu 1 5 10 Lys Ile Leu Leu Lys Ile Phe Leu Lys Thr Phe Gin Lys Ile Trp Ile 25 Val Cys Val Val Ile Trp Gly Leu Gly Cys Ser Phe Leu Asn Ala Asn 40 Ser Val Gin Leu Glu Glu Thr Leu Arg Arg Asn Pro Lys Asn Leu Ile 55 Trp Gin His Phe Lys Lys Lys Phe Lys Lys Ser Asn Thr Ile Pro Tyr 70 75 Ala Pro Asn Ser Arg Trp Lys Tyr Leu Gly Thr Ser Ile Gly Ile Leu 90 Gly Val Ser Leu Val Ile Gly Ile Val Gly Leu Tyr Leu Met Pro Glu 100 105 110 Ser Val Thr Asn Trp Asp Lys Glu Lys Phe Gly Val Lys Ser Trp Phe 115 120 125 Glu Asn Val Arg Met Gly Pro Lys Leu Asp Asn Asp Ser Phe Ile Phe 130 135 140 Asn Glu Ile Leu His Pro Tyr Phe Gly Ala Met Tyr Tyr Met Gin Pro 145 150 155 160 Arg Met Ala Gly Phe Ser Trp Met Thr Ser Ala Phe Phe Ser Phe Ile 165 170 175 Thr Ser Thr Leu Phe Trp Glu Tyr Gly Leu Glu Ala Phe Val Glu Val 180 185 190 Pro Ser Trp Gin Asp Leu Val Ile Thr Pro Leu Leu Gly Ser Ile Leu 195 200 205 Gly Glu Gly Phe Tyr Gin Leu Thr Arg Tyr Ile Gin Arg Asn Glu Gly 210 215 220 Lys Leu Phe Gly Ser Leu Phe Leu Gly Arg Leu Ala Ile Ala Leu Met 225 230 235 240 WO 97/37044 WO 9737044PCTIUS97/05223 679 Asp Pro Ile Gly Phe Ile Ile Arg Asp Leu Gly Leu 245 250 Gly Ile Tyr Asn Lys His Giu Ile Arg Ser Asn Leu 260 265 Leu Asn Leu Thr Tyr Lys Phe 275 INFORMATION FOR SEQ ID NO:772: SEQUENCE CHARACTERISTICS: LENGTH: 177 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .177 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:772: Met Lys Leu Phe Asn Pro Arg Leu Ile Val Phe Ile Gly Giu Ala Leu 255 Ser Pro Asn Gly 270 1 Leu Lys Gly Ser Ile Giu Gin Pro Gly 145 Val Arg Vali Thr Gin Leu Ser Lys Giu 115 Giu Ile Gin Gly Leu Thr Giu Ser Lys 100 Ile Gin Arg Gin 5 Phe Gly Asp Tyr Leu Leu Lys Giu Asn Gly 165 Ser Leu Giu Asn 70 Giu Asp Lys Giu Arg 150 Arg Val Asp Aia Ala Gly Ala Giu Leu 135 Leu Vali Ser Arg Lys Lys Ser Leu 105 Giu Lys Arg Phe 10 Leu Giy Asn Gin Phe 90 Leu Phe Asn Phe Arg 170 Leu Giy Lys Asn 75 Glu Glu Tyr Thr Giy 155 Cys Giu Leu Tyr Ile Leu Leu Ser Ile 140 Leu Asn Phe Thr Asn Leu Leu Leu Gin Val1 125 Leu Ala Cys Leu Gly Leu Leu Lys Giu His Leu Val1 Pro Ala 175 Leu Pro Leu Ala Asp Asp Ser Thr Ile Val 160 Leu INFORMATION FOR SEQ ID NO:773: SEQUENCE CHARACTERISTICS: LENGTH: 436 amino acids TYPE: amino acid WO 97/37044 PCT/US97/05223 680 TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...436 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:773: Met Met Lys Phe Phe Leu Leu Lys Lys Phe Ser Glu Phe Leu Asn Thr 1 5 10 Gln Thr His Phe Asn Leu Lys Arg Leu Asn Ala Ser Ser Phe Leu Leu 25 Glu Thr Phe Ser Lys Glu Lys His Ala Phe Val Val Asp Leu Ser Ala 40 Pro Tyr Ile Gly Leu Ser Lys Lys Pro Pro Glu Ser Val Leu Lys Asn 55 Thr Leu Ala Leu Asp Phe Cys Leu Asn Lys Phe Thr Lys Asn Ala Lys 70 75 Ile Leu Gin Ala Asn Val Ile Asp Asn Asp Arg Ile Leu Glu Ile Lys 90 Gly Ala Lys Asp Leu Ala Tyr Lys Ser Glu Thr Phe Ile Leu Arg Leu 100 105 110 Glu Met Ile Pro Lys Lys Ala Asn Leu Met Ile Leu Asp Gin Glu Lys 115 120 125 Cys Val Ile Glu Ala Phe Arg Phe Asn Asp Arg Val Ala Lys Asn Asp 130 135 140 Ile Leu Gly Ala Leu Pro Pro Asn Ile Tyr Glu His Gin Glu Glu Asp 145 150 155 160 Leu Asp Phe Lys Gly Leu Leu Asp Ile Leu Glu Lys Asp Phe Leu Ser 165 170 175 Tyr Gin His Lys Glu Leu Glu His Lys Lys Asn Gin Ile Ile Lys Arg 180 185 190 Leu Asn Ala Gin Lys Glu Arg Leu Lys Glu Lys Leu Glu Lys Leu Glu 195 200 205 Asp Pro Lys Thr Leu Gin Leu Glu Ala Lys Glu Leu Gin Thr Gin Ala 210 215 220 Ser Leu Leu Leu Thr Tyr Gin His Leu Ile Asn Arg Arg Glu Asn Arg 225 230 235 240 Val Ile Leu Lys Asp Phe Glu Asp Lys Glu Cys Met Ile Glu Ile Asp 245 250 255 Lys Ser Met Pro Leu Asn Ala Phe Ile Asn Lys Lys Phe Thr Leu Ser 260 265 270 Lys Lys Lys Lys Gin Lys Ser Gin Phe Leu Tyr Leu Glu Glu Glu Asn 275 280 285 Leu Lys Glu Lys Ile Ala Phe Lys Glu Asn Gin Ile Asn Tyr Val Arg 290 295 300 Asp Ala Ala Glu Glu Ser Val Leu Glu Met Phe Met Pro Val Lys Asn 305 310 315 320 Ser Lys Ile Lys Arg Pro Met Asn Gly Tyr Glu Val Leu Tyr Tyr Lys 325 330 335 WO 97/37044 PCT/US97/05223 681 Asp Phe Lys Ile Gly Leu Gly Lys Asn Gin Lys Glu Asn Ile Lys Leu 340 345 350 Leu Gin Asp Ala Arg Ala Asn Asp Leu Trp Met His Val Arg Asp Ile 355 360 365 Pro Gly Ser His Leu Ile Val Phe Cys Gin Lys Asn Thr Pro Lys Asp 370 375 380 Glu Val Ile Met Glu Leu Ala Lys Met Leu Ile Lys Met Gin Lys Asp 385 390 395 400 Ala Phe Asn Gly Tyr Glu Ile Asp Tyr Thr Gin Arg Lys Phe Val Lys 405 410 415 Ile Ile Lys Gly Ala His Val Ile Tyr Ser Lys Tyr Arg Thr Ile Ser 420 425 430 Leu Lys Asp Thr 435 INFORMATION FOR SEQ ID NO:774: SEQUENCE CHARACTERISTICS: LENGTH: 97 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...97 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:774: Met Gly Gin Thr Lys Glu Ile Ile Thr Thr Leu Leu Pro Leu Leu Val 1 5 10 Leu Phe Leu Ile Phe Tyr Phe Leu Ile Val Arg Pro Gin Arg Gin Gin 25 Gin Lys Lys His Lys Glu Met Ile Glu Gly Leu Thr Lys Gly Asp Lys 40 Ile Val Thr Gin Gly Gly Leu Ile Val Glu Val Leu Lys Ala Glu Ala 55 Asn Phe Phe Ser Val Lys Leu Asn Asp Asp Thr Thr Ala Lys Leu Ser 70 75 Lys Asn Tyr Val Ala Phe Lys Leu Asp Glu Glu Thr Thr Pro Asn Asn 90 Asn INFORMATION FOR SEQ ID NO:775: SEQUENCE CHARACTERISTICS: LENGTH: 206 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 682 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...206 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:775: Leu Lys Phe Gin Ile Val Ser Leu Leu Leu Ala Phe Leu Leu Ala Ser 1 Cys Ala Gly Lys Phe Phe Thr Asn Val 145 Ile Glu Leu Leu His Val Leu Met Lys Gin Gly 130 Lys Lys Lys Lys Pro Gin Val Lys Leu Leu Lys 115 Tyr Pro Thr Gly Asn 195 Pro Gly Ala Asp Leu Glu 100 Gly Asp Asp Pro Ala 180 Ala 5 Lys Gin Lys Pro Trp Pro Ile Phe Ser Arg 165 Asn Ser Gly His Ser Val His Asn 55 Lys Gly 70 Lys Asn Gly Phe Leu Gin Lys Asn 135 Lys Ile 150 Gly Phe Thr Pro Phe Lys His Arg 40 Glu Pro Arg Tyr Ser 120 Asn Val Leu Trp Asp 200 Ser 25 Thr Thr Leu Tyr Tyr 105 Ala Arg Leu Gly Ile 185 Ala 10 Gly Tyr Leu Phe Thr 90 Leu Pro Pro Pro Val 170 Glu Trp Leu Trp Lys Met 75 Leu Asp Gly Phe Ser 155 Phe Gly Glu Val Arg Glu Leu Ala Ser Tyr Phe 140 Val Leu Ser Leu Asn Lys Asn Gly Lys Phe Ser 125 Leu Glu Phe Leu Glu 205 Met Val Pro Ser Val Ser 110 Tyr Ala Leu Asp Asn 190 Arg Tyr Asp Lys Asn Gin Val Thr Phe Ser Asn 175 Leu Ile Arg Ile Arg Ser Glu Lys Glu Leu 160 Asn Lys INFORMATION FOR SEQ ID NO:776: SEQUENCE CHARACTERISTICS: LENGTH: 533 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...533 WO 97/370"4 PCTUS97/05223 683 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:776: Met Arg Lys Val Ile Ile Met Asn Gly Tyr Leu Arg Val Lys Thr Pro 1 Tyr Met Glu Va1 Lys Gly Ser Met Ile 145 Asp Gly Asn Asn Gly 225 Gly Gly Ala Tyr Tyr 305 Gly Pro Asp I Met Ser 385 Ser Phe Sex Gin Arg Lys Ile Va1 Ile 130 Glu Va1 Gly Gin Phe 210 Asn Lys Phe Ile Tyr 290 Ala Arg Asp Ser 370 kla Pro Leu Ala Lys Asn Thr Gin Arg 115 Leu Leu Ile Vai Ala 195 Val Gin Tyr Arg Tyr 275 Gin Tyr Ala I Arg I Ser 2 355 Gly C Lys 2 Cys AlE Lys PhE Sex Gly Ile 100 Gly Va1 Ala Lys Val 180 Ala Asp Met Ile Gln 260 Lys Tyr ksn Lys -ys 340 krg in ~sn ?rp 5 Ser Asp Ser Thr Asn Arg Phe Asn Ile Gly 165 Asn Glu Pro Leu Gly 245 Asn Ile Asn Arg Arg 325 Val Asp Asn Pro 2 Gin I 405 Val Lys Sex Sex 70 Leu Asp Gly Gly Phe 150 Gly Va1 Arg Lys Phe 230 Ile Ser Asn Ser Phe 310 Phe ly Phe Lys ksn 390 ?he Val SHis Sex 55 Sex Asn Ala Gly Ile 135 Pro Thr Ile Ile Glu 215 Asn Ser Pro Ala Tyr 295 Ile Gly Gly Gly Ile 375 Cys Phe Leu His 40 Ala Arg Ile Thr Gly 120 Pro Va1 Ser Thr Thr 200 Lys Thr Ala Thr Thr 280 His Asn Ile Asp Phe 360 Leu Gly Asp 2 Thl 25 Ph Prc Thi Gli Gi4 105 Gl) Ile Thr Vai Lys 185 Phe Gly Tyr Gin Lys 265 Asn Pro Glu Va1 Phe 345 Ser Pro Leu %sn 10 r Phe 2 Leu Ile Val 1 Asn 90 Thr Asn Tyr Phe Gin 170 Glu Trp Lys Gly Gly.
250 Val Thr Gly Arg Tyr 330 Lys I Asn C Phe I Tyr Ile 410 Trp Lys Sex Ile 75 Ala Gly Gly Gly Gin 155 Tyr Ile Gly Pro Arg 235 Asn Glm Phe Thr Pro 315 21n ?he 3In -ys er 195 ~rg Thl Lyc Trj Ser Leu Val His Ala 140 Ser Gly Pro Arg Leu 220 Thr Trp Asn Lys Leu 300 Asp Asn Thr Tyr Gly 380 Tyr Arg Phe Val Gin Asn I Gin Leu Ser 125 Pro Va1 Pro Lys Ser 205 Ala Ala Ile Tyr Ala 285 Ser Asn Tyr Tyr Gin 365 Lys C Ser I Ser As] Thl Se] Lys Asr Pro 110 Asn Tyr Asp Asn Glu 190 Ser Gin Gly Asn Leu 270 Tyr kia "ln ?he ?he 350 3er ly isp Tal n Ser r Thr Glu Glu Val Lys Thr Ser Arg Thr 175 Trp Asn Thr Met Gly 255 Leu Tyr Gin Asp Gly 2 335 Thr I Val Glu I Thr 4 Val I 415 Phe Thr Glu Leu Pro Ile Asn Asn Ile 160 Phe Glu Gly Leu Leu 240 Gin Asp 3ml Asp 3 iy 320 %sp iis ryr [le ~sn
LO
~sn Ala Phe Giu Pro Lys Leu Asn Leu Ile Val Asn Thr Gly Lys Val Lys 430 WO 97/37044 PCT/US97/05223 684 Gin Thr Phe Asn Met Gly Met Arg Phe Leu Thr Glu Asp Leu Tyr Arg 435 440 445 Arg Ser Thr Thr Arg Lys Asn Pro Ser Met Pro Asn Asn Gly Ser Gly 450 455 460 Phe Asp Ala Gly Thr Ser Leu Asn Asn Phe Asn Asn Tyr Thr Ala Val 465 470 475 480 Tyr Ala Ser Asp Glu Ile Asn Phe Asn Asn Gly Met Leu Thr Ile Thr 485 490 495 Pro Gly Leu Arg Tyr Thr Phe Leu Asn Tyr Glu Lys Lys Asp Ala Pro 500 505 510 Pro Phe Lys Val Gly Gin Thr Pro Lys Thr Thr Lys Glu Arg Tyr Asn 515 520 525 Gin Trp Asn Pro Ala 530 INFORMATION FOR SEQ ID NO:777: SEQUENCE CHARACTERISTICS: LENGTH: 255 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...255 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:777: Met Lys Lys Ile Phe Leu Gly Met Ala Leu Ala Phe Ser Val Ser Met 1 5 10 Ala Glu Lys Ser Gly Ala Phe Leu Gly Gly Gly Phe Gin Tyr Ser Asn 25 Leu Glu Asn Gin Asn Thr Thr Arg Thr Pro Ser Ala Asn Asn Asn Thr 40 Pro Ile Asn Thr Ser Met Phe Gly Asn Asn Gin Ala Ala Pro Ala Gin 55 Glu Thr Pro Ser Val Ile Asn Thr Asn Asn Tyr Gly Gin Met Tyr Gly 70 75 Val Asp Ala Met Ala Gly Tyr Lys Trp Phe Phe Gly Lys Thr Lys Arg 90 Phe Gly Phe Arg Thr Tyr Gly Tyr Tyr Ser Tyr Asn His Ala Asn Leu 100 105 110 Ser Phe Val Gly Ser Lys Leu Gly Ile Met Asp Gly Ala Ser Gin Val 115 120 125 Asn Asn Phe Thr Tyr Gly Val Gly Phe Asp Ala Leu Tyr Asn Phe Tyr 130 135 140 Glu Ser Lys Glu Gly Tyr Asn Thr Ala Gly Leu Phe Val Gly Phe Gly 145 150 155 160 Leu Gly Gly Asp Ser Phe Ile Val Gin Gly Glu Ser Tyr Leu Lys Ser 165 170 175 WO 97/37044 PCT/US97/05223 Gin Met Gin Ile Cys Asn Asn Thr Ala Gly Cys Ser Ala Ser Met 180 185 190 Thr Ser Tyr Phe Gin Met Pro Val Glu Phe Gly Phe Arg Ser Asn 195 200 205 Ser Lys His Ser Gly Ile Glu Val Gly Phe Lys Leu Pro Leu Phe 210 215 220 Asn Gin Phe Tyr Lys Glu Arg Gly Val Asp Gly Ser Val Asp Val 225 230 235 Tyr Lys Arg Asn Phe Ser Ile Tyr Phe Asn Tyr Met Ile Asn Leu 245 250 255 INFORMATION FOR SEQ ID NO:778: SEQUENCE CHARACTERISTICS: LENGTH: 333 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...333 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:778: Asn Phe Thr Phe 240 Met 1 Val Glu Val Phe Ser Met Leu Glu Thr 145 Glu Phe Leu Ile Ala Arg Phe Lys Lys Ala Leu Ile Ser Tyr Ser Leu Glv Leu lie Ser Asn Asp Ser Ser His 130 Ile Val Ile Leu Val Gln Val Lys Ile Thr Trp Ile Val Ser Asp 100 Pro Asp 115 Ala Lys Val Glu Asp Ala Lys Glu 180 5 Ser Lys Ile Asp Lys His Leu Lys Val Ser 165 Arg Ser Asp Tyr Arg 70 Ala Val Val Phe Met 150 Lys Leu Leu Tyr Leu 55 Val Thr Ala Val Gly 135 Glu Lys Lys Leu Phe 40 Gly Val Leu Ala Thr 120 Ile Asp Leu Asn Gly 25 Gly Ser Gly Lys Leu 105 Phe Ser Ile Ala Val 185 10 Val Glu Phe Ile Asp 90 Asn Val Phe Asp Lys 170 Lys Ala Gin Ala Ser 75 Pro Val Gly Leu Ala 155 Met Lys Asn Thr Glu Asp Glu Glu Asn Ser 140 Gin Gin Lys Ala Ile Val Tyr Arg Leu Pro 125 Phe Ala Glu Lys Ser Lys Pro Ala Ile Leu 110 Lys Gin Lys Thr Gly 190 Asn Leu Ala Phe Lys Lys Ala Glu Ala Leu 175 Val Gin Pro Met Lys Pro Lys Val Lys Leu 160 Asp Glu Leu Phe His Lys Ala Asn Lys Ile Ser Gly His Gin Ala Leu Asp Ser 195 200 205 WO 97/37044 PCT/US97/05223 686 Asp Ile Leu Glu Lys Gly Gly Ile Asp Asn Phe Gly Leu Lys Tyr Val 210 215 220 Lys Phe Gly Arg Ala Asp Ile Ser Val Glu Lys Ile Val Lys Glu Asn 225 230 235 240 Pro Glu Ile Ile Phe Ile Trp Trp Ile Ser Pro Leu Ser Pro Glu Asp 245 250 255 Val Leu Asn Asn Pro Lys Phe Ser Thr Ile Lys Ala Ile Lys Asn Lys 260 265 270 Gin Val Tyr Lys Leu Pro Thr Met Asp Ile Gly Gly Pro Arg Ala Pro 275 280 285 Leu Ile Ser Leu Phe Ile Ala Leu Lys Ala His Pro Glu Ala Phe Lys 290 295 300 Gly Val Asp Ile Asn Ala Ile Ile Lys Asp Tyr Tyr Lys Val Val Phe 305 310 315 320 Asp Leu Asn Asp Ala Glu Val Glu Pro Phe Leu Trp His 325 330 INFORMATION FOR SEQ ID NO:779: SEQUENCE CHARACTERISTICS: LENGTH: 54 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...54 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:779: Met Lys Lys Gin Ile Leu Thr Gly Val Leu Leu Ser Val Leu Ala Val 1 5 10 Ser Ser Ala Tyr Ala His Lys Asp Lys Lys Asp Ala Lys Lys Pro Glu 25 Leu Ser Ser Gin Leu Val Ala His Lys Asp Lys Lys Asp Ala Lys Lys 40 Pro Lys Asn Ser Val Ala INFORMATION FOR SEQ ID NO:780: SEQUENCE CHARACTERISTICS: LENGTH: 142 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 687 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...142 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:780: Met Tyr Lys Leu Gly Ile Phe Leu Leu Ala Thr Leu Leu Ser Ala Asn 1 5 10 Thr Gin Lys Val Ser Asp Ile Ala Lys Asp Ile Gin His Lys Glu Thr 25 Leu Leu Lys Lys Thr His Glu Glu Lys Asn Gin Leu Asn Ser Arg Leu 40 Ser Ser Leu Gly Glu Ala Ile Arg Ser Lys Glu Leu Gin Lys Val Glu 55 Ile Glu Arg Gin Met Val Ala Leu Lys Lys Ser Leu Glu Lys Asn Arg 70 75 Asn Glu Ser Leu Val Gin Glu Lys Val Leu Thr Asn Tyr Arg Lys Ser 90 Leu Asp His Leu Gin Lys Gin Arg Ser Phe Leu Gin Lys Arg Val Phe 100 105 110 Asp Thr Leu Leu Glu Asp Phe Leu Phe Ser Gin Ala Leu Lys Gly Gin 115 120 125 Asn Leu Ala Ser Ser Asn Asp Val Ile Leu Leu Ser Gly Val 130 135 140 INFORMATION FOR SEQ ID NO:781: SEQUENCE CHARACTERISTICS: LENGTH: 197 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...197 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:781: Met Asp Lys Asn Asn Asn Asn Asn Leu Arg Leu Ile Leu Ala Ile Ala 1 5 10 Leu Ser Phe Leu Phe Ile Ala Leu Tyr Ser Tyr Phe Phe Gin Glu Pro 25 Asn Lys Thr Thr Thr Glu Thr Thr Lys Gin Glu Thr Thr Asn Asn His 40 Thr Ala Thr Ser Pro Thr Ala Ser Asn Thr Ile Thr Gin Asp Phe Ser 55 Val Thr Gin Thr Ile Pro Gin Glu Ser Leu Leu Ser Thr Ile Ser Phe WO 97/37044 PCT/US97/05223 688 70 Glu His Ala Lys Ile Glu Ile Asp Ser Leu Gly Arg Ile Lys Gin Val 90 Tyr Leu Lys Asp Lys Lys Tyr Leu Thr Pro Lys Glu Lys Gly Phe Leu 100 105 110 Glu His Val Ser His Leu Phe Ser Ser Lys Glu Asn Ser Gin Pro Ser 115 120 125 Leu Lys Glu Leu Pro Leu Leu Ala Ala Asp Lys Leu Lys Pro Leu Glu 130 135 140 Val Arg Phe Leu Asp Pro Thr Leu Asn Asn Lys Ala Phe Asn Thr Pro 145 150 155 160 Tyr Ser Ala Ser Lys Thr Thr Leu Gly Pro Asn Glu Gin Leu Val Leu 165 170 175 Thr Gin Asp Leu Gly Thr Leu Ser Ile Ile Lys Thr Leu Thr Phe Tyr 180 185 190 Asp Asp Leu His Tyr 195 INFORMATION FOR SEQ ID NO:782: SEQUENCE
CHARACTERISTICS:
LENGTH: 259 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...259 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:782: Met Ala Cys Trp His Lys Arg Leu Ala Val Gly Cys Cys Ile Val Leu 1 5 10 Leu Ser Cys Val Met Ser Ala Asn Asn Val Ser Ile Val Arg Asp Asp 25 Pro Pro Leu Asp Pro Thr Leu Pro Ala Trp Ile Tyr Ser Val Ala Leu 40 Leu Lys Val Tyr Phe Ser Asp Gly Thr Tyr Lys Glu Gly Tyr Ala Thr 55 Leu Leu Glu Asn Gly Arg Tyr Ile Ala Ser Ser Glu Thr Leu Tyr Ser 70 75 Asn Gly Leu Tyr Pro Lys Met Ile Leu Ala Lys Met Gin Asp Ser Ser 90 Ala Lys Glu Leu Ile Cys Ile Ala Ser Leu His Leu Glu Ala Met Asp 100 105 110 Arg Asp Gin Gly Leu Ser Leu Leu Lys Thr Ala Asp Phe Arg Asp Asp 115 120 125 Tyr Cys His Lys Arg Glu Glu Ser Tyr Tyr His Ala Arg Ile Tyr Ala 130 135 140 Lys Tyr Ala Gin Thr Phe His Ser Asn Pro Tyr Thr Asn Gin Lys Thr WO 97/37044 PCTIUS97/05223 145 Pro Thr Ser Ile Lys Phe Gly Gly 210 Ser Ser 225 Lys Ile Asn Ile Ser Gin Leu 195 Arg Thr Ala Pro Asp Thr 180 Ser Pro Leu Arcg Leu 165 Thr Leu Tyr Glu Phe 245 150 Tyr Asp Asp Phe Asn 230 Leu Tyr Ile Ala Ser 215 Gin SeL-r Pro Ser Ser 200 Glu Giu Ala Ala Leu 170 Val Ala 185 Phe Lys Val Gly Ser Leu Leu Lys 250 155 Asn Glu Lys Glu Val1 235 Asn Glu Gly LeU Leu Gly Ser 205 Phe Met 220 Ile Ile Gin Asn Asn Ser 175 Lys Ser 190 Val Leu Gly Met Pro Lys Ile Phe 255 160 Phe Lys Trp Ala Glu 240 Pro INFORMATION FOR SEQ ID NO:783: Ci) SEQUENCE
CHARACTERISTICS:
LENGTH: 668 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .668 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:783: Met Lys Asn Gin His Lys Asn Pro Leu Thr Lys Ala Leu Met Lys 1 r Thr Tyr Leu Lys Val Arg Arg Lys Lys Giu 145 Pro Leu Giu Leu Asn Asp Ile Gly 130 Giu Tyr Gly Ile Gly Asn Asni Pro Glu Tyr Asn Leu Asp Lys Ala Gin 100 Pro Asn Gin His Leu Ala Arg Thr Ser Asn Ala Lys Phe Ser Asn Val1 70 Asn Phe Ala Glu Leu Leu Pro Leu 55 Phe Ser Pro Lys Glu 135 Phe Ala 40 Leu Lys Ile Leu Ile 120 Val Phe 25 Tyr Asn Val1 Asp Thr 105 Lys Ala 10 Cys Ala Gly Giu Ile 90 Ser Lys Lys Phe Leu Ala Ala 75 Glu Ser Sen Ile Ile Sen Ile His Ile Ala Thr Leu 140 Thr Leu Ile Glu Gly Thn Lys Ile 125 Gly Lys Gly Ile Sen Phe Ser Thr 110 Leu Val Ala Ala Thr Arg Tyr Leu Sen Val Sen Phe Thn Val1 Phe Leu Leu Leu Lys Asn 160 Sen Glu Asn Ile Al a Gin 150 155 Asp Pro Met Tyr Ala Asn Thr Pro Phe Ser Asn Gly Ser Asp Sen WO 97/37044 PCT/US97/05223 690 165 170 175 Phe Tyr Asp Asn Asn Pro Asn Ser Pro Ser Asn Asn Ala Ile Asn Gly 180 185 190 Lys Asp Gly Ala Asn Gly Ser Asn Gly Tyr Gly Ala Asn Gly Asn Asp 195 200 205 Gly Val Asn Gly Ile Ser Gly Ser Asn Gly Ala Asn Gly Ser His Ser 210 215 220 Asn Asn Asn Ala Ile Gly Ser Gly Ile Asp Thr Asp Gly Val Leu Gly 225 230 235 240 Val Asp Gly Val Asn Gly Ser Ser Ser Ser Ser Gly Gly Ser Val Gly 245 250 255 Gly Tyr Glu Asn Asn Phe Thr Asn His Gly Ser Thr Asn Asn Asn Thr 260 265 270 Gly Gly Tyr Asp Asn Phe Asn Asn Gly Ser Ser Ser Gly Gly Ser Leu 275 280 285 Gly Asn Gly Gly Leu Phe Pro Ile Pro Phe Gly Asn Gly Asp Thr Asn 290 295 300 Asn Ser Asn Asn Ser Thr Asn Thr Thr Ser Pro Thr Asn Gly Ser Ser 305 310 315 320 Ser Asn Asn Ala Thr Asn Pro Ser Ser Gin Glu Asn Asn Tyr Ser Ser 325 330 335 Gin Tyr Cys Lys Val Pro Glu Leu Ser Pro Asn Asn Thr Met Lys Leu 340 345 350 Asp Val Ile Ala Lys Asp Gly Ser Cys Ile Ser Met Asn Ala Leu Arg 355 360 365 Asp Asp Thr Lys Cys Ala Tyr Arg Tyr Asp Phe Glu Ala Gly Lys Ala 370 375 380 Ile Lys Gin Thr Gin Tyr Tyr Tyr Val Asp Arg Glu Asn Lys Thr Gin 385 390 395 400 Asn Ile Gly Gly Cys Val Asp Leu Gin Gly Ala Gin Tyr Ala Met Gin 405 410 415 Leu Tyr Lys Asp Asp Ser Lys Cys Ala Leu Gin Thr Thr Ser Asp Lys 420 425 430 Gly Tyr Gly Met Gly Lys Thr Gin Thr Phe Gin Thr Glu Ile Val Phe 435 440 445 Arg Gly Met Asp Asn Leu Ile His Val Ala Val Pro Cys Ser Asp Tyr 450 455 460 Ala Arg Val Gin Asp Arg Ile Val Arg Tyr Glu Lys Asn Asp Lys Thr 465 470 475 480 Gin Thr Leu Thr Pro Ile Val Asp Gin Tyr Tyr Asn Asp Pro Asn Asn 485 490 495 Pro Asn Lys Gin Glu Ile Leu Asn Arg Gly Ile Ala Thr Gin Leu Ser 500 505 510 Ser Gin Tyr Gin Glu Phe Ala Cys Gly Gin Trp Glu Tyr Asn Asp Ala 515 520 525 Lys Leu Glu Ala Lys Arg Pro Thr Met Leu Lys Ser Tyr Asn Lys Leu 530 535 540 Asn Gly Glu Trp Val Glu Val Thr Pro Cys Asn Phe Glu Ala Gly Ile 545 550 555 560 Lys Ser Gly Ala Val Val Ser Pro Tyr Val Met Gly Val Pro Ser Ser 565 570 575 Lys Val Leu Ser Asp Ile Thr Thr Ser His Tyr Phe Arg Ile Glu Arg 580 585 590 Lys Asn Tyr Gly Glu Arg Glu Gin Cys Gin Lys Leu Tyr Gly Val Asn 595 600 605 Arg Cys Gin Pro Gin Tyr Ser Ile Leu Ile Leu Val Ser Pro Ile Gly 610 615 620 WO 97/37044 PCT/US97/05223 691 Ala Pro Leu Thr Lys Pro Leu Pro Pro Lys Pro Leu Asn Leu Ile Tyr 625 630 635 640 Ala Gin Pro Lys Ile Met Lys Asn Thr Pro Gin Pro Ile Ile Leu Ser 645 650 655 Pro Leu Lys Pro Pro Ser Thr Gly Leu Lys Ala Phe 660 665 INFORMATION FOR SEQ ID NO:784: SEQUENCE
CHARACTERISTICS:
LENGTH: 328 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...328 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:784: Met Thr Tyr Lys Glu Arg Leu Ile His Glu Lys Ile Leu Asn Gin Asn 1 5 10 Asp Lys Gly Phe Lys Thr Glu Leu Arg Ile Leu Ser Val Phe Ile Val 25 Glu Phe Leu Val Asn Ile Leu Gly Phe Met Leu Ala Lys Met Pro His 40 Phe Trp Phe Leu Arg Cys Val Lys Ala Leu Ala Trp Leu Met Lys Thr 55 Phe Asp Arg Arg Arg Tyr Phe Asp Ala Lys Ala Asn Leu Asp Phe Val 70 75 Phe Gly Asp Ser Lys Ser Glu Glu Glu Lys Lys Arg Ile Ile Lys Lys 90 Gly Tyr Glu Asn Phe Ala Phe Ile Ile Leu Glu Thr Ile Arg Val Ile 100 105 110 Phe Ile Pro Lys Asp Glu Tyr Asp Ala Arg Phe Thr Leu Ile Asn Glu 115 120 125 Glu Asn Val Trp Lys Ser Leu Asn Lys Glu Gly Gin Ala Ile Thr Leu 130 135 140 Cys Met His Phe Gly Tyr Trp Glu Ala Val Gly Thr Thr Leu Ala Gin 145 150 155 160 Tyr Tyr Glu Asn Tyr Gly Arg Gly Cys Leu Gly Arg Leu Thr Lys Phe 165 170 175 Ala Pro Ile Asn His Met Ile Met Ser Arg Arg Glu Ala Phe Gly Val 180 185 190 Arg Phe Val Asn Lys Ile Gly Ala Met Lys Glu Leu Ile Lys Met Tyr 195 200 205 Asn Gin Gly Asn Gly Leu Val Gly Ile Leu Val Asp Gin Asn Val Val 210 215 220 Pro Lys Asp Gly Val Val Val Lys Phe Phe Asn Lys Asp Ala Thr Thr 225 230 235 240 WO 97/37044 PCT/US97/05223 692 Thr Thr Ile Ala Ser Ile Leu Ser Arg Arg Tyr Asn Ile Asp Ile 245 250 255 Pro Val Phe Ile Asp Phe Asn Asp Asp Tyr Ser His Tyr Thr Ala 260 265 270 Tyr Tyr Pro Ser Ile Arg Ser Gin Ile Thr Asp Asn Ala Gin Asn 275 280 285 Ile Leu Glu Cys Thr Gin Ala Gin Ala Ser Leu Cys Glu Glu Val 290 295 300 Arg Asn His Pro Glu Ser Tyr Phe Trp Phe His Arg Arg Phe Lys 305 310 315 Thr His Pro Glu Ile Tyr Gin Arg 325 INFORMATION FOR SEQ ID NO:785: SEQUENCE CHARACTERISTICS: LENGTH: 138 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...138 Gin Thr Asp Ile Ser 320 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:785: Met Leu Tyr Leu Arg Lys Glu Asn Gly Val Arg 1 Gly Lys Asp Leu Leu Ser Val Ser Ile Pro Lys Phe Met Phe His Tyr 130 Leu Leu Arg Gly Ser Ala Ile 115 Met Leu Val Thr Glu Lys Lys 100 Lys Ala 5 Ser Val Tyr Ser Phe Tyr Asn Leu 70 His Met Asn Tyr Leu Pro Thr Met Leu Ala Lys 55 Leu Lys Glu Glu Val 135 Asn Gly 40 Arg Leu Gly Lys Gly 120 Met Gly 25 Ser Ala Phe Pro Ala 105 Ala Arg 10 Asp Gly Phe Val Leu 90 Val Pro Phe Asp Ile Ala Lys 75 Glu Asn Ser Thr Leu Ile Phe Leu Asn Gly Asn Leu Arg Gly Thr Lys Arg Cys Phe 125 Ile Ser Leu Tyr Ile Asp Met Lys His Ser His His Gin Lys 110 Gin Ser Leu Ser Ile Ser Ala His Tyr Gly INFORMATION FOR SEQ ID NO:786: SEQUENCE CHARACTERISTICS: LENGTH: 328 amino acids TYPE: amino acid WO 97/37044 PCT/US97/05223 693 TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...328 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:786: Met Arg Lys Val Leu Tyr Ala Leu Met Gly Phe Leu Leu Val Phe Ser 1 5 10 Ala Leu Lys Ala Asp Asp Phe Leu Glu Glu Ala Asn Glu Thr Ala Pro 25 Ala Asn Leu Asn His Pro Met Gln Asp Leu Asn Ala Ile Gin Gly Ser 40 Phe Phe Asp Lys Asn Arg Ser Lys Met Ser Asn Thr Leu Asn Ile Asp 55 Tyr Phe Gin Gly Gin Thr Tyr Lys Ile Arg Leu Arg Tyr Ala Met Ala 70 75 Thr Leu Leu Phe Phe Ser Lys Pro Ile Ser Asp Phe Val Leu Gly Asp 90 Lys Val Gly Phe Asp Ala Lys Ile Leu Glu Ser Asn Asp Arg Ile Leu 100 105 110 Leu Ile Lys Pro Leu Gin Ile Gly Val Asp Ser Asn Ile Ser Val Ile 115 120 125 Asp Ser Glu Gly Lys Ile Phe Ser Phe Tyr Val Phe Ser Thr Thr Phe 130 135 140 Thr Ser Ser Lys His Pro Asn Leu Gin Val Phe Ile Glu Asp Lys Asn 145 150 155 160 Tyr Tyr Thr Asn Ala Phe Ile Lys Pro Gin Lys Glu Asn Gin Glu Asn 165 170 175 Met Ser Glu Asn Ala Pro Lys Asp Ala Gin Lys Asn Asn Lys Pro Leu 180 185 190 Lys Glu Glu Lys Glu Glu Thr Lys Glu Lys Glu Glu Glu Thr Ile Ile 195 200 205 Ile Gly Asp Asn Thr Asn Ala Met Lys Ile Ile Lys Lys Asp Ile Gin 210 215 220 Lys Gly Tyr Lys Ala Leu Lys Ser Ser Gin Arg Lys Trp Tyr Cys Leu 225 230 235 240 Trp Ala Cys Ser Lys Lys Ser Lys Leu Ser Leu Met Pro Lys Glu Ile 245 250 255 Phe Asn Asp Lys Gin Phe Thr Tyr Phe Lys Phe Asp Lys Arg Leu Ala 260 265 270 Leu Ser Lys Phe Pro Val Ile Tyr Lys Val Val Asp Gly Tyr Asp Asn 275 280 285 Pro Val Asn Thr Arg Ile Val Gly Asp Tyr Ile Ile Ala Glu Asp Val 290 295 300 Ser Thr Lys Trp Thr Leu Arg Leu Gly Lys Asp Tyr Leu Cys Ile Arg 305 310 315 320 Phe Val Lys Arg Arg Lys Gly Glu WO 97/37044 PCTIS97/05223 694 INFORMATION FOR SEQ ID NO:787: SEQUENCE CHARACTERISTICS: LENGTH: 335 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...335 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:787: Val Met Phe His Lys Ala Leu Ile Thr Phe Ile Val Leu Trp Phe Phe 1 5 10 Leu Asn Gly Leu Gly Ala Tyr Asp Phe Lys His Cys Gin Ala Phe Phe 25 Lys Lys Ala Ser Leu Gin Lys Gly Gly Val Ala Leu Lys Glu Leu Pro 40 Lys Gly Val Tyr Leu Tyr Tyr Ser Lys Thr Tyr Pro Lys His Ala Lys 55 Val Ile Lys Ser Asp Pro Phe Ile Gly Leu Tyr Leu Leu Gin Ser Ala 70 75 Pro Ser Glu Tyr Val Tyr Thr Leu Arg Asp Leu Asp Lys Asp Ala Leu 90 Ile Arg Pro Met Ala Ser Ile Gly Ala Asn Gin Ala Thr Glu Ala Arg 100 105 110 Leu Leu Val Gly Gin Lys Gly Tyr Asp Arg Tyr Ala Gin Ile Ser Gin 115 120 125 Lys Thr Gin Lys Asn Gly Val Ile Ser Asn Ile Cys Tyr Gin Met Leu 130 135 140 Gly Leu Gly Val Gly Gly Asn Gly Phe Ile Glu Thr Lys Phe Ile Lys 145 150 155 160 Arg Phe Leu Asn Gin Gin Glu Pro Tyr Tyr Gly Asp Ile Gly Val Arg 165 170 175 Leu Glu Glu Arg His Lys Arg Leu Val Val Ala Gin Phe Asp Pro Phe 180 185 190 Phe Pro Lys Asn Pro Phe Leu Lys Asn Asp Glu Ile Leu Ala Ile Asn 195 200 205 Asp Tyr Lys Ile His Ser Leu Ala Glu Phe Glu Trp Val Val Ser Asn 210 215 220 Leu Ser Tyr Gin Ser Leu Ala Lys Val Lys Ile Lys Arg Asn His Gin 225 230 235 240 Ile Lys Glu Val Thr Leu Lys Val Asn Lys Arg Tyr Gly Gly Phe Leu 245 250 255 Leu Lys Asp Thr Phe Leu Glu Arg Tyr Gly Ile Ala Leu Asp Glu Arg 260 265 270 Phe Ile Ile Thr Lys Ile Gly Ala His Leu Pro Lys Gly Leu Asp Phe 275 280 285 WO 97/37044 PCT/US97/05223 Leu Lys Leu Gly Asp Arg Ile Leu Trp Val Asn His Arg Ser Val Ser 290 295 300 Phe Asn Pro Lys Ala Leu Arg Glu Ala Leu Ser Thr Pro Lys Ile Glu 305 310 315 320 Leu Leu Val Trp Arg Gin Gly Phe Glu Phe Tyr Ile Lys Val Arg 325 330 335 INFORMATION FOR SEQ ID NO:788: SEQUENCE CHARACTERISTICS: LENGTH: 250 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...250 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:788: Met 1 Ala Val Lys Val Leu Gly Val Glu Tyr 145 Glu Tyr Ser Val Val 225 Leu Gly Ser Val Lys Lys Thr Leu Phe Gly Val Leu Cys Ser Leu Ala His Phe Leu Lys Lys Tyr Gly Ser Lys 115 Leu Asn 130 Gly Val Lys Ala Met Tyr Lys Ala 195 Met Gin 210 Glu Asn Leu Gly Phe Gly Ala Cys 100 Asp His Gly Cys Gly 180 Cys Tyr Phe 5 Arg Ile Glu Ala Ile Arg Ala Ala Thr Asp 165 Val Glu Asn Lys Gly Glu Lys Phe 70 Gin Leu Lys Glu Pro 150 Leu Ala Leu Ala Lys 230 Leu Ser Ala 55 Tyr Phe Leu Lys Gly 135 Lys Lys Lys Lys Gin 215 Gly Met Val 40 Cys Glu Tyr Gly Ala 120 Cys Asp Asp Asn Asp 200 Gly Cys Ala 25 Lys Glu Glu Thr Asn 105 Ser Thr Leu Ser Phe 185 Gly Thr Lys 10 Glu Lys Leu Gly Lys 90 Leu Gin Val Arg Pro 170 Lys Arg Ala Ser Pro Gin Lys Lys 75 Gly Tyr Tyr Leu Lys 155 Gly Glu Gly Lys Ser 235 Asp Asp Glu Gly Cys Tyr Tyr Gly 140 Ala Cys Ala Cys Asp 220 Val Leu Ala Phe Gly Val Glu Asn Ser 125 Ser Leu Ile Ile Tyr 205 Glu Lys Cys Lys Ala Phe Gly Leu Gly 110 Lys Leu Asp Asn Val 190 Asn Lys Glu Leu Glu Gin Gly Lys Asn Gin Ser His Leu Ala 175 Arg Leu Gin Ala Gly Leu Ala Cys Asp Asp Gly Cys His Tyr 160 Gly Tyr Gly Ala Cys 240 WO 97/37044 PCT/US97/05223 696 Asp Ala Leu Lys Glu Leu Lys Ile Glu Leu 245 250 INFORMATION FOR SEQ ID NO:789: SEQUENCE CHARACTERISTICS: LENGTH: 123 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...123 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:789: Met Arg Leu Leu Ile Ala Leu Val Leu Phe Leu Trp Trp Leu Asn Leu 1 5 10 Gly Ala Lys Glu Ala Asp Phe Ile Ser Asp Trp Glu Tyr Gly Leu Ala 25 Leu Tyr Lys Asn Pro Arg Gly Val Ala Cys Ala Lys Cys His Gly Ile 40 Lys Gly Glu Gin Gin Glu Ile Thr Phe Tyr Tyr Glu Lys Gly Glu Lys 55 Lys Ile Leu Tyr Ala Pro Lys Ile Asn His Leu Asp Phe Lys Thr Phe 70 75 Lys Asp Ala Leu Ser Leu Gly Lys Gly Met Met Pro Lys Tyr Asn Leu 90 Asn Leu Glu Glu Ile Gin Ala Ile Tyr Leu Tyr Ile Thr Ser Leu Gly 100 105 110 His Lys Asp Glu Arg Lys Asp Pro Ser Lys Pro 115 120 INFORMATION FOR SEQ ID NO:790: SEQUENCE CHARACTERISTICS: LENGTH: 180 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...180 WO 97/37044 PCT/US97/05223 697 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:790: Met Leu Asn Lys Phe Lys Lys Ile Val Gly Val Gly 1 Cys Glu Arg Ala Ile Tyr Lys Ile Asn 145 Glu Cys Leu Gin Glu Arg Tyr Leu Ala Ile 130 Ala Thr Val Gly Arg Asn Ala Asp Asp Lys 115 Asp Phe Ile Gly Val Asp Val Ile Tyr Lys 100 Asn Asp Glu Leu Phe 180 5 Leu Ala Lys Lys Glu Tyr Gly Lys Asn Lys 165 Gin Leu Ile Ser 70 Ser Pro Asn Ile Asn 150 Ala Ala Asn Ala 55 Val Asn Asn Tyr Val 135 Tyr Lys Lys Ser 40 Ile Ala His Thr Tyr 120 Phe Glu Ser Asn 25 Leu Tyr Ser Asn Lys 105 Gly Leu Val Tyr 10 Ser Ile Ser Arg Asn 90 Val Ile Gly Leu Tyr 170 Leu Ser Phe Gly 75 Lys Cys Met Ser Leu 155 Gin Phe Gly Thr Ile Gin Leu His Ala 140 Lys Lys Val Val Ile His Lys Ser Leu Gin 125 Asn Thr Met Leu Leu Ser Arg Val Thr Lys 110 Lys Trp Asp Leu Val Pro Ser Asp Gin Ile Gly Val Ser Asp Glu 175 Gly Tyr Ala Ile Ile Gly Leu Ala Lys Thr 160 Gly INFORMATION FOR SEQ ID NO:791: SEQUENCE CHARACTERISTICS: LENGTH: 140 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...140 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:791: Met Arg Arg Ser Leu Ala Phe Cys Leu Leu Ala Leu Leu Gly Leu Gin 1 5 10 Val Leu Gly Ala Arg Asp Phe Ser Gin Leu Lys Asn Glu Glu Leu Leu 25 Lys Leu Ala Gly Thr Leu Pro Ser Asn Glu Ala Ile Asp Tyr Arg Met 40 Glu Val Ser Lys Arg Leu Lys Ala Leu Ser Ala Glu Asp Ala Lys Lys 55 Phe Arg Ala Asn Phe Ser Arg Ile Ala Arg Lys Asn Leu Ser Lys Met WO 97/37044 PCT/US97/05223 698 70 75 Ser Glu Glu Asp Phe Lys Lys Met Arg Glu Glu Val Arg Lys Glu Leu 90 Glu Glu Lys Thr Lys Gly Leu Ser Ala Glu Glu Ile Lys Ala Lys Gly 100 105 110 Leu Asn Val Ser Val Cys Ser Gly Asp Thr Arg Lys Val Trp Cys Arg 115 120 125 Ala Val Lys Lys Lys Asp Glu His Cys Ser Pro Lys 130 135 140 INFORMATION FOR SEQ ID NO:792: SEQUENCE CHARACTERISTICS: LENGTH: 152 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...152 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:792: Met Lys Lys Ala Leu Lys Ile Leu Ser Val Gly Ala Leu Leu Phe Val 1 5 10 Ala Leu Asn Ala Lys Asp Phe Ser Lys Thr Ser Asp Glu Asp Leu Ala 25 Lys Met Ala Gly Val Val Ala Pro Gin Asp Ile Val Asp Tyr Thr Lys 40 Glu Leu Lys Lys Arg Met Glu Lys Met Pro Glu Asp Lys Arg Lys Ala 55 Phe His Lys Gln Leu His Glu Tyr Ala Thr Lys Asn Thr Asp Lys Met 70 75 Thr Val Ala Asp Phe Glu Ala Arg Gin Lys Ala Ile Lys Glu Ala Leu 90 Lys Lys Gly Asn Met Glu Asp Met Asp Asp Asp Phe Gly Leu Arg Ser 100 105 110 Cys Lys His Gly Lys Lys His Lys His Asp Lys His Gly Lys Lys His 115 120 125 Gly Lys Lys His Asp Lys Asp His Asp Asp Lys Asp His Asp His His 130 135 140 Asp Glu Asp His Ser Asp Lys His 145 150 INFORMATION FOR SEQ ID NO:793: SEQUENCE CHARACTERISTICS: LENGTH: 140 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 699 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...140 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:793: Met Leu Lys Lys Leu Leu Leu Ile Ser Leu Phe Leu Gly Phe Leu Arg 1 5 10 Ala Glu Gly Glu His Tyr Glu Ile Ile Ala Glu Leu Ser Lys Ala Phe 25 Leu Lys Ala Lys Glu Val Leu Thr Ala Ile Asn Lys Thr Cys Ile Glu 40 Thr Gly His Asp Arg Thr Gin Ile Arg Leu Gin Asn Asp Phe Leu Glu 55 Asn Leu Ser Gin Thr Glu Gin Gin Phe Asp Asp Tyr Phe Glu Lys Asp 70 75 Phe Lys Ser Val Gly Val Leu Lys Thr Leu Leu Lys Asp Ile Gin Ser 90 Leu Glu Lys Thr Ser Asn Lys Leu Val Cys Val Ala Pro Lys Asn Ala 100 105 110 Lys Asn Phe Glu Ile Leu Glu Gly Ala Ile Thr Gin Ile Ile Gly Leu 115 120 125 Glu Glu Gin Met Asn Gin Phe Ile Asn Gly Ala Lys 130 135 140 INFORMATION FOR SEQ ID NO:794: SEQUENCE CHARACTERISTICS: LENGTH: 66 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...66 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:794: Leu Ser Gly Glu Leu Arg Leu Ser Trp Trp Leu Val Ala Arg Cys Lys 1 5 10 Thr Leu Phe Val Leu Phe Cys Glu Gly Ser Ala Phe Ser Ser Leu Leu 25 WO 97/37044 PCT/US97/05223 700 Ile Ser Ser Ile Leu Phe Ser Leu Leu Ile Arg Ser Phe Thr Ser Leu 40 Val Thr Phe Ser Ser Ser Ser Ser Cys Ser Ile Phe Phe Ile Ser Leu 55 Leu Gly INFORMATION FOR SEQ ID NO:795: SEQUENCE CHARACTERISTICS: LENGTH: 89 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: Xaa Unknown LOCATION 1...89 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:795: Leu Val Val Ser Gly Ser Leu Gin Asn Ile Val Arg Ser Phe Leu Arg 1 5 10 Arg Ile Ser Val Phe Phe Ile Ile Asp Phe Ile Asn Phe Ile Phe Val 25 Ile Asp Gin Ile Ile His Phe Phe Gly Asp Phe Phe Ile Phe Phe Phe 40 Met Leu Asn Leu Phe Tyr Ile Leu Ile Arg Leu Ile Asp Leu Ile Val 55 Leu Ser Gly Phe Ser Val Val Leu Val Arg Leu Gly Xaa Phe Phe Arg 70 75 Leu Ile Phe Ala Cys Ala Gin His Ala INFORMATION FOR SEQ ID NO:796: SEQUENCE CHARACTERISTICS: LENGTH: 118 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...118 WO 97/37044 PCT/US97/05223 701 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:796: Met Lys Lys Ser Leu Cys Leu Ser Phe Phe Leu Thr Phe Ser Asn Pro 1 5 10 Leu Gin Ala Leu Val Ile Glu Leu Leu Glu Glu Ile Lys Thr Ser Pro 25 His Lys Gly Thr Phe Lys Ala Lys Val Leu Asp Ser Lys Glu Pro Arg 40 Gin Val Leu Gly Val Tyr Asn Ile Ser Pro His Lys Lys Leu Thr Leu 55 Thr Ile Thr His Ile Ser Thr Ala Ile Val Tyr Gin Pro Leu Asp Glu 70 75 Lys Leu Ser Leu Glu Thr Thr Leu Ser Pro Asn Arg Pro Thr Ile Pro 90 Arg Asn Thr Gin Ile Val Phe Ser Ser Lys Glu Leu Lys Glu Pro His 100 105 110 Ser Asn Pro Ile Pro Ser 115 INFORMATION FOR SEQ ID NO:797: SEQUENCE CHARACTERISTICS: LENGTH: 245 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...245 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:797: Leu Lys Ala Phe Gin Thr Phe Gly Val Phe Gly Thr Ser Val Ile Thr 1 5 10 Cys Ile Thr Ala Gin Asn Thr Gin Gly Val His Gly Val Tyr Pro Leu 25 Ser Val Glu Ser Val Lys Ala Gln Ile Leu Ala Ile Arg Asp Asp Phe 40 Ser Ile Lys Ala Phe Lys Met Gly Ala Leu Cys Asn Ala Gin Ile Ile 55 Glu Cys Val Ala Asn Ala Leu Glu Thr Cys Asp Phe Gly Leu Cys Val 70 75 Leu Asp Pro Val Met Val Ala Lys Asn Gly Ala Leu Leu Leu Glu Glu 90 Glu Ala Ile Leu Ser Leu Lys Lys Arg Leu Leu Pro Lys Thr Asn Leu 100 105 110 Leu Thr Pro Asn Leu Pro Glu Val Tyr Ala Leu Thr Gly Val Gin Ala 115 120 125 Arg Asp Asp Lys Ser Ala Ser Lys Ala Met Gly Val Leu Arg Asp Leu WO 97/37044 PCT/IUS97/05223 130 135 Gly Val Lys Asn Ala Val Ile Lys Gly Gly His 145; 150 155 Gly Giu Phe Ser Asn Asp Trp Val Phe Leu Giu 165 170 Leu Ser Ala Lys Arg Phe Asn Thr Lys Asn Thr 180 185 Thr Leu Ser Ser Leu Ile Val Gly Leu Leu Ala 195 200 Lys Asn Ala Ile Thr Lys Ala Lys Glu Leu Leu 210 215 Asn Pro Leu Asn Ile Gly His Gly His Gly Pro 225 230 235 Ile Lys Lys His Val 245 INFORMATION FOR SEQ ID NO:798: SEQUENCE
CHARACTERISTICS:
LENGTH: 213 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc Ifeature LOCATION 1..-213 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:798: 140 Thr Giu Asp Ala His Gly Gin Gly 205 Thr Ile 220 Leu Asn His Giu Thr 190 Leu Ile Leu Phe Phe 175 Gly Asp Ile Trp Gin 160 Thr Cys Leu Gin Sen 240 Val1 1 Arg Tyr Asn Val Thr Ala Leu Ala Leu 145 Leu Glu Lys Gin Asp Lys Asp Leu Phe Leu Leu Gin Val Gly Asn Ile Phe Leu Ile Pro Lys Gin Phe 130 Tyr Ile Ala Val Glu Leu Ser Ser 115 Leu Pro Pro Thr Pro Met Lys Sen 100 Gin Gin Asn Thn Met Ala Lys Giu Gin Val1 Met Leu Lys Gin Pro Lys 70 Tyr Glu Leu Lys Gly Ser Arg Lys 55 Leu Lys Phe Ser Ala 135 Glu Giu Asn 40 Ile Lys Glu Leu Phe 120 Lys Ile 10 Gin Giu 25 Gin Leu Leu Asp Glu Ile Leu Leu 90 Asn Thr 105 Val Val Lys Gin Gly Gly Leu Gly Al a Leu 75 Sen Ala Glu Ser Val1 Lys Asp Leu Sen Glu Asn Asn Val 140 Ile Ile Ile Asp Ala Lys Met Glu 125 Asp Tyr Asn Val1 Asp Lys Leu Leu 110 Arg Phe Gin Leu Leu Asp Ile Leu Lys Tyr Pro Lys Pro Asn His Ser Lys Ala .Gu 150 155Ly Lys Glu Lys Gin Leu Phe Leu Lys Thr Thr LeU Gin Arg Thr Lys Giu WO 97/37044 PCT/US97/05223 703 165 170 175 Val Leu Lys Glu Ala Gin Asn Thr Leu Leu Gly Phe Ser Phe Val Glu 180 185 190 Ile Val Cys Glu Lys Thr Pro Met Leu Phe Ala Phe Glu Asp Arg Leu 195 200 205 Leu Asp Thr Leu Gly 210 INFORMATION FOR SEQ ID NO:799: SEQUENCE
CHARACTERISTICS:
LENGTH: 91 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...91 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:799: Met Ser Asp Glu Ile Thr Gin Glu Asn Glu Leu Glu Ile Asn Ser Asn 1 5 10 Asn Gin Asn Gin Glu Pro Lys Glu Val Glu Lys Met Pro Leu Asn Asn 25 Ile Gin Lys Ala Lys Lys Leu Lys Asn His Ala Asn Leu Ile Val Arg 40 Arg Thr Asp Glu Leu Asp Lys Val Ile Asn Lys Arg Glu Ser Leu Gin 55 Arg Glu Phe Lys Arg Arg Ile Lys His Leu Asp Asn Lys Ile Glu Thr 70 75 Leu Ser Asn Asn Ile Glu Glu Leu Lys Arg Lys INFORMATION FOR SEQ ID NO:800: SEQUENCE
CHARACTERISTICS:
LENGTH: 172 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 PCT/US97/05223 704 LOCATION 1...172 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:800: Met Lys Lys Tyr Gin Asn Lys Arg Lys Leu Gly Lys 1 Phe Lys Asp Phe Asn Lys Ile Lys Gin 145 Lys Ile Lys Pro Lys Gly Tyr Phe Ile 130 Thr Gin Ser Ile Ser Val Asn Asn Lys 115 Ser Glu Ser Asn Ser Lys Ile Asn Lys 100 Val Leu Phe Thr 5 Glu Val Glu Ser Ile Pro Met Glu Asn Thr 165 Lys Glu Leu 55 Asn Gin Tyr Thr Leu 135 Ile Pro Asp Cys 40 Arg Gin Ile Leu Leu 120 Leu Ser Lys Ile 25 Ile Ala His Ala Gin 105 Asn Gin Lys Lys 10 Leu Glu Arg Val Lys 90 Asn Lys Ile Ile Asp 170 Glu Ser Asp Tyr 75 Asn Leu Asn Tyr Leu 155 Asn Asn Leu Phe Lys Thr Ala Val Lys 140 Asn Ala Arg Tyr Leu Tyr Glu Asn Phe Glu 125 Glu Ala lie Leu Leu Glu Leu Ile Leu 110 Glu Gin Leu Asn Lys Ala Ser Met Ala Glu Cys Lys Asn Cys Val Leu Leu Ala Ile Glu Lys Ser Glu 160 INFORMATION FOR SEQ ID NO:801: SEQUENCE CHARACTERISTICS: LENGTH: 320 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...320 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:801: Met Asn Thr Asn Leu Lys Lys Ala Leu Phe Leu Thr Thr Leu Gly Tyr 1 5 10 Leu Gin Ala Asn Glu Gly Phe Leu Ala Ile Met Pro Asp Glu Asn Lys 25 Gin Glu Asp Ile Ser Gin Glu Gin Lys Leu Glu His Lys Met Glu Gin 40 Lys Ile Ala Pro Gin Asn Thr Leu Lys Lys Gin Gin Ser Ala Pro Leu 55 Gin Ser Ala Glu Leu Leu Gin Ser Gin Val Lys Arg Glu Val Lys Glu 70 75 WO 97/37044 PCT/US97/05223 705 Gin Phe Ser Lys Gin 145 His Tyr Asp His Phe 225 Asn Leu Gly Gin Lys 305 Lys Phe Leu Thr 130 His Leu Lys Val Thr 210 Ala Lys His Gly Asn 290 Ala Lys Leu Ser 115 Ser Tyr Tyr Ser Lys 195 Leu Lys Pro Tyr Ala 275 Gly Thr Ser Gly 100 Leu Lys Phe Gly Lys 180 Tyr Gly Ile Lys Tyr 260 Ile Asn Asn Gin Lys Ser Glu Ile Ser Asn Ala Gly Ala 150 Gly Thr 165 Val Ser Leu Phe Leu Ser Asp Pro 230 Asp Ile 245 Leu Asn Asn Tyr Phe Lys Ser Ser 310 His Ile Asn Asn 135 Ser Pro Thr Asp Val 215 Ile Phe His Gin Gly 295 Tyr Leu Ala Ala 120 Phe Lys Ser Asn Phe 200 Gly Lys Asn His Ser 280 Glu Ile Val Ile 105 Thr Asn Arg Asp Tyr 185 Leu Phe Thr His Gin 265 Val Val Ala Met 90 Thr Gly Asp Asp Met Gly His Gly 155 Val Ser 170 Thr Pro Asn Val Gly Trp Ser His 235 Gly Phe 250 Phe Glu Val Pro Leu Phe Gin Thr Leu Leu 140 Ile Thr Leu Ile Arg 220 Ser Tyr Val Ser Ser 300 Thr Ser Leu 125 Leu Lys Asp Asn Lys 205 Met Glu Pro Asn Lys 285 Asn Lys Leu 110 Thr Ile Ile Ile Phe 190 Lys Gin Phe Thr Tyr 270 Ile Phe Asn Gly Lys Gin Tyr Ser Gly Tyr Ser Ser 160 Lys Ile 175 Gly Ile Asn Arg Tyr Tyr Ile Thr 240 Ile Gly 255 Arg Phe Val Val Leu Thr Ala Phe Asn Tyr Ala Tyr Leu Phe 315 320 INFORMATION FOR SEQ ID NO:802: SEQUENCE CHARACTERISTICS: LENGTH: 486 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...486 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:802: Leu Ala Ile Lys Arg Glu Lys Met Lys Asn Ser Ala Pro Leu Lys Asn 1 5 10 Lys Val Phe Cys Gly Leu Tyr Val Leu Ser Leu Ser Ala Ser Val Gin 25 Ala Phe Asp Tyr Lys Ile Glu Val Leu Ala Glu Ser Phe Ser Lys Val 40 WO 97/37044 PCT/US97/05223 Gly Glu Leu Gly Gly Gly Leu 145 Arg Glu Ser Ala Ser 225 Tyr Ile Pro Val Glu 305 Asp Leu Phe Asn Gin 385 Phe Trp Gly Leu Thr 465 Thr Phe Thr Ser Gly Gly Gly 130 Gly Ala Tyr Asn Lys 210 Ser Ser His Phe Gly 290 Thr Thr Ile Tyr Pro 370 Ala Gly Thr Tyr Gly 450 Pro Leu Asn Phe Lys Thr Ser 115 Lys Ser Asn Arg Ala 195 Ile Trp Pro Leu Phe 275 Tyr Lys Tyr Arg Lys 355 Leu Leu Gly Ser Lys 435 Val Gly Ser Lys Val Ser Ile 100 Val Arg Asp Lys Tyr 180 Pro Lys Gly Arg Val 260 Gin Asp Ala Arg Gin 340 Val Gly Ser Gly Gly 420 Ile Met Ser Ala Lys Lys Ile Asp 55 Ile Ala Arg Thr Leu Gly Ile Ala Gly Ile 165 Lys Tyr Asp Arg Thr 245 Asp Phe Ser Tyr Tyr 325 Arg Trp Ile His Val 405 Ala Ser Thr Lys Lys 485 Ala 70 Lys Gly Tyr Leu Lys 150 Arg Asp Met Lys Ala 230 Val Tyr Ser Asn Ile 310 Ala Phe Lys Asp Val 390 His Leu Lys His Ala 470 Phe Val Asp Ile Asn Leu 135 Val Arg Ile Ser Asn 215 Phe Ile Thr Pro Pro 295 Leu Val Asp Asn Phe 375 Val Lys Ala Ser Ser 455 Gly Gin Ala Tyr 120 Asp Ile Asn Phe Ser 200 Glu Ala Lys Tyr Gly 280 Asn Leu Lys Tyr Ala 360 Trp Thr Lys Asn Leu 440 Gly Gin Gly Tyr 105 Ile Gly Asp Tyr Ala 185 Tyr Gly Tyr Asn Glu 265 Thr Phe Pro Ala Asn 345 Asn Thr Ala Trp Glu 425 Thr Phe Gly His 90 Asp Gly Thr Ser Leu 170 Ala Thr Ser Gly Gly 250 Arg Tyr Asn Val Gly 330 Glu Ala Asn Asp Leu 410 Ala Ala Thr Asn 75 Val Ser Tyr Ser Ile 155 Met Lys Gin His Glu 235 Arg Lys Tyr Gly His 315 Thr Phe Tyr Ser Ala 395 Trp Ser Ser Val Gly Ile Leu Thr Trp Ile 140 Ala Asn Gly Gly Lys 220 Trp Thr Gly Ser Val 300 Ala Ala Asn Ile Val 380 Val Gly Ala Val Gly 460 ile Tyr Glu Lys Asp 125 His Cys Asn Gly Phe 205 Leu Ile Leu Val Pro 285 Gly Pro Gly Phe Gly 365 Tyr Ser Thr Ala Lys 445 Ser Tyr Ala Gly Phe 110 Gly Glu Gly Ala Arg 190 Glu Trp Tyr Asn Ser 270 Gly Phe Leu Gin Gly 350 Thr Asp Gly Leu Val 430 Leu Tyr Pro Asp Lys Asn Tyr Cys Asn Phe 175 Tyr Ile Trp Asp Tyr 255 Val Val Arg Lys Ser 335 Gly Thr Ile Trp Trp 415 Asn Glu Arg Thr Phe Val Gin Leu Ala Ala 160 Leu Gin Ser Phe Phe 240 Gly Ser Ala Ser Arg 320 Leu Ala Gly Gly Val 400 Arg Val Tyr Pro Leu Tyr Ser Asp Arg Ser His Leu Met Thr 475 480 WO 97/37044 PCTIUS97/05223 707 INFORMATION FOR SEQ ID NO:803: SEQUENCE CHARACTERISTICS: LENGTH: 167 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...167 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:803: Val Cys Ile Ile Phe Asp Ala Asp Ile Lys Lys Glu Asn Gin Glu Ser 1 5 10 Asp Ala Gly Phe Asp Asn Lys Leu Lys His Ile Arg Glu Lys Phe Lys 25 Glu Lys Gly Thr Asp Phe Pro Lys Glu Gin Ile Phe Leu Phe Pro Asn 40 Asn Gin Asp Asp Gly Asp Leu Glu Thr Leu Leu Leu Glu Ile Ala Lys 55 His Asp Glu Phe Leu Lys Cys Phe Glu Gly Tyr Leu Glu Cys Ile Lys 70 75 Ser Lys Glu His Tyr Lys Pro Ile Lys Asn Ile Arg Lys Asn Met Leu 90 Tyr Ala Tyr Leu Glu Ala Leu Gly Leu Glu Asn Leu Thr Lys Thr Asn 100 105 110 Ile Asp Val Phe Asp Ser Lys Gly Lys Ile Lys Ser Arg Tyr Glu Glu 115 120 125 Asn Tyr Lys Lys Leu Thr Glu Glu Val Ile Asp Phe Ser Ser Asn Ser 130 135 140 Leu Ile Pro Leu Lys Asn Phe Leu Gly Gin Phe Ala Glu Asn Lys Gin 145 150 155 160 Lys Thr Asn Pro Lys Ile Phe 165 INFORMATION FOR SEQ ID NO:804: SEQUENCE CHARACTERISTICS: LENGTH: 210 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 708 (ix) FEATURE: NAME/KEY: misc-feature LOCATION .210 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:804: Met Thr Val1 Leu Asn Ile Glu Thr Lys Thr 145 Lys Asp Val Met Val Leu Lys Thr Lys Leu Lys Ile Ile Ser Ser Val Ile Leu Asn Leu Phe Asn Ile Arg Gin 130 Val1 Val1 Tyr Leu Asp Leu Giu Asp Al a Ala 115 Arg Giu Ser Asp Trp Ala Thr Gin Thr Arg 100 Ser Giu Lys Ser Phe 180 Val1 Thr Ala Leu Thr Gin Gly Lys Gly Ile 165 Thr dly Lys Asn Lys 70 Gin Leu Gly Giu Thr 150 Ala Leu *Cys Asn Ala 55 Gly Pro Arg Ser Arg 135 Leu Ala Ser Ser Thr 40 Met Lys Asn Leu Gly 120 Glu Lys Ser Leu Ile 200 Ser 25 Thr Leu His Leu Arg 105 Ile Ser Ala Ile Thr 185 Val 10 Giu Ala Asp Leu Asp 90 Ser Glu Glu Ala Ser 170 Asn Lys Met Ser Ser Ile 75 Met Asn Ala Glu Asp 155 Ser Arg Asn Ala Ile Met Giu Asn Gly Asp Tyr 140 Leu Ser Lys Ala Thr Asn Phe Val1 Leu Arg Ser 125 Asn Ser Arg Thr Ser 205 Tyr Ser Ser Ser Leu Phe 110 Arg Gin Leu Gin Gly 190 Asn Leu Gin Thr Asp Asp Thr Asn Met Asp Ser Arg 175 G1u Lys Ser Asn Asp Pro Val1 Thr Ile Val Thr Gly 160 Leu Glu Arg Trp Ser 195 Phe 210 Asp Val Lys Pro INFORMALTION FOR SEQ ID NO:805: SEQUENCE
CHARACTERISTICS:
LENGTH: 184 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGAN~ISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .184 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:805: Met Lys Ala Phe Leu Lys Ile Cys Met Val Leu Ile Phe Val Gly Val 10 WO 97/37044 PCT/US9705223 Ala His Ala Lys Asn Pro Leu Thr Gin Asn Glu Lys Trp Ala Asn Asp Ile Thr Leu Lys Thr Tyr 130 Lys Asp 145 Ile Asn Glu Asn Leu Pro Leu Lys Pro Lys 115 Arg Asp Pro Ile Gin Leu Trp Glu Leu 100 Gin Leu Met Lys Asp 180 Ser Val Val Val Lys Thr Val Asn Ile 165 Ile Phe Tyr Tyr 70 Val Asp Asp Phe Asn 150 Pro Val Ser Tyr 55 Glu Val Lys Gly Lys 135 Leu Asn Arg Ala 40 Gly Lys Tyr Thr Ser 120 Asp Val Glu Gin Leu Ser His Phe Val Leu Pro Leu Glu Pro 90 Asp Phe 105 Phe Lys Gly Lys Thr Ile Ile Phe 170 Lys Lys Lys Lys 75 Asn Phe Thr Pro Thr 155 Val Glu Gin Ala Lys Leu Thr Thr Phe 140 Phe Phe Glu Val Lys Glu Phe Ile Ile 125 Ser Ser Asn Glu Leu Ala Ile Gin Leu 110 Asn Leu Gin Pro Val Lys Pro Tyr Ala Lys Lys Glu Ala Lys 175 Leu Asn Asn Met Thr Gin Thr Phe Glu 160 Asp INFORMATION FOR SEQ ID NO:806: SEQUENCE CHARACTERISTICS: LENGTH: 185 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...185 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:806: Arg Ile Lys Ala Tyr Phe Leu Arg Phe Ile Ala 5 10 Leu Leu Gly Phe Ser Ala Cys Lys Asn Ser Gin 25 Gin Asn Asn Thr Thr Gin Gin Asp Ser Pro Lys 40 Asp Leu Asn Asn Gin Glu Tyr Thr Ile Met Gly 55 Asn Ile Ser Pro Asp Pro Asn Thr Pro Thr Leu 70 75 Leu Asp Asn Ser Leu Lys Asp Tyr Ala Pro Thr 90 Lys Thr Phe Lys Asp Arg Leu Arg Val Leu Ile 100 105 Met 1 Val Ser Met Leu Ala Lys Leu Lys Thr Asp Leu Phe Leu Val Ser Tyr Leu Ile Asn Leu 110 Phe Gin Thr Asp Leu Val Asn Ile Asp Ala Ser Ser Leu Gin WO 97/37044 PCT/US97/05223 710 Pro Tyr Ser Ser Asp Ala Ile Lys Gly Phe Ile Ala Pro Ser Gin 115 120 125 Asp Leu Met Ile Leu Asn Pro Lys Asp Thr Ala Leu Phe Asp His 130 135 140 Asn His Asp Ala Leu Asn His Ser Phe Asn Met Leu Leu Tyr Asp 145 150 155 His Gin Leu Ile Lys Met Tyr Gin Gly Ile Val Pro Ala Glu Met 165 170 175 Gin Phe Asp Ile Ser Asn Leu Lys Asp 180 185 INFORMATION FOR SEQ ID NO:807: SEQUENCE CHARACTERISTICS: LENGTH: 363 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...363 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:807: Thr Leu Lys 160 Leu Met Val Ser Thr Leu Lys Pro Leu Lys Ile Gly 1 Phe Leu Val Ala Asn Gly Asp Pro Ile 145 Trp Pro Pro Ala Gly Lys Glu Ala Ser Thr 130 Pro Ser Leu Ile Gly Thr Lys Ile Asn Cys 115 Asn Ile Asp Ser Phe Asn Gly Pro Phe Ile 100 Glu Met Ile Arg Gly 180 5 Gin Val Tyr Phe Ala Leu Ala Pro Ser Tyr 165 Gly Gly Ala Tyr Glu 70 Asn Tyr Gly Glu Ser 150 Lys His Gly Lys Lys 55 Ala Ala Ala Ala Phe 135 Ala Arg Gin Met Glu 40 Asn Leu Arg Ile Asn 120 Ala Lys Ile Gly Gly 25 Gly Met Asn Lys Asn 105 Ile Lys Ala Pro Phe 185 10 Val Ala Arg Phe lie 90 Asp lie Asp Leu Asp 170 Lys Gly Leu Phe Tyr 75 Cys Tyr Ile Phe Lys 155 Ala Tyr Lys Ile Gly Val Ser Gly Gly Thr Ser 140 Ile Phe Glu His Ser Val Glu Lys Asn Arg Gly 125 Asp Leu Ile Asp Thr Trp Ile Arg Lys Lys Val 110 Ala Val Cys Val Cys 190 Ile Asp Ser Ile Ala Pro Leu Gly Ala Lys Glu 175 Phe Lys Glu Ala Val Leu Leu Arg Leu Leu Arg 160 Gly Lys Glu Glu Phe Gin Leu Glu Asn Leu Val Pro Lys Val Val Glu Ala Ser 200 205 WO 97/37044 PCTIUS97/05223 Lys Glu Trp Gly Asn Ile Pro Ile Ile Ala Ala Gly Gly I)1 n\ 215 Lys 225 Met Ala Ser Arg Val1 305 Ile Leu Val1 Lys Ala AspD Pro I le 290 Ala Ala Tyr His Asp Thr Leu Val1 275 Glu Pro Asp Phe Giu 355 Ile Arg Leu 260 Gly Glu Cys Gly Thr 340 Leu Asp Phe 245 Pro Tyr Gly Asn Leu 325 Gly Ile Thr 230 Leu Thr Pro Asn Arg 310 Gly Ala Lys Met Gly Leu Ala Ala 295 Gly Arg Asn Giu Leu Ser Thr Lys Lys Lys 265 Arg Ala 280 Pro Lys Giu Glu Ser Tyr Gly Tyr 345 Leu Thr 360 Leu Giu 250 Glu Ile Ile Ala Leu 330 Arg Glu Gly 235 Cys Asp Asn Ala Lys 315 Gly Val1 Gly 220 Ala Ser Asp Ala Ile Leu Thr Gly 285 Cys Val 300 Lys Val Asn Arg Asp Lys Ile Trp Cly Val Lys Ala 255 Leu Ile 270 Val Ile Ser Asn Gly Tyr Glu Giu 335 Ile Ile 350 Asp Gin 240 Tyr Lys Lys Cys Cys 320 Gly Ser INFORMALTION FOR SEQ ID NO:808: SEQUENCE CHARACTERISTICS: LENGTH: 363 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NME/KEY: misc feature LOCATION .363 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:808: Vai Ser Thr Leu Lys Pro Leu Lys Ile Gly Lys 10 Pro Ile Phe Gin Gly Gly Met Gly Val Gly Ile 25 Ala Gly Asn Val Ala Lys Giu Gly Ala Leu Gly 40 Gly Thr Gly Tyr Tyr Lys Asn Met Arg Phe Val 55 60 Lys Lys Pro Phe Glu Ala Leu Asn Phe Tyr Ser 70 75 Glu Ile Phe Ala Asn Ala Arg Lys Ile Cys Giy 90 Ala Asn Ile Leu Tyr Ala Ile Asn Asp Tyr Gly 100 105 Ser Cys Giu Ala Gly Ala Asn Ile Ile Ile Thr 115 120 Met Phe Leu Val Al a Asn Gly Asp His Ser Val1 Giu Lys Asn Arg Gly 125 Thr Trp Ile Arg Lys Lys Val 110 Ala Ile Asp Ser Ile Ala Pro Leu Gly Lys Glu Ala Val1 Leu Leu Arg Leu WO 97/37044 PCTIUS97/05223 Pro Ile 145 Trp Pro Glu Lys Lys 225 Met Ala Ser Arg Val 305 Ile Leu Val Thr 130 Pro Ser Leu Glu Glu 210 Lys Ala Asp Pro Ile 290 Ala Al a Tyr Hlis Asn Ile Asp Ser Phe 195 Trp Asp Thr Leu Val1 275 Glu Pro Asp Phe Glu 355 Met Pro Glu Phe Ala Lys Asp Phe Ser Asp Val Ala Leu Ile Arg Gly 180 Gin Gly Ile Arg Leu 260 Gly Glu Cys Gly Thr 340 Leu Ser Tyr 165 Gly Leu Asn Asp Phe 245 Pro Tyr Gly Asn Leu 325 Gly Ile Ser 150 Lys His Giu Ile Thr 230 Leu Thr Pro Asn Arg 310 Gly Ala Lys 135 *Ala Arg Gin Asn Pro 215 Met Giy Leu Ala Ala 295 Giy Arg Asn Glu Lys Ile Gly Leu 200 Ile Leu Thr Lys Arg 280 Pro Giu Ser Gly Leu 360 Ala Pro Phe 185 Val1 Ile Ser Lys Lys 26S Ala Lys Giu Tyr Tyr 345 Thr Leu Asp 170 Lys Pro Ala Leu Giu 250 Giu Ile Ile Ala Leu 330 Arg Glu Lys Ala Tyr Lys Ala Gly 235 Cys Asp Asn Ala Lys 315 Gly Val1 Gly 140 Ile Phe Glu Val1 Gly 220 Ala Asp Ile Thr Cys 300 Lys Asri Asp Leu Ile Asp Val1 205 Gly Ser Ala Leu Gly 285 Val Val Arg Lys Cys Val1 Cys 190 Giu Ile Gly Lys Leu 270 Val1 Ser Gly Giu Ile 350 Lys Giu 175 Phe Ala Trp Val1 Ala 255 Ile Ile Asn Tyr Glu 335 Ile Arg 160 Gly Lys Ser Asp Gin 240 Tyr Lys Lys Cys Cys 320 Gly Ser INFORMATION FOR SEQ ID NO:809: SEQUENCE CHARACTERISTICS: LENGTH: 192 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Heiicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION i. 192 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:809: Leu Leu Gly Arg Asn Gly Val Thr Leu Asn Ile Arg Gin Val Phe Trp, 1 5 10 Trp Asp Asn Phe Asn Trp Ser Ile Gly Phe Tyr Asn Thr Phe Gly Asn 25 Ser Asp Ala Phe Leu Oily Ser His Thr Met Pro Arg Gly Asn Asn Thr 40 WO 97/37044 PCT/US97/05223 Ser Tyr Ile Ser Asn Glu Ile Ser Val Thr Thr cc Arg His Ala Gly Met Ile Ala Ile Asn Gin Arg 145 Tyr Asn Gly Ile His Lys Phe 130 Ile Phe Tyr Tyr Thr Lys Asn 115 Asn Thr Gly Gin Asp Phe Asn Ala Arg Phe 100 Ala Leu Ala Ser Tyr Tyr Ala Pro 165 Asp Arg 180 Trp 70 Asn Ala Gly Tyr Gly 150 Lys Ser Asp Thr Trp Gin Ala 135 Ala Phe Tyr Asn Phe His Val 120 Phe Arg Asn Met Thr Thr Val 105 Gly Thr Ile Asn Met 185 Ala Phe 90 Phe Arg Glu Asn Pro 170 Thr Tyr 75 Tyr Gly Ala Ser Lys 155 Asp Asn Asp Thr Arg Asn Val 140 Gly Gly Leu Gly Ser Val Glu 125 Leu Tyr Asp Thr Leu Val Ser 110 Tyr Leu Gin Phe Leu 190 Ala Gly His Ser Asn Ala Ser 175 Lys Asp Gly Ala Leu Phe Gly 160 Ala Phe INFORMATION FOR SEQ ID NO:810: Met 1 Ala Thr Gin Ala Ser Phe Asn Gly SEQUENCE CHARACTERISTICS: LENGTH: 354 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...354 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:810: Gin Gin Gin Leu Thr Tyr Leu Asn Ala Gly Asn 5 10 Met Asn Lys Ala Leu Glu Lys Asn Gly Thr Ala 25 Ser Ser Thr Ser Gly Ala Thr Gly Ser Asp Gly 40 Gin Ala Ile Gin Tyr Leu Gin Gly Gin Gin Asn 55 Ala Asn Leu Leu Lys Gin Asp Glu Leu Leu Leu 70 75 Ala Val Ala Ala Asn Ile Gly Asn Lys Glu Phe 90 Thr Gly Leu Val Gin Gly Ile Ile Asp Gin Ser 100 105 Glu Leu Thr Lys Asn Thr Ile Ser Gly Ser Ala 115 120 Ile Asn Ser Asn Gin Ala Asn Ala Val Gin Gly 130 135 140 Val Thr Gin Ile Glu Asn Gin Val 125 Arg Phe Ala Thr Leu Ala Ser Leu 110 Asn Ala Phe Asn Tyr Asn Phe Ala Val Asn Ser Asn Ser Ser Asn Asn Ala Tyr Ala Gin WO 97/37044 PCT/US97/05223 Leu 145 Ala Ala Val Tyr Thr 225 Leu Gly Leu His Leu 305 Val Thr Ser Pro Leu Gly Gly Tyr 210 Gin Tyr Phe Arg Phe 290 Asp Val Val Phe Asn Asn Asn Tyr 195 Gly Asn Asn Phe Asp 275 Gin Gly Val Lys Ala Leu Tyr Asn Val Gin Asn Ser 180 Lys Phe Asn Ile Ser 260 Asp Phe Lys Pro Tyr 340 Gin 165 Arg Gin Phe Val Phe 245 Gly Pro Leu Ser Thr 325 Phe Val Arg Ala Thr Phe Phe Ser Tyr 215 Gly Leu 230 Ser Arg Ile Gin Asn Val Phe Asp 295 Asn Arg 310 Ile Tyr Arg Pro Ser Asn Gly 200 Asn Tyr Ser Leu Lys 280 Phe His Asn Tyr Met Ile 185 Lys Gly Thr Tyr Ala 265 Leu Gly Asn Thr Ser 345 Val Pro 170 Leu Lys Ala Tyr Gin 250 Gly His Met Gin Tyr 330 Val Thr 155 Tyr Asn Arg Ser Gly 235 Asn Glu Gly Arg His 315 Tyr Tyr Leu Leu Gly Asn Val 220 Val Arg Thr Lys Met 300 Thr Lys Trp Asp Pro Phe Ile 205 Gly Gly Ser Phe Ile 285 Asn Val Ser Ser Lys Gin Tyr 190 Gly Phe Thr Val Gin 270 Asn Phe Glu Ala Tyr 350 Ile Phe 175 Thr Leu Arg Asp Asp 255 Ser Asn Gly Phe Gly 335 Gly Asn 160 Arg Lys Arg Ser Val 240 Met Thr Thr Lys Gly 320 Thr Tyr INFORMATION FOR SEQ ID NO:811: SEQUENCE CHARACTERISTICS: LENGTH: 748 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...748 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:811: Glu Asp Phe Leu Tyr Asn Thr Leu Tyr Phe Ile Glu Asp Tyr Lys 5 10 Val Val Ile Phe Ser Phe Ile Gly Leu Ile Ala Leu Phe Phe Leu 25 Lys Phe Ile Lys Thr Gin Lys Lys Val Phe Lys Asp Lys Ala Asn 40 Pro Gin Lys Lys Lys Ser Phe Lys Glu Ile Ile Ile Asp Gly Leu 55 Met 1 Leu Tyr Gin WO 97/37044 PCT/US97/05223 715 Lys Glu Arg Val Lys Thr Phe Gly Phe Trp Leu Gin Ala Ile Leu Leu 70 75 Leu Ser Tyr Ser Phe Ile Thr Ser Gly Leu Phe Phe Leu Ile Leu Leu 90 Gly Asn Phe Tyr Asp Asp Asn Arg Leu Pro Glu Ser Asp Asp Asp Leu 100 105 110 Phe Asp Ile Trp Val Tyr Ala Ile Gin Asp Phe Pro Ala Tyr Tyr Phe 115 120 125 Lys Ala Leu Thr Phe Ser Ser Leu Lys Ile Tyr Gly Phe Asn Ile Ser 130 135 140 Leu Val Val Tyr Ser Ser Ile Leu Cys Ser Tyr Ile Phe Ile Thr Phe 145 150 155 160 Phe Val Trp Phe Leu Lys Tyr Leu Thr Arg Thr Arg Asp Ile Gly Ala 165 170 175 Asn Lys Lys Val Asp Asp Leu Phe Gly Ser Ala Ser Trp Glu Thr Glu 180 185 190 Glu Lys Met Ile Lys Ala Lys Leu Ile Thr Pro Asn Asn Lys Lys Arg 195 200 205 Ala Phe Asp Lys Arg Glu Val Ile Val Gly Arg Arg Gly Leu Gly Asp 210 215 220 Phe Ile Ala Tyr Ala Gly Gin Ala Phe Ile Gly Leu Ile Ala Pro Thr 225 230 235 240 Arg Ser Gly Lys Gly Val Gly Phe Ile Met Pro Asn Met Ile Asn Tyr 245 250 255 Pro Gin Asn Ile Val Val Phe Asp Pro Lys Ala Asp Thr Met Glu Thr 260 265 270 Cys Gly Lys Ile Arg Glu Lys Arg Phe Asn Gin Lys Val Phe Ile Tyr 275 280 285 Glu Pro Phe Ser Leu Lys Thr His Arg Phe Asn Pro Phe Ala Tyr Val 290 295 300 Asp Phe Gly Asn Asp Val Val Leu Thr Glu Asp Ile Leu Ser Gin Ile 305 310 315 320 Asp Thr Arg Leu Lys Gly His Gly Met Val Ala Ser Gly Gly Asp Phe 325 330 335 Ser Thr Gin Ile Phe Gly Leu Ala Lys Leu Val Phe Pro Glu Arg Pro 340 345 350 Asn Glu Lys Asp Pro Phe Phe Ser Asn Gin Ala Arg Asn Leu Phe Val 355 360 365 Ile Asn Cys Asn Ile Tyr Arg Asp Leu Met Trp Thr Lys Lys Gly Leu 370 375 380 Glu Phe Val Lys Arg Lys Lys Ile Ile Met Pro Glu Thr Pro Thr Met 385 390 395 400 Phe Phe Ile Gly Ser Met Ala Ser Gly Ile Asn Leu Ile Asp Glu Asp 405 410 415 Thr Asn Met Glu Lys Val Val Ser Leu Met Glu Phe Phe Gly Gly Glu 420 425 430 Glu Asp Lys Ser Gly Asp Asn Leu Arg Ala Leu Ser Pro Ala Thr Arg 435 440 445 Asn Met Trp Asn Asn Phe Lys Thr Met Gly Gly Ala Lys Glu Thr Tyr 450 455 460 Ser Ser Val Gin Gly Val Tyr Thr Ser Ala Phe Ala Pro Tyr Asn Asn 465 470 475 480 Ala Met Ile Arg Asn Phe Thr Ser Ala Asn Asp Phe Asp Phe Arg Arg 485 490 495 Leu Arg Ile Asp Ala Val Ser Ile Gly Val Ile Ala Asn Pro Lys Glu 500 505 510 Ser Thr Ile Val Gly Pro Ile Leu Glu Leu Phe Phe Asn Val Met Ile WO 97/37044 PCTIUS97/05223 Tyr Ser Asn Leu Ile Leu Pro Ile His 530 Cys 545 Phe Phe Tyr Met Leu 625 Ile Arg Ile Tyr Ser 705 Phe Asp Leu Val1 Val1 Gly Tyr 610 Ser Asp Phe Ile Tyr.
690 Leu Tyr Lys Met Lys Phe Arg 595 Tyr Lys Asp Leu Leu 675 Asp Ser Asp Ser Leu Al a Gin 580 Asn Gly Val1 Asn Met 660 Giu Asp Lys Asp Leu Met Val1 565 Ser Gly Ile Leu Thr 645 Thr Asn Pro Lys Leu 725 Val Asp 550 Giy Lys Ala Asn Giy 630 Gly Pro Thr Phe Tyr 710 Gin Pro 535 Glu Phe Ile Met Ala Gin Lys Thr 600 Asn Asp 615 Lys Tyr Lys Thr Asp Giu Leu Lys 680 Phe Thr 695 Lys Leu Ala Ala Val Gly Thr Ala Leu 585 Ile Asn Thr Asn Leu 665 Pro Asp Gly Lys Ser 745 Asp Leu Giu 570 Giu Leu Tyr Arg Thr 650 Met Ile Giu Lys Thr 730 Ser Cys 555 Tyr Asn Asp Tyr Gin 635 Ser Thr Lys Leu Val1 715 Arg Glu 540 Gly Asn Asp Asn Giu 620 Asp Ile Met Cys Ile 700 Pro Gly Leu Tyr Leu Met Arg Pro Pro 590 Leu Ser 605 His Phe Val Ser Ser Asn Gly Asp 670 His Lys 685 Lys Vai Asn Gin Glu Leu Giu Thr 560 Pro Ala 575 Leu Gly Leu Asn Glu Lys Arg Ser 640 Lys Giu 655 Glu Leu Ala Leu Ser Pro Ala Thr 720 Ser Tyr 735 525 Pro Gin Cys Lys Arg Ser INFORMATION FOR SEQ ID NO:812: SEQUENCE CHARACTERISTICS: LENGTH: 34 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION I1.. .34 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:8i2: Leu Sen His Leu Phe Leu Val Lys Ang Pro Tyr Sen Ang Leu Tyn Leu 10 His Asn Gin Ser Asn His Lys Met Ang His His Ser Ile Leu Phe Val 25 Sen Phe WO 97/37044 PCT/US97/05223 717 INFORMATION FOR SEQ ID NO:813: SEQUENCE CHARACTERISTICS: LENGTH: 110 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...110 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:813: Met Glu His His Lys Ala His Thr Thr Ile Gin Ala Leu Gin Ala Lys 1 5 10 Arg Lys Arg Leu Leu Thr Glu Leu Ala Glu Leu Glu Ala Glu Ile Lys 25 Val Ser Ser Glu Arg Arg Ser Ser Phe Asn Val Ser Leu Ser Pro Ser 40 Leu Leu Ala Glu Ile Glu Glu Ile Glu Tyr Glu Glu Lys Met Ser Lys 55 Glu Arg Arg Ile His His Asn Leu Leu Leu Ser Pro Ser Phe Met Ala 70 75 Lys Val Asp Glu Tyr Met Lys Glu Lys Gly Phe Pro Asn Arg Ser Leu 90 Leu Phe Glu Lys Ala Leu Glu Phe Tyr Met Leu Lys His Pro 100 105 110 INFORMATION FOR SEQ ID NO:814: SEQUENCE CHARACTERISTICS: LENGTH: 158 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...158 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:814: Val Met Arg Phe Phe Ser Phe Cys Tyr Phe Leu Phe Tyr Phe Leu Gly 1 5 10 WO 97/37044 WO 9737044PCTIUS97/05223 Val1 S er Ser Leu Asn Leu Val Thr Gly 145 (2) Ser Tyr Val1 Asp Pro Lys Ala Ser 130 Asp Leu Arg Ser Tyr Leu Lys Ile 115 Ile Met His Leu Lys His Leu Giu 100 Thr Leu Leu Ala Lys Pro Cys Gin Lys His Asn Ser Leu Ile Ile Ser 70 Ile Giu Tyr Leu Leu 150 Ser Val1 Val1 55 Ile Lys Arg Asp Lys 135 Asp Pro Asp 40 Ser Ile Leu Val1 His 120 Ala Ile Leu 25 Ser Arg Thr Giu Ile 105 Ser Leu Phe Giu Glu Arg Val Ile Lys Arg Asn 75 Arg Phe 90 Asp Cys Tyr Lys Ser Val Arg Lys 155 Gin Met Thr Leu Leu Leu Asn Lys 140 Giu Glu Gly Ala Pro Leu Leu Gly 125 Ala Glu Phe Giu Pro Asp Giu Lys 110 Thr Ser Glu Leu Giu Tyr Leu Ile Ser Thr Leu Ile Tyr Val Lys Ala Gin Thr Giu INFORM4ATION FOR SEQ ID NO:815: SEQUENCE CHARACTERISTICS: LENGTH: 117 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .117 (xi) SEQUENCE DESCRIPTION: SEQ ID Met Giu Leu Ile Lys Lys Leu Giu Lys Glu Sen Glu 1 5 10 Asp Leu Gin Gin His Ser Asn Giu Leu Phe Lys Met 25 Asn Giu Asp Leu Phe Lys Giu Gin Phe Giu Ile Met 40 Val Giu Ilie Vai Lys Met Met Phe Giu Leu Thr Lys 55 Asp Gly Giu Met Ile Gly Tyr Thr Giu Giu Leu Leu 70 75 Arg Asp Phe Phe Asn Gly Ile Phe Lys Ser Lys Val 90 Pro Ile Phe Cys Gly Asp Val Lys Cys Giu Asp Phe 100 105 Ser Leu Vai Tyr Leu 115 INFORMATION FOR SEQ ID NO:816: Val Leu Phe Lys Thr Ile Asn Leu Ile Lys Thr Phe Pro Ala 110 Lys Ile Ala Lys Leu Lys Leu Lys Asp Trp Phe Val1 Met Arg WO 97/37044 PCT/US97/05223 719 SEQUENCE CHARACTERISTICS: LENGTH: 277 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...277 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:816: Leu Lys Arg Ala Leu Trp Leu Ile Leu Gly Leu Phe Tyr Ala Leu Asn 1 5 10 Ala Glu Ser Phe Lys Asp Val Leu Thr Lys Gly Asp Tyr Thr Phe Phe 25 Asn Lys Lys Val Val Ser Pro Ile Lys Arg Tyr Ala Asp Arg Ser Ala 40 Phe Tyr Leu Gly Leu Gly Tyr Gin Leu Gly Ser Ile Gin His Asn Ser 55 Ser Asn Leu Asn Leu Ser Gin Arg Phe Asn Lys Ser Gin Ile Ile Phe 70 75 Ser Asp Ser Leu Ser Pro Val Phe Lys Asn Ser Tyr Val Ser Asn Gly 90 Leu Gly Val Gin Val Gly Tyr Lys Trp Val Gly Lys His Glu Glu Thr 100 105 110 Lys Trp Phe Gly Phe Arg Trp Gly Leu Phe Tyr Asp Leu Ser Ala Ser 115 120 125 Leu Tyr Gly Ser Gin Glu Ser Gin Ser Ile Ile Ile Ser Thr Tyr Gly 130 135 140 Thr Tyr Met Asp Leu Leu Leu Asn Ala Tyr Asn Gly Asp Lys Phe Phe 145 150 155 160 Ala Gly Phe Asn Leu Gly Ile Ala Phe Ala Gly Val Tyr Asp Arg Leu 165 170 175 Ser Asp Ala Leu Leu Tyr Gin Thr Leu Leu Gin Asn Thr Phe Gly Gly 180 185 190 Lys Val Asn Leu Asn Gly Phe Gin Phe Leu Val Asp Leu Gly Val Arg 195 200 205 Leu Gly Asn Glu His Asn Gin Phe Gly Phe Gly Ile Lys Ile Pro Thr 210 215 220 Tyr Tyr Phe Asn His Tyr Tyr Ser Met Asn Asn Ile Ser Asn Asn Ser 225 230 235 240 Glu Asn Val Leu Lys Val Leu Arg Phe Leu Glu Tyr Gly Ile Asn Ser 245 250 255 Leu Leu Tyr Gin Val Asp Phe Arg Arg Asn Tyr Ser Val Tyr Phe Asn 260 265 270 Tyr Thr Tyr Ser Phe 275 INFORMATION FOR SEQ ID NO:817: WO 97/37044 PCT/US97/05223 720 SEQUENCE CHARACTERISTICS: LENGTH: 493 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...493 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:817: Met Pro Ala Leu Asn Ser Ser Lys Asn Met Val Val Asn Ile Asn Gin 1 5 10 Thr Phe Thr Lys Asn Pro Thr Thr Glu Tyr Thr Tyr Pro Asp Gly Asn 25 Gly Asn Tyr Tyr Ser Gly Gly Ser Ser Ile Pro Ile Gin Leu Lys Ile 40 Ser Ser Val Asn Asp Ala Glu Asn Leu Leu Gin Gin Ala Ala Thr Ile 55 Ile Asn Val Leu Thr Thr Gln Asn Pro His Val Asn Gly Gly Gly Gly 70 75 Ala Trp Gly Phe Gly Gly Lys Thr Gly Asn Val Met Asp Ile Phe Gly 90 Asp Ser Phe Asn Ala Ile Asn Glu Met Ile Lys Asn Ala Gin Ala Val 100 105 110 Leu Glu Lys Thr Gin Gin Leu Asn Ala Asn Glu Asn Thr Gin Ile Thr 115 120 125 Gin Pro Asp Asn Phe Asn Pro Tyr Thr Ser Lys Asp Thr Gin Phe Ala 130 135 140 Gin Glu Met Leu Asn Arg Ala Asn Ala Gin Ala Glu Ile Leu Ser Leu 145 150 155 160 Ala Gin Gin Val Ala Asp Asn Phe His Ser Ile Gin Gly Pro Ile Gin 165 170 175 Gin Asp Leu Glu Glu Cys Thr Ala Gly Ser Ala Gly Val Ile Asn Asp 180 185 190 Asn Thr Tyr Gly Ser Gly Cys Ala Phe Val Lys Glu Thr Leu Asn Ser 195 200 205 Leu Glu Gin His Thr Ala Tyr Tyr Gly Asn Gin Val Asn Gin Asp Arg 210 215 220 Ala Leu Ser Gin Thr Ile Leu Asn Phe Lys Glu Ala Leu Ser Thr Leu 225 230 235 240 Gly Asn Asp Ser Lys Ala Ile Asn Ser Gly Ile Ser Asn Leu Pro Asn 245 250 255 Ala Lys Ser Leu Gin Asn Met Thr His Ala Thr Gin Asn Pro Asn Ser 260 265 270 Pro Glu Gly Leu Leu Thr Tyr Ser Leu Asp Thr Ser Lys Tyr Asn Gin 275 280 285 Leu Gin Thr Val Ala Gin Glu Leu Gly Lys Asn Pro Phe Arg Arg Ile 290 295 300 WO 97/37044 PCT/US97/05223 721 Gly Val Ile Asn Tyr Gin Asn Asn Asn Gly Ala Met Asn Gly Ile Gly 305 310 315 320 Val Gin Ala Gly Tyr Lys Gin Phe Phe Gly Lys Lys Arg Asn Trp Gly 325 330 335 Leu Arg Tyr Tyr Gly Phe Phe Asp Tyr Asn His Ala Tyr Ile Lys Ser 340 345 350 Asn Phe Phe Asn Ser Ala Ser Asp Val Trp Thr Tyr Gly Val Gly Met 355 360 365 Asp Ala Leu Tyr Asn Phe Ile Asn Asp Lys Asn Thr Asn Phe Leu Gly 370 375 380 Lys Asn Asn Lys Leu Ser Val Gly Leu Phe Gly Gly Phe Ala Leu Ala 385 390 395 400 Gly Thr Ser Trp Leu Asn Ser Gin Gin Val Asn Leu Thr Met Met Asn 405 410 415 Gly Ile Tyr Asn Ala Asn Val Ser Ala Ser Asn Phe Gin Phe Leu Phe 420 425 430 Asp Leu Gly Leu Arg Met Asn Leu Ala Arg Pro Lys Lys Lys Asp Ser 435 440 445 Asp His Ala Ala Gin His Gly Met Glu Leu Gly Val Lys Ile Pro Thr 450 455 460 Ile Asn Thr Asp Tyr Tyr Ser Phe Met Gly Ala Glu Leu Lys Tyr Arg 465 470 475 480 Arg Leu Tyr Ser Val Tyr Leu Asn Tyr Val Phe Ala Tyr 485 490 INFORMATION FOR SEQ ID NO:818: SEQUENCE CHARACTERISTICS: LENGTH: 269 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...269 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:818: Val Leu Asn Thr Thr Gin Lys Ser Leu Leu Val Phe Met Gly Val Phe 1 5 10 Phe Leu Ile Phe Gly Val Asp Gin Ala Ile Lys His Ala Ile Leu Glu 25 Gly Phe His Tyr Glu Ser Leu Val Ile Asp Ile Val Leu Val Phe Asn 40 Lys Gly Val Ala Phe Ser Leu Leu Ser Phe Leu Glu Gly Gly Leu Lys 55 Tyr Leu Gin Ile Leu Leu Ile Leu Gly Leu Phe Ile Phe Leu Met Arg 70 75 Gin Lys Glu Leu Phe Lys Ser His Ala Ile Glu Phe Gly Met Val Phe 90 WO 97/37044 PCT/US97/05223 722 Gly Ala Gly Val Ser Asn Val Leu Asn Arg Leu Val Ser Leu Gly Gin 100 105 110 Leu Tyr Ile Leu Tyr Asp Val Leu Ala Ile Arg Arg Glu Lys Met Lys 115 120 125 Asn Ser Ala Pro Leu Lys Asn Lys Val Phe Cys Gly Leu Tyr Val Leu 130 135 140 Ser Leu Ser Ala Ser Val Gin Ala Phe Asp Tyr Lys Ile Glu Val Leu 145 150 155 160 Ala Glu Ser Phe Ser Lys Val Gly Phe Asn Lys Lys Lys Ile Asp Ile 165 170 175 Ala Arg Gly Ile Tyr Pro Thr Glu Thr Phe Val Thr Ala Val Gly Gin 180 185 190 Gly Asn Ile Tyr Ala Asp Phe Leu Ser Lys Ser Leu Lys Asp Gin Gly 195 200 205 His Val Leu Glu Gly Lys Val Gly Gly Thr Ile Gly Gly Ile Ala Tyr 210 215 220 Asp Ser Thr Lys Phe Asn Gin Gly Gly Ser Val Ile Tyr Asn Tyr Ile 225 230 235 240 Gly Tyr Trp Asp Gly Tyr Leu Gly Gly Thr Lys Ala Cys Leu Met Ala 245 250 255 Gin Val Ser Met Thr Ala Arg Leu Ala Leu Met Ala Arg 260 265 INFORMATION FOR SEQ ID NO:819: SEQUENCE CHARACTERISTICS: LENGTH: 198 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...198 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:819: Met Leu Val Thr Lys Leu Ala Pro Asp Phe Lys Ala Pro Ala Val Leu 1 5 10 Gly Asn Asn Glu Val Asp Glu His Phe Glu Leu Ser Lys Asn Leu Gly 25 Lys Asn Gly Val Ile Leu Phe Phe Trp Pro Lys Asp Phe Thr Phe Val 40 Cys Pro Thr Glu Ile Ile Ala Phe Asp Lys Arg Val Lys Asp Phe His 55 Glu Lys Gly Phe Asn Val Ile Gly Val Ser Ile Asp Ser Glu Gin Val 70 75 His Phe Ala Trp Lys Asn Thr Pro Val Glu Lys Gly Gly Ile Gly Gin 90 Val Ser Phe Pro Met Val Ala Asp Ile Thr Lys Ser Ile Ser Arg Asp 100 105 110 WO 97/37044 PCT/US97/05223 Tyr Asp Val Leu Phe Glu Glu Ala Ile Ala Leu Arg Gly Ala Phe 115 120 125 Ile Asp Lys Asn Met Lys Val Arg His Ala Val Ile Asn Asp Leu 130 135 140 Leu Gly Arg Asn Ala Asp Glu Met Leu Arg Met Val Asp Ala Leu 145 150 155 His Phe Glu Glu His Gly Glu Val Cys Pro Ala Gly Trp Arg Lys 165 170 175 Asp Lys Gly Met Lys Ala Thr His Gin Gly Val Ala Glu Tyr Leu 180 185 190 Glu Asn Ser Ile Lys Leu 195 INFORMATION FOR SEQ ID NO:820: SEQUENCE CHARACTERISTICS: LENGTH: 215 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...215 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:820: Leu Pro Leu 160 Gly Lys Met 1 Phe Ile Glu Pro Gin Ile Tyr Ile Ser 145 Thr Leu Lys Leu Ala Ser Lys Thr Ile Cys Leu Ser Leu Ile Ser Ser Thr Glu Gin Lys Ser Thr Asn Met 130 Tyr Ile Ala Ala Thr Pro Thr Asn Gin 115 Ile Thr Pro Val Gly Ile Ile Ile Leu 100 Glu Gin Asp Lys 5 Glu Phe Ala Thr Leu Thr Ser Asn Asp Gin 165 Ala Glu Thr Pro 70 Lys Phe Ala Phe Gin 150 Ser Phe Thr Thr 55 Gin Asn Tyr Glu Leu 135 Gly Gin Gin Gly 40 Gin Ser Ala Ser Glu 120 Pro Asn Ile Lys 25 Leu Glu Thr Thr Gin 105 Ala Tyr Val Ile Ser 185 10 His Leu Lys Tyr Glu 90 Asn Gly Asn Val Leu 170 Gin Gin Pro Gly 75 Leu Pro Tyr Leu Ser 155 Pro Lys Gly Lys Lys Phe Val Gly Asn 140 Leu Ala Asp Thr Pro Tyr Ala Tyr Asn 125 Asn Gly Ser Gly Gin Lys Tyr Glu Val 110 Asn Ile Val Leu Phe Thr Pro Ile Asp Thr Ser Glu Ile Phe 175 Phe Gin Lys Ser Asn Ala Leu Leu Glu 160 Asn Asp Pro Gin Leu Asn Ala Asp Gly 180 Asn Asn Ser Lys Pro Thr Pro 190 WO 97/37044 PCT/US97/05223 724 His Asp Phe Leu Met Pro Ala Arg Arg Ile Cys Leu Thr Ser Ser Ala 195 200 205 Arg Leu Gin Pro Ile Phe Lys 210 215 INFORMATION FOR SEQ ID NO:821: SEQUENCE CHARACTERISTICS: LENGTH: 168 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...168 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:821: Met Leu Ser Ser Asn Asp Leu Phe Met Val Val Leu Gly Ala Ile Leu 1 5 10 Leu Val Leu Val Cys Leu Val Gly Tyr Leu Tyr Leu Lys Glu Lys Glu 25 Phe Tyr His Lys Met Arg Arg Leu Glu Lys Thr Leu Asp Glu Ser Tyr 40 Gin Glu Asn Tyr Leu Tyr Ser Lys Arg Leu Arg Glu Leu Glu Gly Arg 55 Leu Glu Gly Leu Ser Leu Glu Lys Ser Ala Lys Glu Asp Ser Ser Leu 70 75 Lys Thr Thr Leu Ser His Leu Tyr Asn Gin Leu Gin Glu Ile Gin Lys 90 Ser Met Asp Lys Glu Arg Asp Tyr Leu Glu Glu Lys Ile Ile Thr Leu 100 105 110 Glu Asn Lys Phe Lys Asp Met Gly His Tyr Ala Ala Ser Asp Glu Val 115 120 125 Asn Glu Lys Gin Val Leu Lys Met Tyr Gin Glu Gly Tyr Ser Val Asp 130 135 140 Ser Ile Ser Lys Glu Phe Lys Val Ser Lys Gly Glu Val Glu Phe Ile 145 150 155 160 Leu Asn Met Ala Gly Leu Lys Trp 165 INFORMATION FOR SEQ ID NO:822: SEQUENCE CHARACTERISTICS: LENGTH: 338 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 725 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...338 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:822: Val Gly Phe Glu Tyr Ser Ile Ser His Ala Val Glu His Asn Asn Pro 1 5 10 Phe Leu Asp Gin Glu Arg Ile Gin Ala Ile Ser Asn Ala Arg Gly Glu 25 Val Cys Gly Leu Asp Arg Val Lys Asn Glu Ile Thr Asp Met Pro Asn 40 Thr Phe Asn Tyr Ile Asn Asn Ala Leu Lys Asn Asn Ala Lys Leu Thr 55 Pro Thr Glu Lys Gin Ala Glu Thr Tyr Tyr Leu Gin Ser Thr Leu Gin 70 75 Asn Ile Glu Lys Ile Val Met Leu Ser Gly Gly Val Ala Ser Asn Pro 90 Lys Leu Ala Gin Ala Leu Glu Lys Met Gin Glu Pro Ile Thr Asn Pro 100 105 110 Leu Glu Leu Val Glu Asn Leu Lys Asn Leu Glu Leu Gin Phe Ser Gin 115 120 125 Ser Gin Asn Ser Met Leu Ser Ser Leu Ser Ser Gin Ile Ala Gin Ile 130 135 140 Ser Asn Ser Leu Asn Ala Leu Asp Pro Ser Ser Tyr Ser Lys Asn Val 145 150 155 160 Ser Ser Met Tyr Gly Val Gly Leu Ser Val Gly Tyr Lys His Phe Phe 165 170 175 Thr Lys Lys Lys Asn Gin Gly Phe Arg Tyr Tyr Leu Phe Tyr Asp Tyr 180 185 190 Gly Tyr Thr Asn Phe Gly Phe Val Gly Asn Gly Phe Asp Gly Leu Gly 195 200 205 Lys Met Asn Asn His Leu Tyr Gly Leu Gly Ile Asp Tyr Leu Phe Asn 210 215 220 Phe Ile Asp Asn Ala Gin Lys His Ser Ser Val Gly Phe Tyr Val Gly 225 230 235 240 Phe Ala Leu Ala Gly Ser Ser Trp Val Gly Ser Gly Leu Gly Met Trp 245 250 255 Val Ser Gin Met Asp Phe Ile Asn Asn Tyr Leu Thr Asp Tyr Arg Ala 260 265 270 Lys Met His Thr Ser Phe Phe Gin Ile Pro Leu Asn Phe Gly Val Arg 275 280 285 Val Asn Val Asp Arg His Asn Gly Phe Glu Met Gly Leu Lys Ile Pro 290 295 300 Leu Ala Val Asn Ser Phe Tyr Glu Thr His Gly Lys Gly Leu Asn Ala 305 310 315 320 Ser Leu Phe Phe Lys Arg Leu Val Met Phe Asn Val Ser Tyr Val Tyr 325 330 335 Ser Phe INFORMATION FOR SEQ ID NO:823: WO 97/37044 PCT/US97/05223 SEQUENCE CHARACTERISTICS: LENGTH: 80 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...80 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:823: Met Met Phe Glu Leu Thr Lys Lys Thr Lys Phe Asp Gly Glu Met Ile 1 5 10 Gly Tyr Thr Glu Glu Leu Leu Thr Phe Leu Val Arg Asp Phe Phe Asn 25 Gly Ile Phe Lys Ser Lys Val Ile Pro Lys Met Pro Ile Phe Cys Gly 40 Asp Val Lys Cys Glu Asp Phe Asn Ala Leu Arg Ser Leu Val Tyr Leu 55 Ser Val Leu Glu Leu Glu Glu Thr Ile Asn Pro Asn Lys Ile Pro Phe 70 75 INFORMATION FOR SEQ ID NO:824: SEQUENCE CHARACTERISTICS: LENGTH: 200 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...200 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:824: Val Gin Leu Leu Lys Asp Asn Lys Glu Val Val Val Leu Asp Thr Asp 1 5 10 Ser Gin Lys Ser Met Glu Thr Phe Ala Thr Ile Arg Ala Glu Lys Glu 25 Arg Pro Thr Phe Ser Leu Phe Asn Arg Ser Ser Gly Phe Ser Asp Thr 40 Leu Lys Gin Met Val Ser Lys Tyr Glu Asn Ile Leu Ile Asp Thr Lys WO 97/37044 PCT/US97/05223 727 55 Gly Glu Tyr Ser Lys Glu Thr Gin Lys Ala Met Leu Leu Ser Asn Ile 70 75 Val Leu Val Pro Thr Thr Pro Ser Gin Leu Asp Thr Glu Val Leu Ala 90 Asn Met Leu Glu Arg Ile Glu Gin Leu Gin Glu Leu Asn Glu Asn Leu 100 105 110 Arg Ala Leu Ile Val Ile Asn Arg Met Pro Thr Ile Pro Thr Leu Lys 115 120 125 Glu Arg Gin Ala Leu Ile Glu Phe Ile Lys Glu Asn Asn Pro Ser Asp 130 135 140 Arg Ile Thr Leu Leu Glu Ser Ser Leu Ser Glu Arg Ile Val Tyr Lys 145 150 155 160 Arg Ser Val Ser Glu Gly Leu Gly Val Ile Glu Tyr Ser Asp Lys Lys 165 170 175 Ala Ile Asn Glu Trp Val Asn Phe Tyr Asn Glu Leu Lys Ser His Leu 180 185 190 Glu Lys Glu Lys Ile His Thr Phe 195 200 INFORMATION FOR SEQ ID NO:825: SEQUENCE CHARACTERISTICS: LENGTH: 70 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...70 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:825: Met Arg Ser Asp Val Glu Val Leu Ser Pro Leu His Lys Ile Asp Glu 1 5 10 Lys Tyr Leu Phe His Leu Lys Ile Ala Gly Glu Leu Ala Ser Met Gly 25 Lys Ile Leu Ser Val Tyr Leu Ala His Lys His Ser Ala Tyr Phe Ile 40 Leu Asn Ala Leu Ser Tyr Gly Phe Ser His Gin Asp Arg Ala Ile Ile 55 Cys Leu Leu Gly Ala Ile INFORMATION FOR SEQ ID NO:826: SEQUENCE CHARACTERISTICS: LENGTH: 70 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...70 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:826: Ser Ala Met Met Pro Ser Leu Leu Thr Leu Gin Trp Leu Ser Phe 5 10 Leu Ser Leu Ala Glu Asn Leu Cys Leu Thr Asp Ser His His Leu 25 Tyr Thr Leu Glu Lys Asn Lys Leu Val Ile His Ser Asn Asp Ala 40 Tyr Leu Ala Lys Glu Met Leu Pro Lys Leu Ile Lys Pro Ile Pro 55 Thr Ile Glu Phe Ala Met 1 Ile Lys Leu Leu (2) Leu 1 Leu Val Ala Leu Glu
I
I
INFORMATION FOR SEQ ID NO:827: SEQUENCE CHARACTERISTICS: LENGTH: 478 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...478 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:827: Lys Lys Ile Ala Leu Ile Leu Asp Gly Ile Val 5 10 Asp Leu Val Leu Arg His Tyr Ser Asn His Asn 25 Val Lys Asp Glu Ser Leu Ile Pro Lys Asn Tyr 40 Phe His Cys Phe Asp Ala Thr Ser Ser Phe Arg 55 Asn Asp Glu Val Ser Asp Ala Phe Leu Ile Ile 70 75 Gin Arg Ile Ile His Lys Ile Ile Gln Thr His 90 Ala Phe Pro Leu Gin Phe Lys Tyr Ser Leu Asp Lys Asn Ile Thr Gin Phe Arg Phe Val Phe Val Lys Met WO 97/37044 PCTIUS97/05223 729 Arg Val Val Leu Ser Val Lys Arg Asp Ser Glu Lys Thr Leu Glu Asn 100 105 110 Asn Glu Glu Asn Lys Asp Glu Lys Leu Ile Leu Ile Asp Glu Phe Glu 115 120 125 Val Leu Ala Asn Lys Phe Ile Ser Arg Leu Pro Asn Ile Pro Ser Thr 130 135 140 Pro Arg Glu Phe Gly Leu Gly Lys Gly Glu Ile Met Glu Ile Asp Val 145 150 155 160 Pro Phe Gly Ser Ile Phe Ala Tyr Arg His Ile Gly Ser Ile Arg Gin 165 170 175 Lys Glu Tyr Arg Ile Val Gly Leu Tyr Arg Asn Asp Val Leu Leu Leu 180 185 190 Ser Thr Lys Ser Leu Val Ile Gin Pro Arg Asp Ile Leu Leu Val Ala 195 200 205 Gly Asn Pro Glu Ile Leu Asn Ala Val Tyr His Gin Val Lys Ser Asn 210 215 220 Val Gly Gin Phe Pro Ala Pro Phe Gly Lys Ser Ile Tyr Leu Tyr Ile 225 230 235 240 Asp Met Arg Leu Gin Ser Arg Lys Ala Met Met Arg Asp Val Tyr Gin 245 250 255 Ala Leu Phe Leu His Lys Arg Leu Lys Ser Tyr Lys Leu Tyr Ile Gin 260 265 270 Val Leu His Pro Thr Ser Pro Lys Phe Tyr His Lys Phe Leu Ser Leu 275 280 285 Glu Thr Glu Ser Ile Glu Val Asn Phe Asp Phe Tyr Gly Lys Ser Phe 290 295 300 Ile Gin Lys Leu His Glu Asp His Gin Lys Lys Met Gly Leu Ile Val 305 310 315 320 Val Gly Arg Glu Leu Phe Phe Leu Lys Lys His Arg Arg Ala Leu His 325 330 335 Lys Thr Ala Thr Pro Val Tyr Lys Thr Asn Thr Ser Gly Leu Ser Lys 340 345 350 Thr Thr Gin Ser Ile Val Val Leu Asn Glu Ser Leu Ser Ile Asn Glu 355 360 365 Asp Met Ser Ser Val Ile Phe Asp Val Ser Met Gin Met Asp Leu Gly 370 375 380 Leu Leu Leu Tyr Asp Phe Asp Pro Asn Lys His Tyr Lys Asn Glu Ile 385 390 395 400 Val Asn His Tyr Glu Asn Leu Ala Asn Thr Phe Asn Arg Lys Ile Glu 405 410 415 Ile Phe Gin Thr Asp Ile Lys Asn Pro Ile Met Tyr Leu Asn Ser Leu 420 425 430 Arg Asn Pro Ile Leu His Phe Met Pro Phe Glu Glu Cys Ile Thr Gin 435 440 445 Thr Arg Tyr Leu Trp Phe Leu Ser Thr Lys Val Glu Lys Leu Ala Phe 450 455 460 Leu Asn Asp Asp His Pro Gin Ile Phe Ile Pro Val Ala Glu 465 470 475 INFORMATION FOR SEQ ID N0:828: SEQUENCE CHARACTERISTICS: LENGTH: 418 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 730 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...418 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:828: Met Lys Lys Val Tyr Phe Lys Thr Phe Gly Cys Arg Thr Asn Leu Phe 1 5 10 Asp Thr Gin Val Met Gly Glu Asn Leu Lys Asp Phe Ser Ala Thr Leu 25 Glu Glu Gin Glu Ala Asp Ile Ile Ile Ile Asn Ser Cys Thr Val Thr 40 Asn Gly Thr Asp Ser Ala Val Arg Ser Tyr Ala Arg Lys Met Ala Arg 55 Leu Asp Lys Glu Val Leu Phe Thr Gly Cys Gly Val Lys Thr Gln Gly 70 75 Lys Glu Leu Phe Glu Lys Gly Leu Leu Lys GlyVal Phe Gly His Asp 90 Asn Lys Glu Lys Ile Asn Ala Leu Leu Gin Glu Lys Lys Arg Phe Phe 100 105 110 Ile Asp Asp Asn Leu Glu Asn Lys His Leu Asp Thr Thr Met Val Ser 115 120 125 Glu Phe Val Gly Lys Thr Arg Ala Phe Ile Lys Ile Gin Glu Gly Cys 130 135 140 Asp Phe Asp Cys Asn Tyr Cys Ile Ile Pro Ser Val Arg Gly Arg Ala 145 150 155 160 Arg Ser Phe Glu Glu Arg Lys Ile Leu Glu Gin Val Gly Leu Leu Cys 165 170 175 Ser Lys Gly Val Gin Glu Val Val Leu Thr Gly Thr Asn Val Gly Ser 180 185 190 Tyr Gly Lys Asp Arg Gly Ser Asn Ile Ala Arg Leu Ile Lys Lys Leu 195 200 205 Ser Gin Ile Thr Gly Leu Lys Arg Ile Arg Ile Gly Ser Leu Glu Pro 210 215 220 Asn Gin Ile Asn Asp Glu Phe Leu Glu Leu Leu Glu Glu Asp Phe Leu 225 230 235 240 Glu Lys His Leu His Ile Ala Leu Gin His Ser His Asp Phe Met Leu 245 250 255 Glu Arg Met Asn Arg Arg Asn Arg Thr Lys Ser Asp Arg Glu Leu Leu 260 265 270 Glu Ile Ile Ala Ser Lys Asn Phe Ala Ile Gly Thr Asp Phe Ile Val 275 280 285 Gly His Pro Gly Glu Ser Glu Ser Val Phe Glu Lys Ala Phe Lys Asn 290 295 300 Leu Glu Ser Leu Pro Leu Thr His Ile His Pro Phe Ile Tyr Ser Lys 305 310 315 320 Arg Lys Asp Thr Pro Ser Ser Leu Met Arg Asp Ser Val Ser Leu Glu 325 330 335 Asp Ser Lys Lys Arg Leu Asn Ala Ile Lys Asp Leu Ile Phe His Lys 340 345 350 Asn Lys Ala Phe Arg Gin Leu Gin Leu Lys Leu Asn Thr Pro Leu Lys WO 97/37044 PCT/US97/05223 731 355 360 365 Ala Leu Val Glu Ala Gin Lys Asp Gly Glu Phe Lys Ala Leu Asp Gin 370 375 380 Phe Phe Asn Pro Ile Lys Ile Lys Ser Asp Lys Pro Leu Arg Ala Ser 385 390 395 400 Phe Leu Glu Ile Lys Glu Tyr Glu Ile Lys Glu Arg Glu Asn His Ala 405 410 415 Val Phe INFORMATION FOR SEQ ID NO:829: SEQUENCE CHARACTERISTICS: LENGTH: 175 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...175 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:829: Leu Lys Ile Ala Tyr Arg Leu Leu Gly Leu Met Ser Phe Ile Ala Leu 1 5 10 Val Leu Ala Ile Val Leu Ile Ser Ile Leu Pro Leu Gin Lys Thr Glu 25 His His Phe Val Asp Phe Leu Asn Gin Asp Lys His Tyr Ala Ile Ile 40 Gin Arg Ala Asp Lys Ser Ile Ser Ser Asn Glu Ala Leu Ala Arg Ser 55 Leu Ile Gly Ala Tyr Val Leu Asn Arg Glu Ser Ile Asn Arg Ile Asp 70 75 Asp Lys Ser Arg Tyr Glu Leu Val Arg Leu Gin Ser Ser Ser Lys Val 90 Trp Gin Arg Phe Glu Asp Leu Ile Lys Ala Gin Asn Ser Ile Tyr Val 100 105 110 Gin Ser His Leu Glu Arg Glu Val His Ile Val Asn Ile Ala Ile Tyr 115 120 125 Gin Gin Asp Asn Asn Pro Ile Ala Ser Val Ser Ile Ala Ala Lys Leu 130 135 140 Leu Asn Glu Asn Lys Leu Val Tyr Glu Lys Arg Tyr Lys Ile Val Leu 145 150 155 160 Ser Tyr Leu Phe Asp Thr Pro Asp Phe Asp Tyr Ala Ser Met Pro 165 170 175 INFORMATION FOR SEQ ID NO:830: SEQUENCE CHARACTERISTICS: LENGTH: 186 amino acids WO 97/37044 PCT/US97/05223 732 TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...186 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:830: Leu Leu Leu Val Ser Arg Phe Leu Asn Ala Ile Asp Pro Phe Asn Leu 1 5 10 Gly Val Leu Leu Ser Arg Phe Gin Ile Lys Asn Gly Cys Ile Tyr Gly 25 Val Cys Ser Tyr Lys Val Ser Lys Phe Thr Pro Gly Tyr Glu Glu Ser 40 Lys Ala Arg Val Leu Asn Ala Leu Asn Ile Leu Ser Lys His Gin Ile 55 Trp Gin Ser Asn Gin Glu Ser Val Thr Lys Val Lys Gly Thr Phe Val 70 75 Phe Ile Leu Glu Asn Asp Leu His Leu Asp Glu Asn Ser Phe Tyr Lys 90 Lys Leu Leu Asn Leu Ile Ile Asp Asn Asp Phe Phe Asn Arg Ser His 100 105 110 Leu Val Thr Pro Ser Asn Gly Thr Asn Ser His Pro Glu Leu His Arg 115 120 125 Ser Ile Thr Pro Arg Glu Ala Ala Arg Ile Gin Ser Phe Ser Asp Asp 130 135 140 Tyr Ile Phe Tyr Gly Asn Lys Thr Ser Val Cys Lys Gin Ile Gly Asn 145 150 155 160 Ala Val Pro Pro Leu Leu Ala Leu Ala Leu Gly Lys Ala Ile Leu Lys 165 170 175 Ser Ala Arg Asn Asp Thr Asn Pro Ser Arg 180 185 INFORMATION FOR SEQ ID NO:831: SEQUENCE CHARACTERISTICS: LENGTH: 173 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 PCT/US97/05223 733 LOCATION 1...173 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:831: Met Gly Leu Lys Asn Lys Ile Lys Gly Phe Val Lys Glu Arg Met Pro 1 5 10 Phe Val Met Arg Cys Val Arg Gly Leu Lys Gly Ala Lys Asp Thr His 25 Glu Asn Ala His Asp Arg Asp Ala Tyr Cys Gly Ile Asn Arg Glu Ile 40 Lys Glu Met Leu Glu Ala Lys Lys Leu His Phe Leu Gin Glu Lys Ala 55 Leu Phe Asn His Asp His Gin Glu Ser Val Phe Leu Ala Ile Ala Ser 70 75 Leu Asn Asn Glu Ser Phe Ile Glu Tyr Asn Lys Ser Ile Tyr Lys Asn 90 Ser Ser Leu Asn Tyr Asn Tyr Gly Gly His Leu Glu Asp Arg Val Ile 100 105 110 His Pro Thr Leu Thr Leu Pro Asn Pro Thr His Ser Gly Tyr Phe Asp 115 120 125 Tyr Asp Lys Lys Ser Gin Asn Pro Lys Ser Pro Leu Asn Pro Trp Ala 130 135 140 Phe Ile Arg Val Lys Asn Glu Ile Val Thr Leu Glu Glu Ser Leu Phe 145 150 155 160 Ser Met Leu Pro Ala Val Gin Arg Gly Gly His Trp Phe 165 170 INFORMATION FOR SEQ ID NO:832: SEQUENCE CHARACTERISTICS: LENGTH: 175 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...175 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:832: Leu Trp His Gin Leu Tyr His Tyr Cys Asn Tyr Thr Leu Ser Phe Ile 1 5 10 Pro Lys Asn Glu Trp Val Val Lys Ile Asp Cys Asp His Ile Tyr Asp 25 Ala Lys Lys Leu Tyr Lys Ser Phe Tyr Ile Pro Lys Asn Ile Lys Glu 40 Val Val Met Tyr Ser Arg Ile Asn Phe Val Val Arg Asp Phe Glu Val 55 Phe Val Arg Asn Asp Gly Asp Phe Gly Phe Leu Asp Ala Trp Gly Asp 70 75 WO 97/37044 PCT/US97/05223 734 His Trp Leu Leu Tyr Asn Asp Cys Glu Pro Phe Glu Ile Trp Arg Tyr 90 Asn Asp Glu Ser Tyr Glu Val Leu Lys Leu Lys Asp Lys His His Ile 100 105 110 Lys Asp Lys Glu Met Val Gin Trp His Phe Pro Leu Ala Lys Lys Arg 115 120 125 Arg Asn Ala Ile Val Tyr Asp Asp Leu Ile Pro Leu Glu Glu Phe Lys 130 135 140 Lys Arg His Ala Asp Leu Ile Gly Thr Arg Ile Glu Glu Ser Met Leu 1 45 150 155 160 Asp Glu Lys Arg Ile Leu Glu Val Tyr Gin Lys Phe Arg Leu Pro 165 170 175 INFORMATION FOR SEQ ID NO:833: SEQUENCE CHARACTERISTICS: LENGTH: 357 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...357 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:833: Met Gin Lys Pro Gin Asn Lys Pro Ser Ser Ser Gin Gin Ser Pro Gin 1 5 10 Asn Phe Ser Tyr Pro Glu Ser Lys Leu Gly Ser Lys Asn Ser Lys Asn 25 Ser Leu Leu Gin Pro Leu Val Thr Pro Ser Lys Val Ser Pro Thr Asn 40 Glu Val Lys Thr Pro Thr Asn Asp Ala Asn Pro Pro Leu Lys His Ser 55 Ser Gin Asp Gin Glu Asn Asn Leu Phe Val Ala Pro Pro Thr Glu Lys 70 75 Thr Leu Pro Asn Asn Thr Ser Ser Ala Asp Ala Ser Glu Asn Asn Glu 90 Ser Asn Glu Asn Arg Asp Asn Val Glu Lys Gin Ala Ile Arg Asp Pro 100 105 110 Asn Ile Lys Glu Phe Ala Cys Gly Lys Trp Val Tyr Asp Asp Glu Asn 115 120 125 Leu Gin Ala Tyr Arg Pro Ser Ile Leu Lys Arg Val Asp Lys Asp Lys 130 135 140 Glu Ile Thr Thr Asp Ile Thr Pro Cys Asp Tyr Ser Thr Ala Glu Asn 145 150 155 160 Lys Ser Gly Lys Ile Ile Thr Pro Tyr Thr Lys Ile Ser Val His Lys 165 170 175 Thr Glu Pro Leu Glu Asp Pro Gin Thr Phe Glu Ala Lys Asn Asn Phe 180 185 190 WO 97 3 7044 PCT/US97/05223 735 Ala Ile Leu Gin Ala Arg Ser Ser Thr Glu Lys Cys Lys Arg Ala Arg 195 200 205 Ala Arg Lys Asp Gly Thr Thr Arg Gin Cys Tyr Leu Ile Glu Glu Pro 210 215 220 Leu Lys Gin Ala Trp Glu Ser Glu Tyr Glu Ile Thr Thr Gin Leu Val 225 230 235 240 Lys Ala Ile Tyr Glu Arg Pro Lys Gin Asp Asp Gin Val Glu Pro Thr 245 250 255 Phe Tyr Glu Thr Ser Glu Leu Ala Tyr Ser Ser Thr Arg Lys Ser Glu 260 265 270 Ile Thr Arg Asn Glu Leu Asn Leu Asn Glu Lys Phe Met Glu Phe Val 275 280 285 Glu Val Tyr Glu Gly His Tyr Leu Asn Asp Ile Ile Lys Glu Ser Ser 290 295 300 Glu Tyr Lys Glu Trp Val Lys Asn His Val Arg Phe Lys Glu Gly Val 305 310 315 320 Cys Met Ala Leu Glu Ile Glu Glu Gin Pro Arg Ala Lys Ser Thr Pro 325 330 335 Leu Ser Ile Glu Asn Ser Arg Val Val Cys Val Lys Lys Gly Asn Tyr 340 345 350 Leu Phe Asn Glu Val 355 INFORMATION FOR SEQ ID NO:834: SEQUENCE CHARACTERISTICS: LENGTH: 143 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...143 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:834: Ile Gin Tyr Pro Lys Ser Ser Phe Phe Gin Lys Arg Gly Arg Asn Gly 1 5 10 Asn His Ser Thr Ile Gin Pro Phe Asn His Ser Thr Ile Gin Pro Phe 25 Asn His Ser Ile Ile Gin Ser Phe Asn His Ser Ile Ile Gin Ser Phe 40 Asn His Ser Ile Ile Gin Ser Phe Asn His Ser Thr Ile Gin Ala Thr 55 Leu Pro Tyr Phe Tyr Asn Tyr Leu Ser Phe Tyr Lys Asn Leu Phe Lys 70 75 Asn Pro Leu Phe Phe Ile Ile Pro Pro Phe Ile Asn Pro Phe Ile Asn 90 Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn 100 105 110 WO 97/37044 PCT/US97/05223 736 Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn 115 120 125 Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile 130 135 140 INFORMATION FOR SEQ ID N0:835: SEQUENCE CHARACTERISTICS: LENGTH: 242 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...242 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:835: Leu Lys Val Asn Phe Phe Ala Thr Cys Leu Gly Ala Ala Ile Tyr Ser 1 5 10 Asn Ala Ser Leu Asn Ala Ile Lys Leu Leu Arg Lys Glu Asn Leu Glu 25 Val Val Phe Lys Lys Asp Gin Thr Cys Cys Gly Gin Pro Ser Tyr Asn 40 Ser Gly Tyr Tyr Glu Glu Thr Lys Lys Val Val Leu Tyr Asn Ile Lys 55 Leu Tyr Ser Asn Asn Asp Tyr Pro Ile Ile Leu Pro Ser Gly Ser Cys 70 75 Thr Gly Met Met Arg His Asp Tyr Leu Glu Leu Phe Glu Gly His Ala 90 Glu Phe Asn Met Val Lys Asp Phe Cys Ser Arg Val Tyr Glu Leu Ser 100 105 110 Glu Phe Leu Asp Lys Lys Leu Gin Val Lys Tyr Glu Asp Lys Gly Glu 115 120 125 Pro Leu Lys Ile Thr Trp His Ser Asn Cys His Ala Leu Arg Val Ala 130 135 140 Lys Val Ile Asp Ser Ala Lys Asn Leu Ile Arg Gin Leu Lys Asn Val 145 150 155 160 Glu Leu Ile Glu Leu Glu Lys Glu Glu Glu Cys Cys Gly Phe Gly Gly 165 170 175 Thr Phe Ser Val Lys Glu Pro Glu Ile Ser Ala Val Met Val Lys Glu 180 185 190 Lys Ile Lys Asp Ile Glu Ser Arg His Val Asp Val Ile Val Ser Ala 195 200 205 Asp Ala Gly Cys Leu Met Asn Ile Ser Thr Ala Met Gin Lys Met Gly 210 215 220 Ser Leu Thr Lys Pro Met His Phe Tyr Asp Phe Leu Ala Ser Arg Leu 225 230 235 240 Gly Leu WO 97/37044 PCT/US97/05223 737 INFORMATION FOR SEQ ID NO:836: SEQUENCE CHARACTERISTICS: LENGTH: 401 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...401 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:836: Met Ile Phe Gly Asp Phe Lys Tyr Gin Lys Ser Val Lys Lys Leu Thr 1 5 10 Ala Thr Asn Leu Asn Glu Leu Lys Asn Ala Leu Asp Phe Ile Ser Gin 25 Asn Arg Gly Lys Gly Tyr Phe Val Gly Tyr Leu Leu Tyr Glu Ala Arg 40 Leu Ala Phe Leu Asp Glu Asn Phe Gin Ser Gin Thr Pro Phe Leu Tyr 55 Phe Glu Gin Phe Leu Glu Arg Lys Lys Tyr Pro Leu Glu Pro Leu Lys 70 75 Glu His Ala Phe Tyr Pro Lys Ile His Ser Ser Leu Asp Gin Lys Thr 90 Tyr Phe Lys Gin Phe Lys Ala Val Lys Glu His Leu Lys Asn Gly Asp 100 105 110 Thr Tyr Gin Val Asn Leu Thr Met Asp Leu Leu Leu Asn Thr Lys Ala 115 120 125 Lys Pro Lys Arg Val Phe Lys Glu Val Ile His Asn Gin Asn Thr Pro 130 135 140 Phe Lys Ala Phe Ile Glu Asn Glu Phe Gly Ser Val Leu Ser Phe Ser 145 150 155 160 Pro Glu Leu Phe Phe Glu Leu Glu Phe Leu Asp Thr Ala Ile Lys Ile 165 170 175 Ile Thr Lys Pro Met Lys Gly Thr Ile Ala Arg Ser Asn Asn Pro Leu 180 185 190 Ile Asp Glu Lys Asn Arg Leu Phe Leu Gin Asn Asp Asp Lys Asn Arg 195 200 205 Ser Glu Asn Val Met Ile Val Asp Leu Leu Arg Asn Asp Leu Ser Arg 210 215 220 Leu Ala Leu Lys Asn Ser Val Lys Val Asn Gin Leu Phe Glu Ile Ile 225 230 235 240 Ser Leu Pro Ser Val Tyr Gin Met Ile Ser Glu Ile Glu Ala Gin Leu 245 250 255 Pro Leu Lys Thr Ser Leu Phe Glu Ile Phe Lys Ala Leu Phe Pro Cys 260 265 270 Gly Ser Val Thr Gly Cys Pro Lys Ile Lys Thr Met Gin Ile Ile Glu 275 280 285 WO 97/37044 PCT/US97/05223 738 Ser Leu Glu Lys Arg Pro Arg Gly Val Tyr Cys Gly Ala Ile Gly Met 290 295 300 Val Gly Gly Lys Lys Ala Leu Phe Ser Val Pro Ile Arg Thr Leu Glu 305 310 315 320 Lys Arg Ala His Glu Asp Phe Leu His Leu Gly Val Gly Ser Gly Val 325 330 335 Thr Tyr Lys Ser Lys Ala Ser Lys Glu Tyr Glu Glu Ser Phe Leu Lys 340 345 350 Ser Phe Phe Val Met Pro Lys Ile Glu Phe Glu Ile Val Glu Thr Met 355 360 365 Arg Val Ile Lys Arg Asp Gin Lys Leu Glu Ile Asn Asn Lys Asn Ala 370 375 380 His Lys Glu Arg Leu Met His Ser Ala Gin Tyr Phe Asn Phe Lys Tyr 385 390 395 400 Arg INFORMATION FOR SEQ ID NO:837: SEQUENCE CHARACTERISTICS: LENGTH: 264 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...264 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:837: Met Lys Ile Ser Val Ser Lys Asn Asp Leu Glu Asn Thr Leu Arg Tyr 1 5 10 Leu Gin Ala Phe Leu Asp Lys Lys Asp Ala Ser Ser Ile Ala Ser His 25 Ile His Leu Glu Val Ile Lys Glu Lys Leu Phe Leu Lys Ala Ser Asp 40 Ser Asp Ile Gly Leu Lys Ser Tyr Ile Ser Thr Gin Ser Thr Asp Lys 55 Glu Gly Val Gly Thr Ile Asn Gly Lys Lys Phe Leu Asp Ile Ile Ser 70 75 Cys Leu Lys Asp Ser Asn Ile Val Leu Glu Thr Lys Asp Asp Ser Leu 90 Val Ile Lys Gin Asn Lys Ser Ser Phe Lys Leu Pro Met Phe Asp Ala 100 105 110 Asp Glu Phe Pro Glu Phe Pro Val Ile Asp Pro Lys Val Ser Leu Glu 115 120 125 Ile Asn Ala Pro Phe Leu Val Asp Ala Phe Lys Lys Ile Ala Pro Val 130 135 140 Ile Glu Gln Thr Ser His Lys Arg Glu Leu Ala Gly Val Leu Met Gin 145 150 155 160 WO 97/37044 PCT/US97/05223 739 Phe Asn Gin Lys His Gin Thr Leu Ser Val Val Gly Thr Asp Thr Lys 165 170 175 Arg Leu Ser Tyr Thr Gin Leu Glu Lys Ile Ser Ile His Ser Thr Glu 180 185 190 Glu Asp Ile Ser Cys Ile Leu Pro Lys Arg Ala Leu Leu Glu Ile Leu 195 200 205 Lys Leu Phe Tyr Glu Asn Phe Ser Phe Lys Ser Asp Gly Met Leu Ala 210 215 220 Val Val Glu Asn Glu Thr His Ala Phe Phe Thr Lys Leu Ile Asp Gly 225 230 235 240 Asn Tyr Pro Asp Tyr Gin Lys Ile Leu Pro Lys Glu Tyr Thr Ser Ser 245 250 255 Phe Thr Leu Gly Lys Glu Glu Phe 260 INFORMATION FOR SEQ ID NO:838: SEQUENCE CHARACTERISTICS: LENGTH: 312 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...312 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:838: Leu Asn Asn Leu Glu Pro Leu Gly Ala Glu Val Phe Val Gly Leu Asp 1 5 10 Gly Ile Asp Ala Met Ile Glu Glu Cys Val Ser Asn Leu Val Ile Asn 25 Ala Ile Val Gly Val Ala Gly Leu Lys Ala Ser Phe Lys Ser Leu Gin 40 Arg Asn Lys Lys Leu Ala Leu Ala Asn Lys Glu Ser Leu Val Ser Ala 55 Gly His Leu Leu Asp Ile Ser Gin Ile Thr Pro Val Asp Ser Glu His 70 75 Phe Gly Leu Trp Ala Leu Leu Gin Asn Lys Thr Leu Lys Pro Lys Ser 90 Leu Ile Ile Ser Ala Ser Gly Gly Ala Phe Arg Asp Thr Pro Leu Asp 100 105 110 Leu Ile Ala Ile Gin Asn Ala Gin Asn Ala Leu Lys His Pro Asn Trp 115 120 125 Ser Met Gly Asp Lys Ile Thr Ile Asp Ser Ala Ser Met Val Asn Lys 130 135 140 Leu Phe Glu Ile Leu Glu Thr Tyr Trp Leu Phe Gly Ala Ser Leu Lys 145 150 155 160 Ile Asp Ala Leu Ile Glu Arg Ser Ser Ile Val His Ala Leu Val Glu 165 170 175 WO 97/37044 PCT/US97/05223 740 Phe Glu Asp Asn Ser Val Ile Ala His Leu Ala Ser Ala Asp Met Gin 180 185 190 Leu Pro Ile Ser Tyr Ala Ile Asn Pro Lys Leu Ala Ser Leu Ser Ala 195 200 205 Ser Ile Lys Pro Leu Asp Leu Tyr Ala Leu Ser Ala Ile Lys Phe Glu 210 215 220 Pro Ile Ser Val Glu Arg Tyr Thr Leu Trp Arg Tyr Lys Asp Leu Leu 225 230 235 240 Leu Glu Asn Pro Lys Leu Gly Val Val Leu Asn Ala Ser Asn Glu Val 245 250 255 Ala Met Lys Lys Phe Leu Asn Gin Glu Ile Ala Phe Gly Gly Phe Ile 260 265 270 Gin Ile Ile Ser Gin Ala Leu Glu Leu Tyr Ala Lys Lys Ser Phe Lys 275 280 285 Leu Ser Ser Leu Asp Glu Val Leu Ala Leu Asp Lys Glu Val Arg Glu 290 295 300 Arg Phe Gly Ser Val Ala Arg Val 305 310 INFORMATION FOR SEQ ID N0:839: SEQUENCE CHARACTERISTICS: LENGTH: 290 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...290 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:839: Met Leu Glu Asn Val Lys Lys Ser Leu Phe Arg Val Leu Cys Leu Gly 1 5 10 Ala Leu Cys Leu Gly Gly Leu Met Ala Glu Gin Asp Pro Lys Glu Leu 25 Val Gly Leu Gly Ala Lys Ser Tyr Lys Glu Gin Asp Phe Thr Gin Ala 40 Lys Lys Tyr Phe Glu Lys Ala Cys Asp Leu Lys Glu Asn Ser Gly Cys 55 Phe Asn Leu Gly Val Leu Tyr Tyr Gin Gly His Gly Val Glu Lys Asn 70 75 Leu Lys Lys Ala Ala Ser Phe Tyr Ser Lys Ala Cys Asp Leu Asn Tyr 90 Ser Asn Gly Cys His Leu Leu Gly Asn Leu Tyr Tyr Ser Gly Gin Gly 100 105 110 Val Ser Gin Asn Thr Asn Lys Ala Leu Gin Tyr Tyr Ser Lys Ala Cys 115 120 125 Asp Leu Lys Tyr Ala Glu Gly Cys Ala Ser Leu Gly Gly Ile Tyr His 130 135 140 WO 97/37044 PCT/US97/05223 741 Asp Gly Lys Val Val Thr Arg Asp Phe Lys Lys Ala Val Glu Tyr Phe 145 150 155 160 Thr Lys Ala Cys Asp Leu Asn Asp Gly Asp Gly Cys Thr Ile Leu Gly 165 170 175 Ser Leu Tyr Asp Ala Gly Arg Gly Thr Pro Lys Asp Leu Lys Lys Ala 180 185 190 Leu Ala Ser Tyr Asp Lys Ala Cys Asp Leu Lys Asp Ser Pro Gly Cys 195 200 205 Phe Asn Ala Gly Asn Met Tyr His His Gly Glu Gly Ala Ala Lys Asn 210 215 220 Phe Lys Glu Ala Leu Ala Arg Tyr Ser Lys Ala Cys Glu Leu Glu Asn 225 230 235 240 Gly Gly Gly Cys Phe Asn Leu Gly Ala Met Gin Tyr Asn Gly Glu Gly 245 250 255 Ala Thr Arg Asn Glu Lys Gin Ala Ile Glu Asn Phe Lys Lys Gly Cys 260 265 270 Lys Leu Gly Ala Lys Gly Ala Cys Asp Ile Leu Lys Gin Leu Lys Ile 275 280 285 Lys Val 290 INFORMATION FOR SEQ ID NO:840: SEQUENCE CHARACTERISTICS: LENGTH: 237 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...237 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:840: Met Lys Ser Asn Lys Lys Ser Asn His Leu Arg Ala Ile Tyr Arg Ala 1 5 10 Leu Val Ile Ala Ile Gly Leu Ala Val Ile Ile Val Phe Asn Tyr Phe 25 Asn Arg Lys Asn Asn Asn Ala Arg Ser Ser Arg Arg Ala Cys Ser Cys 40 Phe Phe Ser Leu Thr Gly Val Asn Leu Glu Lys Ile Gly Ser Phe Asp 55 Thr Gly Ala Lys Leu Ile Val Leu Asn His Gin Ser Leu Leu Asp Ile 70 75 Ile Tyr Leu Glu Ala Tyr His Pro Ser Asn Ile Cys Trp Ile Ala Lys 90 Lys Glu Leu Gly Glu Ile Pro Phe Tyr Gly His Ala Leu Thr Asp Thr 100 105 110 Gly Met Ile Leu Ile Asp Arg Glu Asp Lys Lys Gly Ile Val Ser Leu 115 120 125 WO 97/37044 PCTIUS97/05223 742 Leu Lys Ala Cys Lys Glu Lys Leu Asp Gin Asn Arg Pro Leu Val Ile 130 135 140 Phe Pro Glu Gly Thr Arg Gly Lys Gly Gly Glu Lys Phe Leu Pro Phe 145 150 155 160 Lys Gin Gly Ala Lys Ile Ile Ala Glu Lys Phe Gin Leu Lys Ile Gin 165 170 175 Pro Met Val Leu Ile Asn Ser Ile Lys Ile Phe Asn Ser Lys Pro Leu 180 185 190 Glu Ala Tyr Lys Ala Arg Thr Arg Leu Val Met Leu Glu Ser Tyr Thr 195 200 205 Pro Asp Phe Ser Ser Pro Thr Trp Tyr Glu Glu Leu Gin Glu Arg Met 210 215 220 Gin Lys Glu Tyr Leu Lys His Tyr His Glu Leu Asn Ala 225 230 235 INFORMATION FOR SEQ ID NO:841: SEQUENCE CHARACTERISTICS: LENGTH: 681 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...681 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:841: Met Gly Phe Glu Lys Ser Ile Leu Asp Asn Leu Asn Gly Ala Gin Lys 1 5 10 Ile Ala Ala Cys His Ile Gin Gly Pro Leu Leu Ile Leu Ala Gly Ala 25 Gly Ser Gly Lys Thr Lys Thr Leu Thr Ser Arg Leu Ala Tyr Leu Ile 40 Gly Ala Cys Gly Val Pro Ser Glu Asn Thr Leu Thr Leu Thr Phe Thr 55 Asn Lys Ala Ser Lys Glu Met Gin Glu Arg Ala Leu Lys Leu Leu Lys 70 75 Asn Gin Ala Leu Ile Pro Pro Leu Leu Cys Thr Phe His Arg Phe Gly 90 Leu Leu Phe Leu Arg Gin His Met Asn Leu Leu Lys Arg Ala Cys Asp 100 105 110 Phe Ser Val Leu Asp Ser Asp Glu Val Lys Thr Leu Cys Lys Gin Leu 115 120 125 Lys Ile Ser Asn Phe Arg Ala Ser Ile Ser Gln Ile Lys Asn Gly Met 130 135 140 Met Asp Leu Ser Val Gin Asp Ser Glu Cys Tyr Lys Ala Tyr Glu Leu 145 150 155 160 Tyr Gin Asn Ala Leu Lys Lys Asp Asn Leu Val Asp Phe Asp Asp Leu 165 170 175 WO 97/37044 PCTIUS97/05223 Leu Glu Thr His 225 Arg Gly Ile Ile Lys 305 Ile Leu Ala Arg Lys 385 Gly Glu Lys Ile Glu 465 Asp Val Leu Val Val 545 Asn Ile Ser Glu Lys Cys Thr Asn 210 Asn Gly Ala Leu Lys 290 Glu Lys Tyr Leu Ala 370 Asp Leu Gly Leu Gly 450 Arg Asn Lys Asn Ser 530 Phe Gin Thr Tyr Ala 610 Asp Leu Ser 195 Ala Leu Ala Lys Ala 275 Thr Tyr Ala Arg Asn 355 Glu Asp Gly Leu Asn 435 Arg Phe Tyr Glu Glu 515 Cys Val Glu Arg Phe 595 Gin Thr Ser 180 Glu Leu Cys Asp Ile 260 Cys Leu Pro Leu Leu 340 Ile Val Arg Lys Asn 420 Pro Leu Leu Glu His 500 Ser Met Ile Ser Ala 580 Gly Leu Pro Leu Arg Gin Val Ile 245 Val Ala Gin Thr Leu 325 Asn Pro Lys Phe Ile 405 Leu Lys Arg Glu Glu 485 Phe Ala Ser Gly Asp 565 Lys Arg Leu Ile Lys Tyr Leu Val 230 Ser Lys Asn Ser Gin 310 Lys Gly Tyr Asp Phe 390 Thr Glu Asn Glu Glu 470 Arg Lys Leu Va1 Leu 550 Leu Glu Lys Gin Lys Ile His Glu 215 Gly Asn Leu Ser Phe 295 Lys Lys Leu Arg Ala 375 Ile Gin Glu Glu Ala 455 Thr Glu Thr Asp His 535 Glu Glu Glu Ile Gin 615 Va1 Leu Tyr 200 Phe Asp Ile Glu Leu 280 Lys Glu Gly Ser Leu 360 Leu Lys Glu Ala Tyr 440 Phe Asn Gly Asn Val 520 Met Glu Glu Leu Ser 600 Asp Gly Gin 185 Ile Leu Asp Leu Thr 265 Ile Gly Glu Glu Arg 345 Ile Ala Arg Trp Leu 425 Ala Glu Leu Phe Pro 505 His Ser Gly Glu Gin 585 Cys Lys Asp Asp Met Lys Asp Asn 250 Asn Ser Ser Ser Asn 330 Ser Gly Leu Va1 Ile 410 Lys Leu Ile Leu Val 490 Thr Asn Lys Phe Arg 570 Leu Ser Pro Leu Asn Val Gin Gin 235 Phe Tyr His His Leu 315 Leu Ile Ala Met Leu 395 Phe Ile Lys Ser Lys 475 Lys His Thr Gly Phe 555 Arg Ser Pro Pro Ile Glu Asp Leu 220 Ser Ser Arg Asn Lys 300 Asp Glu Glu Val His 380 Asn Ser Gly Lys Val 460 Ser Glu Ser Glu Leu 540 Pro Leu Tyr Ser Lys 620 Lys Lys Glu 205 Ser Ile Lys Ser Gin 285 Ser Val Asn Glu Ser 365 Va1 Lys Leu Ala Phe 445 Glu Tyr Leu Leu Asn 525 Glu His Ala Va1 Val 605 Gin His Leu 190 Tyr Phe Tyr His Ser 270 His Val Ala Ile Ser 350 Phe Val Pro Leu Phe 430 Thr Lys Glu Leu Leu 510 Ala Phe Arg Tyr Lys 590 Phe Asn Lys Ala Gin Thr Gly Phe 255 Ala Arg Ile Tyr Ala 335 Leu Tyr Ala Pro Asp 415 Lys Ala Phe Lys Ser 495 Asp Gin Lys Gly Val 575 Glu Leu His Ile Lys Asp His Phe 240 Lys Glu His Cys Gin 320 Ile Asn Glu Lys Arg 400 Glu Asp Met Cys Glu 480 Leu Phe Lys His Phe 560 Ala Arg Glu Gin Phe WO 97/37044 PCT/US97/05223 744 625 630 635 640 Gly Thr Gly Arg Val Leu Gly Val Glu Lys Gly Leu Ser Gly Leu Cys 645 650 655 Leu Lys Ile Asn Cys Gly Gly Asn Val Tyr Asp Lys Ile Ser Glu Lys 660 665 670 Phe Val Glu Lys Val Asp Asn Glu Phe 675 680 INFORMATION FOR SEQ ID NO:842: SEQUENCE CHARACTERISTICS: LENGTH: 276 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...276 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:842: Met Lys Thr Ser Asn Thr Lys Thr Pro Lys Pro Val Leu Ile Ala Gly 1 5 10 Pro Cys Val Ile Glu Ser Leu Glu Asn Leu Arg Ser Ile Ala Ile Lys 25 Leu Gin Pro Leu Ala Asn Asn Glu Arg Leu Asp Phe Tyr Phe Lys Ala 40 Ser Phe Asp Lys Ala Asn Arg Thr Ser Leu Glu Ser Tyr Arg Gly Pro 55 Gly Leu Glu Lys Gly Leu Glu Met Leu Gin Thr Ile Lys Asp Glu Phe 70 75 Gly Tyr Lys Ile Leu Thr Asp Val His Glu Ser Tyr Gin Ala Ser Val 90 Ala Ala Lys Val Ala Asp Ile Leu Gin Ile Pro Ala Phe Leu Cys Arg 100 105 110 Gin Thr Asp Leu Ile Val Glu Val Ser Gin Thr Asn Ala Ile Val Asn 115 120 125 Ile Lys Lys Gly Gin Phe Met Asn Pro Lys Asp Met Gin Tyr Ser Val 130 135 140 Leu Lys Ala Leu Lys Thr Arg Asp Ser Ser Ile Gin Ser Pro Thr Tyr 145 150 155 160 Glu Thr Ala Leu Lys Asn Gly Val Trp Leu Cys Glu Arg Gly Ser Ser 165 170 175 Phe Gly Tyr Gly Asn Leu Val Val Asp Met Arg Ser Leu Lys Ile Met 180 185 190 Arg Glu Phe Ala Pro Val Ile Phe Asp Ala Thr His Ser Val Gin Met 195 200 205 Pro Gly Gly Ala Asn Gly Lys Ser Ser Gly Asp Ser Ser Phe Pro Pro 210 215 220 Ile Leu Pro Arg Ala Ala Ala Ala Val Gly Ile Asp Gly Leu Phe Ala WO 97/37044 PCTIUS97/05223 745 225 230 235 240 Glu Thr His Ile Asp Pro Lys Asn Ala Leu Ser Asp Gly Ala Asn Met 245 250 255 Leu Lys Pro Asp Glu Leu Glu His Leu Val Thr Asp Met Leu Lys Ile 260 265 270 Gin Asn Leu Phe 275 INFORMATION FOR SEQ ID NO:843: SEQUENCE CHARACTERISTICS: LENGTH: 436 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...436 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:843: Met Met Lys Phe Phe Leu Leu Lys Lys Phe Ser Glu Phe Leu Asn Thr 1 5 10 Gin Thr His Phe Asn Leu Lys Arg Leu Asn Ala Ser Ser Phe Leu Leu 25 Glu Thr Phe Ser Lys Glu Lys His Ala Phe Val Val Asp Leu Ser Ala 40 Pro Tyr Ile Gly Leu Ser Lys Lys Pro Pro Glu Ser Val Leu Lys Asn 55 Thr Leu Ala Leu Asp Phe Cys Leu Asn Lys Phe Thr Lys Asn Ala Lys 70 75 Ile Leu Gin Ala Asn Val Ile Asp Asn Asp Arg Ile Leu Glu Ile Lys 90 Gly Ala Lys Asp Leu Ala Tyr Lys Ser Glu Thr Phe Ile Leu Arg Leu 100 105 110 Glu Met Ile Pro Lys Lys Ala Asn Leu Met Ile Leu Asp Gin Glu Lys 115 120 125 Cys Val Ile Glu Ala Phe Arg Phe Asn Asp Arg Val Ala Lys Asn Asp 130 135 140 Ile Leu Gly Ala Leu Pro Pro Asn Ile Tyr Glu His Gin Glu Glu Asp 145 150 155 160 Leu Asp Phe Lys Gly Leu Leu Asp Ile Leu Glu Lys Asp Phe Leu Ser 165 170 175 Tyr Gin His Lys Glu Leu Glu His Lys Lys Asn Gin Ile Ile Lys Arg 180 185 190 Leu Asn Ala Gin Lys Glu Arg Leu Lys Glu Lys Leu Glu Lys Leu Glu 195 200 205 Asp Pro Lys Thr Leu Gin Leu Glu Ala Lys Glu Leu Gin Thr Gin Ala 210 215 220 Ser Leu Leu Leu Thr Tyr Gin His Leu Ile Asn Arg Arg Glu Asn Arg WO 97/37044 PCT/US97/05223 746 225 230 235 240 Val Ile Leu Lys Asp Phe Glu Asp Lys Glu Cys Met Ile Glu Ile Asp 245 250 255 Lys Ser Met Pro Leu Asn Ala Phe Ile Asn Lys Lys Phe Thr Leu Ser 260 265 270 Lys Lys Lys Lys Gin Lys Ser Gin Phe Leu Tyr Leu Glu Glu Glu Asn 275 280 285 Leu Lys Glu Lys Ile Ala Phe Lys Glu Asn Gin Ile Asn Tyr Val Arg 290 295 300 Asp Ala Ala Glu Glu Ser Val Leu Glu Met Phe Met Pro Val Lys Asn 305 310 315 320 Ser Lys Ile Lys Arg Pro Met Asn Gly Tyr Glu Val Leu Tyr Tyr Lys 325 330 335 Asp Phe Lys Ile Gly Leu Gly Lys Asn Gin Lys Glu Asn Ile Lys Leu 340 345 350 Leu Gin Asp Ala Arg Ala Asn Asp Leu Trp Met His Val Arg Asp Ile 355 360 365 Pro Gly Ser His Leu Ile Val Phe Cys Gin Lys Asn Thr Pro Lys Asp 370 375 380 Glu Val Ile Met Glu Leu Ala Lys Met Leu Ile Lys Met Gin Lys Asp 385 390 395 400 Ala Phe Asn Gly Tyr Glu Ile Asp Tyr Thr Gin Arg Lys Phe Val Lys 405 410 415 Ile Ile Lys Gly Ala His Val Ile Tyr Ser Lys Tyr Arg Thr Ile Ser 420 425 430 Leu Lys Asp Thr 435 INFORMATION FOR SEQ ID NO:844: SEQUENCE CHARACTERISTICS: LENGTH: 231 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...231 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:844: Met Phe Lys Lys Ile Ile Phe Leu Cys Val Phe Leu Ile Gly Gly Phe 1 5 10 Val Ile Pro Pro Leu Glu Ala Met Pro Ile Leu Arg Asn Lys Thr Pro 25 Lys Lys Asn Tyr Gin Glu Ala His Glu Lys Leu Tyr Arg Ser Ile Ile 40 Asn Arg Gin Lys Leu Thr Arg Lys Lys Ser Gly Trp Tyr Phe Leu Gly 55 Gly Val Gly Ala Val Glu Ala Ile Lys Asp Tyr Gin Gly Lys Glu Met WO 97/37044 PCTIUS97/05223 747 70 75 Lys Asp Trp Ile Ala Thr Leu Asn Leu Lys Thr Gly Val Gin Ser Phe 90 Phe Lys Lys Tyr Ile Gly Ile Arg Gly Val Phe Ala Trp Asp Leu Gly 100 105 110 Ser Gly Lys Val Asn Tyr Gin Ser Tyr Lys Asp Pro Thr Asn Ser Phe 115 120 125 Phe Thr Met Leu Ala Val Gly Leu Asp Val Ile Met Glu Phe Pro Leu 130 135 140 Gly Ser Tyr Lys His Tyr Leu Gly Ala Phe Gly Gly Ala Arg Gly Ala 145 150 155 160 Leu Val Val Tyr Thr Asp Lys Gin Asn Phe Lys Phe Phe Lys His Ser 165 170 175 Val Val Ser Gly Gly Leu Ala Ile Ser Gly Gly Val Met Leu Thr Leu 180 185 190 Phe Leu Arg His Arg Ile Glu Leu Gly Phe Lys Ile Leu Pro Thr Ala 195 200 205 Arg Leu Leu Ser Ser Ser Ser Arg Phe Glu Thr Ser Pro Leu Phe Tyr 210 215 220 Ala Ala Tyr Ser Tyr Lys Phe 225 230 INFORMATION FOR SEQ ID NO:845: SEQUENCE CHARACTERISTICS: LENGTH: 660 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein -(iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...660 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:845: Met Leu Lys Lys Ile Phe Tyr Gly Phe Ile Val Leu Phe Leu Ile Ile 1 5 10 Val Gly Leu Leu Ala Val Leu Val Ala Gin Val Trp Val Thr Thr Asp 25 Lys Asp Ile Ala Lys Ile Lys Asp Tyr Arg Pro Ser Val Ala Ser Gin 40 Ile Leu Asp Arg Lys Gly Arg Leu Ile Ala Asn Ile Tyr Asp Lys Glu 55 Phe Arg Phe Tyr Ala Arg Phe Glu Glu Ile Pro Pro Arg Phe Val Glu 70 75 Ser Leu Leu Ala Val Glu Asp Thr Leu Phe Phe Glu His Gly Gly Ile 90 Asn Leu Asp Ala Val Met Arg Ala Met Ile Lys Asn Ala Lys Ser Gly 100 105 110 Arg Tyr Thr Glu Gly Gly Ser Thr Leu Thr Gin Gin Leu Val Lys Asn WO 97/37044 PCT/US97/05223 Met Ile 145 Glu Lys Leu Tyr Ile 225 Lys Gin Gin Asp Gin 305 Ser Glu Lys Ser Ser 385 Asn Tyr His Glu Lys 465 Ala Pro Phe Leu Leu 545 Asn Val 130 Ile Arg Thr Lys Asp 210 Leu Ser Asn Leu Leu 290 Lys Asn Thr Lys Ala 370 Thr Tyr Thr Ser Lys 450 Asp Ala Met Thr Thr 530 Ala Asn 115 Leu Ser Tyr Ala Glu 195 Pro Arg Ala lie Asp 275 Asp Ile Asp Ser Ser 355 Ile Thr Ser Arg Leu 435 Ile Leu Glu Leu Pro 515 Leu Arg Asn Thr Ile Leu Ser 180 Ile Thr Arg Leu Ala 260 Gly Tyr Leu Lys Thr 340 Ala Lys Ser Lys Lys 420 Asn Tyr Ser Lys Ile 500 Ile Ser Ile Ile Arg Glu Arg Ile 150 Asn Gin 165 Leu Gly Thr Met Lys Asn Leu Tyr 230 Asn Glu 245 Pro Tyr Leu Lys Gin Arg Glu Lys 310 Asp Glu 325 Gly Lys Phe Asn Pro Phe Lys Ile 390 Asn Ser 405 Phe Leu Leu Ala Gin Ser Ile Val 470 Tyr Ser 485 Glu Ser Glu Thr Ala Leu Lys Gly 550 Asp Ala 565 Lys 135 Glu Thr Tyr Leu Leu 215 Ser Val Val Thr Leu 295 Ile Asp Ile Arg Val 375 Pro Val Gly Thr Leu 455 Leu Leu Ile Lys Met 535 Leu Trp 120 Thr Lys Phe Phe Val 200 Glu Leu Pro Val Gin 280 Ala Ala Asn Leu Ala 360 Tyr Asp Gin Leu Ile 440 Ser Gly Phe Thr Lys 520 Asp Glu Phe Leu Thr Val Leu Phe Gly 170 Lys Lys 185 Ala Leu Phe Ser Gly Trp Ile Val 250 Asp Glu 265 Gly Tyr Leu Glu Lys Glu Leu Asn 330 Ala Leu 345 Thr Gln Gin Ile Thr Ala Asn His 410 Val Thr 425 Asn Leu Asp Met Ser Phe Ser Asn 490 Asn Gin 505 Ile Thr Ala Val Ile Ala Ile Gly 570 Arg Ser 155 His Pro Pro Leu Ile 235 Tyr Val Thr Ser Lys 315 Ala Val Ala Ala Arg 395 Ala Leu Ser Gly Ala 475 Tyr Gin Ser Glu Gly 555 Phe Lys 140 Lys Gly Leu Arg Ser 220 Ser Asn Leu Ile Leu 300 Pro Ser Gly Lys Phe 380 Asn Trp Gin Asp Phe 460 Ile Gly Asn Lys Asn 540 Lys Thr 125 Leu Glu Tyr Asp Ala 205 Arg Ser Gin Lys Lys 285 Arg Lys Met Gly Arg 365 Asp Phe His Glu Gin 445 Lys Ser Thr Glu Glu 525 Gly Thr Pro Lys Glu Tyr Lys 190 Pro Ala Asn Thr Gin 270 Leu Phe Thr Ile Ile 350 Gin Asn Glu Pro Ala 430 Leu Asn Pro Met Val 510 Gin Thr Gly Thr Glu Ile Gly 175 Leu Ser Asn Glu Ser 255 Leu Thr Gly Asn Val 335 Asp Phe Gly Asn Ser 415 Leu Gly Leu Ile Leu 495 Lys Ala Gly Thr Leu 575 Ala Leu 160 Val Thr Phe Asp Leu 240 Thr Asp Ile His Ala 320 Thr Tyr Gly Tyr Gly 400 Asn Ser Phe Pro Asp 480 Lys Thr Phe Ser Ser 560 Gin WO 97/37044 PCT/US97/05223 749 Ser Val Ile Trp Phe Gly Arg Asp Asp Asn Thr Pro Ile Gly Lys 580 585 590 Ala Thr Gly Gly Val Val Ser Ala Pro Val Tyr Ser Tyr Phe Met 595 600 605 Asn Ile Leu Ala Ile Glu Pro Ser Leu Lys Arg Lys Phe Asp Val 610 615 620 Lys Gly Leu Arg Lys Glu Ile Val Asp Lys Ile Pro Tyr Tyr Ser 625 630 635 Pro Asn Ser Ile Thr Pro Thr Pro Lys Lys Thr Asp Asp Ser Glu 645 650 655 Arg Leu Leu Phe 660 INFORMATION FOR SEQ ID NO:846: SEQUENCE CHARACTERISTICS: LENGTH: 324 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...324 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:846: Gly Arg Pro Ser 640 Glu Met Lys Tyr Leu Trp Leu Phe Leu Ile Tyr Ala Ile 1 Thr Ile Leu Val Asp Gly Leu Phe Pro 145 Gly Asp Glu His Ser Lys Asn Lys Ala 130 Ser Pro Lys Val Glu Gin Lys Lys Lys 115 Ala Ile Gly Thr Arg Val Asn Val Ile 100 Thr His Ala Ile 5 Leu Tyr Leu Lys His Ser Phe Asn Trp Thr 165 Asp Ser Ala Glu 70 Leu Arg Asp Met Met 150 Asn Ile Ile Asn 55 Gin Val Leu Tyr Ala 135 Lys Ile Ile Asp 40 Asp Gly Ala Lys Pro 120 Ile Arg Ala Lys 25 Asn Leu Ala Leu Leu 105 Ile Val Leu Leu Thr Asp Lys Ile Val Tyr Val Val Ile Ala 170 Ile Ala Thr Asn 75 Ser Asp Ser Asn Val 155 Asp Gin Asn Ser Tyr Val Val Leu Asp 140 Phe Tyr Gly Lys Tyr Gln Ala Ala Asp Asp 125 Tyr Ser Thr Leu Leu Ala His Glu Val Thr 110 Leu Leu Lys Met Ala Lys Lys Asp Lys Asn Thr Pro Ala Ile 160 Tyr Gin Lys Glu Ile Ile Lys Asn Asn Arg Leu Asn Ile Phe Pro Lys Trp 180 185 WO 97/37044 PCTIUS97/05223 750 Ala Asn Ala Glu Gin Thr Glu Phe Tyr Tyr Thr Gin Tyr Gly Glu Lys 195 200 205 Thr Pro Met Ile Leu Lys Tyr Asn Ile Gin Lys Ala Thr His Glu Asn 210 215 220 Ile Ala Ser Ser Gin Gly Met Ala Val Val Ser Ser Val Ser Ser Asp 225 230 235 240 Gly Ser Lys Ile Leu Met Ser Leu Ala Pro Asp Gly Gin Pro Asp Val 245 250 255 Tyr Leu Tyr Asp Thr His Lys Lys Thr Lys Thr Lys Ile Thr Arg Tyr 260 265 270 Pro Gly Ile Asp Val Ser Gly Val Phe Leu Glu Asp Asp Lys Ser Met 275 280 285 Ala Phe Val Ser Asp Arg Ser Gly Tyr Pro Asn Ile Tyr Met Lys Lys 290 295 300 Leu Gly Leu Lys Glu Ser Ala Glu Gin Leu Leu Tyr Glu Gly Arg Ser 305 310 315 320 Asn Glu Ser Ile INFORMATION FOR SEQ ID N0:847: SEQUENCE CHARACTERISTICS: LENGTH: 1288 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...1288 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:847: Met Glu Ile Gin Gin Thr His Arg Lys Ile Asn Arg Pro Leu Val Ser 1 5 10 Leu Val Leu Ala Gly Ala Leu Ile Ser Ala Ile Pro Gin Glu Ser His 25 Ala Ala Phe Phe Thr Thr Val Ile Ile Pro Ala Ile Val Gly Gly Ile 40 Ala Thr Gly Thr Ala Val Gly Thr Val Ser Gly Leu Leu Ser Trp Gly 55 Leu Lys Gin Ala Glu Glu Ala Asn Lys Thr Pro Asp Lys Pro Asp Lys 70 75 Val Trp Arg Ile Gin Ala Gly Lys Gly Phe Asn Glu Phe Pro Asn Lys 90 Glu Tyr Asp Leu Tyr Lys Ser Leu Leu Ser Ser Lys Ile Asp Gly Gly 100 105 110 Trp Asp Trp Gly Asn Ala Ala Arg His Tyr Trp Val Lys Gly Gly Gin 115 120 125 Trp Asn Lys Leu Glu Val Asp Met Lys Asp Ala Val Gly Thr Tyr Lys 130 135 140 WO 97/37044 PCTIUS97/05223 Leu 145 Gin Ser Lys Ser Ser 225 Gly Va1 Tyr His Ser 305 Leu Ser Gin Gin Gly 385 Asp Ala Ala Val Ala 465 Asn Asn Gly Val Asn 545 Gly Asn Va1 Ser Lys Tyr Asn Gly 210 Glu Ala Trp Ser Leu 290 Asn Asn Thr Asn Lys 370 Gly Gly Ala Ser Asp 450 Gly Gly Leu Asn Asn 530 Phe Glu Thr Lys Gly Ala Lys Ile 195 Ala Gly Thr Met Thr 275 Thr Lys Ile Thr Asn 355 Thr Lys Thr His Gly 435 Gly Ser Thr Lys Gly 515 Ile Asn Tyr Val Phe Leu Arg Asn 150 Thr Leu Arg 165 Asp Ser Ala 180 Ser Ile Asp Gly Arg Lys lie Thr Ser 230 Leu Asn Leu 245 Gly Arg Leu 260 Ile Asn Thr Vai Gly Asp Thr His Ile 310 Ile Ala Pro 325 Ser Gin Ser 340 Asn Ser Asn Glu Thr Glu Asp Thr Val 390 Ile Lys Val 405 Leu Asn Ile 420 Arg Thr Leu Pro Leu Arg Ser Ala Asn 470 Ala Thr Phe 485 Vai Asp Ala 500 Gly Phe Asn Asn Lys Leu Ile Asn Glu 550 Thr His Phe 565 Arg Leu Glu 580 Lys Ser Gly Phe Thr Gly Gly Asp Leu Asp Vai Asn Met Leu Asp Asn Ala 215 Ser Ala Gin Ser Gin 295 Gly Pro Gly Thr Pro 375 Val Gly Gly Leu Val 455 Phe Asn His Thr Ile 535 Leu Ser Thr Glu Gly Arg Phe 200 Ser Lys Ser Tyr Lys 280 Asn Thr Glu Thr Glu 360 Thr Asn Gly Lys Val 440 Asn Glu Asn Thr Leu 520 Thr Ile Glu Gly Lys Gin Thr 185 Val Ser Asn Asn Va1 265 Va1 Ala Leu Gly Lys 345 Val Gin Ile Phe Gly 425 Glu Asn Phe Asp Ala.
505 Asp Ala Val Asp Thr 585 Leu Phe 170 Thr Glu Thr Ala Ser 250 Gly Gin Ala Asp Gly 330 Asn Ile Val Phe Lys 410 Gly Asn Gin Lys Ile 490 Asn Phe Ser Lys Ile 570 Arg Val 155 Asn Arg Ile Va1 Glu 235 Val Ala Gly Gin Leu 315 Tyr Asp Asn Ile His 395 Ala Val Leu Val Ala 475 Ser Phe Ser Thr Thr 555 Gly Ser Ile Gly Vai Asn Leu 220 Ile Lys Tyr Glu Ala 300 Trp Lys Lys Pro Asp 380 Leu Ser Asn Thr Gly 460 Gly Leu Lys Gly Asn 540 Asn Ser Ile Asn Asn Asn Asn 205 Thr Ser Leu Leu Va1 285 Gly Gin Asp Lys Pro 365 Gly Asn Leu Leu Gly 445 Gly Val Gly Gly Val 525 Val Gly Gin Phe Asp Ser Phe 190 Arg Leu Leu Asn Ala 270 Asp Ile Ser Lys Glu 350 Asn Pro Thr Thr Ser 430 Asn Tyr Asp Arg Ile 510 Thr Ala Ile Ser Ser 590 Phe Phe 175 Asn Va1 Gin Tyr Gly 255 Pro Phe Ile Ala Pro 335 Ile Asn Phe Lys Thr 415 Asn Ile Ala Thr Phe 495 Asp Asp Va1 Ser Arg 575 Gly Tyr 160 Thr Ala Gly Ala Asp 240 Asn Ser Asn Ala Gly 320 Asn Ser Thr Ala Ala 400 Asn Gin Thr Leu Lys 480 Va1 Thr Lys Lys Val 560 Ile Gly Tyr WO 97/37044 PCT/US97/05223 595 Ser Pro Trp 610 Thr Arg Lys 625 Lys Leu Met Tyr Ser Gin Gin Gly Thr 675 Asn Val Gly 690 Ala Thr Gly 705 Leu Ile Lys Tyr Gly Asn Glu Glu Gin 755 Met Asp Thr 770 Met Ala Ile 785 Tyr Leu Ile Asn Gly Ser Glu Asn Gly 835 Ala His Ser 850 Ser Ala Thr 865 Ile Glu Ser Leu Tyr Thr Leu Ile Asp 915 Thr Ser Thr 930 Leu Asn Asn 945 Leu Ser Leu Ser Arg Lys Ala Leu Lys 995 Leu Tyr Gin 1010 Asn Ala Ile 1025 Tyr Gly Thr Asn Phe Phe Phe 660 Ile Asn Phe Asn Val 740 Phe Cys Gly Gly Lys 820 Gly Ala Pro Val His 900 Ser Gly Val Ser His 980 Gly Phe Gly Ser 600 Tyr Phe Asp Ala Arg 615 Ala Ser Ser Thr Pro 630 Asn Asn Leu Thr Leu 645 Ser Asn Leu Thr Ile 665 Asn Tyr Leu Val Arg 680 Ala Ala Ala Met Met 695 Tyr Lys Pro Leu Ile 710 Thr Glu His Val Leu 725 Ser Thr Gly Thr Asn 745 Lys Glu Arg Leu Ala 760 Val Val Arg Asn Thr 775 Asn Gin Ser Met Val 790 Lys Ala Trp Arg Asn 805 Ile Ser Val Tyr Tyr 825 Asn Thr Thr Asn Leu 840 Asn Tyr Ala Leu Val 855 Asn Leu Val Ala Ile 870 Phe Glu Leu Ala Asn 885 Ser Gly Ala Gin Gly 905 His Asp Ala Gly Tyr 920 Glu Ile Thr Lys Gin 935 Ala Ser Leu Glu His 950 Asn Ala Met Ile Leu 965 Thr Asn His Ile Asn 985 Gin Glu Phe Ala Ser 1000 Ala Pro Lys Tyr Glu 1015 Gly Ala Ser Leu Asn 1030 Ala Gly Val Asp Ala 1045 Asn Glu Gly 650 Gin Gly Phe Lys Leu 730 Gly Leu Asp Asn Ile 810 Leu Pro Lys Asn Arg 890 Arg Ala Leu Lys Asn 970 Ser Leu Lys Ser Val Asn 635 Gin Gly Gly Asn Ile 715 Lys Ile Tyr Asp Asn 795 Gly Gly Thr Asn Gin 875 Ser Asp Arg Asn Gin 955 Ser Phe Glu Pro Lys 620 Pro Asn Asp Lys Asn 700 Asn Ala Ser Asn Ile 780 Pro Ile Asn Asn Ala 860 His Lys Leu Gin Ala 940 Ser Arg Ala Ser Thr 605 Asn Val Trp Gly Ala Val Phe Ile 670 Val Ala 685 Asp Ile Ser Ala Lys Ile Asn Val 750 Asn Asn 765 Lys Ala Asp Asn Ser Lys Ser Thr 830 Thr Thr 845 Pro Phe Asp Phe Asp Ile Leu Gin 910 Met Ile 925 Ala Thr Gly Leu Leu Val Gin Arg 990 Ala Ala 1005 Asn Val Glu Thr Met 655 Asn Thr Asp Gin Ile 735 Asn Asn Cys Tyr Thr 815 Pro Asn Ala Gly Asp 895 Thr Asp Asp Gin Asn 975 Leu Glu Trp Ile Ser 640 Asp Asn Leu Ser Asp 720 Gly Leu Arg Gly Lys 800 Ala Thr Asn His Thr 880 Thr Leu Asn Ala Thr 960 Leu Gin Val Ala 1020 Gly Ser Asn Ala 1035 Ser Leu 1040 Phe Leu Asn Gly Asn Val Glu 1050 1055 WO 97/37044 PCT/US97/05223 753 Ala Ile Val Gly Gly Phe Gly Ser Tyr Gly Tyr Ser Ser Phe Ser Asn 1060 1065 1070 Gin Ala Asn Ser Leu Asn Ser Gly Ala Asn Asn Ala Asn Phe Gly Val 1075 1080 1085 Tyr Ser Arg Phe Phe Ala Asn Gln His Glu Phe Asp Phe Glu Ala Gin 1090 1095 1100 Gly Ala Leu Gly Ser Asp Gin Ser Ser Leu Asn Phe Lys Ser Thr Leu 1105 1110 1115 1120 Leu Gin Asp Leu Asn Gin Ser Tyr Asn Tyr Leu Ala Tyr Ser Ala Thr 1125 1130 1135 Ala Arg Ala Ser Tyr Gly Tyr Asp Phe Ala Phe Phe Arg Asn Ala Leu 1140 1145 1150 Val Leu Lys Pro Ser Val Gly Val Ser Tyr Asn His Leu Gly Ser Thr 1155 1160 1165 Asn Phe Lys Ser Asn Ser Gin Ser Gin Val Ala Leu Lys Asn Gly Ala 1170 1175 1180 Ser Ser Gin His Leu Phe Asn Ala Asn Ala Asn Val Glu Ala Arg Tyr 1185 1190 1195 1200 Tyr Tyr Gly Asp Thr Ser Tyr Phe Tyr Leu His Ala Gly Val Leu Gin 1205 1210 1215 Glu Phe Ala His Phe Gly Ser Asn Asp Val Ala Ser Leu Asn Thr Phe 1220 1225 1230 Lys Ile Asn Ala Ala Arg Ser Pro Leu Ser Thr Tyr Ala Arg Ala Met 1235 1240 1245 Met Gly Gly Glu Leu Gin Leu Ala Lys Glu Val Phe Leu Asn Leu Gly 1250 1255 1260 Val Val Tyr Leu His Asn Leu Ile Ser Asn Ala Ser His Phe Ala Ser 1265 1270 1275 1280 Asn Leu Gly Met Arg Tyr Ser Phe 1285 INFORMATION FOR SEQ ID NO:848: SEQUENCE CHARACTERISTICS: LENGTH: 385 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...385 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:848: Met Lys Asp Ser Phe Leu Phe Thr Ser Glu Ser Val Thr Glu Gly His 1 5 10 Pro Asp Lys Met Ala Asp Gin Ile Ser Asp Ala Val Leu Asp Tyr Ile 25 Ile Glu Arg Asp Lys Lys Ala Lys Val Ala Cys Glu Thr Leu Val Ser 40 WO 97/37044 PCT/US97/05223 754 Asn Gly Phe Cys Met lie Thr Gly Glu Leu Lys Thr Ser Val Tyr Ala 55 Pro Met Gln Glu Ile Ala Arg Glu Val Val Lys Lys Ile Gly Tyr Thr 70 75 Asp Ala Leu Tyr Gly Phe Asp Tyr Arg Ser Ala Ala Val Leu Asn Gly 90 Ile Gly Glu Gin Ser Pro Asp Ile Asn Gin Gly Val Asp Arg Glu Asp 100 105 110 Gly Glu Ile Gly Ala Gly Asp Gin Gly Leu Met Phe Gly Tyr Ala Cys 115 120 125 Lys Glu Thr Glu Thr Leu Met Pro Leu Pro Ile His Leu Ala His Gin 130 135 140 Leu Ala Phe Ala Leu Ala Gin Lys Arg Lys Asp Asn Thr Leu Pro Phe 145 150 155 160 Leu Arg Pro Asp Gly Lys Ser Gin Val Ser Val Arg Tyr Glu Asn Asn 165 170 175 Lys Pro Val Ser Val Asp Thr Ile Val Ile Ser Thr Gin His Ser Pro 180 185 190 Glu Val Ser Gin Lys His Leu Lys Glu Ala Val Ile Glu Glu Ile Val 195 200 205 Tyr Lys Val Leu Pro Lys Glu Tyr Leu His Asp Asn Ile Lys Phe Phe 210 215 220 Ile Asn Pro Thr Gly Lys Phe Val Ile Gly Gly Pro Gin Gly Asp Ala 225 230 235 240 Gly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Phe Cys 245 250 255 Pro His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp 260 265 270 Arg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Leu Val Ala 275 280 285 Ser Gly Val Cys Asp Lys Ala Thr Val Gin Leu Ala Tyr Ala Ile Gly 290 295 300 Val Ile Glu Pro Val Ser Ile Tyr Val Asn Thr His Asn Thr Ser Lys 305 310 315 320 His Ser Ser Ala Glu Leu Glu Lys Cys Val Lys Ser Val Phe Lys Leu 325 330 335 Thr Pro Lys Gly Ile Ile Glu Ser Leu Asp Leu Leu Arg Pro Ile Tyr 340 345 350 Ser Leu Thr Ser Ala Tyr Gly His Phe Gly Arg Glu Leu Glu Glu Phe 355 360 365 Thr Trp Glu Lys Thr Asn Lys Val Glu Glu Ile Lys Ala Phe Phe Lys 370 375 380 Arg 385 INFORMATION FOR SEQ ID NO:849: SEQUENCE CHARACTERISTICS: LENGTH: 68 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: WO 97/37044 PCT/US97/05223 755 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...68 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:849: Met Lys Phe Leu Asn Gly Leu Ala Gly Asn Leu Leu Ile Val Val Ile 1 5 10 Leu Leu Cys Val Val Val Phe Phe Ala Leu Lys Ala Ile His Ile Gin 25 Lys Glu Gin Ala Thr Asn Tyr Tyr Arg Tyr Lys Asp Ile Asn Ala Leu 40 Glu Ala Lys Asn Thr Gin Asn His Ala Asn Tyr Glu Leu Val Asn Gin 55 Gly Ser Lys Lys INFORMATION FOR SEQ ID NO:850: SEQUENCE CHARACTERISTICS: LENGTH: 502 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...502 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:850: Leu Lys Ile Phe Leu Val Ile Leu Ser Val Phe Phe Phe Asn Gly Cys 1 5 10 Phe Gly Leu Val Tyr Lys Thr Pro Ile Ser Asn Pro Pro Ile Ser Tyr 25 Asp Pro Tyr Thr Thr Thr Ile Gly Ser Leu Tyr Ala Lys Asn Leu Lys 40 Glu Asn Pro Lys His Ser Ala Ala Ile Leu Leu Glu Asp Gly Phe Asp 55 Ala Leu Leu His Arg Val Gly Leu Ile Arg Met Ser Gin Lys Ser Ile 70 75 Asp Met Gin Thr Tyr Ile Tyr Lys Asn Asp Leu Ser Ser Gin Val Ile 90 Ala Lys Glu Leu Leu Asn Ala Ala Asn Arg Gly Val Lys Val Arg Ile 100 105 110 Leu Leu Asp Asp Asn Gly Leu Asp Ser Asp Phe Ser Asp Ile Met Leu 115 120 125 Leu Asn Phe His Lys Asn Ile Glu Val Lys Ile Phe Asn Pro Tyr Tyr 130 135 140 WO 97/37044 PCT/US97/05223 756 Ile Arg Asn Lys Gly Leu Arg Tyr Phe Glu Met Leu Ala Asp Tyr Glu 145 150 155 160 Arg Ile Lys Lys Arg Met His Asn Lys Leu Phe Ile Val Asp Asn Phe 165 170 175 Ala Val Ile Ile Gly Gly Arg Asn Ile Gly Asp Asn Tyr Phe Asp Asn 180 185 190 Asp Leu Asp Thr Asn Phe Leu Asp Leu Asp Ala Leu Phe Phe Gly Gly 195 200 205 Val Ala Ser Lys Ala Lys Glu Ser Phe Glu Asn Tyr Trp Arg Phe His 210 215 220 Arg Ser Ile Pro Val Ser Leu Leu Arg Thr His Lys Arg Leu Lys Asn 225 230 235 240 Asn Val Lys Glu Ile Ala Lys Leu His Glu Lys Ile Pro Ile Ser Ala 245 250 255 Glu Asp Ala Asn Glu Phe Glu Lys Lys Val Asn Asp Phe Ile Glu Arg 260 265 270 Phe Gin Lys Tyr Gin Tyr Pro Ile Tyr Tyr Gly Asn Ala Ile Phe Leu 275 280 285 Ala Asp Leu Pro Ala Lys Ile Asp Thr Pro Leu Tyr Ser Pro Ile Lys 290 295 300 Ile Ala Phe Glu Lys Ala Leu Lys Asn Ala Lys Asp Ser Val Phe Ile 305 310 315 320 Ala Ser Ser Tyr Phe Ile Pro Gly Lys Lys Ile Met Lys Ile Phe Lys 325 330 335 Asn Gin Ile Ser Lys Gly Ile Glu Leu Asn Ile Leu Thr Asn Ser Leu 340 345 350 Ser Ser Thr Asp Ala Ile Val Val Tyr Gly Ala Trp Glu Arg Tyr Arg 355 360 365 Asn Lys Leu Val Arg Met Gly Ala Asn Val Tyr Glu Ile Arg Asn Asp 370 375 380 Phe Phe Asn Arg Gin Ile Lys Gly Arg Phe Ser Thr Lys His Ser Leu 385 390 395 400 His Gly Lys Thr Ile Val Phe Asp Asp Ala Leu Thr Leu Leu Gly Ser 405 410 415 Phe Asn Ile Asp Pro Arg Ser Ala Tyr Ile Asn Thr Glu Ser Ala Val 420 425 430 Leu Phe Asp Asn Pro Ser Phe Ala Lys Arg Val Arg Leu Ser Leu Lys 435 440 445 Asp His Ala Gin Gin Ser Trp His Leu Val Leu Tyr Arg His Arg Val 450 455 460 Ile Trp Glu Ala Thr Glu Glu Gly Ile Leu Ile His Glu Lys Asn Ser 465 470 475 480 Pro Asp Thr Ser Phe Phe Leu Arg Leu Ile Lys Glu Trp Ser Lys Val 485 490 495 Leu Pro Glu Arg Glu Leu 500 INFORMATION FOR SEQ ID NO:851: SEQUENCE CHARACTERISTICS: LENGTH: 177 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 757 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...177 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:851: Met Arg Phe Ser Tyr Ile Glu Pro Arg Ala Lys Tyr Leu Ile Ser Lys 1 5 10 Leu Ser Lys Ile Trp Val Phe Tyr Ile Phe Leu Ser Phe Val Leu Ile 25 Gly Gly Leu Val Trp Phe Met His Asn Ala Ile Lys Arg Ala Gin Asp 40 Asn Ala Ser Ser Leu Thr Ile Gin Glu Arg Leu Tyr Arg His Glu Ile 55 Ser Arg Leu Gin Val Lys Thr Asp Glu Thr Leu Lys Leu Ile Lys Glu 70 75 Ala Lys Lys Arg Leu Asn Tyr Asn Asp Asp Ile Arg Asp Val Leu Gin 90 Gly Leu Leu Asn Ile Val Pro Asp Leu Ile Thr Ile Asn Ser Ile Glu 100 105 110 Ile Asp Gin Gin Ser Val Val Val Ser Gly Lys Thr Pro Ser Lys Glu 115 120 125 Ala Phe Tyr Phe Leu Phe Gin Asn Lys Leu Asn Pro Met Phe Asp Tyr 130 135 140 Ser Arg Ala Glu Phe Phe Pro Leu Ser Asp Gly Trp Phe Asn Phe Val 145 150 155 160 Ser Thr Asn Phe Ser Asn Ser Leu Leu Ile Lys Asn Pro Glu Ser Ile 165 170 175 Lys INFORMATION FOR SEQ ID NO:852: SEQUENCE CHARACTERISTICS: LENGTH: 222 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...222 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:852: Met Arg Lys Ile Leu Leu Met Gly Leu Ile Leu Gin Ala Leu Phe Gly 1 5 10 WO 97/37044 PCTIUS97/05223 758 Glu Glu Ala Ala Gin Glu Leu Leu Gin Cys Ser Ala Ile Phe Glu Ser 25 Lys Lys Ala Glu Leu Lys Glu Asp Leu Arg Gin Leu Ser Glu Lys Glu 40 Gin Ser Leu Arg Ile Leu Gin Thr Glu Asn Ala Arg Leu Leu Asp Glu 55 Lys Ser Asp Leu Leu Asn Lys Lys Glu Lys Glu Ile Asp Glu Lys Leu 70 75 Lys Asn Leu Ala Ala Lys Glu Glu Ala Phe Lys Thr Leu Gin Thr Glu 90 Glu Lys Lys Arg Leu Lys Asn Leu Ile Glu Glu Asn Glu Gly Ile Leu 100 105 110 Arg Glu Ile Lys Gin Ala Lys Asp Ser Lys Ile Gly Glu Thr Tyr Ser 115 120 125 Lys Met Lys Asp Ser Lys Ser Ala Leu Ile Leu Glu Asn Leu Pro Thr 130 135 140 Gin Asn Ala Leu Glu Ile Leu Met Ala Leu Lys Pro Gin Glu Leu Gly 145 150 155 160 Lys Ile Leu Ala Lys Met Asp Pro Lys Lys Ala Ala Ala Leu Thr Glu 165 170 175 Leu Trp Gin Lys Pro Pro Lys Glu Asn Lys Glu Asn Lys Glu Ser Gin 180 185 190 Lys Thr Thr Asp Pro Thr Pro Pro Thr Pro Pro Thr Pro Pro Lys Glu 195 200 205 Pro Thr Leu Lys Asp Pro Asn Val Lys Glu Pro Thr Gly Val 210 215 220 INFORMATION FOR SEQ ID NO:853: SEQUENCE CHARACTERISTICS: LENGTH: 431 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein.
(iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...431 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:853: Met Asn Ile Gin Thr Lys Lys Arg Phe Leu Ala Asn Leu Leu Leu Phe 1 5 10 Ser Leu Phe Ser Cys Leu Lys Ala Glu Thr Leu Ser Glu Asp His Gin 25 Ile Leu Leu Ser Ser Asp Ala Phe His Arg Gly Asp Phe Ala Thr Ala 40 Gin Lys Gly Tyr Met Asn Leu Tyr Lys Gin Thr Asn Lys Val Val Tyr 55 Ala Lys Glu Ala Ala Ile Ser Ala Ala Ser Leu Gly Asp Ile Lys Thr 70 75 WO 97/37044 PCT/US97/05223 759 Ala Met His Leu Ala Met Leu Tyr Gin Lys Ile Thr Asn Asn Arg Asn 90 Asp Val Ser Ile Asn Lys Ile Leu Val Asp Gly Tyr Ala Gin Met Gly 100 105 110 Gin Ile Asp Lys Ala Ile Glu Leu Leu His Lys Ile Arg Lys Glu Glu 115 120 125 Lys Thr Ile Ala Thr Asp Asn Val Leu Gly Thr Leu Tyr Leu Thr Gin 130 135 140 Lys Arg Leu Asp Lys Ala Phe Pro Leu Leu Asn Lys Phe Tyr Asn Gin 145 150 155 160 Val His Asp Glu Asp Ser Leu Glu Lys Leu Ile Thr Ile Tyr Phe Leu 165 170 175 Gin Asn Arg Lys Lys Glu Gly Leu Asp Leu Leu Gin Ser His Ile Asp 180 185 190 Arg Tyr Gly Cys Ser Glu Gin Leu Cys Gin Lys Ala Leu Asn Thr Phe 195 200 205 Thr Gin Phe Asn Glu Leu Asp Leu Ala Lys Thr Thr Phe Ala Arg Leu 210 215 220 Tyr Glu Lys Asn Pro Ile Val Gin Asn Ala Gin Phe Tyr Ile Gly Val 225 230 235 240 Leu Ile Leu Leu Lys Glu Phe Asp Lys Ala Gin Gin Ile Ala Glu Leu 245 250 255 Phe Pro Phe Asp Arg Arg Leu Leu Leu Asp Leu Tyr Thr Ala Gln Lys 260 265 270 Lys Phe Asp Gin Ala Ser Lys Gin Ala Ser Leu Ile Tyr Gin Glu Arg 275 280 285 Lys Asp Pro Lys Phe Leu Gly Leu Glu Ala Ile Tyr His Tyr Glu Ser 290 295 300 Leu Ser Ala Asn Lys Lys Lys Leu Thr Lys Glu Glu Met Leu Pro Ile 305 310 315 320 Ile Gin Lys Leu Glu Gin Ala Thr Lys Glu Arg Gin Ala Trp Leu Ala 325 330 335 Lys Thr Lys Asp Lys Glu Asp Ala Gin Asp Ala Phe Phe Tyr Asn Phe 340 345 350 Leu Gly Tyr Ser Leu Ile Asp Tyr Asp Met Asp Val Lys Arg Gly Met 355 360 365 Asp Leu Val Arg Lys Ala Leu Ala Leu Asp Ser Asn Ser Val Leu Tyr 370 375 380 Leu Asp Ser Leu Ala Trp Gly Tyr Tyr Lys Leu Gly Asn Cys Leu Glu 385 390 395 400 Ala Lys Lys Ile Phe Ser Ser Ile Ala Lys Glu Leu Ile Gin Asn Glu 405 410 415 Pro Glu Leu Lys Glu His Asn Lys Ile Ile Gin Glu Cys Lys Lys 420 425 430 INFORMATION FOR SEQ ID NO:854: SEQUENCE CHARACTERISTICS: LENGTH: 97 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: WO 97/37044 PCT/US97/05223 760 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...97 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:854: Met Leu Tyr Ala Ser Lys Ala Arg Leu Phe Leu Gin Ile Lys Gly Lys 1 5 10 Phe Met Leu Arg Ile Leu Ile Pro Leu Leu Ile Ile Val Trp Val Leu 25 Trp Arg Leu Phe Leu Arg Gin Lys Pro His Lys Asp Asp His Arg Asp 40 Asn His Ser Tyr Thr Gin Gin Thr Pro Lys Glu Leu Glu Asp His Met 55 Ile Val Cys Ser Lys Cys Gin Thr Tyr Val Ser Ser Lys Asp Ala Ile 70 75 Tyr Ser Gly Ala Val Ala Tyr Cys Ser Glu Thr Cys Leu Lys Asp Lys 90 Gly INFORMATION FOR SEQ ID NO:855: SEQUENCE CHARACTERISTICS: LENGTH: 479 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...479 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:855: Met Lys Asn Ser Ala Pro Leu Lys Asn Lys Val Phe Cys Gly Leu Tyr 1 5 10 Val Leu Ser Leu Ser Ala Ser Val Gin Ala Phe Asp Tyr Lys Ile Glu 25 Val Leu Ala Glu Ser Phe Ser Lys Val Gly Phe Asn Lys Lys Lys Ile 40 Asp Ile Ala Arg Gly Ile Tyr Pro Thr Glu Thr Phe Val Thr Ala Val 55 Gly Gin Gly Asn Ile Tyr Ala Asp Phe Leu Ser Lys Ser Leu Lys Asp 70 75 Gin Gly His Val Leu Glu Gly Lys Val Gly Gly Thr Ile Gly Gly Ile 90 Ala Tyr Asp Ser Thr Lys Phe Asn Gin Gly Gly Ser Val Ile Tyr Asn 100 105 110 WO 97/37044 PCT/US97/05223 761 Tyr Ile Gly Tyr Trp Asp Gly Tyr Leu Gly Gly Lys Arg Ala Leu Leu 115 120 125 Asp Gly Thr Ser Ile His Glu Cys Ala Leu Gly Ser Asp Gly Lys Val 130 135 140 Ile Asp Ser Ile Ala Cys Gly Asn Ala Arg Ala Asn Lys Ile Arg Arg 145 150 155 160 Asn Tyr Leu Met Asn Asn Ala Phe Leu Glu Tyr Arg Tyr Lys Asp Ile 165 170 175 Phe Ala Ala Lys Gly Gly Arg Tyr Gin Ser Asn Ala Pro Tyr Met Ser 180 185 190 Ser Tyr Thr Gin Gly Phe Glu Ile Ser Ala Lys Ile Lys Asp Lys Asn 195 200 205 Glu Gly Ser His Lys Leu Trp Trp Phe Ser Ser Trp Gly Arg Ala Phe 210 215 220 Ala Tyr Gly Glu Trp Ile Tyr Asp Phe Tyr Ser Pro Arg Thr Val Ile 225 230 235 240 Lys Asn Gly Arg Thr Leu Asn Tyr Gly Ile His Leu Val Asp Tyr Thr 245 250 255 Tyr Glu Arg Lys Gly Val Ser Val Ser Pro Phe Phe Gin Phe Ser Pro 260 265 270 Gly Thr Tyr Tyr Ser Pro Gly Val Ala Val Gly Tyr Asp Ser Asn Pro 275 280 285 Asn Phe Asn Gly Val Gly Phe Arg Ser Glu Thr Lys Ala Tyr Ile Leu 290 295 300 Leu Pro Val His Ala Pro Leu Lys Arg Asp Thr Tyr Arg Tyr Ala Val 305 310 315 320 Lys Ala Gly Thr Ala Gly Gin Ser Leu Leu Ile Arg Gin Arg Phe Asp 325 330 335 Tyr Asn Glu Phe Asn Phe Gly Gly Ala Phe Tyr Lys Val Trp Lys Asn 340 345 350 Ala Asn Ala Tyr Ile Gly Thr Thr Gly Asn Pro Leu Gly Ile Asp Phe 355 360 365 Trp Thr Asn Ser Val Tyr Asp Ile Gly Gin Ala Leu Ser His Val Val 370 375 380 Thr Ala Asp Ala Val Ser Gly Trp Val Phe Gly Gly Gly Val His Lys 385 390 395 400 Lys Trp Leu Trp Gly Thr Leu Trp Arg Trp Thr Ser Gly Ala Leu Ala 405 410 415 Asn Glu Ala Ser Ala Ala Val Asn Val Gly Tyr Lys Ile Ser Lys Ser 420 425 430 Leu Thr Ala Ser Val Lys Leu Glu Tyr Leu Gly Val Met Thr His Ser 435 440 445 Gly Phe Thr Val Gly Ser Tyr Arg Pro Thr Pro Gly Ser Lys Ala Leu 450 455 460 Tyr Ser Asp Arg Ser His Leu Met Thr Thr Leu Ser Ala Lys Phe 465 470 475 INFORMATION FOR SEQ ID NO:856: SEQUENCE CHARACTERISTICS: LENGTH: 479 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 762 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...479 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:856: Met Lys Asn Ser Ala Pro Leu Lys Asn Lys Val Phe Cys Gly Leu Tyr 1 5 10 Val Leu Ser Leu Ser Ala Ser Val Gin Ala Phe Asp Tyr Lys Ile Glu 25 Val Leu Ala Glu Ser Phe Ser Lys Val Gly Phe Asn Lys Lys Lys Ile 40 Asp Ile Ala Arg Gly Ile Tyr Pro Thr Glu Thr Phe Val Thr Ala Val 55 Gly Gin Gly Asn Ile Tyr Ala Asp Phe Leu Ser Lys Ser Leu Lys Asp 70 75 Gin Gly His Val Leu Glu Gly Lys Val Gly Gly Thr Ile Gly Gly Ile 90 Ala Tyr Asp Ser Thr Lys Phe Asn Gin Gly Gly Ser Val Ile Tyr Asn 100 105 110 Tyr Ile Gly Tyr Trp Asp Gly Tyr Leu Gly Gly Lys Arg Ala Leu Leu 115 120 125 Asp Gly Thr Ser Ile His Glu Cys Ala Leu Gly Ser Asp Gly Lys Val 130 135 140 Ile Asp Ser Ile Ala Cys Gly Asn Ala Arg Ala Asn Lys Ile Arg Arg 145 150 155 160 Asn Tyr Leu Met Asn Asn Ala Phe Leu Glu Tyr Arg Tyr Lys Asp Ile 165 170 175 Phe Ala Ala Lys Gly Gly Arg Tyr Gin Ser Asn Ala Pro Tyr Met Ser 180 185 190 Ser Tyr Thr Gin Gly Phe Glu Ile Ser Ala Lys Ile Lys Asp Lys Asn 195 200 205 Glu Gly Ser His Lys Leu Trp Trp Phe Ser Ser Trp Gly Arg Ala Phe 210 215 220 Ala Tyr Gly Glu Trp Ile Tyr Asp Phe Tyr Ser Pro Arg Thr Val Ile 225 230 235 240 Lys Asn Gly Arg Thr Leu Asn Tyr Gly Ile His Leu Val Asp Tyr Thr 245 250 255 Tyr Glu Arg Lys Gly Val Ser Val Ser Pro Phe Phe Gin Phe Ser Pro 260 265 270 Gly Thr Tyr Tyr Ser Pro Gly Val Ala Val Gly Tyr Asp Ser Asn Pro 275 280 285 Asn Phe Asn Gly Val Gly Phe Arg Ser Glu Thr Lys Ala Tyr Ile Leu 290 295 300 Leu Pro Val His Ala Pro Leu Lys Arg Asp Thr Tyr Arg Tyr Ala Val 305 310 315 320 Lys Ala Gly Thr Ala Gly Gin Ser Leu Leu Ile Arg Gin Arg Phe Asp 325 330 335 Tyr Asn Glu Phe Asn Phe Gly Gly Ala Phe Tyr Lys Val Trp Lys Asn 340 345 350 Ala Asn Ala Tyr Ile Gly Thr Thr Gly Asn Pro Leu Gly Ile Asp Phe 355 360 365 Trp Thr Asn Ser Val Tyr Asp Ile Gly Gin Ala Leu Ser His Val Val WO 97/37044 PCT/US97/05223 763 370 375 380 Thr Ala Asp Ala Val Ser Gly Trp Val Phe Gly Gly Gly Val His Lys 385 390 395 400 Lys Trp Leu Trp Gly Thr Leu Trp Arg Trp Thr Ser Gly Ala Leu Ala 405 410 415 Asn Glu Ala Ser Ala Ala Val Asn Val Gly Tyr Lys Ile Ser Lys Ser 420 425 430 Leu Thr Ala Ser Val Lys Leu Glu Tyr Leu Gly Val Met Thr His Ser 435 440 445 Gly Phe Thr Val Gly Ser Tyr Arg Pro Thr Pro Gly Ser Lys Ala Leu 450 455 460 Tyr Ser Asp Arg Ser His Leu Met Thr Thr Leu Ser Ala Lys Phe 465 470 475 INFORMATION FOR SEQ ID NO:857: SEQUENCE CHARACTERISTICS: LENGTH: 171 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...171 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:857: Met Gly Leu Lys Asn Leu Ser Thr Leu Leu Val Phe Leu Phe Phe Cys 1 5 10 Leu Gly Cys Val Ser Asn Phe Asn Glu Asp Thr Tyr Thr Leu Asp Leu 25 Val Leu Glu Lys Lys Ile Gin Ala Ser Arg Lys Gly Glu Ile Thr Gin 40 Asp Asn Val Pro Ile Ile Thr Ala Ile Ala Thr His Leu Asn Asp Val 55 Asp Ser Gly Thr Tyr Tyr Asp His Glu Tyr Phe Leu Val Glu Ile Phe 70 75 Thr Gin Asn Asn Asp Trp Ile Asp Asp Gly Tyr Ile Ser Tyr Glu Leu 90 Phe Gly Thr Lys Pro Thr Gly Ser Glu Pro Leu Trp Val Arg Glu Ile 100 105 110 Thr Arg Asp Glu Phe Asp Gly Ile Leu Glu Thr Thr Asn Arg Trp Ser 115 120 125 Arg Ala Phe Leu Ile Ala Phe Asp Lys Leu Asp Tyr Leu Ala Val Gin 130 135 140 Glu Ala Lys Leu Glu Leu Asp Ala Tyr Ser Leu Gly Lys Ile Val Phe 145 150 155 160 Asn Phe Ala Tyr Gin Val Pro Leu Pro Gin Phe 165 170 WO 97/37044 PCT/US97/05223 764 INFORMATION FOR SEQ ID NO:858: SEQUENCE CHARACTERISTICS: LENGTH: 84 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...84 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:858: Val Asn Lys Lys His Arg Leu Ala Phe Leu Gly Leu Ile Val Gly Val 1 5 10 Leu Phe Phe Phe Ser Ala Cys Gin His Arg Leu His Met Gly Tyr Tyr 25 Ser Glu Val Thr Gly Asp Tyr Leu Phe Asn Tyr Asn Ser Thr Ile Val 40 Val Ala Tyr Asp Arg Ser Asp Ala Met Thr Ser Tyr Tyr Ile Asn Val 55 Ile Val Tyr Glu Leu Gin Lys Leu Gly Phe Tyr Asn Val Phe Thr Gin 70 75 Ala Asn Ser Arg INFORMATION FOR SEQ ID NO:859: SEQUENCE CHARACTERISTICS: LENGTH: 252 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...252 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:859: Met Gin Tyr Lys Lys Asn Lys Lys Arg Tyr Tyr His Leu Ala Leu Gly 1 5 10 Ile Leu Phe Cys Asn Gly Leu Ser Leu Lys Ala Leu Glu Ile Ala Val 25 WO 97/37044 PCT/US97/05223 765 Lys Pro Phe Gly Tyr Leu Gly Leu Leu Tyr Asn Gin Gly Thr Gin Lys 40 Asn Pro His Ser Tyr Val Gly Ala Leu Ala Arg Leu Gly Val Asp Phe 55 Ser Tyr Ser Asn Gly Trp Ser Phe Gly Ile Gly Ala Ile Gly Ala Trp 70 75 Asn Ile Tyr Asn Lys Gin Arg Leu Ala Asn Leu Tyr Ile Ser Leu Gly 90 Asn Phe Phe Gly Asn Pro Asn Asn Val Lys Pro Tyr Leu Ser Ala Gly 100 105 110 Asp Val Ser Asp Ala Tyr Leu Gin Tyr Ala Asn Gin Arg Phe Lys Ile 115 120 125 Ala Leu Gly Arg Phe Asn Thr Asp Phe Val Asp Phe Asp Trp Ile Gly 130 135 140 Gly Asn Ile Gin Gly Val Ser Val Ala Phe Lys Gin Asn Ser Met Arg 145 150 155 160 Tyr Phe Gly Ile Phe Met Asp Ser Met Leu Tyr Asn Gly His Gin Ile 165 170 175 Asn Lys Glu Gin Gly Asn Arg Ile Ala Thr Ser Leu Asn Ala Leu Ala 180 185 190 Ser Tyr Asp Pro Val Ser Lys Arg Leu Tyr Val Gly Gly Glu Val Phe 195 200 205 Val Leu Gly Ala Glu Tyr Lys Asn Lys Asn Leu Ile Phe Val Pro Phe 210 215 220 Ile Leu Thr Asp Thr Arg Leu Pro Leu Pro Thr Gin Asn Val Leu Val 225 230 235 240 Gin Val Gly Gly Lys Leu Glu Tyr Arg Arg Phe Phe 245 250 INFORMATION FOR SEQ ID NO:860: SEQUENCE CHARACTERISTICS: LENGTH: 231 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...231 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:860: Met Val Gin Lys Ile Gly Ile Leu Gly Ala Met Arg Glu Glu Ile Thr 1 5 10 Pro Ile Leu Glu Leu Phe Gly Val Asp Phe Glu Glu Ile Pro Leu Gly 25 Gly Asn Val Phe His Lys Gly Val Tyr His Asn Lys Glu Ile Ile Val 40 Ala Tyr Ser Lys Ile Gly Lys Val His Ser Thr Leu Thr Thr Thr Ser 55 WO 97/37044 PCT/US97/05223 766 Met Ile Leu Ala Phe Gly Val Gin Lys Val Leu Phe Ser Gly Val Ala 70 75 Gly Ser Leu Val Lys Asp Leu Lys Ile Asn Asp Leu Leu Val Ala Thr 90 Gin Leu Val Gin His Asp Val Asp Leu Ser Ala Phe Asp His Pro Leu 100 105 110 Gly Phe Ile Pro Glu Ser Ala Ile Phe Ile Glu Thr Ser Gly Ser Leu 115 120 125 Asn Ala Leu Ala Lys Lys Ile Ala Asn Glu Gin His Ile Ala Leu Lys 130 135 140 Glu Gly Val Ile Ala Ser Gly Asp Gin Phe Val His Ser Lys Glu Arg 145 150 155 160 Lys Glu Phe Leu Val Ser Glu Phe Lys Ala Ser Ala Val Glu Met Glu 165 170 175 Gly Ala Ser Val Ala Phe Val Cys Gln Lys Phe Gly Val Pro Cys Cys 180 185 190 Val Leu Arg Ser Ile Ser Asp Asn Ala Asp Glu Lys Ala Gly Met Ser 195 200 205 Phe Asp Glu Phe Leu Glu Lys Ser Ala His Thr Ser Ala Lys Phe Leu 210 215 220 Lys Ser Met Val Asp Glu Leu 225 230 INFORMATION FOR SEQ ID NO:861: SEQUENCE CHARACTERISTICS: LENGTH: 225 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...225 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:861: Met Arg Ala Thr Ala Ile Lys Ile Phe Ser Phe Ser Ser Ala Leu Ala 1 5 10 Leu Leu Leu Gin Gly Cys Leu Ser Ile Asn Leu Lys Gln Met Leu Pro 25 Glu Ile Arg Thr Tyr Asp Leu Asn Ala Ser Ser Phe Glu Met Thr Gin 40 Cys Pro Lys Pro Leu Thr Glu Val Arg Leu Ile Ser Ile Leu Ser Ala 55 Asp Leu Phe Asn Thr Lys Glu Ile Val Phe Lys Ala Gin Asp Gly Gin 70 75 Ile Thr His Gly Lys His Gin Lys Trp Ile Asp Leu Pro Arg Asn Met 90 Leu Lys Thr Met Phe Met Gin Glu Ala Gin Lys Ala Cys Leu Gly Val 100 105 110 WO 97/37044 PCT/US97/05223 767 Ala Leu Pro Pro Tyr Gly Ala Gly Ala Pro Thr Tyr Ala Val Arg Phe 115 120 125 Thr Ile Leu Ser Phe Ser Leu Leu Glu Lys Glu Asn Ser Thr Tyr Arg 130 135 140 Ala Glu Phe Ala Leu Gly Tyr Asp Ile Ser Val Lys Gly Asp Ser His 145 150 155 160 Ser Gly Val Ile Ile Lys His Glu Asn Ile Ser Ser Leu Glu Asn Lys 165 170 175 Thr Thr Lys Thr Ser Lys Asn Gly Ser Gin Asp Phe Gin Glu Ser Ala 180 185 190 Ile Gin Ser Leu Gin His Val Ser Thr Gin Ala Met Gin Glu Ala Ile 195 200 205 Ser Leu Ile Lys Lys Ala Val Glu Ala Gin Ser Val Ser Pro Leu Lys 210 215 220 Lys 225 INFORMATION FOR SEQ ID N0:862: SEQUENCE CHARACTERISTICS: LENGTH: 271 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...271 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:862: Leu Glu Arg His Val Asn Tyr Thr Leu Ile Gly Gly Leu Phe Phe Leu 1 5 10 Cys Leu Val Cys Met Val Gly Phe Ile Leu Trp Leu Gly His Leu Gly 25 Leu Asp Asp Gly Lys Tyr Tyr Glu Tyr Val Val Tyr Thr Asp Lys Asp 40 Leu Gly Gly Ile Ala Thr Asn Ser Pro Ile Asn Tyr Lys Gly Ile Gin 55 Val Gly Asn Val Ile Lys Val Gly Phe Ala Lys Asp Lys Val Gly Val 70 75 Val Arg Leu Asp Leu Met Ile Lys Ser Ser Val Lys Ile Arg Lys Asp 90 Ser Lys Val Ala Val Ser Ser Arg Gly Phe Met Gly Leu Lys Phe Leu 100 105 110 Ala Leu Glu Gin Ser His Asn Glu Glu Phe Tyr Gly Ser Gly Asp Lys 115 120 125 Gly Glu Arg Ile Leu Ile Phe Lys Glu Gly Leu Met Asp Arg Leu Ser 130 135 140 Gly Asp Ala Asn Gin Val Val Gin Glu Val Met Lys Ala Ile Arg Asn 145 150 155 160 WO 97/37044 PCT/US97/05223 768 Val Asn Arg Ile Leu Asp Asp Glu Asn Val Glu Lys Phe Lys His Ile 165 170 175 Leu Ala Ser Val Asp Asp Leu Ile Ala Asn Leu Asp Ser Arg Lys Thr 180 185 190 Gin Phe Asp Ser Leu Ile Asn Asn Ala Asn Asn Leu Val Ser Asn Val 195 200 205 Asn Asn Val Ala Leu Asp Val Asp Lys Arg Val Lys Gin Gly Gin Tyr 210 215 220 Asp Phe Lys Ala Met Phe Thr Pro Leu Ile Met Gin Ala Gin Leu Ser 225 230 235 240 Leu Arg Asn Ile Asp Asn Phe Val Glu Lys Gly Ser Ala Leu Ile Asp 245 250 255 Lys Phe Asp Ala Asn Pro Tyr Lys Thr Ile Phe Gly Glu Arg Lys 260 265 270 INFORMATION FOR SEQ ID NO:863: SEQUENCE CHARACTERISTICS: LENGTH: 350 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...350 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:863: Met Lys Lys Thr Ile Leu Leu Ser Ala Leu Leu Ala Phe Ser Cys Ala 1 5 10 His Ala Leu Ser Asp Met Asp Leu Ile Lys Lys Ala Lys Glu Ser Gin 25 Leu Glu Pro Met Pro Met Gly Lys Ala Leu Lys Glu Tyr Gin Ile Lys 40 Lys Thr Arg Asp Val Gly Ile Gly Ala Lys Asn Ser Glu Ile Met Thr 55 Ser Ala Gin Val Glu Leu Gly Lys Met Leu Tyr Phe Asp Pro Arg Ile 70 75 Ser Thr Ser Tyr Leu Val Ser Cys Asn Thr Cys His Asn Leu Gly Leu 90 Gly Gly Val Asp Leu Ile Pro Ser Ala Ile Gly Ser Gin Trp Lys Lys 100 105 110 Asn Pro His Leu Leu Ser Ser Pro Thr Val Tyr Asn Ser Val Phe Asn 115 120 125 Asp Val Gin Phe Trp Asp Gly Arg Val Thr His Leu Asn Glu Gin Ala 130 135 140 Gin Gly Pro Ile Gin Ser Ser Phe Glu Met Gly Ala Asp Pro Lys Val 145 150 155 160 Val Val Glu Lys Ile Asn Ser Ile Pro Gly Tyr Val Lys Leu Phe Arg 165 170 175 WO 97/37044 PCT/US97/05223 769 Lys Ala Tyr Gly Ser Lys Val Lys Ile Asp Phe Lys Leu Ile Ala Asp 180 185 190 Ser Ile Ala Met Phe Glu Ala Thr Leu Ile Thr Pro Ser Arg Tyr Asp 195 200 205 Asp Phe Leu Arg Gly Asn Pro Lys Ala Leu Ser Lys Ala Glu Lys Glu 210 215 220 Gly Leu Asp Leu Phe Ile Ser Lys Gly Cys Val Ala Cys His Asn Gly 225 230 235 240 Ile Asn Leu Gly Gly Thr Met Gin Pro Phe Gly Val Val Lys Pro Tyr 245 250 255 Lys Phe Ala Asn Val Gly Asp Phe Lys Gly Asp Lys Asn Gly Leu Val 260 265 270 Lys Val Pro Thr Leu Arg Asn Ile Thr Glu Thr Met Pro Tyr Phe His 275 280 285 Asn Gly Gin Phe Trp Asp Val Lys Asp Ala Ile Lys Glu Met Gly Ser 290 295 300 Ile Gin Leu Gly Ile Glu Ile Ser Asp Ala Glu Ala Lys Lys Ile Glu 305 310 315 320 Thr Phe Phe Glu Ala Leu Lys Gly Lys Lys Pro Lys Ile Ile Tyr Pro 325 330 335 Glu Leu Pro Val Met Thr Asp Lys Thr Pro Lys Pro Ser Phe 340 345 350 INFORMATION FOR SEQ ID NO:864: SEQUENCE CHARACTERISTICS: LENGTH: 73 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...73 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:864: Met Met Asp Leu Glu Ser Leu Arg Gly Phe Ala Tyr Ala Phe Phe Thr 1 5 10 Ile Leu Phe Thr Leu Phe Leu Tyr Ala Tyr Ile Phe Ser Met Tyr Arg 25 Lys Gin Lys Lys Gly Ile Val Asp Tyr Glu Arg Tyr Gly Tyr Leu Ala 40 Leu Asn Asp Ala Leu Glu Asp Glu Leu Ile Glu Pro Arg His Lys Glu 55 Val His Asp Lys Gly Ile Lys Glu Ser INFORMATION FOR SEQ ID NO:865: SEQUENCE CHARACTERISTICS: WO 97/37044 770 LENGTH: 275 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 275 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:865: PCT1UJS97/05223 Leu Leu Asp Val Lys Ala Asn Ser Pro Ala Tyr Gln Ala Val Leu Leu 1 Ala Thr Thr Ser Ala As n Phe Leu Ala 145 Ala Ser Ile Al a Lys 225 Asn Lys Glu Leu Ala Phe Tyr Ile Gly Thr Val 130 Thr Gin Ala S er Ile 210 Ile His Thr Lys Asn :ys Asn Tyr Ile Giu Ile 115 Tyr Ile Giu Cys Gly 195 Gin Val1 Sen Arg Leu Ala Gly Asn Glu Asn Gly 100 Asn Pro Thr Leu Pro 180 Asn Gly Ser Thr Ser 260 5 Ala Pro Val1 Pro Lys Ile Gly Trp Val Leu 165 Asn Gly *Met *Giu -Pro 245 Pro Val Gly Pro Gly 70 Ala Pro Asp Ser Pro 150 Lys Phe Thr Ile Asn 230 Thr Ser Gly Sen Gly 55 His Tyr Val1 Lys His 135 Thr Gin Gin Met Ala 215 Thr Gin Gly Leu Asn 40 Gin Gly Gin Leu Arg 120 Gly Thr Ala Asn Cys 200 Asn Gin *Thr *Asp Trp 25 Asn Gly Ile Ser 105 Thr Lys Giu Ser Gly 185 Gly Ala Asn Leu Phe 265 10 ,ln %sn rhr Pro Ile 90 Asn Gly Pro Asn Ile 170 Gly Met Gin Gin Val1 250 Lys Val Ala4 Thr Ile 75 Gin Thr Gly Ile Ile 155 Ile Ser Phe Glu *Asn 235 Leu Pro rhr Asn Thr Ser Lys Thr Giu Sen 140 Asn Ile Gly Lys Ala 220 Ser Leu Ser Ser Gly Ile Thr Ala Thr Pro 125 Thr Thr Thr Tyr Asn 205 Val1 Leu 1Lys Arg Tyr Gly Thr Giu Leu Lys 110 Asn Ser Thr Thr Trp 190 Giu Ala Asp Ala Thr 270 kla Ilie :ys Asn Thr Leu Lys Trp Asn Leu 175 Ala Ile Gin Ala Cys Ser Phe Gin Asn Tyr Ala Asp Lys Asn Ser 160 Asn Gly Ser Ala *Glu 240 *Ser -Gly INFORMATION FOR SEQ ID NO:866: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 771 LENGTH: 90 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...90 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:866: Met Leu Thr Ile Glu Thr Ser Lys Lys Phe Asp Lys Asp Leu Lys Ile 1 5 10 Leu Val Lys Asn Gly Phe Asp Leu Lys Leu Leu Tyr Lys Val Val Gly 25 Asn Leu Ala Thr Glu Gin Pro Leu Glu Pro Lys Tyr Lys Asp His Pro 40 Leu Lys Gly Ala Leu Lys Asp Phe Arg Glu Cys His Leu Lys Pro Asp 55 Leu Leu Leu Val Tyr Gin Ile Lys Lys Gin Glu Asn Thr Leu Phe Leu 70 75 Val Arg Leu Gly Ser His Ser Glu Leu Phe INFORMATION FOR SEQ ID NO:867: SEQUENCE CHARACTERISTICS: LENGTH: 100 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...100 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:867: Met Leu Glu Ile Glu Leu Lys Lys Lys Phe Thr Lys Asp Leu Lys Lys 1 5 10 His Ile Leu Asn Gin Lys Ile Glu Leu Glu Val Phe Asp Leu Val Val 25 Glu Asn Leu Arg Asn Gin Ile Pro Leu Asp Lys Arg Phe Lys Asp His 40 Ala Leu Ser Gly Thr Tyr Lys Gly Cys Arg Glu Arg His Ile Lys Pro WO 97/37044 PCT/US97/05223 772 55 Asp Val Leu Leu Val Tyr Arg Val Lys Gly Asn Val Leu Thr Leu Val 70 75 Arg Leu Gly Ser His Ser Glu Leu Phe Cys Lys Pro Pro Thr Pro Leu 90 Ile Thr Leu Lys 100 INFORMATION FOR SEQ ID NO:868: SEQUENCE CHARACTERISTICS: LENGTH: 89 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...89 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:868: Leu Val Val Ser Gly Ser Leu Gin Asn Ile Val Arg Ser Phe Leu Arg 1 5 10 Arg Ile Ser Val Phe Phe Ile Ile Asp Phe Ile Asn Phe Ile Phe Val 25 Ile Asp Gin Ile Ile His Phe Phe Gly Asp Phe Phe Ile Phe Phe Phe 40 Met Leu Asn Leu Phe Tyr Ile Leu Ile Arg Leu Ile Asp Leu Ile Val 55 Leu Ser Gly Phe Ser Val Val Leu Val Arg Leu Gly Arg Phe Phe Arg 70 75 Leu Ile Phe Ala Gly Ala Gin His Ala INFORMATION FOR SEQ ID NO:869: SEQUENCE CHARACTERISTICS: LENGTH: 479 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature WO 97/37044 773 LOCATION .479 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:869: PCT[US97/05223 Met Lys Lys Ser Leu Cys Leu Ser Phe Phe Leu Thr Phe Ser Asn Pro 1 Leu C His I Gin Thr Lys I Arg Ser Lys Ser 145 Val Asn Asn Ser Asn 225 Cys Ser Thr Thr Pro 305 Ser Thr Ser Pro Leu 385 Asn ,In .ys Jal Ile eu %sn %sn Pro 130 Lys rhr Asp Lea Ser 210 Val Gly Ile Pro Pro 290 Gin Ser Arg Glu Lys 370 Al Let Ala I Gly Leu Thr I Ser Thr Pro 115 Ser Leu Pro Ala Phe 195 Ala Glu Lys Leu Cys 275 Tyr Thr Thr Gin Tyr 355 Gin Tyr 1 Asn eu Phr Ay H{is Leu ln 100 Ile Ser Gly Ser Asn 180 Val Asp Lys Trp Lys 260 Asp Thr Phe Glu Cys 340 Gl Asj Sei Gl.
5 Val I Phe L Val 9 Ile Glu Ile Pro Ser Ser Lys 165 Pro Ala Ala Gin Va1 245 Arg Tyr Lys Glu Lys 325 Tyr Ile Asp Ser a Lys le Glu Lea .ys yr jer 10 Phr lal Ser In Lys 150 V~al Pro Pro Ser Ala 230 Tyr Va1 Ser Ile Ala 310 Cys Lea Thr Gir Thr 39 PhE Ala I Asn Thr 2 Thr I Phe Leu Gin 135 Asn Ser Leu Pro Glu 215 Ile Asp Asp Thr Ser 295 Lys Lys Ile Thr Val 375 Arg Met -ys 10 Ile la .Leu Ser Asn 120 Ser Ser Pro Lys Thr 200 Asn Arg Asp Lys Ala 280 Va1 Asn Arg Glu Glr 36C Lyc G 1 1 Lea C 25 Val I Ser I Ile N Ser Ser I 105 Ala I Pro Lys Thr His 185 Glu Asn Asp Glu Asp 265 Glu His Asn Ala iGlu 345 Leu Pro Ser i Phe .0 ;lu .eu ,ro Tal ?ro )0 ys ?ro 31n Asn Asn 170 Ser Lys Glu Pro Asn 250 Lys Asn Lys Phe Arg 330 Pro Val Thr Glu Val Glu Asp His Tyr 75 Asn Glu Met Asn Ser 155 Glu Ser Thr Ser Asn 235 Leu GLu Lys Thr Ala 315 Ala Let Lys PhE Ii 39 Ci' Ile I Ser Lys Gin Arg Lea Gin Phe 140 Lea Val Gin Leu Asn 220 Ile Gin Ile Ser Glu 300 Ile Arg Lys Ala Tyr 380 Thr i Val Lys Lys Lys Pro Pro Lys Lys 125 Ser Lea Lys Asp Pro 205 Glu Lys Ala Thr Cly 285 Pro Leu Lys Glr IlE 36E Cii Arc Tyl
I
Thr S Glu E Leu 9 Leu i Thr Glu 110 Pro Tyr Gin Thr Gin 190 Asn Asn GLu Tyr Thr 270 Lys Leu Gin Asp Ala 350 Tyr i Thr j Asn r Glu er ,ro 'hr ~sp Ile ?ro Mln Pro Pro Pro 175 lu Asn Arg Phe Arg 255 Asp Ile Glu Ala Gly 335 Trp Glu Ser CIt 411 415 Pro Arg Lea Glu Pro His Asn Glu Leu 160 Thr Asn Thr Asp Ala 240 Pro Ile Ile Asp Arg 320 Thr lu Arg Glu Lea 400 r His 405 410 Tyr Lea Asn Asp Ile Ile Lys Glu Ser Ser Glu Tyr Lys Glu Trp Val WO 97/37044 PCT/US97/05223 774 420 425 430 Lys Asn His Val Arg Phe Lys Glu Gly Val Cys Met Ala Leu Glu Ile 435 440 445 Glu Glu Gin Pro Arg Ala Lys Ser Thr Pro Leu Ser Ile Glu Asn Ser 450 455 460 Arg Val Val Cys Val Lys Lys Gly Asn Tyr Leu Phe Asn Glu Val 465 470 475 INFORMATION FOR SEQ ID NO:870: SEQUENCE CHARACTERISTICS: LENGTH: 479 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...479 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:870: Met Lys Lys Ser Leu Cys Leu Ser Phe Phe Leu Thr Phe Ser Asn Pro 1 5 10 Leu Gin Ala Leu Val Ile Glu Leu Leu Glu Glu Ile Lys Thr Ser Pro 25 His Lys Gly Thr Phe Lys Ala Lys Val Leu Asp Ser Lys Glu Pro Arg 40 Gin Val Leu Gly Val Tyr Asn Ile Ser Pro His Lys Lys Leu Thr Leu 55 Thr Ile Thr His Ile Ser Thr Ala Ile Val Tyr Gin Pro Leu Asp Glu 70 75 Lys Leu Ser Leu Glu Thr Thr Leu Ser Pro Asn Arg Pro Thr Ile Pro 90 Arg Asn Thr Gin Ile Val Phe Ser Ser Lys Glu Leu Lys Glu Pro His 100 105 110 Ser Asn Pro Ile Pro Ser Leu Asn Ala Pro Met Gin Lys Pro Gin Asn 115 120 125 Lys Pro Ser Ser Ser Gin Gin Ser Pro Gin Asn Phe Ser Tyr Pro Glu 130 135 140 Ser Lys Leu Gly Ser Lys Asn Ser Lys Asn Ser Leu Leu Gin Pro Leu 145 150 155 160 Val Thr Pro Ser Lys Val Ser Pro Thr Asn Glu Val Lys Thr Pro Thr 165 170 175 Asn Asp Ala Asn Pro Pro Leu Lys His Ser Ser Gin Asp Gin Glu Asn 180 185 190 Asn Leu Phe Val Ala Pro Pro Thr Glu Lys Thr Leu Pro Asn Asn Thr 195 200 205 Ser Ser Ala Asp Ala Ser Glu Asn Asn Glu Ser Asn Glu Asn Arg Asp 210 215 220 Asn Val Glu Lys Gin Ala Ile Arg Asp Pro Asn Ile Lys Glu Phe Ala WO 97/37044 PCT/US97/05223 775 225 230 235 240 Cys Gly Lys Trp Val Tyr Asp Asp Glu Asn Leu Gin Ala Tyr Arg Pro 245 250 255 Ser Ile Leu Lys Arg Val Asp Lys Asp Lys Glu Ile Thr Thr Asp Ile 260 265 270 Thr Pro Cys Asp Tyr Ser Thr Ala Glu Asn Lys Ser Gly Lys Ile Ile 275 280 285 Thr Pro Tyr Thr Lys Ile Ser Val His Lys Thr Glu Pro Leu Glu Asp 290 295 300 Pro Gin Thr Phe Glu Ala Lys Asn Asn Phe Ala Ile Leu Gin Ala Arg 305 310 315 320 Ser Ser Thr Glu Lys Cys Lys Arg Ala Arg Ala Arg Lys Asp Gly Thr 325 330 335 Thr Arg Gin Cys Tyr Leu Ile Glu Glu Pro Leu Lys Gin Ala Trp Glu 340 345 350 Ser Glu Tyr Glu Ile Thr Thr Gin Leu Val Lys Ala Ile Tyr Glu Arg 355 360 365 Pro Lys Gin Asp Asp Gin Val Glu Pro Thr Phe Tyr Glu Thr Ser Glu 370 375 380 Leu Ala Tyr Ser Ser Thr Arg Lys Ser Glu Ile Thr Arg Asn Glu Leu 385 390 395 400 Asn Leu Asn Glu Lys Phe Met Glu Phe Val Glu Val Tyr Glu Gly His 405 410 415 Tyr Leu Asn Asp Ile Ile Lys Glu Ser Ser Glu Tyr Lys Glu Trp Val 420 425 430 Lys Asn His Val Arg Phe Lys Glu Gly Val Cys Met Ala Leu Glu Ile 435 440 445 Glu Glu Gin Pro Arg Ala Lys Ser Thr Pro Leu Ser Ile Glu Asn Ser 450 455 460 Arg Val Val Cys Val Lys Lys Gly Asn Tyr Leu Phe Asn Glu Val 465 470 475 INFORMATION FOR SEQ ID NO:871: SEQUENCE CHARACTERISTICS: LENGTH: 300 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...300 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:871: Met Arg Lys Thr Ile Ser Ala Leu Phe Leu Ser Ala Cys Ile Gly Leu 1 5 10 Ser Ser Val His Ala Ser Asn Ala Leu Ile Leu Gin Thr Asp Phe Ser 25 Leu Lys Asp Gly Ala Val Ser Ala Met Lys Gly Val Ala Phe Ser Val WO 97/37044 PCT/US97/05223 776 40 Asp Ser Asn Leu Lys Ile Phe Asp Leu Thr His Glu Ile Pro Pro Tyr 55 Asn Ile Trp Glu Gly Ala Tyr Arg Leu Tyr Gin Thr Ala Ser Tyr Trp 70 75 Pro Lys Gly Ser Val Phe Val Ser Val Val Asp Pro Gly Val Gly Thr 90 Asn Arg Lys Ser Val Val Leu Lys Thr Lys Asn Gly Gin Tyr Phe Val 100 105 110 Ser Pro Asp Asn Gly Thr Leu Thr Leu Val Ala Gin Thr Leu Gly Ile 115 120 125 Asp Ser Val Arg Glu Ile Asp Glu Lys Ala Asn Arg Leu Lys Gly Ser 130 135 140 Glu Lys Ser Tyr Thr Phe His Gly Arg Asp Val Tyr Ala Tyr Thr Gly 145 150 155 160 Ala Arg Leu Ala Ser Gly Ala Ile Thr Phe Glu Gin Val Gly Pro Glu 165 170 175 Leu Pro Ile Lys Val Val Glu Ile Pro Tyr Gin Lys Ala Lys Ala Thr 180 185 190 Lys Gly Gly Val Lys Gly Asn Ile Pro Ile Leu Asp Ile Gin Tyr Gly 195 200 205 Asn Val Trp Ser Asn Ile Ser Asp Lys Leu Leu Asn Gin Ala Gly Ile 210 215 220 Lys Arg Asn Asp Thr Val Cys Val Thr Ile Phe Lys Asn Ser Lys Lys 225 230 235 240 Gin Tyr Glu Gly Lys Met Pro Tyr Val Ala Ser Phe Gly Asp Val Pro 245 250 255 Glu Gly Gin Pro Leu Val Tyr Leu Asn Ser Leu Leu Asn Val Ser Val 260 265 270 Ala Leu Asn Met Asp Asn Phe Ala Gin Lys His Gin Ile Lys Ser Gly 275 280 285 Ala Asp Trp Asn Ile Asp Ile Lys Lys Cys Ala Lys 290 295 300 INFORMATION FOR SEQ ID NO:872: SEQUENCE CHARACTERISTICS: LENGTH: 156 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...156 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:872: Leu Thr Lys Lys Phe Met Ser Trp Met Val Val Ile Gly Ala Leu Ile 1 5 10 Cys Val Leu Leu Gly Val Phe Ile Phe Phe Thr Ser Met Ser Val Lys WO 97/37044 PCT/US97/05223 777 25 Lys Ser Leu Thr Ala Tyr Leu Asn Ala Tyr Leu Glu Gin Arg Pro Asn 40 Ile Glu Gly Met Gly Ile Ile Gly Val Pro Phe Lys Cys Glu Arg Phe 55 Phe Lys Ile Ala Cys Val Ser Lys Glu Leu Arg Phe Leu Asp Pro Gin 70 75 Asn Ser Pro Ile Met Asp Phe Lys Asn Leu Lys Ile Lys Leu His Ser 90 Leu Asp Lys Ser Ser Leu Thr Leu Ser Ile His Ser Gin Ile Gin Ser 100 105 110 Pro Ile Leu Glu Gin Ser Ile Gin Gin Lys Ile Ser Gin Ile Pro Leu 115 120 125 Lys Asn Leu Asn Ala Leu Leu Glu Lys Phe Lys Pro Thr Arg Leu Asn 130 135 140 Cys Ser Leu Thr Phe Asn Ala Leu Asp Glu Lys Thr 145 150 155 INFORMATION FOR SEQ ID NO:873: SEQUENCE CHARACTERISTICS: LENGTH: 285 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...285 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:873: Met Gin Asp Phe Ile Lys Ile Phe Ile Gin Glu Val Val Ser Thr Leu 1 5 10 Glu Gly Leu Val Gly Lys Ala Pro Ser Val Gly Leu Glu Lys Glu Val 25 Ser Asn Asn Glu Glu Ala Ser Leu Ile Ser Thr Pro Tyr Ala Arg Val 40 Lys Ile Ser Ala Ile Glu Lys Asn Glu Ser Pro Ile Glu Leu Leu Ala 55 Pro Val Asp Leu Val Thr Ala Leu Ser Asp Leu Met Leu Gly Gly Glu 70 75 Gly Ala Ser Lys Glu Glu Met Asp Asn Asp Asp Leu Asp Ala Phe Lys 90 Glu Met Ala Ser Asn Ile Phe Gly Ala Ile Ala Thr Ser Leu Lys Ser 100 105 110 Gin Glu Leu Leu Pro Lys Leu Asn Phe Thr Thr Thr Asn Ala Glu Ile 115 120 125 Ala Lys Glu Leu Pro Lys Lys Glu Asp Tyr Ala Lys Ala Met Val Phe 130 135 140 Ser Phe Lys Met Glu Ala Ile Lys Glu Ser Gin Ile Val Leu Leu Ile WO 97/37044 PCT/US97/05223 778 145 150 155 160 Thr Ser Ala Phe Glu Gly Gin Phe Glu Lys Thr His Lys Glu Glu Lys 165 170 175 Glu Glu Thr Thr Lys Ser Ala Thr Glu Glu Thr Lys Thr His Asp Ala 180 185 190 Ser Leu Glu Asn Ile Glu Ile Arg Asn Ile Ser Met Leu Leu Asp Val 195 200 205 Lys Leu Asn Val Lys Val Arg Ile Gly Gin Lys Lys Met Ile Leu Lys 210 215 220 Asp Val Val Ser Met Asp Ile Gly Ser Val Val Glu Leu Asp Gin Leu 225 230 235 240 Val Asn Asp Pro Leu Glu Ile Leu Val Asp Asp Lys Val Ile Ala Lys 245 250 255 Gly Glu Val Val Ile Val Asp Gly Asn Phe Gly Ile Gin Ile Thr Asp 260 265 270 Ile Gly Thr Lys Lys Glu Arg Leu Glu Gin Leu Lys Asn 275 280 285 INFORMATION FOR SEQ ID NO:874: SEQUENCE CHARACTERISTICS: LENGTH: 329 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...329 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:874: Met Thr Thr Lys Arg Val Asn Thr Ala Thr Asn Lys Ile Met Thr Leu 1 5 10 Asn Thr Phe Leu Asp Thr Cys Phe Leu Phe Phe Ile Ser Ile Leu Phe 25 Tyr Leu Ser Ile Pro Ile Tyr Pro Asn Lys Val Val Val Val Pro Gin 40 Gly Ser Leu Lys Lys Val Phe Phe Ser Leu Lys Glu Gin Gly Val Asp 55 Ile Asn Ala Leu Asp Leu Leu Leu Leu Arg Leu Met Gly Met Pro Lys 70 75 Lys Gly Tyr Ile Asp Met Gly Asp Gly Ala Leu Arg Lys Gly Asp Phe 90 Leu Val Arg Leu Ile Lys Ala Lys Thr Ala Gin Lys Ser Val Thr Leu 100 105 110 Ile Pro Gly Glu Thr Arg Tyr Phe Phe Thr Gin Ile Leu Ser Glu Thr 115 120 125 Tyr Gin Leu Glu Thr Ser Asp Leu Asn Glu Ala Tyr Glu Ser Ile Ala 130 135 140 Pro Arg Leu Asn Gly Ala Val Ile Glu Asp Gly Val Ile Trp Pro Asp WO 97/37044 PCT/US97/05223 779 145 150 155 160 Thr Tyr His Leu Pro Leu Gly Glu Asp Ala Phe Lys Ile Met Gin Thr 165 170 175 Leu Ile Gly Gin Ser Met Lys Lys His Glu Ala Leu Ser Lys Gin Trp 180 185 190 Leu Gly Tyr Tyr His Lys Glu Glu Trp Phe Glu Lys Ile Ile Leu Ala 195 200 205 Ser Ile Val Gin Lys Glu Ala Ala Asn Val Glu Glu Met Pro Leu Ile 210 215 220 Ala Ser Val Ile Phe Asn Arg Leu Lys Lys Gly Met Pro Leu Gin Met 225 230 235 240 Asp Gly Ala Leu Asn Tyr Gin Glu Phe Ser His Ala Lys Val Thr Lys 245 250 255 Glu Arg Ile Lys Thr Asp Asn Thr Pro Tyr Asn Thr Tyr Lys Phe Lys 260 265 270 Gly Leu Pro Lys Asn Pro Val Gly Ser Val Ser Leu Glu Ala Val Arg 275 280 285 Ala Val Val Phe Pro Lys Lys Thr Asp Phe Leu Tyr Phe Val Lys Met 290 295 300 Pro Asp Lys Lys His Ala Phe Ser Ala Thr Tyr Lys Glu His Leu Lys 305 310 315 320 Asn Ile Asn Ile Ser Asn Asn His Phe 325 INFORMATION FOR SEQ ID NO:875: SEQUENCE CHARACTERISTICS: LENGTH: 146 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...146 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:875: Met Arg Leu Leu Phe Leu Leu Leu Ser Ala Thr Leu Met Leu Leu Ala 1 5 10 Glu Glu Lys Ile Pro Leu Ser Asp Asp Ala Pro Ile Lys Leu Val His 25 Trp Gin Asn Ala Leu Lys Glu Val Gin Pro Asp Ser Asn Ala Pro Ala 40 Thr Pro Pro Ile Lys Ala Val Gin Thr Thr Leu Thr Phe Glu Thr Pro 55 Phe Asn Lys Thr Pro Lys Ile Met Glu Val Glu Gly Gin Lys Val Ile 70 75 Val Leu Lys Asn Ala Gin Leu Asp Ser Lys Lys Thr Met Asp Phe Lys 90 Glu Ala Ser Leu Asn Ala Leu Glu Met Phe Ser Tyr Gin Asn Asp Ile WO 97/37044 PCTIUS97/05223 100 105 110 Tyr Leu Leu Ser Lys Lys Ala Lys Ala Gly Leu Giu Ile Gin Ala Ser 115 120 125 Ser Ser Lys Asp Lys Lys Gin Leu Ala Phe Phe Phe Tyr Pro Lys Val 130 135 140 Phe Ilie 145 INFORMATION FOR SEQ ID NO:876: SEQUENCE CHARACTERISTICS: LENGTH: 660 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1. 660 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:876: Met Val Lys Ile Phe Ser Asn Arg Met Ile 145 Giu Lys Leu Tyr Leu Lys Lys Ile Phe Tyr Gly Phe Ile Val Leu Gly Leu Leu Asp Ile Ala Leu Asp Arg Arg Phe Tyr Leu Leu Ala Leu Asp Ala 100 Tyr Thr Giu 115 Val Leu Thr 130 Ile Sen Ile Arg Tyr Leu Thr Ala Sen 180 Lys Giu Ile 195 Asp Pro Thn 210 Ala Lys Lys Ala Val1 Val Gly Arg Ang Asn 165 Leu Thn Lys Val1 Ile Gly Arg 70 Glu Met Gly Glu Ile 150 Gin Gly Met As n Leu Lys Arg 55 Phe Asp Ang Ser Lys 135 Giu Thn Tyr Leu Leu Val Asp 40 Leu Glu Thr Ala Thr 120 Thr Lys Phe Phe Val 200 Glu Ala 25 Tyn Ile Glu Leu Met 105 Leu Leu Val1 Phe Lys 185 Ala Phe 10 Gin Arg Ala Ile Phe 90 Ile Thr Thr Leu Gly 170 Lys Leu Ser Val1 Pro Asn Pro 75 Phe Lys Gin Arg Ser 155 His Pro Pro Leu Trp Ser Ile Pro Giu Asn Gin Lys 140 Lys Gly Leu Arg Sen Phe Val1 Val1 Tyr Arg His Ala Leu 125 Leu Glu Tyr Asp Ala 205 Ang Leu Thr Ala Asp Phe Gly Lys 110 Val1 Lys Giu Tyr Lys 190 Pro Ile Ile Thr Asp Sen Gin Lys Glu Val Giu Gly Ile Sen Giy Lys Asn Giu Ala Ile Leu 160 Gly Val 175 Leu Thr Ser Phe 215 Ile Leu Arg Arg Leu Tyr Sen Leu Gly Tnp Ile Ser Ser Asn Giu Leu WO 97/37044 PCTIUS97/05223 zz 230 Lys Ser Ala Leu Asn Glu Val Pro Ile Va Gin Gin Asp Gin 305 Ser Glu Lys Ser Ser 385 Asn Tyr His Glu Lys 465 Ala Pro Phe Leu Leu 545 Asn I Ser Ala 9) Asn I 6 Lys G 625 Pro A Arg L As Lei Lei 29] Ly Asr Thi Lys Ala 370 Thr Tyr Thr Ser Lys 450 Asp Ala Met Thr Thr 530 kla ksn lal hr le )10 'ly isn ,eu n I As 27 As ;Ile a AsE Sei Sei IlE Thr Ser Arg Leu 435 Ile Leu Glu Leu Pro 515 Leu Arg Asn Ile Gly 595 Leu Leu Ser Leu e Ala 260 p Gly 5 p Tyr e Leu p Lys Thr 340 Ala Lys Ser Lys Lys 420 Asn Tyr Ser Lys Ile 500 Ile Ser I Ile I Ile I Trp 1 580 Gly Ala I Arg L Ile T 6 Phe 660 24 Pr Lei G1i Gl.
Asj 32 Glj PhE Pro Lys Asn 405 Phe Leu Gin Ile Tyr 485 3iu 3lu Ula ,ys tsp 'he Tal :le lys 'hr '45 5 o Tyr u Lys i Arg i Lys 310 Glu r Lys Asn Phe Ile 390 Ser Leu Ala Ser Vai 470 Ser Ser Thr Leu I Gly 550 Ala Gly Val Glu I 630 Va.
Th Let 29E IlE Asr Ile Arg Val 375 Pro Val Gly Thr Leu 455 Leu Leu Ile Lys Met 535 Leu rrp krg 3er ?ro 515 Ele 1 Val Gin 280 a Ala Ala Asn Leu Ala 360 Tyr Asp Gin Leu Ile 440 Ser Gly Phe Thr Lys 520 Asp Glu Phe Asp I Aa I 600 Ser I Val I Asp 265 Gly Leu Lys Leu Ala 345 Thr Gin Thr Asn Val 425 Asn Asp Ser Ser Asn 505 Ile kia ie C Il ksp 1 585 ?ro ~eu L tsp L 25 Gb Ty Gb.
G11 Asr 33C LeL Gir Ile Ala His 410 Thr Leu Met Phe Asn 490 31n Phr Ial la ;ly ;70 tsn Tal lys ,ys 235 1 Tyr 3 Val r Thr i Ser a Lys 315 Ala Val Ala Ala Arg 395 Ala Leu Ser Gly Ala 475 Tyr Gin Ser Glu Gly 555 Phe Thr Tyr Arg I e Ile P 635 Asr Let IlE Let 300 Prc Ser Gly Lys Phe 380 Asn Trp Gin Asp Phe 460 Ile Gly Asn Lys ksn 540 Lys 'hr ?ro jer .ys 'ro Gin I Lys Lys 285 Arg Lys Met Gly Arg 365 Asp Phe His Glu Gin 445 Lys Ser Thr Glu Glu C 525 Gly 'I Thr C Pro T Ile G 5 Tyr P 605 Phe A Tyr T Th Gir 27( Lei PhE Thr Ile Ile 350 Gin Asn Glu Pro Ala 430 Leu Asn Pro 4et lai 510 lnn 'hr ;ly hr ly he *sp yr Se 25 I Lei a Th Gb Asr Val 335 Asr Phe Gly Asn Ser 415 Leu Gly Leu Ile Leu 495 Lys Ala Gly Thr Leu 575 Lys Met Vai Ser r Thr Asp Ile I His 1 Ala 320 Thr Tyr Gly Tyr Gly 400 Asn Ser Phe Pro Asp 480 Lys Thr Phe Ser Ser 560 Gin Gly Arg Pro Ser 640 Pro Thr Pro Lys Lys Thr Asp Asp Ser Giu Glu 650 Cc INFORMATION FOR SEQ ID NO:877: WO 97/37044 PCT/US97/05223 782 SEQUENCE CHARACTERISTICS: LENGTH: 113 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...113 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:877: Met Ala Lys Met Asn Ala Pro Asp Gly Val Ala Val Trp Val Asn Glu 1 5 10 Asp Arg Cys Lys Gly Cys Asp Ile Cys Val Ser Val Cys Pro Ala Gly 25 Val Leu Gly Met Gly Ile Glu Lys Glu Arg Val Leu Gly Lys Val Ala 40 Lys Val Ala Tyr Pro Glu Ser Cys Ile Gly Cys Val Gin Cys Glu Leu 55 His Cys Pro Asp Phe Ala Ile Tyr Val Ala Asp Arg Lys Asp Phe Lys 70 75 Phe Ala Lys Val Ser Lys Glu Ala Gin Glu Arg Ser Glu Lys Val Lys 90 Ala Asn Lys Tyr Met Leu Leu Glu Glu Thr Ile Leu Glu Gly Arg Gly 100 105 110 Lys INFORMATION FOR SEQ ID NO:878: SEQUENCE CHARACTERISTICS: LENGTH: 339 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...339 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:878: Met Ile Phe Ile Asp Ala Cys Phe Arg Lys Glu Thr Pro Tyr Thr Pro WO 97/37044 PCT/US97/05223 Ile Trp Met Met Arg Gin Ala Gly Arg Tyr Leu Ser Glu, T C1 r 1 Ser Leu Ala Leu Ile Gin Ser Ala 145 Lys Lys Gly Lys Lys 225 Lys Phe Gly Asp Gly 305 Leu Thr Arg Ala Ala Asn Thr Leu Lys 130 Thr Lys Leu Val Glu 210 Glu Gly Gly Gly Lys 290 Asn Pro Arg Lys Thr l1e Leu Asp Asn 115 Glu Tyr Met Ser Asn 195 Ala Leu Ile Val Lys 275 Asn Gn Arg Arg Lys Glu Leu Glu Leu 100 Tyr Lys Met Leu Leu 180 Ala Tyr Lys Gly Asp 260 Tyr Ala Gly Glu z Ala Val Phe Phe Lys Val Ala Ile Tyr 165 Glu Val Leu Lys Ala 245 Trp Val Leu His Asn 325 Gly Thr SSer 70 Ile Ser Tyr Leu Glu 150 Ser Leu Met Glu Arg 230 Tyr Gly Leu Glu Ile 310 Ala Ser Leu 55 Asp Pro Val Asp Ile 135 Gly Glu Ile Ile Phe 215 Tyr Leu Thr G1n Glu 295 Phe SPhe 40 Gin SIle SLys Glu Thr 120 Gly Glu Pro Glu Phe 200 Ser Ala Asp Pro Gly 280 Gly Asn Leu Pro Leu Lys Ser 105 Ile Phe Gly Glu Tyr 185 Asp Trp His Ser Leu 265 Asn Val Leu Glu Val Val Gly 90 Leu Ser Cys Ser Val 170 Leu Ser Asp Ile Ile 250 Thr Leu Glu Gly I Leu Glu Val 75 Pro Lys Gin Gly Lys 155 Leu Ser Trp Tyr Pro 235 Asp Ala Glu Lys -is 315 SCys Ile Pro His Val Thr Ser 140 Ser Lys Leu Ala Leu 220 Val Gly Ala Pro Ile 300 Gly Lys Leu Leu Phe Gly Arg 125 Pro Tyr Ala Gin Ser 205 Lys Ile Glu Lys Thr 285 Leu vet Asn Gly Glu Leu Val 110 Gin Trp Ala Leu Ile 190 Ala Lys Leu Phe Lys 270 Arg Lys Leu Ser Val Met Glu Tyr Lys Thr Lys Leu 175 Gin Leu Ile Phe Asp 255 Ile Leu Val Pro Asp SAsp SGly Thr Lys Leu Leu Ser 160 Glu Ala Glu Ser Pro 240 Val Leu Tyr Met Asp 320 Lys Tyr Leu Val Gin Leu Val His Ala Lys INFORMATION FOR SEQ ID NO:879: SEQUENCE CHARACTERISTICS: LENGTH: 192 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCTIUS9/05223 784 (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...192 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:879: Val Leu Glu Lys Ser Phe Leu Lys Ser Lys Gin Leu Val Leu Cys Gly 1 5 10 Leu Gly Val Leu Met Leu Gin Ala Cys Thr Cys Pro Asn Thr Ser Gin 25 Arg Asn Ser Phe Leu Gin Asp Val Pro Tyr Trp Met Leu Gin Asn Arg 40 Ser Gin Tyr Leu Thr Gin Gly Val Asp Ser Ser His Ile Val Asp Gly 55 Lys Ala Thr Glu Glu Ile Glu Lys Ile Ala Thr Lys Arg Ala Thr Ile 70 75 Arg Val Ala Gin Asn Ile Val His Lys Leu Lys Glu Ala Tyr Leu Ser 90 Lys Ser Asn Arg Ile Lys Gin Lys Ile Thr Asn Glu Met Phe Ile Gin 100 105 110 Met Thr Lys Pro Ile Phe Asp Ser Leu Met Asn Val Asp Arg Leu Gly 115 120 125 Ile Tyr Ile Asn Pro Asn Asn Glu Glu Val Phe Ala Leu Val Arg Ala 130 135 140 Arg Ser Phe Asp Lys Asp Ala Leu Ser Glu Gly Leu His Lys Met Ser 145 150 155 160 Leu Asp Asp Gin Ala Val Ser Ile Leu Val Ser Lys Val Glu Glu Ile 165 170 175 Phe Lys Asp Ser Ile Asn Tyr Gly Asp Val Lys Val Pro Ile Ala Met 180 185 190 INFORMATION FOR SEQ ID NO:880: SEQUENCE CHARACTERISTICS: LENGTH: 110 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...110 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:880: Met Leu Arg Asn Gin Phe Arg Ile Val Phe Val Ser Cys Ile Val Ala 1 5 10 Ser Ser Leu Gin Ala Gin Glu Asn Thr His Thr Leu Gly Lys Val Thr 25 Thr Lys Gly Glu Arg Thr Phe Glu Tyr Asn Asn Lys Met Tyr Ile Asp WO 97/37044 PCT/US97/05223 785 40 Arg Lys Glu Leu Gin Gin Arg Gin Ser Asn Gin Ile Arg Asp Ile Phe 55 Arg Thr Arg Ala Asp Val Asn Val Ala Ser Gly Gly Leu Met Ala Gin 70 75 Lys Ile Tyr Val Arg Gly Ile Glu Ser Arg Leu Leu Arg Val Thr Ile 90 Asp Gly Val Ala Gin Asn Gly Asn Ile Phe His His Asp Ala 100 105 110 INFORMATION FOR SEQ ID NO:881: SEQUENCE CHARACTERISTICS: LENGTH: 89 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...89 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:881: Val Met Lys Lys Ile Val Val Ser Leu Cys Val Ala Leu Gly Phe Leu 1 5 10 Ser Ala Asp Pro Ala Gin Ala Asn Lys Ala Ile Ser Asp Ala Asp Leu 25 Ile Glu Glu Ile Arg Asp Leu Lys Lys Ile Ile Ser Ala Gin Asn Thr 40 Glu Ile Asn Gin Leu Arg Lys Val Gin Glu Val Leu Ser Gly Gin Leu 55 Gly Asp Met Arg Lys Asp Ile Leu Ser Thr Arg Asp Tyr Cys Ile Ser 70 75 Leu Arg Pro Tyr Ile Tyr Asn Trp Arg INFORMATION FOR SEQ ID NO:882: SEQUENCE CHARACTERISTICS: LENGTH: 158 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 786 (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...158 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:882: Leu Asn Lys Phe Gly Trp Arg Ala Cys Phe Leu Thr Leu Phe Phe Ser 1 5 10 Gly Tyr Ser Lys Lys Ala Pro Gly Thr Ile Gly Ser Leu Val Ala Leu 25 Leu Leu Gly Leu Pro Ile Leu Ile Phe Ser Ala Asn Thr Leu Phe Leu 40 Gly Ala Ile Phe Val Gly Leu Ile Ala Ile Thr Gin Ile Asp Lys Glu 55 Glu Glu Glu Thr Lys Arg His Asp Ser Ser Tyr Ile Val Ile Asp Glu 70 75 Leu Val Gly Met Trp Leu Ala Met Ala Ile Ser Gly Leu Ser Leu Val 90 Gly Val Ile Leu Ser Phe Ile Phe Phe Arg Ile Tyr Asp Ile Thr Lys 100 105 110 Pro Ser Leu Ile Gly Lys Ile Asp Lys Glu Val Lys Gly Gly Leu Gly 115 120 125 Val Val Ala Asp Asp Ala Leu Ala Gly Val Leu Ala Gly Leu Ser Thr 130 135 140 Leu Leu Ala Ile Asn Ile Leu Gly Phe Phe Asn Ile Lys Phe 145 150 155 INFORMATION FOR SEQ ID NO:883: SEQUENCE CHARACTERISTICS: LENGTH: 133 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...133 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:883: Met Glu Leu Ile Lys Lys Leu Glu Lys Glu Ser Glu Val Leu Lys Lys 1 5 10 Asp Leu Gin Gin His Ser Asn Glu Leu Phe Lys Met Leu Ile Ile Asp 25 Asn Glu Asp Leu Phe Lys Glu Gin Phe Glu Ile Met Phe Lys Ala Trp 40 Val Glu Ile Val Lys Met Met Phe Glu Leu Thr Lys Lys Thr Lys Phe 55 Asp Gly Glu Met Ile Gly Tyr Thr Glu Glu Leu Leu Thr Phe Leu Val 70 75 WO 97/37044 PCT/US97/05223 787 Arg Asp Phe Phe Asn Gly Ile Phe Lys Ser Lys Val 90 Pro Ile Phe Cys Gly Asp Val Lys Cys Glu Asp Phe 100 105 Ser Leu Val Tyr Leu Ser Val Leu Glu Leu Glu Glu 115 120 Asn Lys Ile Pro Phe 130 INFORMATION FOR SEQ ID NO:884: SEQUENCE CHARACTERISTICS: LENGTH: 133 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...133 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:884: Met Glu Leu Ile Lys Lys Leu Glu Lys Glu Ser Glu 1 5 Asp Leu Gin Gin His Ser Asn Glu Leu Phe Lys Met 25 Asn Glu Asp Leu Phe Lys Glu Gin Phe Glu Ile Met 40 Val Glu Ile Val Lys Met Met Phe Glu Leu Thr Lys 55 Asp Gly Glu Met Ile Gly Tyr Thr Glu Glu Leu Leu 70 75 Arg Asp Phe Phe Asn Gly Ile Phe Lys Ser Lys Val Pro Ile Phe Cys Gly Asp Val Lys Cys Glu Asp Phe 100 105 Ser Leu Val Tyr Leu Ser Val Leu Glu Leu Glu Glu 115 120 Asn Lys Ile Pro Phe 130 INFORMATION FOR SEQ ID NO:885: SEQUENCE CHARACTERISTICS: LENGTH: 125 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein Ile Pro Lys Met Asn Ala Leu Arg 110 Thr Ile Asn Pro 125 Val Leu Phe Lys Thr Ile Asn Thr 125 Leu Ile Lys Thr Phe Pro Ala 110 Ile Lys Asp Trp Phe Val Met Arg Pro WO 97/37044 PCT/US97/05223 788 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...125 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:885: Met Lys Lys Leu Ala Ala Leu Phe Leu Val Ser Ala Leu Gly Val Met 1 5 10 Ser Leu Asn Ala Trp Glu Gin Thr Leu Lys Ala Asn Asp Leu Glu Val 25 Lys Ile Lys Ser Val Gly Asn Pro Ile Lys Gly Asp Asn Thr Phe Val 40 Leu Ser Pro Thr Leu Lys Gly Lys Ala Leu Glu Lys Ala Ile Val Arg 55 Val Gin Phe Met Met Pro Glu Met Pro Gly Met Pro Ala Met Lys Glu 70 75 Met Ala Gin Val Ser Glu Lys Asn Gly Leu Tyr Glu Ala Lys Thr Asn 90 Leu Ser Met Asn Gly Thr Trp Gin Val Arg Val Asp Ile Lys Ser Lys 100 105 110 Glu Gly Gin Val Tyr Arg Thr Lys Thr Ser Leu Asp Leu 115 120 125 INFORMATION FOR SEQ ID NO:886: SEQUENCE CHARACTERISTICS: LENGTH: 341 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...341 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:886: Met Lys Arg Leu Leu Leu Leu Ala Leu Ala Leu Phe Phe Ser Leu Ser 1 5 10 Cys Thr Asn Ala Gin Glu Ile Lys Glu Thr Gin Glu Thr Lys Lys Thr 25 Lys Glu Thr Lys Ser Gin Thr Arg Phe Asn Ile Ser Thr Thr Lys Val 40 Ile Glu Lys Glu Phe Ser Gin Ser Arg Arg Tyr Tyr Ala Leu Leu Glu 55 Pro Asn Glu Ala Leu Ile Phe Ser Gin Thr Leu Arg Phe Asp Gly Tyr WO 97/37044 PCT/US97/05223 789 70 75 Val Glu Lys Leu Tyr Ala Asn Lys Thr Tyr Thr Pro Ile Lys Lys Gly 90 Asp Arg Leu Leu Ser Val Tyr Ser Pro Glu Leu Ala Gly Val Gin Ser 100 105 110 Glu Leu Leu Ser Ser Leu Lys Phe Asn Gin Gin Val Gly Ala Ile Lys 115 120 125 Glu Lys Leu Lys Leu Leu Gly Leu Glu Asn Phe Ser Ile Glu Lys Ile 130 135 140 Ile Ser Ser His Lys Val Gin Asn Glu Ile Thr Ile Tyr Ser Arg Phe 145 150 155 160 Asn Gly Val Ile Phe Lys Lys Ser Pro Asp Leu Asn Glu Gly Ser Phe 165 170 175 Ile Lys Lys Gly Gin Glu Leu Phe Lys Ile Ile Asp Leu Ser Arg Leu 180 185 190 Trp Ala Leu Val Lys Val Asn Gin Glu Asp Leu Glu Phe Leu Lys Asn 195 200 205 Thr His Gin Ala Ile Leu Phe Val Glu Gly Val Lys Gly Lys Gin Ala 210 215 220 Ile Thr Leu Glu Asn Ile Asn Pro Ile Ile Asn Ala Gin Asp Lys Met 225 230 235 240 Leu Glu Ala Arg Phe Asn Val Pro Asn Leu Lys Leu Leu Tyr Tyr Pro 245 250 255 Asn Met Phe Ala Gin Val Glu Ile Phe His Lys Pro Gin Lys Met Lys 260 265 270 Ile Leu Pro Lys Glu Ala Val Leu Ile Lys Gly Gly Lys Ala Ile Val 275 280 285 Phe Lys Lys Asp Asp Phe Gly Leu Ser Pro Leu Glu Ile Lys Ala Val 290 295 300 Arg Leu Ser Asp Gly Ser Tyr Glu Ile Leu Glu Gly Leu Lys Ala Gly 305 310 315 320 Glu Glu Val Ala Asn Asn Ala Leu Phe Val Leu Asp Ala Asp Ala Gln 325 330 335 Asn Asn Gly Asp Tyr 340 INFORMATION FOR SEQ ID NO:887: SEQUENCE CHARACTERISTICS: LENGTH: 120 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...120 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:887: Met Gly Val Ala Val Val Leu Phe Leu Thr Leu Ile Leu Leu Phe Leu WO 97/37044 PCT/US97/05223 790 1 5 10 Val Leu Arg Asp Phe Gly Leu Ala Ser Pro Lys Gin Lys Ile Leu Ala 25 Phe Leu Ile Val Gly Ile Ile Gly Ala Ser Ile Ser Val Tyr Thr Tyr 40 Lys Gin Asn Gin Gin Asn Gin Gin Glu Ile Ala Leu Gin Arg Ala Phe 55 Leu Arg Gly Glu Thr Leu Leu Cys Lys Gly Ile Lys Val Asn Asn Gin 70 75 Thr Phe Asn Leu Val Ser Gly Thr Leu Ser Phe Leu Gly Lys Lys Gin 90 Thr Pro Met Lys Asp Val Leu Val Asp Leu Asp Ser Cys Gin Thr Leu 100 105 110 Gin Lys Asp Pro Leu Ile Gin Pro 115 120 INFORMATION FOR SEQ ID NO:888: SEQUENCE CHARACTERISTICS: LENGTH: 332 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...332 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:888: Met Arg Phe Phe Ile Leu Phe Phe Met Gly Met Leu Gly Val Gly Phe 1 5 10 Ser Gin Thr Glu Leu Asp Leu Lys Asp Leu Glu Lys Lys Pro Ala Gly 25 Ile Val Arg Asp Tyr Tyr Leu Trp Arg Tyr Ile Ser Asp Lys Lys Thr 40 Ser Leu Glu Asn Ala Lys Lys Ala Tyr Glu Leu Thr Gin Asn Lys Asn 55 Ser Ala Leu Gin Lys Ala Met Gin Glu Lys Gly Val Glu Asn Ser Asp 70 75 Lys Ser Pro Asp Ala Lys Met Pro Glu Asp Ile Tyr Cys Lys Gin Ile 90 Thr Leu Glu Ser Met Leu Glu Thr Thr Asp Ala Phe Gin Ala Ser Cys 100 105 110 Ile Ala Ile Ala Leu Lys Ser Lys Ile Arg Asp Phe Asp Lys Ile Pro 115 120 125 Ile Gin Thr Phe Lys Pro Leu Gin Glu Lys Ile Lys Glu Ala Tyr Pro 130 135 140 Ile Leu Tyr Glu Glu Leu Glu Ile Leu Gin Ser Lys Asn Val Ser Ala 145 150 155 160 Ser Leu Phe Lys Ala Asn Ala Gin Val Phe Ser Ala Leu Phe Asn His WO 97/37044 PCT/US97/05223 791 165 170 175 Leu Ser Tyr Glu Lys Lys Leu Gin Ile Phe Glu Lys His Ile Pro Ile 180 185 190 Lys Glu Leu Asn Arg Leu Leu Asp Glu Asp Tyr Pro Ala Phe Asn Arg 195 200 205 Leu Ile Tyr Gin Val Ile Leu Asp Pro Lys Leu Asp His Phe Lys Asp 210 215 220 Ala Leu Ala Lys Ser Asn Ala Thr Gin Ser Asn Ala Gin Thr Phe Phe 225 230 235 240 Ile Leu Gly Ile Asn Glu Ile Leu His Lys Lys Thr Ser Lys Ala Leu 245 250 255 Lys Tyr Phe Glu Arg Ser Glu Ala Val Val Lys Asp Asp Asp Phe Ser 260 265 270 Lys Asp Arg Ala Ile Phe Trp Gin Tyr Leu Ala Ser Lys Lys Lys Lys 275 280 285 Thr Leu Glu His Leu Ser Gln Ser Pro Ala Leu Asn Leu Tyr Ser Leu 290 295 300 Tyr Ala Ser Arg Lys Pro Gin Asn His Ala Gin Leu Ser His His Phe 305 310 315 320 Ser Tyr Pro Lys Phe Lys Pro Arg Glu Pro Ser Phe 325 330 INFORMATION FOR SEQ ID NO:889: SEQUENCE CHARACTERISTICS: LENGTH: 157 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...157 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:889: Val His Leu Ala His Tyr Leu Lys Arg Asn Phe Ser Phe Phe Pro Ser 1 5 10 Val Asp Ile Val Asp Gly Gly Thr Met Ala Gin Gin Leu Ile Pro Leu 25 Ile Thr Ser Tyr Glu Lys Val Leu Ile Leu Asp Cys Val Ser Ala Lys 40 Gly Val Glu Ile Gly Ser Val Tyr Ala Phe Asp Phe Lys Asp Ala Pro 55 Lys Glu Ile Thr Trp Ala Gly Ser Ala His Glu Val Glu Met Leu His 70 75 Thr Leu Arg Leu Thr Glu Phe Leu Gly Asp Leu Pro Lys Thr Phe Ile 90 Val Gly Leu Val Pro Phe Val Ile Gly Ser Glu Thr Thr Phe Lys Leu 100 105 110 Ser Ser Glu Met Leu Asn Ala Leu Glu Thr Ala Leu Lys Ala Ile Glu WO 97/37044 PCTIUS97/05223 792 115 120 125 Thr Gin Leu Asn Ala Trp Gly Val Lys Met Gin Arg Thr Asp His Ile 130 135 140 Ala Leu Asp Cys Ile Ala Glu Leu Ser Tyr Lys Gly Phe 145 150 155 INFORMATION FOR SEQ ID NO:890: SEQUENCE CHARACTERISTICS: LENGTH: 307 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...307 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:890: Met Lys Lys Val Leu Leu Leu Thr Leu Ser Leu Ser Leu Ser Phe Trp 1 5 10 Leu His Ala Glu Arg Asn Gly Phe Tyr Leu Gly Leu Asn Phe Ala Glu 25 Gly Ser Tyr Ile Gin Gly Gin Gly Ser Ile Gly Glu Lys Ala Ser Ala 40 Glu Asn Ala Leu Asn Gin Ala Ile Asn Asn Ala Lys Asn Ser Leu Phe 55 Pro Asn Thr Lys Ala Ile Arg Asp Val Gin Asn Ala Leu Asn Ala Val 70 75 Lys Asp Ser Asn Lys Ile Ala Asn Arg Phe Ala Gly Asn Gly Gly Ser 90 Gly Gly Ile Phe Asn Glu Leu Ser Leu Gly Tyr Lys Tyr Phe Leu Gly 100 105 110 Lys Lys Arg Ile Ile Gly Phe Arg His Ser Leu Phe Phe Gly Tyr Gin 115 120 125 Leu Gly Gly Val Gly Ser Val Pro Gly Ser Gly Leu Ile Ala Phe Leu 130 135 140 Pro Tyr Gly Phe Asn Thr Asp Leu Leu Ile Asn Trp Thr Asn Asp Lys 145 150 155 160 Arg Ala Ser Gin Glu Tyr Val Glu Arg Arg Val Lys Gly Leu Ser Ile 165 170 175 Phe Tyr Lys Asp Met Thr Gly Arg Thr Leu Asp Ala Asp Thr Leu Lys 180 185 190 Arg Ala Ser Arg His Ile Ile Arg Lys Ser Ser Gly Leu Val Ile Gly 195 200 205 Met Glu Leu Gly Ala Ser Thr Trp Phe Ala Ser Asn Asn Leu Thr Pro 210 215 220 Phe Asn Gin Val Lys Ser Arg Thr Ile Phe Gin Leu Gln Gly Lys Phe 225 230 235 240 Gly Val Arg Phe Ser Ser Asp Glu Tyr Asp Ile Asp Arg Tyr Gly Asp WO 97/37044 PCT/US97/05223 Glu Ala Tyr Asn 305 (2) 245 Asn Tyr Leu Gly Gly 260 Phe Lys Val Asn Tyr 275 Lys Arg Val Val Ser 290 Lys His 250 255 Ser Ser Val Glu Leu Gly Val Lys Val Pro 265 270 Tyr Ser Asp Asp Tyr Gly Asp Lys Leu Asp 280 285 Val Tyr Leu Asn Tyr Thr Tyr Asn Phe Lys 295 300 INFORMATION FOR SEQ ID NO:891: SEQUENCE CHARACTERISTICS: LENGTH: 578 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...578 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:891: Met Ser Lys Lys Ile Val Val Asp Pro Ile Thr Arg 1 Leu Arg Phe Ser Arg Asp Cys Thr Leu Gly Asn Met His Gly Ile Gin 130 Thr Gly 145 Ala Lys Lys Thr Tyr Leu Ile Ser Pro Tyr Ile Ala Leu 115 Ala Ala Ser Tyr Lys Glu Ser Arg Ser Thr Leu 100 Asp Ala Gly Gly Arg 180 Leu 5 Val Thr Asp His Pro Leu Trp Lys Glu Ser 165 Leu Leu Ile Leu Ala Tyr 70 Pro Phe Cys Leu Leu 150 Leu Asn Glu Val Phe Gly 55 Lys Leu His Asp Ser 135 Lys Gly Pro Ile Asp Arg 40 Phe Ala Asn Asp Ile 120 Phe Ala Pro Glu Gin Asp 25 Gly Ile Gly Ala His 105 Leu Lys Val Phe Gin 185 Arg 10 Asp Asn Leu Glu Ala Gin Val Thr 75 Gin Leu 90 Val Val Ser Thr Tyr Ser Gin Lys 155 Ser Asn 170 Asn Leu Glu Ala Val Thr Arg Ala Val His Leu Pro 140 Arg Gly Ile Ala Ile Ile Ile Ile Val Arg Phe Lys 125 Tyr Leu Tyr Val Lys Glu Thr Ile Cys Glu Ser Tyr 110 Ala Pro Asn Tyr Leu 190 Met Gly Asp Lys Gly Asn Leu Thr Asp Ile Asp Gly 175 Ser Thr His Ala Gly Val Ala Met Leu Pro Asn Phe 160 His His Ala 195 200 205 Ile Phe Gly Ala Lys Gin Pro His Pro Gin Ser Leu Thr Val Gly Gly WO 97/37044 PCT/US97/05223 794 210 215 220 Val Thr Ser Val Met Asp Ile Leu Asp Pro Thr Arg Leu Ala Glu Trp 225 230 235 240 Lys Ser Lys Phe Glu Val Val Ala Asn Phe Ile Asn His Ala Tyr Tyr 245 250 255 Pro Asp Leu Val Met Ala Gly Glu Met Phe Ala Asn Glu Pro Ser Val 260 265 270 Ile Lys Gly Cys Gly Leu Arg Asn Phe Ile Ala Tyr Glu Glu Val Leu 275 280 285 Leu Gly Lys Asp Lys Tyr Leu Leu Ser Ser Gly Val Val Leu Asp Gly 290 295 300 Asp Ile Ser Lys Leu His Pro Ile Asp Glu Ser Leu Ile Lys Glu Glu 305 310 315 320 Val Thr His Ser Trp Tyr Gln Tyr Glu Asp Thr Lys Glu Val Gin Leu 325 330 335 His Pro Tyr Asp Gly Gin Thr Asn Pro His Tyr Thr Gly Leu Lys Asp 340 345 350 Gly Glu Ser Val Gly Ile Glu Asn Lys Ile Ile Pro Ala Lys Val Leu 355 360 365 Asp Thr Lys Asp Lys Tyr Ser Trp Ile Lys Ser Pro Arg Tyr Asp Ser 370 375 380 Lys Pro Met Glu Val Gly Pro Leu Ser Ser Val Val Val Gly Leu Ala 385 390 395 400 Ala Lys Asn Pro Tyr Val Thr Glu Val Ala Thr Lys Phe Leu Lys Asp 405 410 415 Thr Lys Leu Pro Leu Glu Ala Leu Phe Ser Thr Leu Gly Arg Thr Ala 420 425 430 Ala Arg Cys Ile Glu Ala Lys Thr Ile Ala Asp Asn Gly Leu Leu Ala 435 440 445 Phe Asp Ala Leu Val Glu Asn Leu Lys Ser Asp Gin Ser Thr Cys Ala 450 455 460 Pro Tyr His Ile Asp Lys Asn Gin Glu Tyr Lys Gly Arg Tyr Ile Gly 465 470 475 480 Gin Val Pro Arg Gly Met Leu Ser His Trp Val Arg Ile Lys Asn Gly 485 490 495 Val Val Glu Asn Tyr Gin Ala Val Val Pro Ser Thr Trp Asn Ala Gly 500 505 510 Pro Arg Asp Ser Lys Asn Gin Arg Gly Ala Tyr Glu Met Ser Leu Ile 515 520 525 Gly Thr Lys Ile Ala Asp Leu Thr Gin Pro Leu Glu Ile Ile Arg Thr 530 535 540 Ile His Ser Phe Asp Pro Cys Ile Ala Cys Ser Val His Val Met Asp 545 550 555 560 Phe Lys Gly Gin Ser Leu Asn Glu Phe Lys Val Glu Pro Asn Phe Ala 565 570 575 Lys Phe INFORMATION FOR SEQ ID NO:892: SEQUENCE CHARACTERISTICS: LENGTH: 68 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCTIUS97/05223 795 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...68 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:892: Met Asn Ser Ser Asn Leu Lys Asn Trp Leu Phe Pro Thr Ile Cys Phe 1 5 10 Phe Leu Phe Cys Tyr Ile Leu Ile Phe Leu Ile Phe Phe Met Phe Lys 25 Asn Leu Gin Ser Gin Ser Phe Gly Ser Val Ala Glu Thr Gly Lys Lys 40 Pro Ile Thr Thr Thr Lys Lys Phe Gly Lys Glu Leu Gin Lys Gin Ile 55 Ser Lys Ile His INFORMATION FOR SEQ ID NO:893: SEQUENCE CHARACTERISTICS: LENGTH: 53 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...53 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:893: Val Val Leu Pro Lys Glu Thr Leu Ser Ser Ile Ala Lys Arg Tyr Gin 1 5 10 Val Ser Ile Ser Ser Ile Gin Leu Ala Asn Asn Leu Lys Asp Ser Asn 25 Ile Phe Ile His Gin Arg Leu Ile Ile Pro Thr Asn Lys Lys Leu Leu 40 Ala Thr Arg Glu Phe INFORMATION FOR SEQ ID NO:894: SEQUENCE CHARACTERISTICS: LENGTH: 352 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 796 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...352 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:894: Leu Tyr Phe Leu Asn Gly Gly Tyr Asn Arg Leu His 1 Arg Ser Ser Lys Pro Ala Ile Leu Lys 145 Gin Lys Thr Gly Leu 225 Lys Lys Pro Lys Arg 305 Ala Ile Leu Asn Leu Ala Phe Lys Lys 130 Ala Glu Ala Leu Val 210 Asp Tyr Glu Glu Asn 290 Ala Phe Val Gly Gin Pro Met Lys Pro 115 Lys Val Lys Leu Asp 195 Glu Ser Val Asn Asp 275 Lys Pro Lys Met Val Glu Val Phe Ser 100 Met Leu Glu Thr Glu 180 Phe Leu Asp Lys Pro 260 Val Gin Leu Gly 5 Leu Leu Ile Ser Asn Asp Ser Ser His Ile 165 Val Ile Phe Ile Phe 245 Glu Leu Val Ile Val 325 Ile Leu Gin Lys 70 Thr Ile Ser Pro Ala 150 Val Asp Lys His Leu 230 Gly Ile Asn Tyr Ser 310 Asp Ala Val Val 55 Ile Trp Val Asp Asp 135 Lys Glu Ala Glu Lys 215 Glu Arg Ile Asn Lys 295 Leu Ile Arg Ser 40 Lys Ile Asp Lys His 120 Leu Lys Val Ser Arg 200 Ala Lys Ala Phe Pro 280 Leu Phe Asn Phe 25 Ser Asp Tyr Arg Ala 105 Val Val Phe Met Lys 185 Leu Asn Gly Asp Ile 265 Lys Pro Ile Ala 10 Lys Leu Tyr Leu Val 90 Thr Ala Val Gly Glu 170 Lys Lys Lys Gly Ile 250 Trp Phe Thr Ala Ile Lys Leu Phe Gly 75 Val Leu Ala Thr Ile 155 Asp Leu Asn Ile Ile 235 Ser Trp Ser Met Leu 315 Ile Ala Gly Gly Ser Gly Lys Leu Phe 140 Ser Ile Ala Val Ser 220 Asp Val Ile Thr Asp 300 Lys Lys Lys Leu Val Glu Phe Ile Asp Asn 125 Val Phe Asp Lys Lys 205 Gly Asn Glu Ser Ile 285 Ile Ala Asp Ser Ile Ala Gin Ala Ser Pro 110 Val Gly Leu Ala Met 190 Lys His Phe Lys Pro 270 Lys Gly His Tyr Tyr Ser Asn Thr Glu Asp Glu Glu Asn Ser Gin 175 Gln Lys Gin Gly Ile 255 Leu Ala Gly Pro Tyr 335 Glu Tyr Ala lie Val Tyr Arg Leu Pro Phe 160 Ala Glu Lys Ala Leu 240 Val Ser Ile Pro Glu 320 Lys 330 Val Val Phe Asp Leu Asn Asp Ala Glu Val Glu Pro Phe Leu Trp His WO 97/37044 PCT/US97/05223 797 INFORMATION FOR SEQ ID NO:895: SEQUENCE CHARACTERISTICS: LENGTH: 197 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...197 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:895: Met Lys Cys Ser Ser Phe Thr Ser Asn Ser Val Leu Asn Phe Phe Val 1 Val Leu Gin Pro Glu Asn Ile Glu Asp Ser Ser Pro Val Thr Ile Tyr 130 Leu Thr 145 His Ala Leu Asp Phe Lys Ser Phe Thr Ser Phe Lys Gly Lys Lys lie Glu Ala 100 Tyr Lys 115 Asn His Ser Lys Leu Ala Glu Ile 180 Gly Gly 195 5 Ile Val Ala Lys Lys Lys Arg Lys Asp Ile 165 Lys Phe Thr Val Phe Ala 70 Arg Arg Ser Glu Ser 150 Ile Gin Ile Ser Gin 55 Leu Tyr Gin Asp Gin 135 Lys Glu Ser Gly Lys 40 Ile Gin Asp Gin Asp 120 Asn Val Ala Gin Leu 25 Glu Asn Tyr Glu Asp 105 Ser Phe Glu Gin Lys 185 10 Val Asn Asp Asp Asp 90 Leu Ser Lys Gly Ser 170 Glu Phe Ile Lys Asp 75 Thr Tyr Phe Gly Leu 155 Ile Lys Phe Pro Ile His Ile Phe Trp Lys 140 Asp Gin Lys Phe Lys Leu Glu Glu Phe Ser 125 Gly Ile Ala Lys Leu Ile Asp Ile Ser Pro 110 Glu Arg Ser Asn Phe 190 Arg Glu Leu Phe Val Asn Thr Phe Tyr Leu 175 Pro Ser Leu Ser Phe Glu Gly Gly Ile Ser 160 Phe Thr INFORMATION FOR SEQ ID NO:896: SEQUENCE CHARACTERISTICS: LENGTH: 185 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 798 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...185 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:896: Met Arg Trp Trp Cys Phe Leu Val Cys Cys Leu Ser Val Leu Ser Val 1 5 10 Met Asp Ala Lys Lys Leu Glu Asn Lys Gly Leu Lys Lys Glu Arg Glu 25 Leu Leu Glu Ile Thr Gly Asn Gin Phe Val Ala Asn Asp Lys Thr Lys 40 Thr Ala Val Ile Gin Gly Asn Val Gin Ile Lys Lys Gly Lys Asp Arg 55 Leu Phe Ala Asp Lys Val Ser Val Phe Leu Asn Asp Lys Arg Lys Pro 70 75 Glu Arg Tyr Glu Ala Thr Gly Asn Thr His Phe Asn Ile Phe Thr Glu 90 Asp Asn Arg Glu Ile Ser Gly Ser Thr Asp Lys Leu Ile Tyr Asn Ala 100 105 110 Leu Asn Gly Glu Tyr Lys Leu Leu Gin Asn Ala Val Val Arg Glu Val 115 120 125 Gly Lys Ser Asn Val Ile Thr Gly Asp Glu Ile Ile Leu Asn Lys Thr 130 135 140 Lys Gly Tyr Ala Asp Val Leu Gly Ser Ala Lys Arg Pro Ala Lys Phe 145 150 155 160 Val Phe Asp Met Glu Asp Ile Asn Glu Glu Asn Arg Lys Ala Lys Leu 165 170 175 Lys Lys Lys Gly Ala Lys Glu Lys Pro 180 185 INFORMATION FOR SEQ ID NO:897: SEQUENCE CHARACTERISTICS: LENGTH: 211 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...211 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:897: WO 97/37044 PCT/US97/05223 Leu 1 Thr Glu Val Glu Phe Trp Pro Leu Asn 145 Asn Glu Met Ile Ile Lys Val Ile Ile Asn Lys Met Phe 130 Met Ala His Lys Lys 210 Gin Leu Asp Leu Ile Val Asn Val 115 Glu Lys Asp Gly Ala 195 Leu Gin Ala Glu Phe Ala Ile Thr 100 Ala Glu Val Glu Glu 180 Thr Asn 5 Pro His Phe Phe Gly Pro Asp Ala Arg Met 165 Val His Phe Asp Phe Trp Asp 70 Val Val Ile Ile His 150 Leu Cys Gin Lys Phe Glu Pro 55 Lys Ser Glu Thr Ala 135 Ala Arg Pro Gly Thr Lys Leu 40 Lys Arg Ile Lys Lys 120 Leu Val Met Ala Val 200 Leu Ala 25 Ser Asp Val Asp Gly 105 Ser Arg Ile Val Gly 185 Ala Lys Glu Leu Tyr Met 10 Pro Lys Phe Lys Ser 90 Gly Ile Gly Asn Asp 170 Trp Glu Ala Asn Thr Asp 75 Glu Ile Ser Ala Asp 155 Ala Arg Tyr Val Leu Phe Phe Gin Gly Arg Phe 140 Leu Leu Lys Leu Leu Gly Val His Val Gin Asp 125 Leu Pro Leu Gly Lys 205 Gly Lys Cys Glu His Val 110 Tyr Ile Leu His Asp 190 Glu Leu Asn Asn Pro Lys Phe Ser Asp Asp Gly Phe 175 Lys Asn Val Asn Gly Thr Gly Ala Phe Val Lys Arg 160 Glu Gly Ser INFORMATION FOR SEQ ID NO:898: SEQUENCE CHARACTERISTICS: LENGTH: 196 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...196 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:898: Val Pro Leu Trp Ile Met Val Val Gly Ala Ala Gly Ile Ala Leu Gly 1 5 10 Leu Ser Leu Tyr Gly Pro Lys Leu Ile Lys Thr Val Gly Ser Glu Ile 25 Thr Glu Leu Asp Lys Met Gin Ala Phe Cys Ile Ala Leu Ser Ala Val 40 Ile Thr Val Leu Leu Ala Ser Gin Leu Gly Leu Pro Val Ser Ser Thr 55 WO 97/37044 PCT/US97/05223 800 His Ile Val Val Gly Ala Val Phe Gly Val Gly Phe Leu Arg Glu Arg 70 75 Leu Arg Glu Gin Ser Arg Arg Arg Phe Ala Arg Ile Arg Asp Asn Ile 90 Val Ala Ala His Phe Gly Glu Asp Leu Glu Glu Ile Glu Gly Phe Leu 100 105 110 Glu Arg Phe Asp Lys Ala Asn Leu Lys Glu Lys Ser Leu Met Leu Glu 115 120 125 Ser Leu Lys Lys Ser Lys Asn Thr Ala Ile Ala Leu Glu Leu Lys Lys 130 135 140 Lys Glu Lys Lys Ser Leu Lys Lys Val Tyr Lys Glu Glu Val Ile Lys 145 150 155 160 Arg Ser Ile Leu Lys Lys Ile Val Thr Ala Trp Leu Val Thr Val Pro 165 170 175 Val Ser Ala Leu Leu Gly Ala Leu Leu Phe Val Ala Leu Gly Phe Ile 180 185 190 Glu Lys Tyr Phe 195 INFORMATION FOR SEQ ID N0:899: SEQUENCE CHARACTERISTICS: LENGTH: 174 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...174 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:899: Met Ser Asn Ser Met Leu Asp Lys Asn Lys Ala Ile Leu Thr Gly Gly 1 5 10 Gly Ala Leu Leu Leu Gly Leu Ile Val Leu Phe Tyr Leu Ala Tyr Arg 25 Pro Lys Ala Glu Val Leu Gin Gly Phe Leu Glu Ala Arg Glu Tyr Ser 40 Val Ser Ser Lys Val Pro Gly Arg Ile Glu Lys Val Phe Val Lys Lys 55 Gly Asp Arg Ile Lys Lys Gly Asp Leu Val Phe Ser Ile Ser Ser Pro 70 75 Glu Leu Glu Ala Lys Leu Ala Gin Ala Glu Ala Gly His Lys Ala Ala 90 Lys Ala Val Ser Asp Glu Val Lys Arg Gly Ser Arg Asp Glu Thr Ile 100 105 110 Asn Ser Ala Arg Asp Val Trp Gin Ala Ala Lys Ser Gin Ala Asn Leu 115 120 125 Ala Lys Glu Thr Tyr Lys Arg Val Gin Asp Leu Tyr Asp Asn Gly Val 130 135 140 WO 97/37044 PCT/US97/05223 801 Ala Ser Leu Gin Lys Arg Asp Glu Ala Tyr Ala Ala Met Lys Ala Pro 145 150 155 160 Asn Thr Thr Arg Ala Arg Leu Thr Lys Ser Ile Lys Trp Leu 165 170 INFORMATION FOR SEQ ID NO:900: SEQUENCE CHARACTERISTICS: LENGTH: 148 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...148 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:900: Met Glu Phe Leu Ser Gly Tyr Phe Leu Trp Val Lys Ala Phe His Val 1 5 10 Ile Ala Val Ile Ser Trp Met Ala Ala Leu Phe Tyr Leu Pro Arg Leu 25 Phe Val Tyr His Ala Glu Asn Ala His Lys Lys Glu Phe Val Gly Val 40 Val Gin Ile Gin Glu Lys Lys Leu Tyr Ser Phe Ile Ala Ser Pro Ala 55 Met Gly Phe Thr Leu Ile Thr Gly Ile Leu Met Leu Leu Ile Ala Pro 70 75 Glu Met Phe Lys Ser Gly Gly Trp Leu His Ala Lys Leu Ala Leu Val 90 Val Leu Leu Leu Ile Tyr His Phe Tyr Cys Lys Lys Cys Met Arg Glu 100 105 110 Leu Glu Lys Asp Pro Thr Gly Lys Asn Ala Arg Phe Tyr Arg Val Phe 115 120 125 Asn Glu Ile Pro Thr Ile Leu Met Ile Leu Ile Val Ile Leu Val Val 130 135 140 Val Lys Pro Phe 145 INFORMATION FOR SEQ ID NO:901: SEQUENCE CHARACTERISTICS: LENGTH: 145 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 802 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...145 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:901: Met Lys Pro Ile Phe Leu Leu Ile Phe Leu Leu Leu Ala Ser Leu Ile 1 5 10 Ala Arg Glu Lys Asp Ala Ser Ser Asn Leu Phe Asp Leu Ile Asp Lys 25 Gly Ile Asn Arg Glu Gin Glu Leu Lys Glu Gin Glu Gin Lys Thr Arg 40 Leu Lys Leu Ala Gin Ser Pro Leu Val Ala Leu Glu Ile Val Pro Gin 55 Glu Thr Pro Tyr Leu Glu Trp Gin Gly Ala Arg Glu Ser Tyr Tyr Leu 70 75 Lys Val Ser Ala Val Val Glu Ser Val Val Ile Leu Lys Ile Asp Ile 90 Asn Gin Gly Arg Ser Cys Ser Leu Tyr Pro Thr Pro Lys Ser Val Ser 100 105 110 Leu Val Arg Asn Gin Ser Val Ala Tyr Glu Ile Leu Cys Glu Asn Gin 115 120 125 Pro Leu Trp Ile Glu Val Ser Thr Asn Leu Gly Lys Arg Thr Phe Gin 130 135 140 Phe 145 INFORMATION FOR SEQ ID NO:902: SEQUENCE CHARACTERISTICS: LENGTH: 257 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...257 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:902: Met Ala Lys Lys Asn Lys Pro Thr Glu Cys Pro Ala Gly Glu Lys Trp 1 5 10 Ala Val Pro Tyr Ala Asp Phe Leu Ser Leu Leu Leu Ala Leu Phe Ile 25 Ala Leu Tyr Ala Ile Ser Ala Val Asn Lys Ser Lys Val Glu Ala Leu 40 Lys Thr Glu Phe Ile Lys Ile Phe Asn Tyr Ala Pro Lys Pro Glu Ala WO 97/37044 PCT/US97/05223 803 55 Met Gin Pro Val Val Val Ile Pro Pro Asp Ser Gly Lys Glu Glu Glu 70 75 Gin Met Ala Ser Glu Ser Ser Lys Pro Ala Ser Gin Asn Thr Glu Thr 90 Lys Ala Thr Ile Ala Arg Lys Gly Glu Gly Ser Val Leu Glu Gin Ile 100 105 110 Asp Gin Gly Ser Val Leu Lys Leu Pro Ser Ser Leu Leu Phe Glu Asn 115 120 125 Ala Thr Ser Asp Ala Ile Asn Gin Asp Met Met Leu Tyr Ile Glu Arg 130 135 140 Ile Ala Lys Ile Ile Gin Lys Leu Pro Lys Arg Val His Ile Asn Val 145 150 155 160 Arg Gly Phe Thr Asp Asn Thr Pro Leu Asn Lys Thr Arg Phe Lys Ser 165 170 175 His Tyr Glu Leu Ala Ala Asn Arg Ala Tyr Arg Val Met Lys Val Leu 180 185 190 Ile Gin Tyr Gly Val Asp Pro Asn Gin Leu Ser Phe Ser Ser Tyr Gly 195 200 205 Ser Thr Asn Pro Ile Ala Pro Asn Asp Ser Leu Glu Asn Arg Met Lys 210 215 220 Asn Asn Arg Val Glu Ile Phe Phe Ser Thr Asp Ala Asn Asp Leu Ser 225 230 235 240 Lys Ile His Ser Ile Leu Asp Glu Glu Phe Asn Pro His Lys Gin Gin 245 250 255 Glu INFORMATION FOR SEQ ID NO:903: SEQUENCE CHARACTERISTICS: LENGTH: 798 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...798 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:903: Met Arg Lys Val Ile Ile Met Asn Gly Tyr Leu Arg Val Lys Thr Pro 1 5 10 Tyr Phe Leu Ala Ser Val Val Leu Thr Phe Trp Thr Phe Asn Ser Phe 25 Met Ser Ala Lys Asp Lys His His Phe Leu Lys Lys Val Thr Thr Thr 40 Giu Gin Lys Phe Ser Ser Ser Ala Pro Ile Ser Trp Gin Ser Glu Glu 55 Val Arg Asn Ser Thr Ser Ser Arg Thr Val Ile Ser Asn Lys Glu Leu WO 97/37044 PCT/US97/05223 804 70 75 Lys Lys Thr Gly Asn Leu Asn Ile Glu Asn Ala Leu Gin Asn Val Pro 90 Gly Ile Gin Ile Arg Asp Ala Thr Gly Thr Gly Val Leu Pro Lys Ile 100 105 110 Ser Val Arg Gly Phe Gly Gly Gly Gly Asn Gly His Ser Asn Thr Asn 115 120 125 Met Ile Leu Val Asn Gly Ile Pro Ile Tyr Gly Ala Pro Tyr Ser Asn 130 135 140 Ile Glu Leu Ala Ile Phe Pro Val Thr Phe Gin Ser Val Asp Arg Ile 145 150 155 160 Asp Val Ile Lys Gly Gly Thr Ser Val Gin Tyr Gly Pro Asn Thr Phe 165 170 175 Gly Gly Val Val Asn Val Ile Thr Lys Glu Ile Pro Lys Glu Trp Glu 180 185 190 Asn Gin Ala Ala Glu Arg Ile Thr Phe Trp Gly Arg Ser Ser Asn Gly 195 200 205 Asn Phe Val Asp Pro Lys Glu Lys Gly Lys Pro Leu Ala Gin Thr Leu 210 215 220 Gly Asn Gin Met Leu Phe Asn Thr Tyr Gly Arg Thr Ala Gly Met Leu 225 230 235 240 Gly Lys Tyr Ile Gly Ile Ser Ala Gin Gly Asn Trp Ile Asn Gly Gin 245 250 255 Gly Phe Arg Gin Asn Ser Pro Thr Lys Val Gin Asn Tyr Leu Leu Asp 260 265 270 Ala Ile Tyr Lys Ile Asn Ala Thr Asn Thr Phe Lys Ala Tyr Tyr Gin 275 280 285 Tyr Tyr Gin Tyr Asn Ser Tyr His Pro Gly Thr Leu Ser Ala Gin Asp 290 295 300 Tyr Ala Tyr Asn Arg Phe Ile Asn Glu Arg Pro Asp Asn Gin Asp Gly 305 310 315 320 Gly Arg Ala Lys Arg Phe Gly Ile Val Tyr Gin Asn Tyr Phe Gly Asp 325 330 335 Pro Asp Arg Lys Val Gly Gly Asp Phe Lys Phe Thr Tyr Phe Thr His 340 345 350 Asp Met Ser Arg Asp Phe Gly Phe Ser Asn Gin Tyr Gin Ser Val Tyr 355 360 365 Met Ser Gly Gin Asn Lys Ile Leu Pro Phe Lys Gly Lys Gly Glu Ile 370 375 380 Ser Ala Lys Asn Pro Asn Cys Gly Leu Tyr Ser Tyr Ser Asp Thr Asn 385 390 395 400 Ser Pro Cys Trp Gin Phe Phe Asp Asn Ile Arg Arg Ser Val Val Asn 405 410 415 Ala Phe Glu Pro Lys Leu Asn Leu Ile Val Asn Thr Gly Lys Val Lys 420 425 430 Gin Thr Phe Asn Met Gly Met Arg Phe Leu Thr Glu Asp Leu Tyr Arg 435 440 445 Arg Ser Thr Thr Arg Lys Asn Pro Ser Met Pro Asn Asn Gly Ser Gly 450 455 460 Phe Asp Ala Gly Thr Ser Leu Asn Asn Phe Asn Asn Tyr Thr Ala Val 465 470 475 480 Tyr Ala Ser Asp Glu Ile Asn Phe Asn Asn Gly Met Leu Thr Ile Thr 485 490 495 Pro Gly Leu Arg Tyr Thr Phe Leu Asn Tyr Glu Lys Lys Asp Ala Pro 500 505 510 Pro Phe Lys Val Gly Gin Thr Pro Lys Thr Thr Lys Glu Arg Tyr Asn 515 520 525 WO 97/37044 PCT/US97/05223 Gin Trp 530 Leu Phe 545 Asn Ile Asn Val Phe Asn Arg Tyr 610 Glu Leu 625 Ala Tyr Asn Pro Leu Pro Tyr Ala 690 Tyr Ser 705 Ile Lys Tyr Trp Asn Gin Lys Tyr 770 Ala Pro 785 Asn Tyr Gly Met Ala 595 Gly Glu Thr Ala Phe 675 Lys Asp Asn Val Ser 755 Trp Pro Pro Phe Asn Glu 580 Asn Asp Leu Phe Asn 660 Val Thr Val Gly Trp 740 Val Phe Arg Ala Asn Phe 565 Gly Tyr Asn Tyr Ile 645 Pro Ser Thr Leu Ala 725 Asn Asn Ser Ser Val Tyr 550 Val Gly Phe Arg Tyr 630 Asp Lys Pro Ile Asn 710 Ile Leu Ala Gly Ile 790 Asn Val 535 Gin Arg Gly Thr Ser Arg Val Ile 600 Glu Pro 615 Thr Pro Ala Asn Gly Pro His Gin 680 Gly Leu 695 Thr Val Val Thr Gin Ile Ser Leu 760 Ile Gly 775 Thr Ala Gly Ser Ser Tyr 585 Phe Val Ile Ile Lys 665 Phe Ser Pro Lys Ser 745 Gin Thr Tyr Tyr Tyr Thr 570 Tyr Ala Asn Arg Thr 650 Lys Ile Ser Phe Thr 730 Thr Ile Ser Val Lys Ile 555 Asp Phe Asn Ala Gly 635 Ser Asp Leu Phe Thr 715 Ala Thr Asn Pro Ser 795 Pro 540 Pro Tyr Asn His Arg 620 Leu His Ile Asp Phe 700 Glu Gly Leu Asn Asn 780 Tyr Ile Pro Phe His Tyr 605 Ser Asn Thr Phe Ala 685 Tyr Tyr Met Trp Ile 765 Gly Asn Lys Gin Gin Gin 590 Phe Gin Phe Met Gly 670 Ser Ser Ala Thr Glu 750 Phe Lys Phe Glu Phe Ile 575 Val Thr Gly His Val 655 Lys Tyr Arg Pro Pro 735 Arg Asn Glu Leu Ser 560 Phe Ser Gly Val Ala 640 Thr Lys Thr Ala Thr 720 Tyr Lys Met Ala INFORMATION FOR SEQ ID NO:904: SEQUENCE CHARACTERISTICS: LENGTH: 798 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...798 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:904: Met Arg Lys Val Ile Ile Met Asn Gly Tyr Leu Arg Val Lys Thr Pro 1 5 10 WO 97/37044 PTU9/52 PCTIUS97/05223 Tyr Met Glu Val Lys Gly Ser Met Ile 145 Asp Gly Asn Asn Gly 225 Gly Gly Ala Tyr Tyr 305 Gly Pro Asp Met Ser 385 Ser Ala Gin Arg Phe Phe Ser Gin Arg Lys Ile Val Ile 130 Glu Val1 Gly Gin Phe 210 Asn Lys Phe Ile Tyr 290 Ala Arg Asp Met Ser 370 Ala Pro Phe Thr Ser 450 Asp Leu Ala Lys Asn Thr Gin Arg 115 Leu Ile Vai Ala Vali Gin Tyr Arg Tyr 275 Gin Tyr Ala Arg Ser 355 Gly Lys Cys Glu Phe 435 Thr Ala Ala Lys Phe Ser Giy Ile 100 Giy Vali Aia Lys Val1 180 Ala Asp Met Ile Gin 260 Lys Tyr Asn Lys Lys 340 Arg Gin Asn Trp Pro 420 Asn Thr Gly Ser Val Asp Lys Ser Ser Thr Ser 70 Asn Leu Arg Asp Phe Gly Asn Giy Ile Phe 150 Gly Gly 165 Asn Val Giu Arg Pro Lys Leu Phe 230 Gly Ile 245 Asn Ser Ile Asn Asn Ser Arg Phe 310 Arg Phe 325 Val Gly Asp Phe Asn Lys Pro Asn 390 Gin Phe 405 Lys Leu Met Gly Arg Lys Thr Ser ValI His Ser 55 Ser Asn Ala Giy Ile 135 Pro Thr Ile Ile Glu 215 Asn Ser Pro Ala Tyr 295 Ile Gly Gly Giy Ile 375 Cys Phe Asnr Met Asn 455 Leu Leu His 40 Ala Arg Ile Thr Giy 120 Pro Val Ser Thr Thr 200 Lys Thr Ala Thr Thr 280 His Asn Ile Asp Phe 360 Leu Gly Asp Leu Arg 440 Pro Asn Thr Phe 25 Phe Leu Pro Ile Thr Val Giu Asn 90 Gly Thr 105 Gly Asn Ile Tyr Thr Phe Val Gin 170 Lys Giu 185 Phe Trp Gly Lys Tyr Gly Gin Gly 250 Lys Val 265 Asn Thr Pro Gly Giu Arg Val Tyr 330 Phe Lys 345 Ser Asn Pro Phe Leu Tyr Asn Ile 410 Ile Val 425 Phe Leu Ser Met Asn Phe Trp Lys Ser Ile 75 Ala Gly Gly Gly Gin i55 Tyr Ile Giy Pro Arg 235 Asn Gin Phe Thr Pro 315 Gin Phe Gin Lys Ser 395 Arg Asn Thr Pro Asn Thr Lys Trp Ser Leu Val1 His Ala 140 Ser Giy Pro Arg Leu 220 Thr Trp Asn Lys Leu 300 Asp Asn Thr Tyr Giy 380 Tyr Arg Thr Giu Asn 460 Asn Phe Vali Gin Asn Gin Leu Ser 125 Pro Val Pro Lys Ser 205 Ala Ala Ile Tyr Aia 285 Ser Asn Tyr Tyr Gin 365 Lys Ser Ser Gly Asp 445 Asn Tyr Asn Thr Ser Lys Asn Pro Asn Tyr Asp Asn Giu 190 Ser Gin Gly Asn Leu 270 Tyr Ala Gin Phe Phe 350 Ser Gly Asp Val1 Lys 430 Leu Gly Thr Ser Thr Glu Giu Val1 Lys Thr Ser Arg Thr 175 Trp Asn Thr Met Gly 255 Leu Tyr Gin Asp Gly 335 Thr Val Giu Thr Val1 415 Val1 Tyr Ser Ala Phe Thr Giu Leu Pro Ile As n Asn Ile 160 Phe Glu Gly Leu Leu 240 Gin Asp Gin Asp Gly 320 Asp His Tyr Ile Asn 400 Asn Lys Arg Gly Vali WO 97/37044 PCT/US97/05223 807 465 470 475 480 Tyr Ala Ser Asp Glu Ile Asn Phe Asn Asn Gly Met Leu Thr Ile Thr 485 490 495 Pro Gly Leu Arg Tyr Thr Phe Leu Asn Tyr Glu Lys Lys Asp Ala Pro 500 505 510 Pro Phe Lys Val Gly Gin Thr Pro Lys Thr Thr Lys Glu Arg Tyr Asn 515 520 525 Gin Trp Asn Pro Ala Val Asn Val Gly Tyr Lys Pro Ile Lys Glu Leu 530 535 540 Leu Phe Tyr Phe Asn Tyr Gin Arg Ser Tyr Ile Pro Pro Gin Phe Ser 545 550 555 560 Asn Ile Gly Asn Phe Val Gly Thr Ser Thr Asp Tyr Phe Gin Ile Phe 565 570 575 Asn Val Met Glu Gly Gly Ser Arg Tyr Tyr Phe Asn His Gin Val Ser 580 585 590 Phe Asn Ala Asn Tyr Phe Val Ile Phe Ala Asn His Tyr Phe Thr Gly 595 600 605 Arg Tyr Gly Asp Asn Arg Glu Pro Val Asn Ala Arg Ser Gin Gly Val 610 615 620 Glu Leu Glu Leu Tyr Tyr Thr Pro Ile Arg Gly Leu Asn Phe His Ala 625 630 635 640 Ala Tyr Thr Phe Ile Asp Ala Asn Ile Thr Ser His Thr Met Val Thr 645 650 655 Asn Pro Ala Asn Pro Lys Gly Pro Lys Lys Asp Ile Phe Gly Lys Lys 660 665 670 Leu Pro Phe Val Ser Pro His Gin Phe Ile Leu Asp Ala Ser Tyr Thr 675 680 685 Tyr Ala Lys Thr Thr Ile Gly Leu Ser Ser Phe Phe Tyr Ser Arg Ala 690 695 700 Tyr Ser Asp Val Leu Asn Thr Val Pro Phe Thr Glu Tyr Ala Pro Thr 705 710 715 720 Ile Lys Asn Gly Ala Ile Val Thr Lys Thr Ala Gly Met Thr Pro Tyr 725 730 735 Tyr Trp Val Trp Asn Leu Gin Ile Ser Thr Thr Leu Trp Glu Arg Lys 740 745 750 Asn Gin Ser Val Asn Ala Ser Leu Gin Ile Asn Asn Ile Phe Asn Met 755 760 765 Lys Tyr Trp Phe Ser Gly Ile Gly Thr Ser Pro Asn Gly Lys Glu Ala 770 775 780 Ala Pro Pro Arg Ser Ile Thr Ala Tyr Val Ser Tyr Asn Phe 785 790 795 INFORMATION FOR SEQ ID NO:905: SEQUENCE CHARACTERISTICS: LENGTH: 277 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 808 NAME/KEY: misc_feature LOCATION 1...277 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:905: Met Lys Thr Asn Gly Leu Phe Lys Met Trp Gly Leu 1 Ile Asp Phe Gin Gly Val Thr Met Ile 145 Ala Phe Ile His Ile 225 Asn Tyr Glu Ala Ala Ser Gly Asp Glu Arg Lys 130 Glu Asp Glu Ala Pro 210 Ala Asn Gin Ile Leu Leu Asp Tyr Glu Phe Thr 115 Val Glu Phe Gin Leu 195 Glu Pro Glu Glu Ile 275 Val Glu Lys Asp Asn Leu 100 Lys Ala Leu Tyr Asn 180 Ala Phe Ala Ile Thr 260 Phe 5 Phe Val Pro Val Lys Lys Glu Leu Lys Phe 165 Thr His Lys Ile Asp 245 Leu Glu Asn Ile Pro Val 70 Ile Ala Arg Gly Asp 150 Thr Glu Asp Leu Lys 230 Ser Glu Ala Lys Phe 55 Ile Glu Asn Glu Val 135 Lys Lys Thr Asn Gly 215 Lys Leu Pro Cys Gin 40 Gly Ala Phe Lys Lys 120 Ile Glu Asn Phe Thr 200 Ile Gly Ile Ile 10 Ser Asp 25 Arg Gly Ser Val Lys Arg Ile Pro 90 Val Asp 105 Val Val Ser Lys Leu Ile Tyr Pro 170 Leu Ala 185 Leu Leu Thr Ser Asn Pro Ser Ser 250 Tyr Gly 265 Ser Val Asp Met 75 Val Ile Asp Asp Val 155 Asn Leu Leu Leu Lys 235 Asp His Leu Ser Ala Glu Ile Phe Gly 140 Asn Ile Leu Ala Gly 220 Leu Phe Phe Lys Lys Lys Leu Ala Met Ala 125 Val Lys Lys Asn Trp 205 Asp Leu Leu Leu Glu Val Gly Asp Ser Ala 110 Asn Ile Gly Leu Asn 190 Val Lys Glu Lys Val Lys Gly Asn Leu Ala Asn Pro Lys Thr Leu 175 Lys Lys Asp Trp Glu Leu Lys Val Tyr Leu Arg Phe Tyr Asn Thr 160 Lys Ala Gin Val Leu 240 Ala Asp Gly Ile 255 Lys Pro Glu 270 INFORMATION FOR SEQ ID NO:906: SEQUENCE CHARACTERISTICS: LENGTH: 207 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 809 NAME/KEY: misc_feature LOCATION 1...207 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:906: Leu Asp Asn Phe Ile Ser Gin Tyr Ala Gin Arg Leu Ile Val Thr Asn 1 5 10 Leu Ser Gin Ala Ile Arg Ile Tyr Gly Tyr Glu Val Gly Gly Thr Phe 25 Arg Tyr Lys Gly Val Ser Leu Asn Val Gly Ile Ser Arg Thr Trp Pro 40 Thr Thr Arg Gly Tyr Leu Met Ala Asp Ser Tyr Glu Leu Ala Ala Ser 55 Thr Gly Asn Val Phe Ile Ile Lys Leu Asp Tyr Thr Ile Pro Lys Thr 70 75 Gly Ile Asn Leu Ala Trp Leu Ser Arg Phe Val Thr Gly Leu Asp Tyr 90 Cys Gly Phe Asp Ile Tyr Leu Pro Asp Tyr Gly Thr Ala Glu Lys Pro 100 105 110 Lys Thr Pro Thr Asp Leu Ala Lys Cys Gly Ser Gin Leu Gly Leu Val 115 120 125 His Met His Lys Pro Gly Tyr Gly Val Ser Asn Phe Tyr Ile Asn Trp 130 135 140 Ser Pro Lys Thr Lys Ser His Trp Lys Gly Leu Leu Leu Ser Ala Val 145 150 155 160 Phe Asn Asn Val Phe Asn Lys Phe Tyr Val Asp Gin Thr Ser Pro Tyr 165 170 175 Val Met Ser Pro Asp Met Pro Gly Thr Asp Ala Ile Lys Arg Ala Ile 180 185 190 Ala Glu Pro Gly Phe Asn Ala Arg Phe Glu Val Ala Tyr Lys Trp 195 200 205 INFORMATION FOR SEQ ID NO:907: SEQUENCE CHARACTERISTICS: LENGTH: 244 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...244 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:907: Met Lys Lys Ala Gly Phe Leu Phe Leu Ala Ala Met Ala Ile Ile Val 1 5 10 Val Ser Leu Asn Ala Lys Asp Pro Asn Val Leu Arg Lys Ile Val Phe 25 Glu Lys Cys Leu Pro Asn Tyr Glu Lys Asn Gin Asn Pro Ser Pro Cys WO 97/37044 PCT/US97/05223 Ile Gly Glu Ser Pro Gin Lys 145 Leu Glu Val Gin Leu 225 Ala Glu Pro Asn Trp Asp Asn 130 Gin Ser Ser Pro Gin 210 Thr Ile Val Leu Pro Gin Tyr 115 His Leu Gly Glu Asn 195 Ser Leu Leu Lys Gin Leu Ala 100 Ala Phe Asp Gly Leu 180 Ala Asp Asn Arg Pro Tyr Leu Arg Ile His Asn Leu 165 Ala His Asn Arg Asp Leu 70 Leu Asp Ser Ile Asn 150 Asn Gin Lys Ser Ala 230 Ala 55 Leu Asp Phe Leu His 135 Leu Gly Lys Arg Phe 215 Ser 40 Gly Met Pro Met Thr 120 Ile Lys His Ser Met 200 Val Ala Tyr Pro Ser Ser 105 Ile Ser Asn Lys Pro 185 Gly Leu Glu Val Thr Thr 90 Lys Asn Cys Ile Tyr 170 Phe Asp Leu Glu Val Thr 75 Pro Lys Ser Ile Asn 155 Leu Val Tyr Ala Ile 235 Leu His Asn Tyr Lys Ser 140 Ser Ala Met Gly Thr 220 Gin Lys Ile Phe Gly Lys 125 Leu Arg Arg Leu Leu 205 Gin Asp Asp Ser Phe Lys 110 Gly Asp Trp Arg Ala 190 Ala Phe His Ile Gly Tyr Pro Arg Val Ser Val 175 Lys Val Asn Glu Asn Ile Leu Ile Ser Arg Pro 160 Thr Glu Val Pro Cys 240 INFORMATION FOR SEQ ID NO:908: SEQUENCE CHARACTERISTICS: LENGTH: 228 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...228 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:908: Met Val Phe Asp Arg Thr Thr Ser Val Arg Glu Lys Lys Ala Ala Lys 1 5 10 Ala Leu Gly Ile Val Gly Ile Val Phe Phe Ile Leu Phe Gly Ile Val 25 Ile Ser Gly Val Ala Ser Gin Lys Glu Trp Val Gin Gin Leu Asp Leu 40 Phe Phe Ile Asp Leu Ile Arg Asn Pro Ala Pro Ile Gin Gly Ser Thr 55 Trp Leu Ser Phe Val Phe Phe Ser Thr Trp Phe Ala Gin Ser Lys Leu WO 97/37044 PCT/US97/05223 811 70 75 Thr Thr Pro Ile Ala Leu Leu Ile Gly Leu Trp Phe Gly Phe Gin Lys 90 Arg Ile Ala Leu Gly Val Trp Phe Phe Phe Ser Ile Leu Leu Gly Glu 100 105 110 Phe Thr Leu Lys Ser Leu Lys Leu Leu Val Ala Arg Pro Arg Pro Val 115 120 125 Thr Asn Gly Glu Leu Val Phe Ala His Gly Phe Ser Phe Pro Ser Gly 130 135 140 His Ala Leu Ala Ser Ala Leu Phe Tyr Gly Ser Leu Ala Leu Leu Leu 145 150 155 160 Cys Tyr Ser Asn Ala Asn Asn Arg Ile Lys Thr Ile Gly Ala Ile Ile 165 170 175 Leu Leu Phe Trp Ile Phe Leu Met Ala Tyr Asp Arg Val Tyr Leu Gly 180 185 190 Val His Tyr Pro Ser Asp Val Leu Gly Gly Phe Leu Leu Gly Ile Ala 195 200 205 Trp Ser Cys Cys Ser Leu Ala Leu Tyr Leu Gly Phe Leu Lys Arg Pro 210 215 220 Tyr Lys Ala Ala 225 INFORMATION FOR SEQ ID NO:909: SEQUENCE CHARACTERISTICS: LENGTH: 157 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM4: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...157 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:909: Met Lys Lys Phe Gly Leu Gly Val Tyr Leu Leu Leu Leu Gly Ile Leu 1 5 10 Gly Gly Ser Leu Ile Ile Leu Gly Ala Ile Val Ala Pro Ile Val Phe 25 Lys Ala Ser Ser Ile Leu Pro Glu Leu Asn Leu Thr Pro Phe Glu Ser 40 Gly Lys Leu Met Ala Gin Ile Phe Val Arg Phe Asn Tyr Val Leu Gly 55 Ala Ile Gly Phe Val Val Leu Leu Tyr Glu Ile Ile Ser Phe Ile Tyr 70 75 Tyr Lys Arg Ser Leu Val Tyr Leu Ile Leu Gly Val Ala Ile Gly Ala 90 Leu Cys Leu Leu Phe Val Phe Tyr Tyr Thr Pro Tyr Ile Leu Asn Ala 100 105 110 Gin Lys Val Gly Glu Val Ala Leu Gin Ser Ala Glu Phe Ala Arg Ser WO 97/37044 PCT/US97/05223 812 115 120 125 His Ala Gin Ser Glu Trp Leu Phe Lys Glu Leu Phe Val Leu Val Cys 130 135 140 Ala Leu Phe Phe Trp Arg Leu Phe Gly Lys Asn Ala Leu 145 150 155 INFORMATION FOR SEQ ID NO:910: SEQUENCE CHARACTERISTICS: LENGTH: 110 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...110 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:910: Leu Ile Asn Leu Leu Pro His His Tyr His Lys Phe Pro Pro Asn Ile 1 5 10 Asn Pro Ser Leu Ile Ser Leu Lys Asp Arg Phe Leu Pro His Glu Lys 25 His Ser Gin Lys Val Lys Lys Glu Cys Val Asn Leu Phe Glu Val Leu 40 Ser Pro Leu His Lys Ile Asp Glu Lys Tyr Leu Phe His Leu Lys Ile 55 Ala Gly Glu Leu Ala Ser Met Gly Lys Ile Leu Ser Val Tyr Leu Ala 70 75 His Lys His Ser Ala Tyr Phe Ile Leu Asn Ala Leu Ser Tyr Gly Phe 90 Ser His Gin Asp Arg Ala Ile Ile Cys Leu Leu Gly Ala Ile 100 105 110 INFORMATION FOR SEQ ID NO:911: SEQUENCE CHARACTERISTICS: LENGTH: 215 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 PCT/US97/05223 813 LOCATION 1...215 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:911: Met Lys Ser Phe Leu Lys Leu Phe Ala Gin Pro Leu 1 Ala Val Thr Thr Leu Glu Glu Lys Thr 145 Gly Ser Trp Lys Phe Lys Glu Thr Asp Ile Ile Gin 130 Pro Val Leu Ala Leu 210 Met Lys Arg Thr Thr Lys Lys 115 Asn Leu Asn Ile Glu 195 Leu Leu Asp Gin Ala Ala Gin 100 Gin Ser Met Val Lys 180 Ile Lys 5 Leu Ser Asn Thr Thr Glu Glu Val Gly Arg 165 Asn Glu Lys Tyr Ala Ser Glu 70 Gin Ile Thr Ser Lys 150 Ala Lys Phe Ala Ala Pro Thr 55 Gin Lys Lys Lys Pro 135 Lys Phe Ser Ser Glu 215 Leu Met 40 Phe Asn Gin Gin Gin 120 Val Pro Pro Val His 200 Ala 25 Ser Ser Pro Glu Glu 105 Glu Gin Leu Ser Lys 185 Glu 10 His Pro Pro Thr Ile 90 Ile Gin Asn Glu Thr 170 Val Thr Ala Asn Lys Lys 75 Lys Lys Glu Asp Tyr 155 Lys Leu Lys Val Val Glu Asp Gin Gin Lys Gin 140 Lys Gly Glu Gly Leu Leu Glu Glu Thr Glu Glu Glu 125 Lys Val Lys Ile Tyr 205 Val Gly Lys Ala Val lie Ile 110 Asn Thr Ala Ile Gin 190 Val Val Phe Ser Asn Pro Lys Lys Lys Pro Val Ile 175 Asn Phe Leu Tyr Glu Ala Pro Gin Gin Pro Thr Ser 160 Gly Asp Leu INFORMATION FOR SEQ ID NO:912: SEQUENCE CHARACTERISTICS: LENGTH: 79 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...79 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:912: Met Trp Pro Val Ala Leu Lys Gin Pro Asn Arg Val Ser His His Phe 1 5 10 Tyr Ile Met Ala Met Leu Phe Ile Leu Phe Asp Val Glu Ile Val Phe 25 WO 97/37044 PCT/US97/05223 814 Met Phe Pro Trp Ala Ile Asp Phe Lys Lys Leu Gly Leu Phe Gly Leu 40 Val Glu Met Leu Gly Phe Val Phe Phe Leu Ala Ile Gly Phe Ile Tyr 55 Ala Leu Lys Arg Asn Ala Leu Ser Trp Gin Lys Leu Glu Val Lys 70 INFORMATION FOR SEQ ID NO:913: SEQUENCE CHARACTERISTICS: LENGTH: 416 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...416 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:913: Met Cys Gly Met Phe Lys Asn Glu Ile Ser Ala Ile Gin Gly Met Ile 1 5 10 Ala Asn Ala Gin Glu Ala Val Ala Gin Ala Lys Ile Val Ser Glu Asn 25 Thr Gin Asn Gin Asn Ser Leu Asp Ala Gly Lys Pro Phe Asn Pro Tyr 40 Thr Asp Ala Ser Phe Ala Glu Ser Met Leu Lys Asn Ala Gin Ala Gin 55 Ala Glu Ile Leu Asn Gin Ala Glu Gin Val Val Lys Asn Phe Glu Lys 70 75 Ile Pro Thr Ala Phe Val Asn Asp Ser Leu Gly Val Cys Tyr Glu Val 90 Gin Gly Gly Glu Arg Arg Gly Thr Asn Pro Gly Gin Thr Thr Ser Asn 100 105 110 Thr Trp Gly Ala Gly Cys Ala Tyr Val Gly Gin Thr Ile Thr Asn Leu 115 120 125 Lys Asn Ser Ile Ala His Phe Gly Thr Gin Glu Gin Gin Ile Gin Gin 130 135 140 Ala Glu Asn Ile Ala Asp Thr Leu Val Asn Phe Lys Ser Arg Tyr Ser 145 150 155 160 Glu Leu Gly Asn Thr Tyr Asn Ser Ile Thr Thr Ala Leu Ser Asn Ile 165 170 175 Pro Asn Ala Gin Ser Leu Gin Asn Ala Val Ser Lys Lys Asn Asn Pro 180 185 190 Tyr Ser Pro Gin Gly Ile Asp Thr Asn Tyr Tyr Leu Asn Gin Asn Ser 195 200 205 Tyr Asn Gin Ile Gin Thr Ile Asn Gin Glu Leu Gly Arg Asn Pro Phe 210 215 220 Arg Lys Val Gly Ile Val Ser Ser Gin Thr Asn Asn Gly Ala Met Asn 225 230 235 240 WO 97/37044 PCT/US97/05223 815 Gly Ile Gly Ile Gin Val Gly Tyr Lys Gin Phe Phe Gly Gin Lys Arg 245 250 255 Lys Trp Gly Ala Arg Tyr Tyr Gly Phe Phe Asp Tyr Asn His Ala Phe 260 265 270 Ile Lys Ser Ser Phe Phe Asn Ser Ala Ser Asp Val Trp Thr Tyr Gly 275 280 285 Phe Gly Ala Asp Ala Leu Tyr Asn Phe Ile Asn Asp Lys Ala Thr Asn 290 295 300 Phe Leu Gly Lys Asn Asn Lys Leu Ser Val Gly Leu Phe Gly Gly Ile 305 310 315 320 Ala Leu Ala Gly Thr Ser Trp Leu Asn Ser Glu Tyr Val Asn Leu Ala 325 330 335 Thr Met Asn Asn Val Tyr Asn Ala Lys Met Asn Val Ala Asn Phe Gin 340 345 350 Phe Leu Phe Asn Met Gly Val Arg Met Asn Leu Ala Arg Pro Lys Lys 355 360 365 Lys Asp Ser Asp His Ala Ala Gin His Gly Ile Glu Leu Gly Leu Lys 370 375 380 Ile Pro Thr Ile Asn Thr Asn Tyr Tyr Ser Phe Met Gly Ala Glu Leu 385 390 395 400 Lys Tyr Arg Arg Leu Tyr Ser Val Tyr Leu Asn Tyr Val Phe Ala Tyr 405 410 415 INFORMATION FOR SEQ ID NO:914: SEQUENCE CHARACTERISTICS: LENGTH: 114 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...114 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:914: Met Ala Thr Ile Gin Pro Phe Asn His Ser Thr Ile Gin Pro Phe Asn 1 5 10 His Ser Thr Ile Gin Pro Phe Asn His Ser Ile Ile Gin Ser Phe Asn 25 His Ser Thr Ile Gin Ala Thr Leu Pro Tyr Phe Tyr Asn Tyr Leu Ser 40 Phe Tyr Lys Asn Leu Phe Lys Asn Pro Leu Phe Phe Ile Ile Pro Pro 55 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro 70 75 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro 90 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro 100 105 110 WO 97/37044 PCT/US97/05223 816 Phe Ile INFORMATION FOR SEQ ID NO:915: SEQUENCE CHARACTERISTICS: LENGTH: 126 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...126 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:915: Leu Val Ser Glu Tyr Trp Leu Lys Phe Phe Thr Arg Ser Phe Ser Lys 1 5 10 Ser Thr Met Leu Phe Lys Thr Leu Leu Arg Ser Phe Phe Thr Phe Pro 25 Val Glu Leu Ser Glu Asn Ile Thr Leu Gly Ser Thr Val Val Leu Ile 40 Val Ala Glu Ala Val Ser Ala Leu Asn Lys Lys Val Val Ala Ala Lys 55 Lys Asn Lys Ile Arg Phe Thr Pro Asn Ser Tyr Ile His Asn Arg Asn 70 75 Lys Asn Arg Arg Tyr Ser Ser Leu Ser Pro Leu Leu Lys Ser Ser Ser 90 Ile Tyr Lys Asn Pro Pro Arg Ile Gin Ala Ile Leu Ile Ile Leu Lys 100 105 110 Tyr Arg Leu Ser Lys Gly Ile Tyr His Leu Gly Met Ile Leu 115 120 125 INFORMATION FOR SEQ ID NO:916: SEQUENCE CHARACTERISTICS: LENGTH: 286 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...286 WO 97/37044 PCT/US97/05223 817 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:916: Met Lys Lys Ser Val Ile Val Gly Ala Ile Ser Leu 1 Leu Pro Gin Cys Pro Thr Phe Phe Lys 145 Ser Asn Gly Lys Gin 225 Phe Ala His Leu Thr Leu Asn Thr Arg Gly Gly 130 Tyr Gin Pro Val Asp 210 Val Glu Ser Arg Ser Lys Gly Gly Ala Gly Phe 115 Met Tyr Ser Ala Ala 195 Leu Leu Ile Ala Pro 275 Ala Lys Met Asn Ser Tyr 100 Val Arg Thr Phe Ile 180 Ile Ala Val Gly Asp 260 Tyr 5 Glu Gly Leu Gin Asn Lys Val Tyr Tyr Met 165 Phe Gly Glu Asn Leu 245 Asn Ala Thr Glu Ser Ser 70 Pro Gly Gly Tyr Asn 150 Phe Asn Gly Glu Gly 230 Lys Val Phe Pro Arg Thr 55 Gly Thr Leu Tyr Gly 135 Asp Gly Arg Thr Tyr 215 Gly Ile Pro Tyr Lys Asn 40 Thr Ala Gly Ser Lys 120 Phe Tyr Tyr Glu Ser 200 Arg Ile Gin Glu Trp 280 Gin 25 Ala Ala Tyr Gly Asn 105 His Phe Gly Gly Asn 185 Trp Gly Arg Thr Gly 265 Arg 10 Glu Ala Gin Gly Leu 90 Gin Phe Asp Met Ala 170 Leu Gly Ser Leu Ile 250 Thr Tyr Lys Phe Asn Ser 75 Thr Gin Phe Phe Arg 155 Gly His Pro Phe Gly 235 Arg Thr Ile Ala Ile Cys Asn His Tyr Lys Ala 140 Asp Thr Leu Thr His 220 Thr Asn Tyr Val Ala Met Ile Lys Gly Ile Ser His Thr Pro Gly Ala Ala Ile 110 Lys Ala 125 Ser Ser Ala Arg Asp Val Gly Phe 190 Asn Tyr 205 Pro Ser Lys His Asn Tyr Arg Phe 270 Ser Phe 285 Thr Thr Asp Gly Asn Leu Asn Pro Tyr Lys Leu 175 Phe Tyr Asn Gin Tyr 255 Thr Ser Ser Tyr Asn Met Gly Gly Gin Tyr Gly 160 Phe Leu Phe Phe Gly 240 Thr Phe INFORMATION FOR SEQ ID NO:917: SEQUENCE CHARACTERISTICS: LENGTH: 285 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...285 WO 97/37044 WO 9737044PCTIUS97/05223 818 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:9i7: Val Lys Arg Ile Leu Phe Phe Leu Ala Ala Thr 1 Ala Met Val1 Asn Gly Cys Arg Ser Leu 145 Ile Ala Ala Ser Asp 225 Lys Arg Phe Giu Phe Leu Gin Ala Tyr Giu Leu 130 His Ser Ile Aia Pro 210 Leu Glu Arg Glu Thr Ser Lys Tyr Phe Tyr Lys 115 Pro Giu Met Val1 Phe 195 Giu Asp Pro Leu Lys 275 Ala Giu Ser Ser Phe Glii 100 Phe Leu Gin Val1 Leu 180 Gin Phe Pro Glu Ser 260 Gin 5 Ser Ser Met Giu Lys His Tyr Phe Phe Arg 165 Lys Lys Ile Met Thr 245 Lys Ile Ala Ser Val1 Thr 70 Giy Arg Lys Asn Gly 150 Val1 Lys Arg Ser Thr 230 Ser Lys Ser Thr Thr Asp Lys Ser Asn His Trp 135 Asp Ser Met Ser Ser 215 Asn Ser Giu Asp Ile Giy 40 Leu Met Leu Giy Val1 120 Leu Met Gin Glu Ser 200 Ser Ala Lys Lys Ser 280 Asn 25 Asn Giu Ser Glu Lys 105 Leu Tyr Tyr Lys Giu 185 Gly Lys Asn Lys Gin 265 Ser 10 Thr Vali Lys Lys Asp 90 Val1 Lys Lys Asp Glu 170 Gin Giu Thr Thr Giu 250 Gin Lys Thr Lys Giu Gly 75 Cys Ser Asp Gly Gly 155 Lys Ala Leu Gin Leu 235 Lys Gin Ser Thr Vali Lys Arg Asp Vali Phe Leu Ser 140 Tyr Ala Glu Giu Asn 220 Lys Lys Ala Glu Phe Asp Asp Val1 Leu Glii Vai Gly 125 Asp Ile Arg Lys Ser 205 Ser Glu Pro Leu Lys 285 Leu Pro Arg Lys Ser Gin Val1 110 Thr Phe Lys Lys Asp 190 His Ser Thr Lys Gin 270 Leu Asn Lys Asn Ala Lys Asn Glu Giy Tyr Val 175 Thr Thr Asn Ala Lys 255 Gin Arg Va 1 Arg Phe Phe Ile Asp Leu Ala Leu 160 Asp Lys Asp Pro Ser 240 Lys Glii INFORMATION FOR SEQ ID NO:9i8: SEQUENCE CHARACTERISTICS: LENGTH: 75 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 WO 97/37044 PCT/US97/05223 819 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:918: Met Pro Lys Glu Asn Thr Thr His Glu Asp Ala Leu Ala Lys Cys Phe 1 5 10 Ala Pro Ala Ile Phe Lys Ala Ile Met Leu Asn Lys Leu Ile Leu Ala 25 Val Val Ala Ala Pro Ile Ile Lys Met Leu Lys Leu Ser Lys Glu Ile 40 Trp Trp Lys Lys Met Gly Lys Lys His Ala Ile Asp Phe Lys Met Arg 55 Ile Lys Ile Leu Ala Ser Leu Ala Pro Ser Phe 70 INFORMATION FOR SEQ ID NO:919: SEQUENCE CHARACTERISTICS: LENGTH: 134 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...134 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:919: Met Cys Gin Ile Gin Cys Leu Leu Ile Leu Leu Ser Ile Asn Ile Val 1 5 10 Ser Ala Ile Ile Val Tyr Phe Phe Gin Ala Phe Gin Gly Val Leu Asn 25 Phe Glu Gly Gly Phe Leu Gly Phe Phe Ile Val Ala Leu Ser Ser Tyr 40 Tyr Gly Val Lys Lys Arg Leu Asp Leu Arg Lys Gin Asp Ser Lys Glu 55 Lys Glu Glu Lys Gin Lys Phe Gin Lys Phe Ala Leu Gly Leu Glu Met 70 75 Ser Phe Asn Val Trp Arg Leu Gly Gly Tyr Gly Val Leu Leu Gly Ile 90 Leu Gly Val Leu Leu Phe Leu His Leu Phe Asn Gly Leu Pro Phe Leu 100 105 110 Ile Gly Val Phe Val Ser Ser Leu Ser Ser Ala Leu Leu Arg Phe Leu 115 120 125 Asn Asn Asn Gly Lys Phe 130 INFORMATION FOR SEQ ID NO:920: SEQUENCE CHARACTERISTICS: LENGTH: 92 amino acids WO 97/37044 PCTIUS97/05223 820 TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...92 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:920: Met Ala Cys Glu Phe Leu Lys Lys Pro Lys Tyr Tyr Lys Phe Ile Glu 1 5 10 Gly Ala Asn Tyr Leu Ser Leu Gly Leu Ser Met Val Val Ala Ile Leu 25 Met Gly Val Ala Ile Gly Tyr Gly Leu Lys Lys Leu Thr His Ile Ser 40 Trp Leu Phe Trp Leu Gly Val Ile Trp Gly Val Leu Ala Ser Phe Leu 55 Asn Val Tyr Lys Ala Tyr Lys Asn Met Gln Lys Asp Tyr Glu Glu Leu 70 75 Ala Lys Asp Pro Lys Tyr Thr Gin Asn Lys Thr Lys INFORMATION FOR SEQ ID NO:921: SEQUENCE CHARACTERISTICS: LENGTH: 80 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...80 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:921: Met Glu Asp Phe Leu Tyr Asn Thr Leu Tyr Phe Ile Glu Asp Tyr Lys 1 5 10 Leu Val Val Ile Phe Ser Phe Ile Gly Leu Ile Ala Leu Phe Phe Leu 25 Tyr Lys Phe Ile Lys Thr Gin Lys Lys Val Phe Lys Asp Lys Ala Asn 40 Gin Pro Gin Lys Lys Lys Ser Phe Lys Glu Ile Ile Ile Asp Gly Leu 55 WO 97/37044 PCT/US97/05223 821 Lys Glu Arg Val Lys Thr Phe Gly Phe Arg Leu Gin Ala Ile Leu Leu 70 75 INFORMATION FOR SEQ ID NO:922: SEQUENCE CHARACTERISTICS: LENGTH: 363 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...363 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:922: Met Ala Ile Asp Leu Ala Glu Val Thr Gly Ala Lys Ala Ala Gin Glu 1 5 10 Arg Lys Lys Glu Gin Pro Thr Ile Ala Asn Gly Leu Asp Lys Asn Ala 25 Phe Met Lys Leu Phe Leu Glu Gin Leu Lys Asn Gin Asp Pro Thr Ala 40 Pro Met Glu Thr Asp Lys Ile Ile Thr Gin Thr Ala Gin Leu Thr Gin 55 Val Glu Met Gin Glu Glu Asn Lys Lys Thr Met Gin Glu Val Ala Ser 70 75 Ala Met Lys Ser Asn Lys Glu Thr Asn Glu Ser Leu Lys Asp Phe Gin 90 Gly Ala Leu Lys Asp Thr Met Glu Asn Leu Asn Lys Gly Met Asp Asp 100 105 110 Ser Leu Lys Ala Asn Asn Ala Leu Arg Glu Val Ser Ala Leu Asn Ser 115 120 125 Val Ser Met Ile Gly Lys Ile Ala Glu Thr Asp Val Ser Gly Ala Asn 130 135 140 Phe Asp Gly Asn Asn Lys Leu Ser Phe Ser Leu Phe Phe Asp Glu Lys 145 150 155 160 Ile Asp Ala Ser Lys Gly Val Pro Ala Ile Gin Ile Leu Asn Glu Asn 165 170 175 Asn Glu Leu Val Lys Thr Ile Pro Leu Lys Asp Tyr Asn Gly Gin Lys 180 185 190 Gly Tyr Ile Asn Phe Glu Trp Asp Gly Thr Asn Glu Lys Gly Glu Lys 195 200 205 Val Pro Lys Gly Asn Tyr Lys Ile Lys Ala Glu Tyr Asn Leu Asp Ser 210 215 220 Gin Ser Lys Gin Tyr Leu Gin Thr Arg Ile Gly Arg Gly Glu Val Glu 225 230 235 240 Ser Val Ile Phe Asp Lys Gly Lys Pro Met Leu Arg Met Gly Glu Met 245 250 255 Ile Leu Pro Ile Asp Ser Ala Ile Glu Phe Tyr Lys Pro Asp Gin Lys 260 265 270 WO 97/37044 PCTIUS97/05223 Pro Leu Asp Gin Lys Leu Ser Asp Gin Lys Pro Gin Lys Leu Ser Glu 275 280 285 Gin Lys Ala Leu Glu Gin Lys Ile Ser Glu Gin Lys Pro Gln Glu Pro 290 295 300 Leu Asp Gin Lys Leu Ser Asp Gin Lys Pro Gin Lys Pro Leu Glu Gin 305 310 315 320 Lys Ile Ser Glu Gin Lys Pro Leu Asp Gin Lys Leu Ser Asp Gin Lys 325 330 335 Pro Gin Lys Pro Leu Glu Gin Lys Ile Ser Glu Gin Lys Pro Leu Asp 340 345 350 Gin Lys Pro Gin Thr Pro Pro Lys Glu Thr Ala 355 360 INFORMATION FOR SEQ ID NO:923: SEQUENCE CHARACTERISTICS: LENGTH: 217 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...217 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:923: Leu Lys His Leu Ala Pro Leu Ile His Ile Pro Phe 1 Leu Glu Lys Leu Gin Trp Ser Gly Ile 145 Tyr Tyr Gly Asn Pro Lys Phe Lys Ala His 130 Lys Trp Asn Thr Pro Lys Asp Gly Glu Gly 115 Asn Asp Lys Lys Ala Thr Ser Phe Ser Ser 100 Ile Asn Asp Thr Gly 180 5 Leu Lys Pro Asn Lys Cys Tyr Ser Ala Arg 165 Ser Ser Thr Val Ala 70 Glu Ala His Pro Phe 150 Tyr Arg Met Glu Thr 55 Lys Asn Gly Ala Phe 135 Ala His Trp Phe Pro 40 Asn Gin Leu Thr Tyr 120 Leu Ser Asp Glu Leu 25 Lys Val Lys Gly Tyr 105 Ile Arg Glu Asn Lys 185 10 Ser Pro Met Glu Tyr 90 Lys Pro Asn Val Leu 170 Asn Leu Ala Met Val 75 Glu Ile Ser Val Ala 155 Lys Glu Asn Lys Thr Leu Met Asn Val Met 140 Leu Asp Lys Lys Leu Gly Asn Lys Ala Phe Leu 125 Gly Lys Met Ala Ala Asn Val Cys Ala Gly Ser 110 Lys Glu Glu Ile Asn 190 Leu Ala Lys Asp Ala Ile Asn Ser Leu Leu Lys 175 Ala Trp Glu Asn Asn Tyr Ala Pro Tyr Leu Leu 160 Ser Asp WO 97/37044 WO 9737044PCTIUS97/05223 823 Ala Glu Lys Tyr Tyr Giu Gu Ile Gin Asp Arg Ile Arg Arg Leu Lys 195 200 205 Giu Ser Lys Ile Phe Glu Leu Ala Val 210 215 INFORMATION FOR SEQ ID NO:924: SEQUENCE CHARACTERISTICS: LENGTH: 366 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .366 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:924: Met Gin Phe Gin Lys Thr Leu Leu Ser Leu Ser Leu 1 Tyr Tyr Giu Asn Ile Gin Ile Ala Giu 145 Met Asn Gly Asn Phe 225 Cys Ser Arg Gin Asn Ala Val1 Leu 130 Asn Leu Ala Val1 Gin 210 Gly Ile Ile Ile Val Asn Glu Met 115 Giu Leu Ser Leu Gly 195 Gly Phe Ala Ser Gin Lys Ala Thr 100 Leu Lys Lys Ser Asp 180 Leu Phe Val1 5 Giu His Thr Asn Leu Tyr Ser Met Asn Leu 165 Pro Ser Arg Gly Giu Ala Ile Giu 70 Lys Tyr Gly Gin Leu 150 Ser Ser Val Tyr Asn 230 Asn Val1 Ser 55 Ile Asn Leu Gly Giu 135 Giu Ser Ser Gly Tyr 215 Giy Gly Giu 40 Asn Thr Asn Gin Val 120 Pro Leu Gin Tyr Tyr 200 Leu Phe Ala 25 His Ala Asn Ala Sen 105 Ala Ile Gin Ile Ser 185 Lys Phe Asp 10 Tyr Asn Gin Met Lys 90 Thn Ser Thr Phe Ala 170 Lys His Tyr Gly Ala Asn Asn Pro 75 Leu Leu Asn Asn Ser 155 Gin Asn Phe Asp Leu 235 Ser Pro Lys Asn Thn Gin Pro Pro 140 Gin Ile Val1 Phe Tyr 220 Gly Leu Val Phe Ile Thr Pro Asn Lys 125 Leu Ser Ser Ser Thr 205 Gly Lys Phe Gly Leu Tyr Phe Thr Ile 110 Leu Giu Gin Asn Ser 190 Lys Tyr Met Leu Phe Asn Lys Asn Giu Giu Ala Leu Asn Ser 175 Met Lys Thr Asn Ser Glu Gin Leu Tyr Lys Lys Gin Val Sen 160 Leu Tyr Lys Asn Asn 240 His Leu Tyr Gly Leu Gly Ile Asp Tyr Leu Phe Asn Phe Ile Asp Asn 250 255 WO 97/37044 PCT/US97/05223 824 Ala Gin Lys His Ser Ser Val Gly Phe Tyr Val Gly Phe Ala Leu Ala 260 265 270 Gly Ser Ser Trp Val Gly Ser Gly Leu Gly Met Trp Val Ser Gin Met 275 280 285 Asp Phe Ile Asn Asn Tyr Leu Thr Asp Tyr Arg Ala Lys Met His Thr 290 295 300 Ser Phe Phe Gin Ile Pro Leu Asn Phe Gly Val Arg Val Asn Val Asp 305 310 315 320 Arg His Asn Gly Phe Glu Met Gly Leu Lys Ile Pro Leu Ala Val Asn 325 330 335 Ser Phe Tyr Glu Thr His Gly Lys Gly Leu Asn Ala Ser Leu Phe Phe 340 345 350 Lys Arg Leu Val Met Phe Asn Val Ser Tyr Val Tyr Ser Phe 355 360 365 INFORMATION FOR SEQ ID NO:925: SEQUENCE CHARACTERISTICS: LENGTH: 366 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...366 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:925: Met Gin Phe Gin Lys Thr Leu Leu Ser Leu Ser Leu Leu Phe Leu Ser 1 5 10 Tyr Cys Ile Ala Glu Glu Asn Gly Ala Tyr Ala Ser Val Gly Phe Glu 25 Tyr Ser Ile Ser His Ala Val Glu His Asn Asn Pro Phe Leu Asn Gin 40 Glu Arg Ile Gin Thr Ile Ser Asn Ala Gin Asn Lys Ile Tyr Lys Leu 55 Asn Gin Val Lys Asn Glu Ile Thr Asn Met Pro Asn Thr Phe Asn Tyr 70 75 Ile Asn Asn Ala Leu Lys Asn Asn Ala Lys Leu Thr Pro Thr Glu Lys 90 Gin Ala Glu Thr Tyr Tyr Leu Gin Ser Thr Leu Gin Asn Ile Glu Lys 100 105 110 Ile Val Met Leu Ser Gly Gly Val Ala Ser Asn Pro Lys Leu Ala Gin 115 120 125 Ala Leu Glu Lys Met Gin Glu Pro Ile Thr Asn Pro Leu Glu Leu Val 130 135 140 Glu Asn Leu Lys Asn Leu Glu Leu Gin Phe Ser Gin Ser Gin Asn Ser 145 150 155 160 Met Leu Ser Ser Leu Ser Ser Gin Ile Ala Gin Ile Ser Asn Ser Leu 165 170 175 WO 97/37044 PCT/US97/05223 825 Asn Ala Leu Asp Pro Ser Ser Tyr Ser Lys Asn Val Ser Ser Met Tyr 180 185 190 Gly Val Gly Leu Ser Val Gly Tyr Lys His Phe Phe Thr Lys Lys Lys 195 200 205 Asn Gin Gly Phe Arg Tyr Tyr Leu Phe Tyr Asp Tyr Gly Tyr Thr Asn 210 215 220 Phe Gly Phe Val Gly Asn Gly Phe Asp Gly Leu Gly Lys Met Asn Asn 225 230 235 240 His Leu Tyr Gly Leu Gly Ile Asp Tyr Leu Phe Asn Phe Ile Asp Asn 245 250 255 Ala Gin Lys His Ser Ser Val Gly Phe Tyr Val Gly Phe Ala Leu Ala 260 265 270 Gly Ser Ser Trp Val Gly Ser Gly Leu Gly Met Trp Val Ser Gin Met 275 280 285 Asp Phe Ile Asn Asn Tyr Leu Thr Asp Tyr Arg Ala Lys Met His Thr 290 295 300 Ser Phe Phe Gin Ile Pro Leu Asn Phe Gly Val Arg Val Asn Val Asp 305 310 315 320 Arg His Asn Gly Phe Glu Met Gly Leu Lys Ile Pro Leu Ala Val Asn 325 330 335 Ser Phe Tyr Glu Thr His Gly Lys Gly Leu Asn Ala Ser Leu Phe Phe 340 345 350 Lys Arg Leu Val Met Phe Asn Val Ser Tyr Val Tyr Ser Phe 355 360 365 INFORMATION FOR SEQ ID N0:926: SEQUENCE CHARACTERISTICS: LENGTH: 174 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...174 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:926: Leu Asn Thr Met Asn Ser Val Leu Glu Cys Lys Glu Leu Ala Leu Tyr 1 5 10 Gly Gly Ser Phe Asp Pro Leu His Lys Ala His Leu Ala Ile Ile Glu 25 Gin Thr Leu Glu Leu Leu Pro Phe Val Gin Leu Ile Val Leu Pro Ala 40 Tyr Gln Asn Pro Phe Lys Lys Pro Cys Phe Leu Asp Ala Lys Thr Arg 55 Phe Lys Glu Leu Glu Arg Ala Leu Lys Gly Met Pro Arg Val Leu Leu 70 75 Ser Asp Phe Glu Ile Lys Gin Glu Arg Ala Val Pro Thr Ile Glu Ser 90 WO 97/37044 PCT/US97/05223 826 Val Leu His Phe Gin Lys Leu Tyr Arg Pro Lys Thr Leu Tyr Leu Val 100 105 110 Ile Gly Ala Asp Cys Leu Arg His Leu Ser Ser Trp Thr Asn Ala Lys 115 120 125 Glu Leu Leu Lys Arg Val Glu Leu Val Val Phe Glu Arg Ile Gly Tyr 130 135 140 Glu Glu Ile Gin Phe Lys Gly Arg Tyr His Pro Leu Lys Gly Ile Asp 145 150 155 160 Ala Pro Ile Ser Ser Ser Ala Ile Arg Ala Ser Leu Gly Val 165 170 INFORMATION FOR SEQ ID NO:927: SEQUENCE CHARACTERISTICS: LENGTH: 376 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...376 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:927: Met Arg Phe Phe Cys Phe Phe Leu Phe Phe Leu 1 Gin Ser Arg Ile Ala Ala Ser Gin Ser 145 Lys Ala Gin Ile Asn Ile Lys Ser Leu Met Glu 130 Asn Glu Leu Gin Met Glu Tyr Lys Gin Glu Glu 115 His Ala Pro Asn Ser 195 Met Gin Gin Ala Ile Leu 100 His Cys Leu Lys Ala 180 His 5 Thr Leu Asn Asn Arg Glu Glu Pro Glu Leu 165 Leu Asn Asp Asp Leu 55 Thr Met Leu Leu Leu 135 Gin His Asp Thr Ser Met 40 Ser Leu Asp Gin Ile 120 Ser Glu Leu Gin Leu 200 Gin 25 Leu Asn Thr Thr Ala 105 Lys Gly Lys Leu Val 185 Glu 10 Thr Tyr Asn Ser Asp 90 Leu Glu Val Asn Ser 170 Leu Tyr Asn Lys Gin Gin 75 Ala Glu Ser Lys Asn 155 Arg Glu Asn Thr Ala Leu Asp Arg Leu Lys Gin Asn 140 Ala Leu Asn Ala Phe Lys Asn Gin Arg Leu His Ala 125 Leu Leu Asp Glu Leu 205 Ser Leu Glu Leu Phe Lys Leu 110 Leu Glu Phe Leu Ile 190 Met Asn Ser Ser Lys Phe Gin Lys Phe Glu Leu Met 175 Gin Asn Ala Arg Leu Glu Asn Ser Glu Leu Ala Leu 160 Ser Asp His WO 97/37044 PCT/US97/05223 827 Asp Phe Gin Ala Tyr Lys Ala Met Arg Leu Lys Lys Ile Lys Asn Lys 210 215 220 Leu Gin Ser Gin Ile Gin Ala Lys Glu Asp Ala Leu Lys Thr Phe Leu 225 230 235 240 Pro Leu Glu Lys Arg Leu Glu Thr Leu Lys Ser Arg Phe Leu Cys Asp 245 250 255 Lys Glu Asn Leu Lys Ser Cys Ala Asn Gin Leu Gin Gin Arg Tyr Gin 260 265 270 Asn Ala Leu Ile Glu Arg Asp Lys Glu Leu Lys Asn Ala Lys Asn Asn 275 280 285 Lys Glu Lys His Ala Leu Ile Leu Ala Asn Tyr Glu His Thr Leu Lys 290 295 300 Thr Leu Asn Ile Glu Phe Leu Ser Glu Leu Ser Glu Gin Met Ala Phe 305 310 315 320 Leu Asn Glu Thr Met Ala Leu Asn Ala Gln Val Leu Ala Leu Leu Ala 325 330 335 Gin Gin Gin Thr Lys Lys Pro Phe Asn Val Ser Asp Gly Leu Ser Gly 340 345 350 Gly Lys Ala Leu Ile Lys Asn Ile Arg Leu Asp Pro His Gly Phe Pro 355 360 365 Ser Phe Lys Asn Phe Lys Gin Glu 370 375 INFORMATION FOR SEQ ID NO:928: SEQUENCE CHARACTERISTICS: LENGTH: 229 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...229 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:928: Val Lys Gly Glu Lys Asn Ala Trp Tyr Leu Gly Ile Ser Tyr Gin Val 1 5 10 Gly Gin Ala Ser Gin Ser Val Lys Asn Pro Pro Lys Ser Ser Glu Phe 25 Asn Tyr Pro Lys Phe Pro Val Gly Lys Thr Asp Tyr Leu Ala Val Met 40 Gin Gly Leu Gly Leu Thr Val Gly Tyr Lys Gin Phe Phe Gly Glu Lys 55 Arg Trp Phe Gly Ala Arg Tyr Tyr Gly Phe Met Asp Tyr Gly His Ala 70 75 Val Phe Gly Ala Asn Ala Leu Thr Ser Asp Asn Gly Gly Val Cys Lys 90 Leu Asn Glu Pro Cys Ala Thr Lys Val Gly Thr Met Gly Asn Leu Ser 100 105 110 WO 97/37044 PCT/US97/05223 828 Asp Met Phe Thr Tyr Gly Val Gly Ile Asp Thr Leu Tyr Asn Val Ile 115 120 125 Asn Lys Glu Asp Ala Ser Phe Gly Phe Phe Phe Gly Ala Gin Ile Ala 130 135 140 Gly Asn Ser Trp Gly Asn Thr Thr Gly Ala Phe Leu Glu Thr Lys Ser 145 150 155 160 Pro Tyr Lys His Thr Ser Tyr Ser Leu Asp Pro Ala Ile Phe Gin Phe 165 170 175 Leu Phe Asn Leu Gly Ile Arg Thr His Ile Gly Gin His Gin Glu Phe 180 185 190 Asp Phe Gly Val Lys Ile Pro Thr Ile Asn Val Tyr Tyr Phe Asn His 195 200 205 Gly Asn Leu Ser Phe Thr Tyr Arg Arg Gin Tyr Ser Leu Tyr Val Gly 210 215 220 Tyr Arg Tyr Asn Phe 225 INFORMATION FOR SEQ ID NO:929: SEQUENCE CHARACTERISTICS: LENGTH: 479 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...479 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:929: Leu Lys Asn His Ser Phe Lys Lys Thr Ile Ala Leu Ser Leu Leu Ala 1 5 10 Ser Met Ser Leu Cys Asn Ala Glu Glu Asp Gly Ala Phe Phe Val Ile 25 Asp Tyr Gin Thr Ser Leu Ala Arg Gin Glu Leu Lys Asn Pro Gly Phe 40 Thr Gin Ala Gin Glu Leu Lys Gin Leu Ile Arg Asp Gly Ala Val Arg 55 Leu Gin Thr Ser Ala Ile Pro Leu Ser Tyr Tyr Leu Asp Ile Leu Gly 70 75 Asn Lys Thr Lys Thr Leu Leu Ser Glu Ser Leu Lys Asn Asn Pro Gin 90 Gin Pro Asn Gly Gin Pro Asn Gin Ala Leu Val Asn Leu Glu Gin Ser 100 105 110 Leu Gly Ile Leu Gly Lys Leu Leu Asp Leu Ser Gin Gin Tyr Ala Ser 115 120 125 Glu Gly Val Ile Lys Pro Leu Val Val Asp Val Gly Lys Glu Gin Ile 130 135 140 Gly Ile Thr Asp Ser Met Leu Leu Val Ala Gin Asn Ile Val Leu Ala 150 155 160 WO 97/37044 PCT/US97/05223 829 Leu Gly Gin Val Asp Leu Ser Lys Ile Gin Gin Asn Asn Gly Asn Gin 165 170 175 Gin Leu Tyr Glu Asn Ile Met Lys Val Met Leu Leu Gly Thr Gly Gly 180 185 190 Thr Asn Gly Ala Tyr Asn Gly Val Ser Val Gly Asp Ile Ala Thr Gly 195 200 205 Met Gin Asn Phe Pro Ser Gin Thr Gly Leu Ile Gly Ala Asn Ser Thr 210 215 220 Val Ser Glu Leu Asn Ala Leu Ile Lys Ser Gly Ile Ser Leu Asp Arg 225 230 235 240 Glu Thr Leu Gly Leu Gly Ser Phe Ile Glu Lys Asn Ile Cys Ser Gly 245 250 255 Ala Ser Ser Cys Phe Ser Gly Asn Gin Leu Ile Tyr Lys Lys Gly Leu 260 265 270 Asp Arg Thr Ile Asn Ile Ile Asn Ala Val Leu Gly Gin Phe Glu Ser 275 280 285 Ser Ala Ser Ser Leu Tyr Lys Ile Ser Tyr Ile Pro Asn Leu Phe Ser 290 295 300 Leu Lys Asp Tyr Gin Ser Ala Ser Met Asn Gly Phe Gly Ala Lys Met 305 310 315 320 Gly Tyr Lys Gin Phe Phe Thr His Lys Lys Asn Ile Gly Leu Arg Tyr 325 330 335 Tyr Gly Phe Leu Asp Tyr Gly Tyr Ala Asn Phe Gly Asp Thr Asn Leu 340 345 350 Lys Val Gly Ala Asn Leu Val Thr Tyr Gly Val Gly Thr Asp Phe Leu 355 360 365 Tyr Asn Val Tyr Glu Arg Ser Arg Arg Arg Glu Arg Thr Thr Ile Gly 370 375 380 Leu Phe Phe Gly Ala Gin Ile Ala Gly Gin Thr Trp Ser Thr Asn Val 385 390 395 400 Thr Asn Leu Leu Ser Gly Gin Arg Pro Asp Val Lys Ser Ser Ser Phe 405 410 415 Gin Phe Leu Phe Asp Leu Gly Val Arg Thr Asn Phe Ala Lys Thr Asn 420 425 430 Phe Asn Lys His Arg Leu Asp Gin Gly Ile Glu Phe Gly Val Lys Ile 435 440 445 Pro Val Ile Ala His Lys Tyr Phe Ala Thr Gin Gly Ser Ser Ala Ser 450 455 460 Tyr Met Arg Asn Phe Ser Phe Tyr Val Gly Tyr Ser Val Gly Phe 465 470 475 INFORMATION FOR SEQ ID NO:930: SEQUENCE CHARACTERISTICS: LENGTH: 179 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature WO 97/37044 PCT/US97/05223 830 LOCATION 1...179 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:930: Met Ala Ser Leu Ala Phe Val Gin Ala Phe Leu Glu Ser Phe Lys Gly 1 5 10 Phe Leu Ser Gin Ala Thr Leu Ile Ser Val Leu Ile Ala Ser Val Leu 25 Ile Leu Phe Cys Ala Ile Leu Leu Leu Leu Ala Leu Leu Leu Arg Asn 40 Arg Trp Ala Ser Tyr Ile Thr Thr Ala Ala Phe Leu Gly Ala Phe Leu 55 Ser Met Pro Phe Val Leu Asn Val Leu Leu Thr Gin Ala Ile Tyr Pro 70 75 Ile Glu Thr Arg Ile Leu His Ala Asn Pro Leu Ser Tyr Ser Asn Ala 90 Phe Ser Leu Gin Val Gly Val Lys Asn Ile Ser Lys Phe Ser Leu Asn 100 105 110 Lys Cys Val Leu Arg Leu Glu Val Leu Lys Asn Pro His Asn Phe Val 115 120 125 Glu Glu Arg Ala Phe Lys Trp Phe Val Lys Lys Ser Tyr Glu Lys Thr 130 135 140 Phe Lys Glu Lys Ile Leu Pro Glu Glu Ser Lys Val Phe Ser Phe Phe 145 150 155 160 Ile Asp Asp Tyr Pro Tyr Ser Lys Thr Ala Pro Tyr Gin Val Ser Leu 165 170 175 Phe Cys Leu INFORMATION FOR SEQ ID NO:931: SEQUENCE CHARACTERISTICS: LENGTH: 129 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...129 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:931: Leu Gly Glu Leu Leu Gin Ala Ser Ala Pro Tyr Ala Ile Ala Pro Val 1 5 10 Glu Leu Ile Val Val Phe Tyr Lys Asn Glu Tyr Lys Arg Lys Lys Gin 25 Thr Ser Thr Met Ser Arg Glu Glu Phe Leu Leu Tyr Thr Asn Gly Leu 40 Trp Asn Phe Ser Gly Glu Ser Lys Lys Arg Leu Lys His Pro Ala Pro 55 WO 97/37044 PCT/US97/05223 831 Phe Pro Arg Glu Leu Pro Arg Arg Cys Ile Gin Leu Phe Ser Phe Leu 70 75 Glu Asp Thr Ile Phe Asp Pro Phe Ser Gly Ser Gly Thr Thr Ile Leu 90 Glu Ala Asn Ala Leu Gly Arg Phe Ser Val Gly Leu Glu Ile Glu Lys 100 105 110 Glu Tyr Cys Glu Leu Ser Lys Lys Arg Ile Leu Glu Ser Leu Ser Leu 115 120 125 Val INFORMATION FOR SEQ ID NO:932: SEQUENCE CHARACTERISTICS: LENGTH: 327 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...327 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:932: Val Leu Lys Gly Leu Lys Lys Ala Phe Lys Glu Arg Phe Cys Ser Gin 1 5 10 Val Tyr Ile Ser Phe Asn Val Asp His Asn Leu Leu Ser Ala Gin Val 25 Leu Arg Val Lys Asn His Arg Ile Lys Glu Lys Phe Phe Lys Thr Phe 40 Glu Thr Lys Val Glu Thr Lys Asn Gly Glu Val Pro Ile Gin Ala Leu 55 Lys Ile Ala Arg Thr Tyr-Ser Gin Lys Tyr Pro Tyr Thr Tyr Phe Ser 70 75 Ala Met Ser Lys Ala Lys Glu Val Leu Cys Glu Lys Gin Ala Phe Glu 90 Gin Ile Lys Gin Glu Asn Gin Asp Tyr Gin Ala Cys Glu Val Asn Gin 100 105 110 Lys Tyr Cys Val Tyr Val Glu Ser Lys Asp Phe Leu Lys Asp Phe Lys 115 120 125 Arg Phe Lys Ile Gin Asp Val Asp Phe Leu Phe Ser Pro Phe Ser Leu 130 135 140 Ile Tyr Asp Phe Val Arg Asp Asn Leu Glu Asn Lys Pro Leu Leu Tyr 145 150 155 160 Leu Leu Leu Glu Arg Ser Arg Phe Tyr Phe Leu Ile Ala Asp Lys Lys 165 170 175 Glu Ile Phe Leu Ala Lys Ser Val Phe Leu Glu Glu Gin Pro Glu Glu 180 185 190 Phe Ile Glu Ser Lys Glu Glu Asp Ser Met Gly Met Asp Asn Glu Ala 195 200 205 WO 97/37044 PCT/US97/05223 832 Val Asp Leu Phe Leu Ser Glu Ile Gin Glu Asp Ile Asp Ser Leu Glu 210 215 220 Glu Ala Ile Gly Leu Asp Ser Ser Lys Asp Asn Ser Glu Lys Ile Thr 225 230 235 240 Glu Asp Ala Tyr Ser Leu Ile Glu Gly Met Thr Asn Ile Pro Leu Ile 245 250 255 Ala Asp Val Leu Gin Glu Gly Leu Arg Gly Val Tyr His Ser Arg Glu 260 265 270 Ile Asp Phe Val Glu Lys Val Val Val Leu Asp Ser Cys Gin Ile His 275 280 285 His Lys Ala Leu Met His Leu Gin Glu Thr Leu Met Ile Glu Val Asp 290 295 300 Arg Leu Asp Phe Ser Leu Val Glu Arg Leu Asn Val Leu Ala Arg Met 305 310 315 320 Glu Asn Glu Lys His Ala Phe 325 INFORMATION FOR SEQ ID NO:933: SEQUENCE CHARACTERISTICS: LENGTH: 431 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...431 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:933: Met Asn Ile Gin Thr Lys Lys Arg Phe Leu Ala Asn Leu Leu Leu Phe 1 5 10 Ser Leu Phe Ser Cys Leu Lys Ala Glu Thr Leu Ser Glu Asp His Gin 25 Ile Leu Leu Ser Ser Asp Ala Phe His Arg Gly Asp Phe Ala Thr Ala 40 Gin Lys Gly Tyr Met Asn Leu Tyr Lys Gin Thr Asn Lys Val Val Tyr 55 Ala Lys Glu Ala Ala Ile Ser Ala Ala Ser Leu Gly Asp Ile Lys Thr 70 75 Ala Met His Leu Ala Met Leu Tyr Gin Lys Ile Thr Asn Asn Arg Asn 90 Asp Val Ser Ile Asn Lys Ile Leu Val Asp Gly Tyr Ala Gin Met Gly 100 105 110 Gin Ile Asp Lys Ala Ile Glu Leu Leu His Lys Ile Arg Lys Glu Glu 115 120 125 Lys Thr Ile Ala Thr Asp Asn Val Leu Gly Thr Leu Tyr Leu Thr Gin 130 135 140 Lys Arg Leu Asp Lys Ala Phe Pro Leu Leu Asn Lys Phe Tyr Asn Gin 145 150 155 160 WO 97/37044 PCT/US97/05223 Val Gin Arg Thr Tyr 225 Leu Phe Lys Lys Leu 305 Ile Lys Leu Asp Leu 385 Ala Pro His Asn Tyr Gin 210 Glu Ile Pro Phe Asp 290 Ser Gin Thr Gly Leu 370 Asp Lys Glu Asp Arg Gly 195 Phe Lys Leu Phe Asp 275 Pro Ala Lys Lys Tyr 355 Val Ser Lys Leu Glu Asp 165 Lys Lys 180 Cys Ser Asn Glu Asn Pro Leu Lys 245 Asp Arg 260 Gin Ala Lys Phe Asn Lys Leu Glu 325 Asp Lys 340 Ser Leu Arg Lys Leu Ala Ile Phe 405 Lys Glu 420 Ser Leu Glu Lys Glu Glu Leu Ile 230 Glu Arg Ser Leu Lys 310 Gin Glu Ile Ala Trp 390 Ser His Gly Gin Asp 215 Val Phe Leu Lys Gly 295 Lys Ala Asp Asp Leu 375 Gly Ser Asn Leu Leu 200 Leu Gin Asp Leu Gin 280 Leu Leu Thr Ala Tyr 360 Ala Tyr Ile Lys Asp 185 Cys Ala Asn Lys Leu 265 Ala Glu Thr Lys Gin 345 Asp Leu Tyr Ala Ile 425 Leu 170 Leu Gin Lys Ala Ala 250 Asp Ser Ala Lys Glu 330 Asp Met Asp Lys Lys 410 Ile Ile Leu Lys Thr Gin 235 Gin Leu Leu Ile Glu 315 Arg Ala Asp Ser Leu 395 Glu Gin Thr Gin Ala Thr 220 Phe Gin Tyr Ile Tyr 300 Glu Gin Phe Val Asn 380 Gly Leu Glu Ile Ser Leu 205 Phe Tyr Ile Thr Tyr 285 His Met Ala Phe Lys 365 Ser Asn Ile Cys Tyr His 190 Asn Ala Ile Ala Ala 270 Gin Tyr Leu Trp Tyr 350 Arg Val Cys Gin Lys 430 Phe 175 Ile Thr Arg Gly Glu 255 Gin Glu Glu Pro Leu 335 Asn Gly Leu Leu Asn 415 Lys Leu Asp Phe Leu Val 240 Leu Lys Arg Ser Ile 320 Ala Phe Met Tyr Glu 400 Glu INFORMATION FOR SEQ ID NO:934: SEQUENCE CHARACTERISTICS: LENGTH: 414 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...414 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:934: Leu Ile Gin Lys Asn Lys Ser Leu Ser Ile Phe Leu Ile Ser Asn Ser 1 5 10 WO 97/37044 PCTIUS97/05223 834 Val Val Phe Leu Gly Lys Ile Ile Leu His Lys Val Phe Ile Met Glu 25 Ala Leu Glu Cys Leu Lys Arg Ile Glu Lys Glu Ser Ile Gin Thr Ile 40 Tyr Ile Asp Pro Pro Tyr Asn Thr Lys Ser Ser Asn Phe Glu Tyr Glu 55 Asp Ala His Ala Asp Tyr Glu Lys Trp Ile Glu Glu His Leu Ile Leu 70 75 Ala Lys Ala Val Leu Lys Gin Ser Gly Cys Ile Phe Ile Ser Met Asp 90 Asp Asn Lys Met Ala Glu Val Lys Ile Ile Ala Asn Glu Ile Phe Gly 100 105 110 Thr Arg Asn Phe Leu Gly Thr Phe Ile Thr Lys Gin Ala Thr Arg Ser 115 120 125 Asn Ala Lys His Ile Asn Ile Thr His Glu Tyr Val Leu Ser Tyr Ala 130 135 140 Lys Asn Lys Ala Phe Ala Pro Gly Phe Lys Ile Leu Arg Thr Leu Leu 145 150 155 160 Pro Ile Tyr Ala Lys Ala Leu Lys Asp Leu Met Arg Thr Ile Lys Asn 165 170 175 Val Phe Lys Gin Lys Gly Gin Ala Gin Ala Gin Leu Ile Leu Lys Glu 180 185 190 Gin Ile Lys Glu Leu Ser Gin Lys Glu His Phe Asn Phe Leu Lys Asn 195 200 205 Tyr Asn Leu Val Asp Glu Lys Gly Glu Ile Tyr Phe Ala Lys Asp Leu 210 215 220 Ser Thr Pro Ser Asn Pro Arg Ser Val Ala Ile Gin Glu Ile Asn Leu 225 230 235 240 Phe Leu Glu Pro Leu Lys Ser Arg Gly Trp Ser Ser Asp Glu Lys Leu 245 250 255 Lys Glu Leu Tyr Tyr Gin Asn Arg Leu Ile Phe Lys Asn Asn Arg 'Pro 260 265 270 Tyr Glu Lys Tyr Tyr Leu Lys Glu Ser Gin Asp Asn Cys Leu Ser Val 275 280 285 Leu Asp Phe Tyr Ser Arg Gin Gly Thr Lys Asp Leu Glu Lys Leu Gly 290 295 300 Leu Lys Gly Leu Phe Lys Thr Pro Lys Pro Val Ala Leu Ile Lys Tyr 305 310 315 320 Leu Leu Leu Cys Ser Thr Pro Lys Asp Ser Ile Ile Leu Asp Phe Phe 325 330 335 Ala Gly Ser Gly Thr Thr Ala Gin Ala Val Ile Glu Val Asn Lys Asp 340 345 350 Tyr Tyr Leu Asn Trp Ser Phe Tyr Leu Cys Gin Lys Glu Glu Lys Ile 355 360 365 Lys Asn Asn Pro Gin Ala Val Ser Ile Leu Lys Asn Lys Gly Tyr Lys 370 375 380 Asn Thr Ile Ser Asp Ile Met Leu Leu Arg Leu Glu Lys Ile Ile Lys 385 390 395 400 Arg Ser Glu Tyr Glu Ile Leu Lys Thr Lys Ser Ile Leu Phe 405 410 INFORMATION FOR SEQ ID NO:935: SEQUENCE CHARACTERISTICS: LENGTH: 166 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 PCT/US97/05223 835 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...166 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:935: Leu Glu Ile Asp Lys Thr Glu Cys Ser Thr Leu Leu Ala Ser Ile Gin 1 5 10 Lys Gin Gin Leu Val Ile Pro Val Val Gly Asn Phe Ser Ala Gly Lys 25 Ser Thr Leu Leu Asn Arg Phe Leu Gly Ser Ser Val Leu Pro Thr Gly 40 Ile Thr Pro Glu Thr Ser Leu Ala Thr Glu Leu His Tyr Ser Ala Asn 55 Glu Arg Ile Glu Ala Phe Ser Ser Asn Asp Glu Lys Thr Glu Ser Phe 70 75 Glu Leu Asn Glu Gin Ser Phe Glu Ala Ile Lys Glu Asn Ala Ala Lys 90 Tyr Ser Tyr Leu Lys Val Tyr Leu Asn Asn Glu Ala Leu Lys Asp Ser 100 105 110 Ala Pro Leu Val Phe Val Asp Met Pro Gly Phe Asp Ser Pro Ile Ser 115 120 125 Ser His Thr His Ala Ile Leu Glu Tyr Leu Glu Arg Gly Val His Phe 130 135 140 Val Ile Leu Thr Ser Val Glu Glu Gly Asn Leu Thr Lys Arg Met Val 145 150 155 160 Arg Glu Leu Lys Thr Phe 165 INFORMATION FOR SEQ ID NO:936: SEQUENCE CHARACTERISTICS: LENGTH: 251 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...251 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:936: WO 97/37044 PCT/US97/05223 836 Met Lys Leu Pro Val Val Glu Ser Phe Phe Ser Leu Gin Gly Glu Gly 1 5 10 Lys Arg Ile Gly Lys Pro Ser Leu Phe Leu Arg Leu Gly Gly Cys Asn 25 Leu Ser Cys Lys Gly Phe Asn Cys Lys Thr Leu Phe Asn Asp Glu Ile 40 Leu Thr Gly Cys Asp Ser Leu Tyr Ala Val His Pro Lys Phe Lys Thr 55 Ser Trp Asp Tyr Tyr Asn Glu Pro Lys Pro Leu Ile Glu Arg Leu Val 70 75 Asn Leu Ala Pro Asn Tyr Lys Asp Phe Asp Phe Ile Leu Thr Gly Gly 90 Glu Pro Ser Leu Tyr Phe Asn Asn Pro Ile Leu Leu Ser Val Leu Glu 100 105 110 His Phe Tyr His Lys Lys Ile Pro Leu Phe Val Glu Ser Asn Gly Ser 115 120 125 Ile Phe Phe Glu Phe Ser Pro Ile Leu Lys Glu Leu His Phe Thr Leu 130 135 140 Ser Val Lys Leu Ser Phe Ser Leu Glu Gin Glu Ser Lys Arg Ile Asn 145 150 155 160 Leu Lys Ala Leu Gin Asn Ile Leu Asn Asn Ala Lys Ser Val His Phe 165 170 175 Lys Phe Val Leu Glu Ser Gin Asn Ala Ala His Ser Ile Ala Glu Ile 180 185 190 Gin Ser Leu Leu Lys Gin Leu Ser Leu Lys Asn Asn Glu Ile Phe Leu 195 200 205 Met Pro Leu Gly Thr Thr Asn Asn Glu Leu Asp Lys Asn Leu Lys Thr 210 215 220 Leu Ala Pro Leu Ala Ile Glu His Gly Phe Arg Leu Ser Asp Arg Leu 225 230 235 240 His Ile Arg Leu Trp Asp Asn Lys Lys Gly Phe 245 250 INFORMATION FOR SEQ ID NO:937: SEQUENCE CHARACTERISTICS: LENGTH: 381 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...381 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:937: Met Asn Leu Asn Phe Met Pro Leu Leu His Ala Tyr Asn His Val Ser 1 5 10 Ile Asp Phe His Phe Asn Ser Ser Ala Arg Asp Phe Cys Val His Glu 25 WO 97/37044 PCTIUS97/05223 Val Pro Leu Ty Val Arg Lys Se Gin Ile Leu Gi Asp Lys Asn Al Ala Pro Leu Le i0 Lys Ile Leu Se 115 Leu Lys Gly As 130 Asn Ala Gin Ly 145 Met Pro Asn Ty His Lys Giu G1 18( Gin Lys Leu Asi 195 Asn Ser Leu Let 210 Phe Ser Val Lys 225 Val His Ser Asr Lys Ile Leu Gil 260 Phe Asp Ala Leu 275 Giu Ala Val Pro 290 Lys Asn Leu Ser 305 Ser Gly His Ala Val Giu Asn Ile 340 Leu Giu Phe Tyr 355 Glu Ile Lys His 370 INFORMAtTION r Giu Phe Ser Asn Thr Gly Giu His Ala Val Ile Gin a r
"I
7Gly Val Leu 85 Giu *Leu *Arg Thr Phe 165 Leu Ala Ser Glu Al a 245 Gly Glu Thr Leu Lys 325 Thr Leu Giu Let Lys 70 Thr Lys Asn Phe Giu 150 Gly Lys Phe Lys Ser 230 Leu Asp Leu Gly Glu 310 Thr Ser Pro Lys 1Ser 55 Ile Thr Asn Tyr Phe 135 Gin Ser Ile Leu Arg 215 Leu Lys Val Giu Leu 295 Ile Leu Gin Lys GlyC 375 Thr Ala Gin Thr His 120 Met Val1 Gin Leu Ile 200 Leu Giu Ala Met Lys 280 Leu .,lu Pyr ~60 Let *Gl.
PhE Ser 105 His Arg Leu Arg Gin 185 Ser Giu Phe Leu Arg 265 Giu Asp Lys Ser Ile 345 Ser Asn 2 Giu Leu Ile 90 *Asn Asn *Phe Giu Phe 170 Asn Ser Ile Phe Lys 250 His Ser Gly Gly Arg .z 330 Lys C Tyr P Asn A~ Met G1~ 75 Sei PhE Lys Lys Gin 155 Gly Giu Tyr Ser Lys 235 Asn Tyr 31u ys ?he 1i5 ~rg flu lia Lsp tLet iTyi Let Gir Ile Lys 140 Ile Lys Thr Gin Lys 220 Gin Gin Pro Arg Lys 300 Gin Phe Lys Ser Giu 380 2 Gin 7Ala aPro Giu Lys 125 Met Ala Phe Lys Ser 205 Ile Lys Ala Tyr PheI 285 AlaI His2 Phe Ala C 3 Ala L 365 Phe Ile Gly Lys Arg 110 Leu Thr Gin As n Phe 190 Tyr Ile Asn His 31
Y
270 eu ~sn 'rp fin ~eu *PhE *Leu Lys Asn Gly Pro Phe Asp 175 Ala Leu Ser Leu Pro 255 Lys Asn Tyr Leu Val1 335 Phe Leu Ser 1Lys *Tyr Leu *His Leu Gly 160 Asn His Phe Ala Ser 240 Phe Phe Lys Ala Leu 320 Phe Giu Lys FOR SEQ ID NO:938: SEQUENCE CHARACTERISTICS: LENGTH: 697 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: WO 97/37044 WO 9737044PCT/US97/05223 838 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .697 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:938: Leu Gin Asn Phe Val Phe Asn Lys Lys Trp Leu Ile 1 Leu Phe Asn Asn Giu Asp Gin Asn Thr 145 Asn Ala Ile Lys Val1 225 Gin Lys Ala Tyr Phe 305 Pro Gin Ile Asn Pro Met Ser so Aia Ala Pro Asn Ala 130 Gin Sen Leu Thr Ser 210 Leu Thr Ser Giy Asn 290 Ser Leu Tyr Ile Phe 370 Leu Giy Gly Ile Met Ser Asn 115 Thr Leu Lys Pro Asn 195 Phe Gin Ile Ile Ile 275 Lys Giy Lys Thr Aia 355 Ala Phe Val1 Leu Ala Gly Lys 100 Asn Phe Ile Val1 Giu 180 Ala Ser His Cys Ala 260 Leu Ala Pro Ala Tyr 340 Asn Asn 5 Phe Ser Asn Leu Gin Arg Gly Asp Gly Phe 165 Gly Leu Thr Leu Sen 245 Gin Gly Pro Giy Leu 325 His Gly Lys Leu Tyr Ala Glu 70 Gin Cys Asp Met Giu i50 Asn Leu Thr Pro Leu 230 Thr Asn Gly Asn Tyr 310 Pro Pro Ile Ala As n Gin Ser 55 Sen Thr Leu Thr Gin 135 Thr Val1 Ala Thr Ser 215 Gin Gin Ala Leu Gly 295 Tyr Ala Ser Thr Ala 375 Pro Thr 40 Gin Ala Arg Leu Gly 120 Ser Leu Lys Asn Leu 200 Asn Asp Asn Gin Ala 280 Ser Thr Gly Ser Al a 360 Lys Leu 25 Sen Asp Aia Val1 Tyr 105 Asn Leu Ile Phe Thr 185 Trp Thr Gly Gln Asn 265 Asn Asp Lys Ala Ala 345 Sen Leu Met Leu Ala Val1 Leu 90 Ala Asn Val Arg Gly 170 Met Tyr Ser Leu Cys 250 Ile Glu Ser Asn Thr 330 Val1 Met Ile Ala Ala Ser Pro 75 Met Gly Pro Asn Asn 155 Asn Asp Asn Val Ala 235 Thr Phe Lys Gin Asp 315 Ile Tyr I le Gly Giu Val1 Thr Leu Gin Gly Pro Asn 140 Pro Gin Ala Gin Asn 220 Thr Ala Gin Gin Gin 300 Asn Gly Tyr Phe Thr Tyr Asp Gin Tyr Ala Met Tyr Arg 125 Leu Giu Sen Leu Thr 205 Phe Ala Thr Ala Phe 285 Giy Thr Ser Leu Sen 365 Sen Ser Asp Arg Ile Tyr Leu Gin 110 Giy Asn Asn Thn Asn 190 Leu Ser Asn Asn Leu 270 Giy Tyr Thr Giy Ala 350 Giy Ser Ser Gly Val1 Arg Tyr Cys Asn Asn Lys Leu Val1 175 Asn Thr Pro Asn Giu 255 Met Phe Gln Gin Asn 335 Asp Met Tyr Leu Phe Asp Gin Leu Pro Giy Vali Leu Pro 160 Ile Asp Asn Gin Asn 240 Ala Gin Thr Sen Ala 320 Gly Ser Gin Asn 380 Gin Met Gin Asp Val Ile Asn Tyr Gly Giu Ser Leu Leu Ser Asn Thr WO 97/37044 PCT/US97/05223 839 385 390 395 400 Val Ala Tyr Gly Asp Phe Ile Thr Asn Trp Val Ala Pro Tyr Leu Asp 405 410 415 Leu Asn Asn Lys Gly Leu Asn Phe Leu Pro Asn Tyr Gly Gly Gin Leu 420 425 430 Asn Gly Ala Asn Asn Gin Thr Pro Gin Leu Thr Pro Gin Gin Ala Gin 435 440 445 Gin Glu Gin Lys Val Ile Met Asn Gin Leu Glu Gin Ala Thr Asn Ala 450 455 460 Pro Thr Pro Ala Gin Ile Asn Arg Ile Leu Ala Asn Pro Tyr Ser Pro 465 470 475 480 Thr Ala Lys Thr Leu Met Ala Tyr Gly Leu Tyr Arg Ser Lys Ala Val 485 490 495 Ile Gly Gly Val Ile Asp Glu Met Gin Thr Lys Val Asn Gln Val Tyr 500 505 510 Gin Met Gly Phe Ala Arg Asn Phe Leu Glu His Asn Ser Asn Ser Asn 515 520 525 Asn Met Asn Gly Phe Gly Val Lys Met Gly Tyr Lys Gin Phe Phe Gly 530 535 540 Lys Lys Arg Met Phe Gly Leu Arg Tyr Tyr Gly Phe Tyr Asp Phe Gly 545 550 555 560 Tyr Ala Gin Phe Gly Thr Glu Ser Ser Leu Val Lys Ala Thr Leu Ser 565 570 575 Ser Tyr Gly Ala Gly Thr Asp Phe Leu Tyr Asn Val Phe Thr Arg Lys 580 585 590 Arg Gly Thr Glu Ala Ile Asp Ile Gly Phe Phe Ala Gly Ile Gin Leu 595 600 605 Ala Gly Gin Thr Trp Lys Thr Asn Phe Leu Asp Gin Val Asp Gly Asn 610 615 620 His Leu Lys Pro Lys Asp Thr Ser Phe Gin Phe Leu Phe Asp Leu Gly 625 630 635 640 Ile Arg Thr Asn Phe Ser Lys Ile Ala His Gin Lys Arg Ser Arg Phe 645 650 655 Ser Gin Gly Ile Glu Phe Gly Leu Lys Ile Pro Val Leu Tyr His Thr 660 665 670 Tyr Tyr Gin Ser Glu Gly Val Thr Ala Lys Tyr Arg Arg Ala Phe Ser 675 680 685 Phe Tyr Val Gly Tyr Asn Ile Gly Phe 690 695 INFORMATION FOR SEQ ID NO:939: SEQUENCE CHARACTERISTICS: LENGTH: 697 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...697 WO 97/37044 PCT1UJS97/05223 840 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:939: Leu Gin Asn Phe Vai Phe Asn Lys Lys Trp Leu Ile Tyr Ser Sea 1 Leu Phe Asn Asn Giu Asp Gin Asn Thr 145 Asn Ala Ile Lys Val 225 Gin Lys Ala Tyr Phe 305 ProI Gin Ile Asn 1 Gin D~ 385 Val P~ Leu P Prc Met Ser Ala Ala Pro Asn Ala 130 Gin Ser Leu Thr Ser 210 Leu Thr Ser Gly lksn 290 Ser Leu I'yr Ile 'he let lia ~sn Lei Gl Gl 11E *Met S e1 *Asr 115 Thr Leu Lys Pro Asn 195 Phe Gin Ile Ile Ile 275 Lys Giy Lys Thr Ala 355 Ala Gin Tyr Asn uPhe {Val {Leu SAla Lys 100 Asn Phe Val1 Glu 180 Ala Ser His Cys Ala 260 Leu Ala Pro Ala Tyr 340 Asn Asn Asp Gly Lys C 420 5 Ph Sea Asr Lei- Gir Arg GiV Asp Gly Phe 165 Gly Leu Thr Leu Ser 245 Gln
G
1 y Pro
G
1 y Leu 325 iis 31
Y
,ys l ~sp 105 fly E? Leu Tyr 1Ala Glu 70 Gin Cys Asp Met Giu 150 Asn Leu Thr Pro Leti 230 Thr Asn GlyI Asn Tyr 310 Pro Pro Ala Ile A 390 Phe I Leu A Asi Cl' Sea 55 Sea Thr Leu Thr Gin 135 Thr Val Ala Thr Ser 215 Gln Glm ka Leu ~95 ['yr la ~er 'hr lia ~75 sn le sn n Prc -1 Thi 40 *Ala *Arc Leu *Gly 120 Ser *Leu Lys Asn Leu 200 Asn Asp Asn Gin Ala 280 Ser Thr Giy Ser Ala 360 Lys Tyr Thr Phe DLeu 25 Ser Asp Ala Val Tyr 105 Asn Leu Ile Phe Thr 185 Trp Thr Gly Gin Asn 265 Asn Asp Lys Ala Ala 345 Ser I LeuI Gly C Asn 'I 4 Leu P 425 10 Met LeL Ala Val1 Leu 90 Ala Asn Val Arg Gly 170 Met Tyr Ser Leu Cys 250 Ile Glu Ser ksn Phr 330 Ia 1 4e t le liu ~rp 110 ~ro Al~ Al Sea *Prc 75 *Met Gil, Pro Asn Asn 155 Asia Asp Asn Val Ala 235 Thr Phe Lys Gin Asp 315 Ile Tyr Ile Gly Ser 395 Val Asn
"GB
Va :Thi Lei- Gir Gl) Pro Asia 140 Pro Gin Ala Gin Asn 220 Thr Ala Gin Gin Gin 300 Asn Gly Tyr Phe Thr 380 Leu Ala Tyr u As 1 i] Ty: Al Met Tya Arc 125 Lei- Git Ser Lea Thr 205 Phe Ala Thr Ala Phe 285 Gly Thr Ser Leu Ser 365 Ser Leti Pro Gly p Asp ri Arg r Ile a Tyr t Leu Gin 110 Gly 1Asia IAsn Thr *Asn 190 *Leu Ser Asn Asn Leti 270 Gly Tyr Thr Gly Ala 350 Gly Ser Ser I Tyr I Gly C 430 G1l Val Arc Tyr Cys Asa Asia Lys Leti Val 175 Asn Thr Pro Asia Giti 255 Met Phe 3mn 3mn A.sn 335 ksp ['yr isn ~eu 115 fin r Leu IPhe Asp Gin *Leu Pro Gly Val Leu Pro 160 Ile Asp Asn Gin Asn 240 Ala Gin Thr Ser Ala 320 Gly Ser Gin Asn Thr 400 Asp Leu WO 97/37044 PCTfUS97/052 2 3 Asn Giy Ala Asn Asn Gin Thr Pro Gin Leu Thr Pro Gin Gin Ala Gin Gin Glu Gin Ly: 450 Pro Thr Pro Ah 465 Thr Ala Lys Thi Ile Giy Giy Val Gin Met Giy PhE 515 Asn Met Asn Gl 530 Lys Lys Arg Met 545 Tyr Ala Gin Phe Ser Tyr Gly Ala 580 Arg Gly Thr Giu 595 Ala Gly Gin Thr 610 His Leu Lys Pro 625 Ile Arg Thr Asn Ser Gin Gly Ile 660 Tyr Tyr Gin Ser 675 Phe Tyr Val Gly 690
INFORMATION
3Val Ile Met 455 a Gin Ile Asn 470 :Leu Met Ala 485 Sle Asp Glu Ala Arg Asn Phe Gly Vai 53S Phe Gly Leu 550 Gly Thr Giu 565 Gly Thr Asp Ala Ile Asp Trp Lys Thr 615 LYS Asp Thr 630 Phe Ser Lys 645 Glu Phe Gly Glu Gly Vail Tyr Asn Ile Asn 445 Gin Leu Giu Gln Ala Thr Asn Ala Arg Tyr Met Phe 520 Lys Arg Ser Phe Ile 600 Asn Ser Ile Ile Gly Gin S05 Leu Met Tyr Ser Leu 585 Gly Phe Phe Ala LeL Leu 490 Thr Glu Gly Tyr Leu 570 Tyr Phe Leu Gln H{is 650 Ilie aAla 475 1Tyr Lys His Tyr Gly 555 Val1 Asn Phe Asp Phe 635 Gin Pro 460 Asn Arg Val1 Asn Lys 540 Phe Lys Vali Ala Gin 620 Leu Lys Val Pro Ser Asn Ser 525 Gin Tyr Ala Phe Gly 605 Val1 Phe Arg Leu Tyr Lys Gin 510 Asn Phe Asp Thr Thr 590 Ile Asp Asp Ser Ser Ala 495 Val1 Ser Phe Phe Leu 575 Arg Gin Gly Leu Arg 655 Pro 480 Val1 Tyr Asn Gly Gly 560 Ser Lys Leu Asn Gly 640 Phe 665 rhr Ala 680 'ly Phe Lys Tyr Arg 670 Ala Phe Ser Arg 685 695 FOR SEQ ID NO:940: SEQUENCE
CHARACTERISTICS:
LENGTH: 1237 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1. i237 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:940: Met Ile Lys Lys Ala Lys Lys Phe Ile Pro Leu Phe Leu Ile Giy Sen 10 WO 97/37044 WO 9737044PCTIUS97/05223 Leu Gly Asni Thr Gly Asn Gin Asp Asn 145 Ile Lys Gin Ser Thr 225 Giy Giy Ser Thr Cys 305 Pro Gin Asn Aia Aia 385 Leu Gin Leu Thr Vali Leu Gly Ile Thr Tyr Ile Ile Pro 130 Thr Ser Aia Ile Leu 210 Thr Val Gin Asn Leu 290 Aia Leu Ala Asn Leu 370 Leu Lys Giy Ser Asn 450 Gin Aia Thr Ile Giy Giy Leu Ser 115 Leu Gin Ala Gin Giu 195 Leu Tyr Phe Val S er 275 Gin Asn Ala Ile Aia 355 Asn Lys Asn Gin Ser 435 Asn Thr Giu Gin Asn Leu Vai Asn 100 Ser Lys Ser Vali Leu 180 His Ser Vali Pro Val1 260 As n Giy Gin Aia Aia 340 Ile Phe Gin Thr Asn 420 Asp Thr Ile Asp Asn Giy Trp Gin Phe Ser Ile Ile Thr 70 Ser Asn Gin Ile Ile Ser Ala His Ala Phe 150 Asn Ser 165 Gin Asn Ser Ile Asn Leu Ser Ala 230 Thr Thr 245 Phe Tyr Asn Gin Glu Leu Ile Gin 310 Thr Pro 325 Gin Lys Asn Asn Gin Ala Ile Ser 390 Ser Asn 405 Ile Ser Ala Ser Asn Ser Asn Gly Ile Thr 55 Leu Thr Giy Gin Ser 135 Asp Leu Thr Thr Thr 215 Leu Thr Pro Gin Ser 295 Cys Thr Leu Thr Tyr 375 Trp Asn Ala Ser Phe 455 Lys Asn 40 Gin Ser Val1 Lys Gin 120 Ser Gin Asn Ala Lys 200 Asp Val1 Ser Thr Gin 280 Thr Leu Ser Gin Thr 360 Gin Ile Tyr Tyr Giy 440 Asp Giu Tyr Met 25 Asn Lys Ser Ala Ser Gin Gly Asn 90 Arg Lys 105 Ile Ile Gin Ile Gly Ile Pro Ser 170 Gin Ser 185 Thr Thr Ala Val Asn Ala Thr His 250 Asn Ser 265 Tyr Asn Asn Asn Giu Gin Thr Asn 330 Ser Val 345 Tyr Asn Ser Thr Ser Phe Gin Ile 410 Asp Cys 425 Ile Ser Asn Ser Gin Ile Ser Val Gin Leu Ile Asn Ser Val 75 Gin Leu Asp Phe Giy Leu Thr Ala 140 Ala Leu 155 Asn Asn Met Thr Ser Thr Asn Ala 220 Leu Asn 235 Val Val Leu Leu Asn Thr Gin Asn 300 Phe Ile 315 Gin Ala Ala Ile Leu Asn Ile Giu 380 Ser Giu 395 Gly Thr Thr Ser Cys Ser Leu Val 460 Giy Val Giy Tyr Leu Giu Ile Ala Ile Asp Giu Gly Tyr Ser Lys Gly 125 Lys Leu Ser Ser Ser Gin Giu Leu 190 Thr Tyr 205 Ser Ser Thr Leu Leu Asn Giy Ser 270 Leu Leu 285 Asn Pro Gin Asn Asn Gin Asn Ala 350 Asn Leu 365 Gin Tyr Pro Lys Vai Thr Aia Thr 430 Ala Thr 445 Ala Thr Asn Ser Gin Asn Giy Ala Ile Ser Ser Leu Asn Glu 175 Leu Ala Asn Gly Pro 255 Thr Met Asn Leu Gin 335 Leu His Asn Asn Asn 415 Gly Ser Ser Phe Ile Gin Pro Leu Ser Arg Ser Ser Ile 160 Val1 Gin Gin Asn Val1 240 Pro Ser Asn Gly Thr 320 Val Asp Asn Asn Leu 400 Asp Ser Ser Lys Asn WO 97/37044 WO 9737044PCTIUS97/05223 465 Leu Gilu Ser Giy Thr 545 Met Ser Asn Ala Tyr 625 Asn Leu Ile Thr Tyr 705 Ala Gly Gly Ser Tyr 785 Ser Thr Cys Thr Asn 865 Lys Leu Val Asn Gly Asn 530 Asn Val1 Gly Thr Phe 610 Asn Asn Ile Asn Asn 690 Gin Leu Ser Leu Asn 770 Gin Ser Pro Tyr Asp 850 Ile Gin Cys Se r Leu Thr 515 Ala Thr Asn Pro Val1 595 Gin Thr Asn Asn Gin 675 Asn Gin Gly Asn Leu 755 Giy Met Asn Cys Giu 835 Ser Ile Phe Gly Gin Gin 500 Ser Gin Gin As n Thr 580 Leu Asn Ser Gin Thr 660 Ser Ala Trp Tyr Ile 740 Asn Gly Leu Ser Asn 820 Pro Asn Ala Phe Asn 900 Val 485 Lys Pro Leu Ala Giu 565 Thr Gin Gin Asn Asp 645 Ile Gin Cys Ser Gin 725 Thr Gin Ser Thr Ser 805 Ser Asn Leu Ser Giu 885 Gly 470 Trp Asn Cys Gin Lys 550 Giu Gin Asn Giu Pro 630 Leu Asn Gin Ala Asp 710 Thr Tyr Ile Ser Asp 790 Asn Thr Lys Gin Ser 870 Ala Ser Ser Ala Asn Asn 535 Ser Giu Ser Val Asn 615 Asn Arg Gin Thr Ser 695 Ser Gin Asn Ile Gly 775 Ala Ser Asn Gin Lys 855 Gly Leu Ser Val Lys Ser 520 Ile Asn Ala Ser Ser 600 As n Gly Ile Gin Gin 680 Giy Lys Ala Val1 Thr 760 Asn Ser Ser Gly Gin 840 Val1 Asn Lys Giy Tyr Ile 505 Ser Leu Ala Lys Asn 585 Asn Ile Asn Gin Val1 665 Gin Met Aia Thr Gin 745 Asn Giy Asp Asn Ser 825 Asn Tyr Asn Ser Ser 905 Asn 490 Leu Ser S er Ser Thr 570 Ser Phe Gin Gin Leu 650 Pro Thr Gly Tyr Thr 730 Gin Leu Thr Giy Ser 810 Asn Ala Asn Lys Asn 890 Ser 475 Ser Cys Gly Pro Lys 555 Thr Thr Gin Ala Ser 635 Arg Thr Ser Ser Tyr 715 Gin Ile Lys Ser Lys 795 Gly Giy Thr Asp Gly 875 Ser Ser Leu Lys Asn Asn Gly Leu 525 Thr Asn 540 Leu Lys Asn Phe Val Met Gin Ser 605 Trp Ala 620 Gin Asn Ala Asn Asp Met Gly Ser 685 Ser Giy 700 Ser Giy Asn Gly Thr Leu Ser Vai 765 Gin Ile 780 Leu Giy Asn Asn Thr Ser Thr Ala 845 Ala Gin 860 Val Giu Ser Ser Thr Cys Thr Gly 510 Ser Giy Aia Asn Giy 590 Ile Asn Leu Phe Asn 670 Aia Asn Leu Ser Thr 750 Asn Asn Thr Asn Giy 830 Thr Lys Asn Leu Ser 910 480 Ser Giu 495 Ser Gin Ile Ser Thr Thr Met Val 560 Gin Ser 575 Aia Leu Gin Ser Ala Leu Thr Thr 640 Tyr Gin 655 Aia Leu Ser Thr Trp Cys Gin Ser 720 Ser Gly 735 Ser Gly Giy Gly Thr Ala Tyr Asn 800 Gly Tyr 815 Ser Asn Thr Thr Ile Ala Gly Leu 880 Ser Asn 895 Gly Giy Leu Ile Asn Leu Leu Gly Ala Ile Pro Thr Asn Gly Val Ser Asp Thr 915 920 925 WO 97/37044 PCT/US97/05223 844 Asn Asn Leu Ile Asn Leu Leu Thr Glu Phe Ile Lys Thr Ala Gly Phe 930 935 940 Ile Gin Asn Asn Asp Ser Asn Val Ser Thr Ser Leu Thr Ser Ala Phe 945 950 955 960 Gin Ala Ile Thr Ser Ala Ile Ser Gin Gly Phe Gin Ala Leu Gin Asn 965 970 975 Asp Ile Ser Pro Asn Ala Ile Leu Thr Leu Leu Gin Glu Ile Thr Ser 980 985 990 Asn Thr Thr Thr Ile Gin Ser Phe Ser Gin Thr Leu Arg Gin Leu Leu 995 1000 1005 Gly Asp Lys Thr Phe Phe Met Val Gin Gin Lys Leu Ile Asp Ala Met 1010 1015 1020 Ile Asn Ala Arg Asn Gin Val Gin Asn Ala Gin Asn Gin Ala Asn Asn 1025 1030 1035 1040 Tyr Gly Ser Gin Pro Val Leu Ser Gin Tyr Ala Ala Gly Lys Ser Thr 1045 1050 1055 Gin His Gly Met Ser Asn Gly Leu Gly Val Gly Ile Gly Tyr Lys Tyr 1060 1065 1070 Phe Phe Gly Lys Ala Arg Lys Leu Gly Leu Arg His Tyr Phe Phe Phe 1075 1080 1085 Asp Tyr Gly Phe Ser Glu Ile Gly Leu Ala Asn Gin Ser Val Lys Ala 1090 1095 1100 Asn Ile Phe Ala Tyr Gly Val Gly Thr Asp Phe Leu Trp Asn Leu Phe 1105 1110 1115 1120 Arg Arg Thr Tyr Asn Thr Lys Ala Leu Asn Phe Gly Leu Phe Ala Gly 1125 1130 1135 Val Gln Leu Gly Gly Ala Thr Trp Leu Ser Ser Leu Arg Gin Gin Ile 1140 1145 1150 Ile Asp Asn Trp Gly Asn Ala Asn Asp Ile His Ser Thr Asn Phe Gin 1155 1160 1165 Val Ala Leu Asn Phe Gly Val Arg Thr Asn Phe Ala Glu Phe Lys Arg 1170 1175 1180 Phe Ala Lys Lys Phe His Asn Gin Gly Val Ile Ser Gin Lys Ser Val 1185 1190 1195 1200 Glu Phe Gly Ile Lys Val Pro Leu Ile Asn Gin Ala Tyr Leu Asn Ser 1205 1210 1215 Ala Gly Ala Asp Val Ser Tyr Arg Arg Leu Tyr Thr Phe Tyr Ile Asn 1220 1225 1230 Tyr Ile Met Gly Phe 1235 INFORMATION FOR SEQ ID NO:941: SEQUENCE CHARACTERISTICS: LENGTH: 171 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 PCT/US97/05223 845 LOCATION 1...171 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:941: Met Glu Lys Leu Pro Lys Lys Arg Val Ser Lys Thr Lys Ser Gin Lys 1 5 10 Leu Ile His Ser Leu Thr Thr Gin Lys Asn Arg Ala Phe Leu Gln Lys 25 Thr Ser Ala Asn Glu Met Leu Leu Glu Leu Glu Lys Gly Ala Phe Lys 40 Lys Asn Glu Ala Tyr Phe Ile Ser Asp Glu Glu Asp Lys Asn Tyr Val 55 Leu Val Pro Asp Asn Val Ile Ser Leu Leu Ala Glu Asn Ala Arg Lys 70 75 Ala Phe Glu Ala Arg Leu Arg Ala Glu Leu Glu Arg Asp Ile Ile Thr 90 Gin Ala Pro Ile Asp Phe Glu Asp Val Arg Glu Val Ser Leu Gln Leu 100 105 110 Leu Glu Asn Leu Arg Gln Lys Asp Gly Asn Leu Pro Asn Ile Asn Thr 115 120 125 Leu Asn Phe Val Lys Gin Ile Lys Lys Glu His Pro Asn Leu Phe Phe 130 135 140 Asn Phe Asp Asn Met Phe Lys Gin Pro Pro Phe Asn Glu Asn Asn Phe 145 150 155 160 Glu Asn Phe Asp Asn Ser Asp Glu Glu Asn Phe 160 165 170 INFORMATION FOR SEQ ID NO:942: SEQUENCE
CHARACTERISTICS:
LENGTH: 435 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...435 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:942: Met Lys Lys Ile Leu Ile Thr Leu Ile Thr Leu Leu Leu Gly Val Phe 1 5 10 Met Gly Leu Gin Ala Ser Ala Leu Thr His Gin Glu Ile Asn Gln Ala 25 Lys Val Pro Val Ile Tyr Glu Glu Asn His Leu Leu Pro Met Gly Phe 40 Ile His Leu Ala Phe Arg Gly Gly Gly Ser Leu Ser Asp Lys Asn Gin 55 Leu Gly Leu Ala Lys Leu Phe Ala Gln Val Leu Asn Glu Gly Thr Lys 70 75 WO 97/37044 PCT/US97/05223 Giu Ser Phe Leu Arg 145 Ala Asn Asp Val Asn 225 Tyr Thr Leu Gly Leu 305 Phe Val Thr Pro Tyr 385 Gin His Lys Leu Leu Leu Lys 130 Met Lys Ala Leu Val 210 Asn Phe Glu Lys Gly 290 Ala Ala Ala Gin Leu 370 Phe Ile Thr Asp SGly SAsn Lys 115 Ser Leu Leu Ala Lys 195 Leu Ala Glu Gin Gin 275 Phe Tyr Ser Leu Gin 355 Arg Tyr Gin Glu Lys 435 Ala Val Val Asp 100 Glu Tyr Pro Asn Ala Gin Thr Leu 165 Leu Gly 180 Gin Gin Gly Gly Leu Asn Ala Ser 245 Ala Phe 260 Asp Leu Gly Ser Ser Val Gly Tyr 325 Val Lys 340 Glu Leu Asn Glu Leu Gly Lys Met 405 Ile Asn 420 Gly Phe Ala Gin Leu Leu Glu Gin Lys Ala Ile Thr Glu Phe Leu 150 Lys Thr Phe Asp Phe 230 Asp Val Ala Arg Tyr 310 Leu Lys Asp Thr Leu 390 Ser Asp Ser Asp Thr 135 Leu Gin Lys Ala Leu 215 Leu Lys Tyr Lys Leu 295 Ile Gin Ile Asp Ile 375 Pro Leu Leu Ala Glu 120 Gin Gin Glu Glu Lys 200 Lys Pro Lys Phe Ser 280 Met Arg Thr Ile Ala 360 Ser Leu Lys Thr Glu 105 Ala Asn Lys Leu Ser 185 Val Val Gin Ser Gly 265 Lys Glu Ser Lys Lys 345 Lys Ser Asp Glu Phe 425 Asp lie Ala Glu Phe 170 Leu Phe Asn Gly Glu 250 Val Val Lys Asn Leu 330 Glu Lys Arg Phe Ile 410 Ala Leu Met Leu Ser 155 Ala Gin Glu Gin Lys 235 Lys Pro Met Ile Phe 315 Ser Phe Phe Leu Asn 395 Asn Ile Gin Arg Glu 140 Asp Asn Lys Leu Thr 220 Ala Val Phe Met Arg 300 Ser Thr Ile Leu Asn 380 Gin Asp Val lie Leu 125 Lys Phe Thr Ile Asn 205 Leu Tyr Leu Lys Phe 285 Val Lys Gin Glu Leu 365 Thr Thr Phe Ser Thr 110 Lys Val Asp Pro Lys 190 Lys Asn Glu Tyr Ile 270 Val Gin Val Ala Lys 350 Gly Thr Leu Ile Asn 430 Leu Glu Lys Tyr Leu 175 Leu Leu Arg Glu Lys 255 Lys Leu Glu Ala Lys 335 Gly Ser Tyr Leu Lys 415 Lys Glu Leu Thr Leu 160 Ala Asp Val Leu Pro 240 Asp Asp Gly Gly His 320 Ser Met Glu Asn Asn 400 Glu Lys INFORMATION FOR SEQ ID NO:943: SEQUENCE CHARACTERISTICS: LENGTH: 614 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCTIUS97/05223 847 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 614 (xi) SEQUENCE DESCRIPTION: SEQ 1D NO:943: Leu Ala Ser Pro Lys Glu Thr Pro Lys Giu Ala Gin Lys Asn Giu Ala 1 Gin Lys Asn Ile Val Lys Giu Phe Leu 145 Lys Asp Lys Met Leu 225 Tyr Thr Arg Lys Ile 305 Asp Giu Met Asn Ser Giu Asp Tyr Ala Lys Asp 130 Glu Vali Sen Arg Gly 210 Giu Leu His Ile Thr 290 Giu Lys Lys Val1 Glu Ile Ile Thr Aia Arg Asp 115 Giu Giy Sen Ile Arg 195 Trp Tyr Asp Asp Ser 275 Leu His Gly Asn His Thr Ser Ala Ala Thr Ile 100 Gly Gin Gin Giu Tyr 180 Val Met Asp Ala Ala 260 Asp Giu Leu Tyr Gly 340 Ile 5 Ser Tyr Lys Val1 Phe Ala Leu Lys Gly Gly 165 Ile Ile Trp Ser His 245 Lys Ile Lys Arg Ala 325 Leu Asn Gin Val1 Ile Leu 70 Giu Gly Lys Leu Tyr 150 Ala Lys Giu Gly Leu 230 Ile Leu Leu Ala Ala 310 Phe Val1 Asp *Ser Gly Arg 55 Aia As n Val1 Ser Giu 135 Tyr Leu Gin Sen Leu 215 Arg Sen His Ile Leu 295 Asp Ala Lys Val1 Asn Leu 40 Val1 Leu Gly Giu Gin 120 His Gly Leu Ser Leu 200 Asn Ile Sen Tyr Giu 280 Lys Ala Val1 Val1 Ile Gin 25 *Sen Gly Phe Ile Ile 105 Met Ala Ser Ile Ile 185 Ser Asp Gin Pro Lys 265 Ile Val1 Gin Val Ile 345 Ile 10 Thr Tyr Asp Asn Leu 90 Lys Gly Lys Val1 Val1 170 Tyr Ala Gly Asp Phe 250 Val1 Asp Lys Ile Lys 330 Tyr Sen Pro Met Met Gin 75 Giu Gly Ile Thn Val 155 Phe Giu Asn Lys Val1 235 Leu Lys Asn Ang Leu 315 Pro Arg Gly Lys Ser Val Gly Phe Tyr Lys Ala 140 Giu Asp Gly Lys Leu 220 Tyr Lys Giu Pro Lys 300 Lys Asp Ile Asn Glu Asp Asp Tyr His Gly Lys 125 Leu Val Vai Ser Gin 205 Arg Met Thr Gly Vai 285 Asp Thr Leu Giu Gin Met Met Sen Phe Phe Thr 110 Gly Lys Arg Asn Asp 190 Arg Leu Ang Asp Ile 270 Vai Val1 Glu Asp Val 350 Lys Leu Lys Lys Asp Giu Asp Thr Thr Arg 175 Lys Asp Asp Arg Phe 255 Gin Pro Phe Ile Lys 335 Gly Val1 Ala Lys Asp Glu Lys Thr Ala Giu 160 Gly Leu Phe Gin Giy 240 Sen Tyr Leu Asn Ala 320 Asp Asp 355 360 36S Tr e Asp Arg Ile Ile Arg Arg Glu Leu Leu Leu Gly Pro Lys Asp Lys Tyr WO 97/37044 PCT/US97/05223 370 Asn Leu 385 375 Asn Thr Lys Leu Arg Phe Met Phe Val Ala 465 Gly Ile Arg Val Leu 545 Tyr Pro Val Asn Phe Asp Gly Ser 450 Asn Ala Phe Ile Gly 530 Asn Tyr Ala Pro Lys 610 Ser Leu Leu 435 Glu Ile Gly Asp Ser 515 Arg Val Ser Ser Glu 595 Arg Lys Leu 420 Gly Arg Ala Arg Ser 500 Tyr Met Thr Ser Val 580 Ser Tyr Val 405 Va 1 Tyr Asn Thr Met 485 Trp Gin Leu Lys Val 565 Ile Cys Leu 390 Lys Ile Ser Val Gly Ser Leu Phe 455 Gly Gly 470 Phe Ala Tyr Ser Tyr Ile Gly Asn 535 Leu Leu 550 Asn Glu Ile Asn Ser Ser Gly Ser Glu Asn Glu Glu Lys 410 Glu Glu Gly 425 Tyr Gly Gly 440 Gly Thr Gly Gly Arg Ser Gly Asn Leu 490 Ser Thr Ile 505 Gin Gin Gly 520 Arg Thr His Gly Phe Ser Val Ala Ser 570 Arg Leu Ser 585 Pro Gly Ala 600 Ser Leu Arg Arg Leu 395 Arg Arg Leu Gin Tyr 475 Ser Asn Gly Val Ser 555 Pro Gly Ile Val Thr Met Ser 460 Pro Leu Leu Gly Ser 540 Pro Arg Gly Thr Asn Gly Leu 445 Met Gly Thr Tyr Phe 525 Leu Leu Gin Arg Ile 605 Ser Ser 415 Gin Leu 430 Asn Gly Ser Leu Met Pro Asn Pro 495 Ala Asp 510 Gly Val Gly Tyr Tyr Asn Cys Ser 575 Thr Pro 590 Phe Thr Gly 400 Leu Gin Ser Tyr Lys 480 Arg Tyr Asn Asn Arg 560 Thr Leu Arg INFORMATION FOR SEQ ID NO:944: SEQUENCE CHARACTERISTICS: LENGTH: 376 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...376 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:944: Leu Cys Glu Gly Pro Asn Val Met Ser Gly Trp Lys Pro His Ile Lys 1 5 10 His Gly Val Tyr Arg Asn Trp Asn Asn Trp Arg Asn Asn Tyr Thr Ala 25 Val Tyr Leu Ser Asp Arg Ile Glu Ala Trp Asp Gly Arg Phe Phe Ile WO 97/37044 PCTIUS97/05223 Val Pro Giy Leu Arg Tyr Ala Phe Val Ser Met Gly Pro Gin Lys 145 Ala Val Tyr Asp Val 225 Phe Gly Ser Asn Met 305 Gin Ser Ile Thr Asn Asn Asp Pro His 130 Phe Thr Arg Arg Thr 210 Leu Gin Ile Ala Tyr 290 Thr Val Leu Gly Ala 370 Trr As r His Gin 115 Phe Ser Gly Pro Pro 195 Arg Lys Phe Ser Ala 275 Giu Gin Ser Gin Ser 355 Tyr Met Trp Asn 100 Leu Asp Phe Gin Ile 180 Ile Val1 Gly Ile Ser 260 Gly Ser His Gin Ile 340 Ser Leu Gin Met Val1 Asp Thr Asn Tyr 165 Asn Arg Thr Thr Phe 245 Tyr Gly Val1 Giu Ile 325 Asn Pro Asn Ile 70 Pro Leu Val Val1 Ala 150 Ser Gly Gly Ser Ser 230 Asp Phe Tyr Leu Gly 310 Phe Aksf kia I'yr 55 Pro Glu Ser Thr Thr Tyr Leu Ser 120 Giu Ala 135 Asp Tyr Val Tyr Tyr Ser Leu Gin 200 His Gly 215 Tyr Asn Ala Arg Tyr Ser Tyr Gly 280 Asn Ser 295 Leu Leu Trp Glu Ile Phe Gly Leu 360 Thr Phe Lys Asn Phe 105 Tyr Gly Phe Thr Gin 185 Phe Pro Lys Tyr Arg 265 Met Gly Pro Asn Asn 345 Gin Gin Asp Ile 90 Asn Gly Ala Arg Ser 170 Gly His Leu His Asn 250 Ala Gin Tyr Trp Gly 330 Met Pro Tyr Leu 75 Gly Tyr Gly Arg Ile 155 Gly Val1 Ala Thr Phe 235 Trp Tyr Tyr Gin Tyr 315 Arg Lys Aia Asn Arg Phe Gin Ala Tyr 140 Trp Pro Glu Ala Asp 220 Pro Arg Ser Tyr Cys 300 Trp His Tyr Pro Asn Lys Ile Arg Giu 125 Thr Ala Met Leu Phe 205 Leu Phe Lys Gly Ser 285 Glu Val Arg Tyr Gly 365 Giu Ile Pro Ser 110 Tyr Tyr Arg Lys Giu 190 Asn Asn Val1 Thr Ile 270 Gly Ala rrp Val1 Phe 350 %rg Asn Lys Val1 Phe Phe Lys Asp Giy 175 Leu Tyr Gly Ser Thr 255 Ser Gly Trp Asn Thr 335 Thr Ser Ala His Gin Val1 Thr Asp Phe 160 As n Tyr Ile Asp Pro 240 Ile Asn Asn Cys Ile 320 ?31y Gly Val1 INFORMATION FOR SEQ ID NO:945: SEQUENCE CHARACTERISTICS: LENGTH: 621 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 850 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...621 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:945: Leu His Ala Glu Asp Asn Gly Phe Phe Val Ser Ala Gly Tyr Gin Ile 1 5 10 Gly Glu Ala Val Gin Met Val Lys Asn Thr Gly Glu Leu Lys Asn Leu 25 Asn Asp Lys Tyr Glu Gin Leu Ser Gin Ser Leu Ala Gin Leu Ala Ser 40 Leu Lys Lys Ser Ile Gin Thr Ala Asn Asn Ile Gin Ala Val Asn Asn 55 Ala Leu Ser Asp Leu Lys Ser Phe Ala Ser Asn Asn His Thr Asn Lys 70 75 Glu Thr Ser Pro Ile Tyr Asn Thr Ala Gin Ala Val Ile Thr Ser Val 90 Leu Ala Phe Trp Ser Leu Tyr Ala Gly Asn Ala Leu Ser Phe His Val 100 105 110 Thr Gly Leu Asn Asp Gly Ser Asn Ser Pro Leu Gly Arg Ile His Arg 115 120 125 Asp Gly Asn Cys Thr Gly Leu Gin Gin Cys Phe Met Ser Lys Glu Thr 130 135 140 Tyr Asp Lys Met Lys Thr Leu Ala Glu Asn Leu Gin Lys Ala Gin Gly 145 150 155 160 Asn Leu Cys Ala Leu Ser Glu Cys Ser Ser Asn Gin Ser Asn Gly Gly 165 170 175 Lys Thr Ser Met Thr Thr Ala Leu Gin Thr Ala Gin Gin Leu Met Asp 180 185 190 Leu Ile Glu Gin Thr Lys Val Ser Met Val Trp Lys Asn Ile Val Ile 195 200 205 Ala Gly Val Thr Asn Lys Pro Asn Gly Ala Gly Ala Ile Thr Ser Thr 210 215 220 Gly His Val Thr Asp Tyr Ala Val Phe Asn Asn Ile Lys Ala Met Leu 225 230 235 240 Pro Ile Leu Gin Gin Ala Leu Thr Leu Ser Gin Ser Asn His Thr Leu 245 250 255 Ser Thr Gin Leu Gin Ala Arg Ala Met Gly Ser Gin Thr Asn Arg Glu 260 265 270 Phe Ala Lys Asp Ile Tyr Ala Leu Ala Gin Asn Gin Lys Gin Ile Leu 275 280 285 Ser Asn Ala Ser Ser Ile Phe Asn Leu Phe Asn Ser Ile Pro Lys Asp 290 295 300 Gin Leu Lys Tyr Leu Glu Asn Ala Tyr Leu Lys Val Pro His Leu Gly 305 310 315 320 Lys Thr Pro Thr Asn Pro Tyr Arg Gln Asn Val Asn Leu Asn Lys Glu 325 330 335 Ile Asn Ala Val Gin Asp Asn Val Ala Asn Tyr Gly Asn Arg Leu Asp 340 345 350 Ser Ala Leu Ser Val Ala Lys Asp Val Tyr Asn Leu Lys Ser Asn Gin 355 360 365 Thr Glu Ile Val Thr Thr Tyr Asn Asp Ala Lys Asn Leu Ser Glu Glu 370 375 380 Ile Ser Lys Leu Pro Tyr Asn Gin Val Asn Val Thr Asn Ile Val Met 385 390 395 400 WO 97/37044 PCTIUS97/05223 Ser Pro Lys Asp Gin Ser Asn Leu 420 Lys Lys Val Gly 435 Gly Leu Gly Val 450 Arg Trp Gly Leu 465 Ile Lys Ser Ser Gly Gly Ser Asp 500 Lys Asn Asn Lys 515 Gly Thr Thr Trp 530 Asn Pro Tyr Ser 545 Asn Leu Gly Leu Glu Arg Ser Ala 580 Ile Asn Thr Asn 595 Arg Leu Tyr Ser 610
INFORMATION
Ser 405 Asn Met Gin Arg Phe 485 Leu Leu Leu Ala Arg 565 Gin Tyr Val Thr Gin Ile Val Tyr 470 Phe Leu Ser Asn Lys 550 Thr His Tyr Tyr Ala Ala Ser Gly 455 Tyr Asn Val Val Ser 535 Val Asn Gly Ser Leu 615 Gly Leu Ser 440 Tyr Gly Ser Asn Gly 520 Gin Asn Leu Val Phe 600 Asn Gin Ala 425 Gin Lys Phe Ser Phe 505 Leu Tyr Ala Ala Glu 585 Leu Tyr Tyr Gin Ile Asn Pro Glu Gin 410 Ala Asn Gin Phe Ser 490 Ile Phe Met Ser Thr 570 Leu Gly Met Asn Phe Asp 475 Asp Asn Gly Asn Asn 555 Ala Gly Thr 415 Ser Asn Phe 460 Tyr Ile Asp Gly Leu 540 Phe Lys Ile Lys Asn Asn 430 Gly Ala 445 Gly Glu Asn His Trp Thr Ser Ile 510 Ile Gin 525 Thr Ala Gin Phe Lys Lys Lys Ile 590 Leu Glu 605 Pro Leu Ser Gly Tyr 495 Thr Leu Phe Leu Asp 575 Pro Tyr Phe Asn Lys Tyr 480 Gly Arg Ala Asn Phe 560 Ser Thr Arg Val Phe Ala Tyr 620 FOR SEQ ID NO:946: SEQUENCE CHARACTERISTICS: LENGTH: 163 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...163 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:946: Met Arg Glu Ile Leu Thr Asn Arg Phe Phe Pro Ser Leu Phe Lys 5 10 Arg Leu Asp Phe Ser Asn Arg Val Val Leu Gly Leu Gly Ser Asn 25 Lys Asn Pro Leu Lys Ile Leu Lys Ser Cys Phe Leu Tyr Phe Lys 40 His Ser Lys Ile Gly Lys Ile Phe Ser Ser Pro Ile Tyr Ile Asn 55 Met 1 Lys Leu Asn WO 97/37044 PCTIUS97/05223 852 Pro Pro Phe Gly Tyr Thr Asn Gin Pro Asn Phe Tyr Asn Ala Thr Ile 70 75 Ile Leu Lys Thr Ser Leu Gly Leu Arg His Phe Phe Ala Leu Val Phe 90 Tyr Ile Glu Arg Arg Phe Gly Arg Ala Arg Lys Arg Asp Phe Lys Asp 100 105 110 Ala Pro Arg Thr Leu Asp Ile Asp Ile Ile Ala Phe Asn Gin Val Ile 115 120 125 Leu Arg Gin Asn Asp Leu Thr Leu Pro His Pro Lys Trp Ser Glu Arg 130 135 140 Asp Ser Val Leu Val Pro Leu Thr Leu Gin Gin Ile Leu Phe Lys Arg 145 150 155 160 Glu Glu Trp INFORMATION FOR SEQ ID N0:947: SEQUENCE CHARACTERISTICS: LENGTH: 285 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...285 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:947: Met Gin Asp Phe Ile Lys Ile Phe Ile Gin Glu Val Val Ser Thr Leu 1 5 10 Glu Gly Leu Val Gly Lys Ala Pro Ser Val Gly Leu Glu Lys Glu Val 25 Ser Asn Asn Glu Glu Ala Ser Leu Ile Ser Thr Pro Tyr Ala Arg Val 40 Lys Ile Ser Ala Ile Glu Lys Asn Glu Ser Pro Ile Glu Leu Leu Ala 55 Pro Val Asp Leu Val Thr Ala Leu Ser Asp Leu Met Leu Gly Gly Glu 70 75 Gly Ala Ser Lys Glu Glu Met Asp Asn Asp Asp Leu Asp Ala Phe Lys 90 Glu Met Ala Ser Asn Ile Phe Gly Ala Ile Ala Thr Ser Leu Lys Ser 100 105 110 Gin Glu Leu Leu Pro Lys Leu Asn Phe Thr Thr Thr Asn Ala Glu Ile 115 120 125 Ala Lys Glu Leu Pro Lys Lys Glu Asp Tyr Ala Lys Ala Met Val Phe 130 135 140 Ser Phe Lys Met Glu Ala Ile Lys Glu Ser Gin Ile Val Leu Leu Ile 145 150 155 160 Thr Ser Ala Phe Glu Gly Gin Phe Glu Lys Thr His Lys Glu Glu Lys 165 170 175 WO 97/37044 PCT/US97/052 2 3 853 Glu Glu Thr Thr Lys Ser Ala Thr Glu Glu Thr Lys Thr His Asp Ala 180 185 190 Ser Leu Glu Asn Iie Glu Ile Arg Asn Ile Ser Met Leu Leu Asp Val 195 200 205 Lys Leu Asn Val Lys Val Arg Ile Gly Gin Lys Lys Met Ile Leu Lys 210 215 220 Asp Val Val Ser Met Asp Ile Gly Ser Val Val Glu Leu Asp Gin Leu 225 230 235 240 Val Asn Asp Pro Leu Glu Ile Leu Val Asp Asp Lys Val Ile Ala Lys 245 250 255 Gly Glu Val Val Ile Val Asp Gly Asn Phe Gly Ile Gin Ile Thr Asp 260 265 270 Ile Gly Thr Lys Lys Glu Arg Leu Glu Gin Leu Lys Asn 275 280 285 INFORMATION FOR SEQ ID NO:948: SEQUENCE CHARACTERISTICS: LENGTH: 170 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...170 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:948: Met Lys Ile Leu Val Ile Gin Gly Pro Asn Leu Asn Met Leu Gly His 1 5 10 Arg Asp Pro Arg Leu Tyr Gly Met Val Thr Leu Asp Gin Ile His Glu 25 Ile Met Gin Thr Phe Val Lys Gin Gly Asn Leu Asp Val Glu Leu Glu 40 Phe Phe Gin Thr Asn Phe Glu Gly Glu Ile Ile Asp Lys Ile Gin Glu 55 Ser Val Gly Ser Asp Tyr Glu Gly Ile Ile Ile Asn Pro Gly Ala Phe 70 75 Ser His Thr Ser Ile Ala Ile Ala Asp Ala Ile Met Leu Ala Gly Lys 90 Pro Val Ile Glu Val His Leu Thr Asn Ile Gin Ala Arg Glu Glu Phe 100 105 110 Arg Lys Asn Ser Tyr Thr Gly Ala Ala Cys Gly Gly Val Ile Met Gly 115 120 125 Phe Gly Pro Leu Gly Tyr Asn Met Ala Leu Met Ala Met Val Asn Ile 130 135 140 Leu Ala Glu Met Lys Ala Phe Gin Glu Ala Gin Lys Asn Asn Pro Asn 145 150 155 160 Asn Pro Asn Asn Pro Ile Asn Asn Gin Lys 165 170 WO 97/37044 PCT/US97/05223 854 INFORMATION FOR SEQ ID NO:949: SEQUENCE CHARACTERISTICS: LENGTH: 357 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...357 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:949: Met Arg Gly Leu Glu Arg Glu Ser His Phe Thr Leu Asn Glu Asn Ala 1 5 10 Met Phe Phe Glu Cys Ala Tyr Ser Cys Asp Asn Ala Leu Phe Leu Gin 25 Leu Asp Asp Arg Ser Phe Phe Ile Thr Asp Ser Arg Tyr Thr Gin Glu 40 Ala Lys Glu Ser Leu Gin Pro Lys Lys Gly Val Leu Val Glu Val Ile 55 Glu Ser Ser Asp Leu Val Gin Ser Ala Ile Asp Leu Ile Thr Lys Ser 70 75 Ser Val Lys Lys Leu Phe Phe Asp Pro Asn Gin Val Asn Leu Gln Thr 90 Tyr Lys Arg Leu Asp Ser Ala Val Gly Asn Lys Val Val Leu Glu Gly 100 105 110 Val Pro Ser Tyr His Arg Gln Lys Arg Ile Ile Lys Asn Glu His Glu 115 120 125 Ile Gin Leu Leu Lys Lys Ser Gin Ala Leu Asn Val Glu Ala Phe Glu 130 135 140 Asn Phe Ala Glu Tyr Val Lys Lys Ile Phe Asp Glu Lys Glu Ser Leu 145 150 155 160 Ser Glu Arg Tyr Leu Gin His Lys Val Lys Asp Phe Leu Thr Lys Glu 165 170 175 Gly Val Tyr Asp Leu Ser Phe Glu Pro Ile Leu Ala Leu Asn Ala Asn 180 185 190 Ala Ser Lys Pro His Ala Leu Pro Ser Ala Lys Asp Phe Leu Lys Ala 195 200 205 Asp His Ser Ile Leu Leu Asp Met Gly Ile Lys Tyr Glu Arg Tyr Cys 210 215 220 Ser Asp Arg Thr Arg Thr Ala Phe Phe Asp Pro Lys Asp Phe Val Phe 225 230 235 240 Lys Arg Glu Gin Ser Phe Lys Asp Lys Glu Arg Gin Lys Ile Tyr Asp 245 250 255 Ile Val Lys Glu Ala Gin Glu Lys Ala Ile Ser Gly Ile Arg Ala Gly 260 265 270 Met Thr Gly Lys Glu Ala Asp Ser Leu Ala Arg Gly Val Ile Ser Asp 275 280 285 WO 97/37044 PCT/US97/05223 855 Tyr Gly Tyr Gly Gln Tyr Phe Thr His Ser Thr Gly His Gly Ile 290 295 300 Leu Asp Ile His Glu Leu Pro Tyr Ile Ser Ser Arg Ser Glu Thr 305. 310 315 Leu Glu Glu Gly Met Val Phe Ser Val Glu Pro Gly Ile Tyr Ile 325 330 335 Gly Phe Phe Gly Val Arg Ile Glu Asp Leu Val Val Ile Lys Asn 340 345 350 Arg Ala Glu Leu Leu 355 INFORMATION FOR SEQ ID NO:950: SEQUENCE CHARACTERISTICS: LENGTH: 385 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...385 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:950: Asp Ile 320 Pro Ser Met Lys Asp Ser Phe Leu Phe Thr Ser Glu Ser Val 1 Pro Ile Asn Pro Asp Ile Gly Lys Leu 145 Leu Lys Glu Asp Glu Gly Met Al-a Gly Glu Glu 130 Ala Arg Pro Val Lys Arg Phe Gin Leu Glu Ile 115 Thr Phe Pro Val Ser 195 Met Asp Cys Glu Tyr Gin 100 Gly Glu Ala Asp Ser 180 Gin 5 Ala Lys Met Ile Gly Ser Ala Thr Leu Gly 165 Val Lys Asp Lys Ile Ala 70 Phe Pro Gly Leu Ala 150 Lys Asp His Gin Ala Thr 55 Arg Asp Asp Asp Met 135 Gin Ser Thr Leu Ile Lys 40 Gly Glu Tyr Ile Gin 120 Pro Lys Gin Ile Lys 200 Ser Val Glu Val Arg Asn 105 Gly Leu Arg Val Val 185 Glu 10 Asp Ala Leu Val Ser 90 Gin Leu Pro Lys Ser 170 Ile Ala Ala Cys Lys Lys 75 Ala Gly Met Ile Asp 155 Val Ser Val Val Glu Thr Lys Ala Val Phe His 140 Asn Arg Thr Ile Thr Leu Thr Ser Ile Val Asp Gly 125 Leu Thr Tyr Gin Glu 205 Gly Tyr Val Tyr Tyr Asn Glu Ala His Pro Asn 175 Ser Ile His Ile Ser Ala Thr Gly Asp Cys Gin Phe 160 Asn Pro Val WO 97/37044 PCT/US97/05223 856 Tyr Lys Val Leu Pro Lys Glu Tyr Leu His Asp Asn Ile Lys Phe Phe 210 215 220 Ile Asn Pro Thr Gly Lys Phe Val Ile Gly Gly Pro Gin Gly Asp Ala 225 230 235 240 Gly Leu Thr Gly Arg Lys Ile Ile Val Asp Thr Tyr Gly Gly Phe Cys 245 250 255 Pro His Gly Gly Gly Ala Phe Ser Gly Lys Asp Pro Ser Lys Val Asp 260 265 270 Arg Ser Ala Ala Tyr Ala Ala Arg Tyr Val Ala Lys Asn Leu Val Ala 275 280 285 Ser Gly Val Cys Asp Lys Ala Thr Val Gin Leu Ala Tyr Ala Ile Gly 290 295 300 Val Ile Glu Pro Val Ser Ile Tyr Val Asn Thr His Asn Thr Ser Lys 305 310 315 320 His Ser Ser Ala Glu Leu Glu Lys Cys Val Lys Ser Val Phe Lys Leu 325 330 335 Thr Pro Lys Gly Ile Ile Glu Ser Leu Asp Leu Leu Arg Pro Ile Tyr 340 345 350 Ser Leu Thr Ser Ala Tyr Gly His Phe Gly Arg Glu Leu Glu Glu Phe 355 360 365 Thr Trp Glu Lys Thr Asn Lys Val Glu Glu Ile Lys Ala Phe Phe Lys 370 375 380 Arg 385 INFORMATION FOR SEQ ID NO:951: SEQUENCE CHARACTERISTICS: LENGTH: 218 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...218 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:951: Val Leu Asp Ser Phe Glu Ile Leu Lys Ala Leu Lys Ser Leu Asp Leu 1 5 10 Leu Lys Asn Ala Pro Ala Trp Trp Trp Pro Asn Ala Leu Lys Phe Glu 25 Ala Leu Leu Gly Ala Val Leu Thr Gin Asn Thr Lys Phe Glu Ala Val 40 Leu Lys Ser Leu Glu Asn Leu Lys Asn Ala Phe Ile Leu Glu Asn Asp 55 Asp Glu Ile Asn Leu Lys Lys Ile Ala Tyr Ile Glu Phe Ser Lys Leu 70 75 Ala Glu Cys Val Arg Pro Ser Gly Phe Tyr Asn Gin Lys Ala Lys Arg 90 WO 97/37044 PCT/US97/05223 857 Leu Ile Asp Leu Ser Lys Asn Ile Leu Lys Asp Phe Gin Ser Phe Glu 100 105 110 Asn Phe Lys Gin Glu Val Thr Lys Glu Trp Leu Leu Asp Gin Lys Gly 115 120 125 Ile Gly Lys Glu Ser Ala Asp Ala Ile Leu Cys Tyr Val Cys Ala Lys 130 135 140 Glu Val Met Val Val Asp Lys Tyr Ser Tyr Leu Phe Leu Lys Lys Leu 145 150 155 160 Gly Ile Glu Ile Glu Asp Tyr Asp Glu Leu Gin His Phe Phe Glu Lys 165 170 175 Gly Val Gin Glu Asn Leu Asn Ser Ala Leu Ala Leu Tyr Glu Asn Thr 180 185 190 Ile Ser Leu Ala Gin Leu Tyr Ala Arg Phe His Gly Lys Ile Val Glu 195 200 205 Phe Ser Lys Gin Lys Leu Glu Leu Lys Leu 210 215 INFORMATION FOR SEQ ID NO:952: SEQUENCE CHARACTERISTICS: LENGTH: 132 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...132 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:952: Met Lys Gin Leu Phe Leu Ile Ile Gly Ala Pro Gly Ser Gly Lys Thr 1 5 10 Thr Asp Ala Glu Leu Ile Ala Lys Asn Asn Ser Glu Thr Ile Ala His 25 Phe Ser Thr Gly Asp Leu Leu Arg Ala Glu Ser Ala Lys Lys Thr Glu 40 Arg Gly Leu Leu Ile Glu Lys Phe Thr Ser Gin Gly Glu Leu Val Pro 55 Leu Glu Ile Val Val Glu Thr Ile Leu Ser Ala Ile Lys Ser Ser Ser 70 75 Lys Gly Ile Ile Leu Ile Asp Gly Tyr Pro Arg Ser Val Glu Gin Met 90 Gin Ala Leu Asp Lys Glu Leu Asn Ala Pro Asn Glu Val Ile Leu Lys 100 105 110 Ser Val Ile Glu Val Glu Val Ser Glu Asn Thr Ala Lys Glu Arg Val 115 120 125 Leu Gly Gin Phe 130 INFORMATION FOR SEQ ID NO:953: WO 97/37044 PCT/US97/05223 858 SEQUENCE CHARACTERISTICS: LENGTH: 153 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...153 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:953: Met His Tyr Ser Tyr Glu Ala Phe Leu Lys Asp Ser Leu Glu Leu Ala 1 5 10 Lys Gin Val Glu Arg Leu Cys Gly Ile Pro Glu Ala Leu Val Cys Val 25 Met Arg Gly Gly Met Thr Leu Val His Phe Leu Ser Leu His Trp Asn 40 Leu Arg Glu Val Tyr Gly Ile Asn Ala Ile Ser Tyr Asp Thr Thr Lys 55 Arg Gin Asn Ala Leu Lys Ile Glu Asn Thr Pro Thr Ile Lys Asp His 70 75 Leu Lys Thr Ile Leu Val Val Asp Glu Ile Val Asp Ser Gly Asn Ser 90 Leu Glu Ala Val Leu Lys Val Leu Gln Asp Lys His Pro Asp Lys Lys 100 105 110 Phe Tyr Ser Ala Ser Leu Phe Gin Lys Thr Ser Ala Lys Tyr Lys Ala 115 120 125 Asp Ala Phe Leu Lys Asp Ala Pro Glu Trp Ile Asp Phe Phe Trp Glu 130 135 140 Val Asp Leu Lys Asn Leu Lys Ser His 145 150 INFORMATION FOR SEQ ID NO:954: SEQUENCE CHARACTERISTICS: LENGTH: 235 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...235 WO 97/37044 PCTIUS97/05223 859 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:954: Leu Asn Gly Gly Asn Leu Giu Asp Gin Arg 145 Gly Phe Ile Ile Asn 225 (2) Ser Giu Pro Ile Asp Arg Phe Thr Arg Ile Arg Trp Leu Asp Gly Gin Arg Gin Ala Cys Asn 130 Leu Asp Lys Giu Ala 210 Giu *Phe *Val Ile Gin Asp Phe Met 115 Phe Asn Lys Gly Leu 195 Ser Arg Glu *Gly Thr Sle Leu Leu 100 Asp Ala Pro Phe Asp 180 Gly Glu Leu Lys Gly Ile Gly Tyr Asn Asp Tyr Lys Gly 165 Phe Ser Val1 Arg Ile Phe Ile Ser 70 Lys Ser Leu Gly His 150 Arg Lys Phe Val Arg 230 Arg Ala Asp 55 Giu Gly Phe Pro Lys 135 Ile Lys Val1 Asn Gin 215 krg Gin Leu 40 Lys Arg Ile Asn Ile 120 Phe Gin Phe Val Ala 200 Asp Ile Gin 25 Asp Asp Ile Gin Phe 105 Lys Ile Val1 Arg Phe 185 Vai Ile Giy i0 Arg Ser Met Gly Ala 90 Arg Thr Ser Gly Asp 170 Ser Thr Ile Arg Val1 Leu Phe Giu 75 Leu Asp Ser Ser Ser 155 Phe Pro Ala Lys Phe 235 Leu Tyr Asp Ser Asn Tyr Leu Met 140 Val Leu Giu Ser Asn 220 Ile Arg Val1 Lys Cys Asp Ala 125 Giy Trp Lys Val Phe 205 Glu Cys Val Thr Val1 Arg Tyr 110 Ile Ser Giu Lys Pro 190 Gly Arg Phe Gl1y Gly Asn Leu Ile Ile Lys Ala Ser Arg 175 His Leu Gin Lys Vali Ile Gin Val1 Asp Leu Cys Lys Tyr 160 Arg Cys Gin Trp INFORMATION FOR SEQ ID NO:955: SEQUENCE
CHARACTERISTICS:
LENGTH: 439 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .439 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:955: Met Gin Val Lys Giu Asn Lys Gin Leu Cys Leu Ile Ser Leu Gly Cys 1 5 10 Ser Lys Asn Leu Val Asp Ser Giu Val Met Leu Giy Lys Lou Tyr Asn WO 97/37044 PCT/US97/0522 3 Tyr Thr Leu Thr Cys Gly Leu Asn Ala Thr 25 Asn Asp Ala Lys Lys Ala Asp Val Ile Leu Ile Asn Gly Glu Leu Glu Val 145 Pro Lys Ile Gly Ser 225 Ile Pro Ser Lys C Asn C 305 Phe A Ala T Lys A Ala L 3 Glu T 385 Gly G Gly H Ala L' Cy Va Il Hi 13 Ly Se Gl Al Le 21( Al.
Ile 1u Ser Glu ;lu ksp yr .la eu 70 yr lu is ys 'S Leu il Asp e Ala 115 s Tyr 0 s Ile r Phe u Val a Gin 195 u Ile 0 i Arg I Ala Gin Gin 275 Ser I Ser C Arg I Ser L 3 Leu A 355 Leu A Phe T Ile L Tyr T 4 Val L 435
I
c
G
I
I
H
2 e h 2 e Phe Ile Glu Ala Lys Asp 70 Ser Glu Arg Ile Phe Thr 100 Lys Lys Gin Asn Ala Arg Ser Glu Gly 150 Lys Gly Lys 165 ;lu Asp Leu sp Ser Ser 1n Leu Ile le Leu Tyr 230 le Glu Asp 245 is Ile Ser 60 la His His I he Ile Arg S 2 lu Phe Glu G 310 eu Asn Ile P 325 eu Glu Lys V 10 sn Lys Ile A sn Lys Pro I 3 rr Lys Ala A: 390 !u Ile Asn A! 405 ir Ile Val P: 0 u Ser Pro PI Ser 55 Lys Tyr Gly Asn Ile 135 Cys Leu Ala Ser Arg 215 Leu Ser I Asp S Jeu L 2 er T 95 lu L he A al P la L 3 le L 75 rg A sp S ro St he 4C Al Ly Ly Va G1 12 Ii, Asi Gl Leu PhE 20( Ala ryr Pro Ser ,ys 80 'hr eu la ro eu 60 ys sp er er 's Glu G: s Asp G; 9 1 Gly A 105 n Phe SE 0 e Thr G1 n Gin Ly n Ser Ar 17 u Lys G1 185 Leu Ty Ile As Pro Se: Ile Ph 25( Met Lei 265 Leu Lei Ile liE Ser Ala Phe Ser 330 Lys Lys 345 Lys His Ala Leu Leu Arg Glu Leu 410 a 1
I
r r e 0 p r e u u e y Al 75 u Il p Ty r G1 y Se s Cys 155 SGlu Tyr Asp SLys Ser 235 Gin Lys Asn Val Phe 315 Ala Ile Gin Val Trp 395 Thr a Ile e Lys r Asp u Gin r Ser 140 Ser SLeu Lys Lys Gin 220 Thr Asn Lys b Ala Gly H 300 Leu A Glu G Ile A Asn H 3 Glu H 380 Ala P Thr P: Le Gl Ly Va: 12! Va: PhE Asr Asp Gly 205 Gln Thr Tyr Met let is Lsp lu sn is is ro ro Lys Gin Glu Ser Ile Gin Thr Ile u Il u Le s Ii 11 1 Ph 1 Hi e Cy Se: SMet 19( SGlr Ala Leu Phe Arg 270 Lys Pro Glu Asn Ala 350 Ser Lys Glu Leu e Ala u Ile e Asp 0 e Leu s Ala s Ala r Ile 175 t Thr 0 n Lys Leu SGlu Asp 255 Arg I Gin X Glu C Phe A 3 Thr H 335 Arg I Phe L Glu G Val A 4 Lys P: Ser Pro Ile Ser Tyr Ile 160 Leu Phe Asp Lys Leu 240 Mlet Asn al ;lu Lrg [is le ys ly sp 00 ro Ala 425 Phe Lys Asp Asn 415 Ile Leu Leu 430 INFORMATION FOR SEQ ID NO:956: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 861 LENGTH: 667 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...667 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:956: Lys Lys Pro Phe Tyr Ser Leu Ser Leu Ala Ser Ser Leu Leu Asn Ala 1 5 10 Glu Asp Asn Gly Phe Phe Ile Ser Ala Gly Tyr Gin Ile Gly Glu Ala 25 Ala Gin Met Val Lys Asn Thr Gly Glu Leu Lys Lys Leu Ser Asp Thr 40 Tyr Glu Asn Leu Ser Asn Leu Leu Thr Asn Phe Asn Asn Leu Asn Gln 55 Ala Val Thr Asn Ala Ser Ser Pro Ser Glu Ile Asn Ala Ala Ile Asp 70 75 Asn Leu Lys Ala Asn Thr Gin Gly Leu Ile Gly Glu Lys Thr Asn Ser 90 Pro Ala Tyr Gln Ala Val Tyr Leu Ala Leu Asn Ala Ala Val Gly Leu 100 105 110 Trp Asn Val Ile Ala Tyr Asn Val Gln Cys Gly Pro Gly Asn Ser Gly 115 120 125 Gin Gin Ser Val Thr Phe Glu Gly Gin Pro Gly His Asn Ser Ser Ser 130 135 140 Ile Asn Cys Asn Leu Thr Gly Tyr Asn Asn Gly Val Ser Gly Pro Leu 145 150 155 160 Ser Ile Glu Asn Phe Lys Lys Leu Asn Gin Ala Tyr Gin Thr Ile Gin 165 170 175 Gin Ala Leu Lys Gin Asp Ser Gly Phe Pro Val Leu Asp Ser Ala Gly 180 185 190 Lys Gin Val Thr Ile Thr Ile Thr Thr Gin Thr Asn Gly Ala Asn Lys 195 200 205 Ser Glu Thr Thr Thr Thr Thr Thr Thr Thr Asn Asp Ala Gin Thr Leu 210 215 220 Leu Gin Glu Ala Ser Lys Met Ile Ser Val Leu Thr Thr Asn Cys Pro 225 230 235 240 Trp Val Asn His Asn Gin Gly Gin Asn Gly Gly Ala Pro Trp Gly Leu 245 250 255 Asp Thr Ala Gly Asn Val Cys Gin Val Phe Ala Thr Glu Phe Ser Ala 260 265 270 Val Thr Ser Met Ile Lys Asn Ala Gin Glu Ile Val Thr Gin Ala Gin 275 280 285 Ser Leu Asn Gin Gin Asn Asn Gin Asn Ala Pro Gin Asp Phe Asn Pro 290 295 300 Tyr Thr Ser Ala Asp Arg Ala Phe Ala Gin Asn Met Leu Asn His Ala 305 310 315 320 WO 97/37044 PCT/US97/05223 862 Gin Ala Gin Ala Lys Ile Leu Glu Leu Ala Asp Gin Met Lys Lys Asp 325 330 335 Leu Asn Thr Ile Pro Ser Gin Phe Ile Thr Asn Tyr Leu Ala Ala Cys 340 345 350 His Asn Gly Gly Gly Thr Leu Pro Asp Ala Gly Val Thr Asn Asn Thr 355 360 365 Trp Gly Ala Gly Cys Ala Tyr Val Glu Glu Thr Ile Thr Ala Leu Asn 370 375 380 S Asn Ser Leu Ala His Phe Gly Thr Gin Ala Glu Gin Ile Lys Gin Ser 385 390 395 400 Glu Leu Leu Ala Arg Thr Ile Leu Asp Phe Arg Gly Ser Leu Ser Asn 405 410 415 Leu Asn Asn Thr Tyr Asn Ser Ile Thr Thr Thr Ala Ser Asn Thr Pro 420 425 430 Asn Ser Pro Phe Leu Lys Asn Leu Ile Ser Gin Ser Thr Asn Pro Asn 435 440 445 Asn Pro Gly Gly Leu Gin Ala Val Tyr Gin Val Asn Gin Ser Ala Tyr 450 455 460 Ser Gin Leu Leu Ser Ala Thr Gin Glu Leu Gly His Asn Pro Phe Arg 465 470 475 480 Arg Val Gly Leu Ile Ser Ser Gin Thr Asn Asn Gly Ala Met Asn Gly 485 490 495 Ile Gly Val Gin Val Gly Tyr Lys Gin Phe Phe Gly Glu Lys Arg Arg 500 505 510 Trp Gly Leu Arg Tyr Tyr Gly Phe Phe Asp Tyr Asn His Ala Tyr Ile 515 520 525 Lys Ser Ser Phe Phe Asn Ser Ala Ser Asp Val Phe Thr Tyr Gly Val 530 535 540 Gly Thr Asp Val Leu Tyr Asn Phe Ile Asn Asp Lys Thr Thr Lys Asn 545 550 555 560 Ser Lys Ile Ser Phe Gly Val Phe Gly Gly Ile Ala Leu Ala Gly Thr 565 570 575 Ser Trp Leu Asn Ser Gin Tyr Val Asn Leu Ala Thr Phe Asn Asn Phe 580 585 590 Tyr Ser Ala Lys Met Asn Val Ala Asn Phe Gin Phe Leu Phe Asn Leu 595 600 605 Gly Leu Arg Met Asn Leu Ala Lys Asn Lys Lys Lys Ala Ser Asp His 610 615 620 Ala Ala Gin His Gly Val Glu Leu Gly Val Lys Ile Pro Thr Ile Asn 625 630 635 640 Thr Asn Tyr Tyr Ser Leu Leu Gly Thr Gin Leu Gin Tyr Arg Arg Leu 645 650 655 Tyr Ser Val Tyr Leu Asn Tyr Val Phe Ala Tyr 660 665 INFORMATION FOR SEQ ID NO:957: SEQUENCE CHARACTERISTICS: LENGTH: 87 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: WO 97/37044 PCT/US97/05223 863 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...87 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:957: Met Phe Ile Leu Phe Lys Phe Gly Arg Val Leu Gly Lys Ala Tyr Ser 1 5 10 Leu Tyr Leu Tyr Ile Tyr Glu Ser Leu Ile Cys Gin Ala Phe Gly Leu 25 Ser Leu Ser Cys Asn Asn Ser Met Leu Phe Ser Thr Phe Leu Ile Asn 40 Leu Pro Leu Pro His Asn Glu Ser Leu Cys Cys Cys Arg Asp Ile Leu 55 Ala Tyr Ser Asn Ser Ser Ser Leu Lys Thr Tyr Ser Leu Glu Ser Asn 70 75 Phe Ser Phe Asn Ser Leu Phe INFORMATION FOR SEQ ID NO:958: SEQUENCE
CHARACTERISTICS:
LENGTH: 134 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...134 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:958: Met Phe Met Ala Ser Ile Arg Thr Leu Thr His Met Thr Asn Ala Asp 1 5 10 Gly Thr Ile Thr Cys Gly Asp Thr Thr Pro Ala Ser Cys Asn Val Gly 25 Ile Asn Pro Asn Ser Val Tyr Thr Thr Gly Lys Leu Asn Ala Lys Val 40 Asn His Thr Ile Phe Gin Phe Leu Val Asn Val Gly Ile Arg Thr Asn 55 Ile Phe Glu His His Gly Ile Glu Phe Gly Ile Lys Ile Pro Thr Leu 70 75 Pro Asn Tyr Phe Phe Lys Gly Ser Thr Thr Ile Arg Ala Lys Lys Gin 90 Gly Pro Leu Glu Asn Gly Asn Pro Thr Thr Ile Thr Gly Ala Glu Thr 100 105 110 Asn Phe Ser Leu Thr Gin Thr Leu Arg Arg Gin Tyr Ser Met Tyr Leu 115 120 125 WO 97/37044 PCTIUS97/05223 864 Arg Tyr Val Tyr Thr Phe 130 INFORMA~TION FOR SEQ ID NO:959: SEQUENCE
CHARACTERISTICS:
LENGTH: 278 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .278 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO:959: Leu Tyr Asp Thr Phe Met Arg Phe Asp Ser 145 Gin Asp Phe Ala Leu 225 Ile Asn Lys Leu Leu Lys Lys Gly Phe Leu Ala Phe Ph Leu Leu His Asn Ser Asn Phe Tyr 130 Leu Arg Phe Val1 Lys 210 %.sn 3iu *Ar Gil Prc IlE Trp Ile Pro 115 Ala Arg Lys Leu Gly 195 Asp Val1 Phe gAla {Tyr Pro Asp Leu Pro 100 Lys Tyr Ala Giu Tyr 180 Giu Ser Gly Gly Tyr 260 Asj Gir Cys Lys Gin Phe Lys Gly Ser Thr 165 Lys Thr Leu Tyr kla 245 Asp Arg Phe Sen 70 Ser Ala Giu Phe Ser 150 Phe Arg Trp Asn Arg 230 Arg- Leu Phe Thr 55 Ser Arg Giu His Phe 135 Gin Ile la Phe rhr 215 T'yr Ile *Val Leu 40 Ala His Giu Val Tyr 120 Leu Ile Asn Phe Tyr 200 Tyr Arg Pro *Thn 25 Ala *Ser Tyr Lys Ser 105 Gly Lys Pro Ala Giy 185 Giu Arg Phe Phe Lys 265 10 Tyr Lys Lys Tyr Phe 90 Leu Phe Asn Lys Ile 170 Thr Thr Pro Ser Leu *Thr Lys Lys Gly 75 Giu Ile Arg Lys Ser 155 Phe Leu Lys Asn Arg 235 Ile Ile Cys Pro Thr Asn Tyr Phe Gly 140 Tyr Tyr Ile Ile M4et 220 Tyr ksn Ala Leu Lys Sen His Gly Tyr 125 Ala Ang Gly Leu Phe 205 Phe ys sp Leu Lys IAng Arg Val Ser Tyr 110 Val Leu Giu Val Gly 190 Lys Gin Asn 9I Tyr P Ser Giu Gly Lys Val1 Lys Lys Ser Gly Lys Gly 175 lai 31n Ia 1 'rp ~he *Val Giu *Lys Pro Gin Tyr Gin Leu Asp Leu 160 Ala Asn Tnp Met Ala 240 Lys Thn Pro Leu Phr Leu His Phe krg Asn Ile Ser Val Tyn Leu 27o WO 97/37044 PCT/US97/05223 865 Thr Ser Thr Tyr Asp Phe 275 INFORMATION FOR SEQ ID NO:960: SEQUENCE CHARACTERISTICS: LENGTH: 499 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...499 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:960: Met Lys Leu Lys Lys Arg Lys Val Ala Ala Thr Leu 1 Thr Glu Ser Asn Thr Arg Lys Lys Ile 145 Tyr Tyr Gly Leu Phe 225 Leu Leu Val Pro Leu Val Tyr Thr Met 130 Gly Pro Lys Arg Phe 210 Leu Phe Pro His Ile Thr Asn Asp Ser 115 Trp Tyr Ala Ala Phe 195 Gin Leu Pro Leu Gly Asn Gly Val Arg 100 Cys Gin Met Asn Asn 180 Asp Gly Phe Ile 5 Leu Asp Pro Lys Gly Trp Gly Gin Tyr Ala 165 Leu Val Phe Ser Tyr 245 Phe Phe Val Leu 70 Gly Ala Thr Gin Met 150 Tyr Thr Thr Tyr Ser 230 Arg Thr Ile Lys 55 Glu Val Lys Asp Gly 135 Gly Leu Tyr Glu Gly 215 Trp Glu Thr Asn 40 Gly Gly Leu Asp Ser 120 Pro Glu Pro Asp Gin 200 Thr Gly Lys Gly 25 Phe Ile Ser Gly Phe 105 Leu Gly Trp Gly Ser 185 Glu Phe Arg Pro 10 Ser Ser Tyr Val Gly 90 Thr Ser Gly Asn His 170 Asp Gin Lys Gly Trp 250 Leu Lys Pro His 75 Gin Pro Leu Ile Gly 155 Ser Arg Met Leu Ile 235 Gly Gly Val Thr Leu Val Pro Cys Ile 140 Leu Arg Val Asp Thr 220 Ala Ile Leu Ala Gly Glu Gly Tyr Ser Met 125 Asp Phe Arg His Trp 205 Lys Asp His Lys Val Phe Thr Arg Asp Tyr 110 Asn Pro Pro Tyr Met 190 Ile Asn Gly Lys Arg Thr Asn Phe Gly Asn Trp Ala Arg Asn Glu 175 Val Tyr Met Gin Ala 255 Leu Tyr Arg Val Trp Thr Asp Thr Gly Tyr 160 Val Met Gin Lys Trp 240 Gly Ile Ile Tyr Arg Pro Thr Lys Asn Leu Met Ile His Pro Tyr Val Tyr 260 265 270 WO 97/37044 PCT/US97/05223 Leu Thr Tyr 305 Ala Trp His lle Leu 385 Ala Lys Phe Asp Leu 465 Gly Ser Ile Asn 290 Val Pro Arg Ile Gly 370 Asp Gly Gly Thr Tyr 450 Glu Pro Ala Pro 275 Pro Leu Ala Gly Asp 355 Asn Gly Ile Gly Thr 435 Gin Phe Asn Phe Met Glu Tyr Arg Leu 340 Ile Pro Ile Asp Gly 420 Ala Phe Gin Gly Val Gly Thr Phe Asp Tyr 325 Gin Asn Asn Glu Asn 405 Lys Pro Ser Ile Gin 485 Ser Tyr 310 Asn Gly Asn Met Gin 390 Ile His Arg Lys Arg 470 Pro Gly 295 Arg Thr Pro Tyr Asn 375 Trp Thr Gly Ala His 455 Ala Leu Leu Pro Gly 280 Arg Gly Ile Trp Asn Asn Trp Asp Pro 330 Gly Gly Ala 345 Phe Val Val 360 Leu Gly Thr Val Gly Gly Asp Ala Asp 410 Lys Phe Ser 425 Leu Glu Tyr 440 Val Lys Ala Gly Tyr Asn Asn Leu Asn 490 Ala Arg Ala 315 Phe Thr Gly Trp Ile 395 Ala Trp Gly Gly Pro 475 Asn Lys Asn 300 Glu Leu Leu Gly Gly 380 Tyr Phe Ser Ile Leu 460 Gly Gly Ile 285 Lys Tyr Asp Tyr Ala 365 Asn Ser Thr Val Gly 445 Lys Thr Leu Glu Thr Gly Asn Leu 350 Tyr Pro Leu Glu Tyr 430 Met Leu Gly Phe Tyr Thr Arg Gly 335 His Leu Val Gly Tyr 415 Gin Tyr Val Phe Glu 495 Asp Phe Tyr 320 Lys His Asn Ala Phe 400 Val Arg Leu Trp Leu 480 Ser INFORMATION FOR SEQ ID NO:961: SEQUENCE CHARACTERISTICS: LENGTH: 68 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...68 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:961: Leu Lys Glu Lys Phe Asp Phe Phe Lys Gly Lys Asn Phe Lys Ile Val 1 5 10 Tyr Cys Ile Gly Glu Asp Leu Thr Thr Arg Glu Lys Gly Phe Arg Ala 25 Val Lys Glu Phe Leu Ser Glu Gin Leu Glu Asn Ile Asp Leu Asn Tyr 40 WO 97/37044 PCTIUS97/'05223 867 Ser Asn Leu Ile Val Ala Tyr Giu Pro Ile Trp Ala Ile Gly Thr Lys s0 55 Lys Ala Arg Phe INFORMA~TION FOR SEQ ID NO:962: SEQUENCE CHARACTERISTICS: LENGTH: 263 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 263 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:962: Met Leu Glu Ile Ile Glu Leu Gin Leu Asn 145 Leu Gin Gly Lys Asp 225 Ser Lys Ser Ala Ile Phe Val Leu Sen Giy Phe Tyr Aia Asp Giu Pro Asn Lys Lys 130 Leu Asp Glu Val1 Asp 210 Tyr Ala Leu Giu Lys Leu Asn Lys Val Lys Glu Asp Phe 100 Asn Giu 115 Gin Asn Asn Gin Leu Gin Trp Tyr 180 Phe Tyr 195 Gly Giu Asn Lys Leu Ile Leu Thn Giu Gin Gin Gin Phe Ile 165 Ala His Phe Sen Leu Leu Giu Giu 70 Pro Giu Giu Giu Thn 150 Pro Gin Lys Asp Val 230 Tyr Leu Asp 55 Lys Glu Lys Giu Asn 135 Gin Lys Ile Ala Tyr 215 Met Gly Asp 40 Leu Gin Giu Thn Gin 120 Gin Lys Gin Tyr Ser 200 Thr Thr Leu 25 Leu Pro Gly Sen Asp 105 Arg Giu Leu Asp Gin 185 Val Ile Leu 10 Leu Asn Ser Asp Leu 90 Lys Ang Met Giu Gly 170 Ile Sen Leu Leu Leu Lys Giu Phe 75 Giu Asn Leu Leu Sen 155 Val1 Leu Val Sen Asp 235 Giu Lys Lys Leu Asp Ala Arg Lys 140 Val1 Asp Tyr Leu Tyr 220 Asp Leu Arg Asp Lys Glu Ile Gin Giu 125 Gly Lys Giu Lys Ile 205 Ser Leu Ala His Giu Asn Pro Phe Lys 110 Gin Leu Asn Lys Gly 190 Met Asp Lys Phe Leu Asn Lys Gin Ala Giu Lys Lys Giu Sen Sen Asp Giu Gin Arg Gin Gin Lys Thn 160 Ala Tyr 175 Tnp Ang Ile Thr Phe Lys Lys Val Asp Phe Pro Pro Tyr Pro Gly Giy Asn Met Ile Sen Ile Lys Val Asn WO 97/37044 PCTIUS97/05223 868 Phe Thr Thr Lys Glu Glu Gin 260 INFORMATION FOR SEQ ID NO:963: SEQUENCE CHARACTERISTICS: LENGTH: 78S amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .785 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:963: Met Phe Leu Gly Val Gly Gin Val Thr Asn 145 Phe Phe Gly Leu Leu 225 Ala Giu Gin Pro Val Ile Lys Giu Gly Thr Leu Ala Leu Ile Ala Thr Met Phe Giu Ile Giu Leu 130 Gin Leu Thr Val Gly 210 Leu Phe Tyr Asn Val Ala Tyr Pro Val 115 Ser Leu Ala Asp Lys 195 Ser Ser Leu *Leu Phe *Asp Lys Lys Lys Leu Giu Lys Gin Ile Ala 100 Gly Gly Pro Tyr Leu Ser Lys Asp 165 Tyr Gin 180 Gly Ile Leu Giu Pro Lys Ser Lys 245 Arg Gly Phe Sen 70 Asn Leu Phe Lys Asp 150 Cys Gly Gly Lys Met 230 Glu Sen Phe Tyr 55 Gin Ang Glu Giu Thr 135 Lys Val1 Ile Ser Ile 215 Tyr Leu Tyr Pro 40 Lys Thr Lys Trp Ala 120 Arg Ile Giu Val1 Lys 200 Tyr Gin Ala Tyr 25 Thr Asp Lys Asp Leu 105 Asp Ile Ala Lys Gly 185 Asn Giu Ala Thr Phe 10 Met Gly Arg Thr Ala 90 Gin Asp Tyr Leu Tyr 170 Asp Ala Asn Leu Leu 250 Pro Ser *Leu *Lys Lys 75 Pro Lys Val Ser Phe 155 Giy Ser Lys Leu Ile 235 Giu Ser Ala Leu Asn Ang Lys Met Ile Lys 140 Asp Ile Sen Giu Asp 220 Gln Arg Glu Lys Thr Met Ala Giu Gly Ala 125 Asp Gly Leu Asp Leu 205 Leu Asp Gly Asn Asn Gly Pro Giu Met Phe 110 Ser Lys Lys Pro Asn 190 Leu Ala Lys Cys Pro 270 Asp Lys Leu Phe Lys Leu Thr Leu Asp Thr Sen 175 Tyr Gin Lys Gly Ile 255 Leu Thr Pro Val1 Ile Leu Leu Cys Ala Phe Giu 160 Gin Lys Arg Asn Ser 240 Lys Leu Glu Phe Asp Phe Leu Sen Cys Ala 260 WO 97/37044 PCTIUS97/05223 Lys lie Lys Asp Giu Leu Lys Giu Tyr Gly Phe Ile Ser Thr Leu Arg 275 280 Asp Ser 305 Lys Leu Leu Gin Leu 385 Ile Tyr Ala Glu Lys 465 Asn 2 Glu I Val I Phe L Arg G 545 Lys G Lys S His P Phe A 6 Lys I 625 Ser S Gly L Leu L Phe S Ile H-i 705 Glu Lj
LE
29 Th Se
GI
As Gl 37 11 Cli G1I Ph( Tyl rhr kla ~sp ~eu lys .30 ;ln iln er ro sn 10 le er eu iS 's !u Giu Asn Ser '0 r Pro Ile Leu r Arg Met Ile 325 u Lys Leu Glu 340 p Lys Asp Lys 355 y Tyr Phe Leu 0 Phe Leu Gin Gly His Asp 405 1 Val Pro Leu 420 Leu Lys Asn 435 Leu Lys Glu Lys Ser Lys Leu Lys Arg 485 Leu Leu Thr 500 Met Gly Met C 515 Arg Leu Glu C Ile Leu Asp I 5 Leu Gly Glu V 565 His Ser Thr A 580 Ser Ile Pro L 595 Thr Tyr Thr T His Thr Thr P 6 His Ser Pro A 645 Leu Ile Arg L 660 Gly Val Asp T 675 Gin Asp Lys A! Leu Giu Thr S 7: Arg Ser Ile A Pro Phe Ile Va As 31 Va As Ly Pr Asi 391 Lei GlL Prc Asp: la 470 Leu Leu 3lu :ln .eu )50
T
al ,sp eu hr he 30 sn ys yr sp er 10 la 29 p As 0 1 Le n Pr s I o Le 37 -i Al
D
j Ly, 1 As G1i Lei 45 CyE Ala Phe Glu 535 Ile Leu Glu Ile Pro 615 Ile Leu Gly Ser Leu 695 Lys Lys 55 ;n Thr u Glu o Asn e Leu 360 u Glu 5 a Phe s Pro n Ile u Lys 440 2 Ile 1 Lys Glu Arg Gin 520 Phe I Gly Tyr Lys A 5 Leu G 600 Leu L Gin T Gin A Phe I 6 Gin I 680 Met G Ala L Ser I Pr Se Al 34 Al G11 Se: Le Arc 42E Va] Prc Ser Tyr ksp 505 ly ,ys Tal sp isn 85 lu ,eu 'hr sn le 65 le lu eu le 1 Glu o Ala r Ala 330 a Arg 5 a Leu u Ala r Gin i Leu 410 j Ile Gly His Glu I Phe 490 Ile C Phe I Asn C Asp P 5 Lys L 570 Leu L Tyr A Arg L Gly T 6 Ile P 650 Ala S Glu L Ala P1 Phe G1 7: Asn P1 As Le 31 G1 Va Al Le Me 39.
Se Gl Ph 175 llu ;lu ys :lu 'he '55 eu eu rg eu hr 35 ro eU e he Ly L5 le ni Vai Pro 300 u Asp Asn 5 u Pro Leu 1 Phe Met a Phe Leu 365 u Phe Ser 380 t Leu Gin 5 r Phe Leu a Asp Thr Asp Glu 445 a Lys Ile 460 1 Leu Ser I Lys Gly Thr Pro I Ile Asp 7 525 Leu Asn 540 Asn Leu A Gly Leu P Lys Ile L 5 Glu Leu A 605 Lys Asp L 620 Ala Thr G Val Arg S Ser Lys G 6' Arg Leu LE 685 Leu Lys G: 700 Glu Asp L Gly Leu Vz
II
Al Se Ar 35 Le Pr Hi Ly Gl 431 Va Ly .le t 31 ?he la ral rsn 'ro leu sn ys ly er 1 u ly *u 90 .e Leu .a Pro r Met 335 g Leu 0 u Gin o Phe s Ala s Ala 415 Ile 1 Leu Asp Glu Leu 495 Val I Pro Leu C Ser E 5 Lys A 575 Asp L Lys L Asp A Arg L 6 Pro L 655 Tyr C Ala H Arg A Ala L~ 7: Tyr G: Asn Lys 320 Phe Val Asp Ser Cys 400 Lys Leu Lys Phe Leu 480 3lu ys 'yr ;lu >ro ~sn 'ys eu sp eu ys ys is sp (s Ly WO 97/37044 PCT/US97/05223 870 725 730 Met Gly Ser Lys Lys Leu Ser Glu Thr Leu Ser Ile Pro Leu Se Glu 740 745 750 Ala Lys Ser Tyr Ile Glu Ala Tyr Phe Lys Arg Phe Pro Ser Ile Lys 755 760 765 Asp Tyr Leu Asn Gly Met Arg Glu Glu Ile Leu Lys Thr Ser Lys Ala 770 775 780 Phe 785 INFORMATION FOR SEQ ID NO:964: SEQUENCE
CHARACTERISTICS:
LENGTH: 55 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:964: Val Ser Glu Ser Glu Lys Gin Glu Ile Glu Asn Lys Ile Glu Glu Arg 1 5 10 Lys Arg Ala Lys Glu Gin Lys Asp Phe Leu Lys Ala Asp Ser Ile Arg 25 Glu Glu Leu Leu Gin Gin Lys Ile Ala Leu Met Asp Thr Pro Gin Gly 40 Thr Ile Trp Glu Lys Leu Phe INFORMATION FOR SEQ ID NO:965: SEQUENCE
CHARACTERISTICS:
LENGTH: 571 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...571 WO 97/37044 871 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:965: PCT/US97/0522 3 Leu Thr Trp Pro Gin Ile Ala Thr Ser Asn 145 Asp Asn Ser Asn Phe 225 Ala 2 Thr I Asn I Thr P 2 Lys A 305 Ser A Pro I Phe S Ser S 3 Val T 385 Pro A Tyr V Pro Ser Val Gin Ile Trp Gly Ser Tyr Thr Asn Ser Phe Ls Gin Ph Th Se Cy Th: As Il.
Th 13( Gi, G1 Phe Asn Thr 110 %sn ksn Isn :le Lsn 90 ~sn la le er er 70 yr sn al e Ser r Asp r Gly s Thr r Ala n Leu e Thr 115 Gin r Lys Lys Asn Asn 195 Ile Asn Ser 2 Gly Ala C 275 Thr 5 Leu P His S Thr L 3 Lys A 355 Glu L Her L Her I Asp M Al Pr Se Gl Asl I1 101 Gl Asi Let PhE Gi4 180 Asn Phe Ser %sn 3er fly jer ~sn er eu 40 .sn ys eu le et a Ser o Asn r Ala y Thr n Leu e Phe -i His Met 1 Leu Ile 165 Gly Gin Asn Ser Leu 1 245 Gin Asn 2 Val I Ala I Val I 325 Val S Leu T Leu V Asn A 3 Ser I 405 Glu L As Va Le Th 70 Ar As Asl As Va 15( PhE Sei PhE Asn Ala 230 3Gl ksn kla ,ys ?ro 10 :le jer rp 'al .sn 90 le ys *n Se 1 Se u As 55 r As g Se: n G1 Al p Asr 13E 1 Tyi Asr Tyr Asn Ala 215 Thr Ile Thr Thr Gly 295 Leu Asn Ser Gin Her 375 Gin Arg Ser r Asn r Her 40 n Gly n Gly r Gly Val Gly 120 S Her Gly 1 Ala Gin Ser 200 Asn Thr Ala C Ala z 2 Phe P 280 Lys V Her P Ile G Her L 3 Leu I 360 Ser A Thr T Arg L Asp A Le 25 Se Hi Th: Asi Asi 10I Ale Glr Thr Gly Phe 185 ly Phe 3er fly ~sn !65 sp 'al he ly ys 45 le la yr eu rg 10 u Val r Asp s Cys r Tyr a Arg 90 D Ser Tyr 1 Asn Thr Gin 170 Ser Ser Asn Phe Asn 2 250 Phe 2 Asn Thr I Gly T 3 Glu A 330 Ala I Asn T Gly A Asn P 3 Gly V 410 Leu T Ile Gly Tyr Asn Ala Thr Th 01 Se 75 Iie Sei Lel PhE 155 Ala Gl4 Phe Asn Val 235 kia ksn 1al eu ~sp ;15 ila :le yr .sn he 95 al yr r Val y Pro r Ala e Gly e Asn r Ser 1 Asn 140 Thr Thr Asp Glu Ser 220 Gly Val Asn Val Asn I 300 Gly '1 Ile Glu T Gin G 3 Gly V 380 Gin G Gly M Tyr G Ala Trp Tyr Thr Ile Ser 125 Gly Asn Phe Ser Ile 205 Thr A~sp Phe Fhr C Phe 1 ~sn I 'hr I hr A 'yr A 3 iy H 'al T lu V et V in A Ph Pr Hi 01' Al il( Met Let Glr GlL Leu 190 Gly Ser The 31y fly ~sn le le ~sn .sn .is yr al al sn e G1 o Ty s Va y G1 a As t Th: I As 17 Asr Ale Phe Thr Asn 255 Ser Ser Thr Val Gly 335 Asp Gly Asp Phe Phe 415 Ala y Asp r Tyr 1 Tyr y Ala n Ala r Phe n Ser 4 Lys 160 1 Thr 1 Phe Lys Asn Asn 240 Ser Val Pro Leu Phe 320 Asn Ala Ala Val Ser 400 Asp Leu 420 sn laLe Gly Phe Met Thr Tyr Met Pro Asn Her Tyr Asn Asn Asn Leu Gly Asn WO 97/37044 PCT/US97/05223 Leu Ser 465 Gly Val1 Lys Cys Thr 545 Ala Asn 450 Gly Gin Ser Gly Ile 530 Gly Leu 435 Asn Lys Asn Asp Ala 515 Gly Ser Ile Thr Thr Ser Ala 500 Gly Phe Ile Leu Ile Leu Ala 485 Pro Ser Ile Giu Thr 565 Tyr Phe 470 Ile Gin Asn Thr Ser 550 Gly Tyr 455 Thr Val1 Ser Asp Gly 535 Gly Phe 440 Tyr Lys Phe Asn Ala 520 His Asn Lys Asp Ala Gly Vai 505 Ser Tyr Arg Ala Asn Giu Ala 490 Ile Gly Giu Ile Phe 570 S er Phe 475 Lys Ile His Al a Ser 555 Phe Ile 460 Ser Asn Arg Cys Gin 540 Ser 445 Asp Gin Ile Phe Trp 525 Lys Giy Phe Thr Trp Gly 510 Asn le Gly Tyr Phe Thr 495 Asp Leu Tyr Ala Ala Thr 480 Ser Asn Gin Ile Arg 560 INFORMATION FOR SEQ ID NO:966: Ci) SEQUENCE CHARACTERISTICS:.
LENGTH: 368 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1. 368 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:966: Met Val Val Leu Gly Ser Thr Gly Ser Ile Gly 1 Ile Asn Val1 Ala Cys Lys Asn Ile Asn Ala Ile Ala Glu Val Ala Lys Thr 130 Lys Lys Ala Ilie Val Ser Ser Giu 115 Pro Thr Lys Leu Leu Phe Asn Phe 100 Ser Val1 Leu 5 Phe Ile Asp Vali Leu Lys Leu Asp Lys Gly Asn Pro Gly 70 Val1 Ser Val1 Ser Pro Val Giu Asn 55 Leu Ile Leu Ser Giu 135 Lys Lys Gin 40 Asp Asp Asn Gin Ala 120 His Ser Ile 25 Ile Leu Gly Ala Arg 105 Gly Phe Leu 10 Giu Lys Asn Ile I le 90 Asn His Gly Ile Ala Val Asn Asp 75 Val Lys Leu Leu Ile Lys Leu Phe Leu Ala Gly Lys Leu Trp 140 Ser Asn Ser Lys Giu Met Val1 Leu Asp 125 Ala Ala Ala Cys Pro Pro Ile Ala Ala 110 Ile Leu Ser Leu Gly Lys Leu Giu Gly Leu Ser Leu Gly Lys Lys Lys Gly Giu Leu Ala Gin Gin Gly WO 97/37044 PCT/US97/052 2 3 873 145 150 155 Ala Phe Arg Asp Thr Pro Leu Asp Leu Ile Ala Ile Gin Asn Ala Gln 165 170 175 Asr Ala Leu Lys His Pro Asn Trp Ser Met Gly Asp Lys Ile Thr Ile 180 185 190 Asp Ser Ala Ser Met Val Asn Lys Leu Phe Glu Ile Leu Glu Thr Tyr 195 200 205 Trp Leu Phe Gly Ala Ser Leu Lys Ile Asp Ala Leu Ile Glu Arg Ser 210 215 220 Ser Ile Val His Ala Leu Val Glu Phe Glu Asp Asn Ser Val Ile Ala 225 230 235 240 His Leu Ala Ser Ala Asp Met Gin Leu Pro Ile Ser Tyr Ala Ile Asn 245 250 255 Pro Lys Leu Ala Ser Leu Ser Ala Ser Ile Lys Pro Leu Asp Leu Tyr 260 265 270 Ala Leu Ser Ala Ile Lys Phe Glu Pro Ile Ser Val Glu Arg Tyr Thr 275 280 285 Leu Trp Arg Tyr Lys Asp Leu Leu Leu Glu Asn Pro Lys Leu Gly Val 290 295 300 Val Leu Asn Ala Ser Asn Glu Val Ala Met Lys Lys Phe Leu Asn Gin 305 310 315 320 Glu Ile Ala Phe Gly Gly Phe Ile Gin Ile Ile Ser Gin Ala Leu Glu 325 330 335 Leu Tyr Ala Lys Lys Ser Phe Lys Leu Ser Ser Leu Asp Glu Val Leu 340 345 350 Ala Leu Asp Lys Glu Val Arg Glu Arg Phe Gly Ser Val Ala Arg Val 355 360 365 INFORMATION FOR SEQ ID NO:967: SEQUENCE
CHARACTERISTICS:
LENGTH: 86 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...86 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:967: Met Lys Thr Thr Glu Asn Thr Asp Glu Thr His Leu Arg Glu Thr Lys 1 5 10 Asn Lys Leu Gly Arg Lys Pro Lys Ala Asp Ala Asn Lys Lys Thr Arg 25 Ala Val Ser Leu Tyr Phe Ser Asp Glu Gin Tyr Gin Lys Leu Glu Lys 40 Met Ala Asn Glu Glu Glu Glu Ser Val Gly Ser Tyr Ile Lys Arg Tyr 55 Ile Leu Lys Ala Leu Arg Lys Ile Glu Gin Gly Gly Gly Phe Ile Ala WO 97/37044 PCT/US97/05223 874 70 75 Phe Asn Leu Phe Leu Ile INFORMATION FOR SEQ ID NO:968: SEQUENCE CHARACTERISTICS: LENGTH: 803 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...803 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:968: Leu Trp Leu Lys Ser Lys Phe Phe Leu Leu Met Gly Val Gly Leu Leu 1 5 10 Ser His Ser Leu Asn Ala Leu Ser Leu Thr Leu Thr Gin Gly Lys Glu 25 Glu Gly Glu Asp Phe Ser Val Leu Thr Leu Arg Asn Asp Lys Ala Phe 40 Ser Cys Ser Tyr Ala Asn Glu Lys Pro Pro Ser Gly Ile Glu Ala Ser 55 Leu Ser Ile Ile His Ala Lys Arg Pro Ile Glu Cys Val Ile Asp Ser 70 75 Ile Pro Lys Glu Gly Phe Thr Pro Leu Glu Asn Ala Phe Phe Asn Ile 90 Thr Tyr Ser Met Arg Gin Gin Gin Phe Ile Leu His Ile Lys Pro Lys 100 105 110 Val Met Arg Arg Leu Thr Leu Phe Ser Phe Asp Arg Asp Tyr Lys Lys 115 120 125 Ala Ile Pro Leu Phe Val Glu Asn Asp Ala Lys Ala Lys Met Trp Gin 130 135 140 Ile Ile Gly Tyr Asp Gin Lys Ile Pro Phe Leu Ser Glu Lys Asp Asn 145 150 155 160 Ala Gin Lys Gly Leu Asn Phe Pro Ile Ile Ile Lys Asp Ala Gin Thr 165 170 175 Pro Ile Ile Gin Glu Leu Asp Val Asn Asn Lys Pro Leu Leu Thr Thr 180 185 190 Lys Gly Tyr Asp Leu Asn Ala Tyr Leu Glu Ala Lys Lys Gin Met Asp 195 200 205 Ser Gin Ala Tyr Phe Asp Ala Leu Arg Thr Ile Ser Arg Ala Phe Lys 210 215 220 Asn Tyr Pro Gin Thr Ile Phe Lys Lys Asp Leu Tyr Leu Leu Glu Ile 225 230 235 240 Ile Ala Leu Gly Lys Leu Gly Ile Lys Lys Ser Leu Leu Ile Asp Ile 245 250 255 Gly Thr Gin Trp Ile Lys Asn Tyr Pro Thr Asp Pro Asn Ile Pro Glu WO 97/37044 PCTIUS97/05223 Ala Gin Arg 305 Gly Asn Glu Asp Leu 385 Ser Lys Ile Asp Lys 465 Tyr Ala Glu Gin I Cys I 545 Asn P Leu L Lys T Tyr T 6 Ala L 625 Ala P Ala L Arg M Lys S Le Al 29 Ty Se: Ali Al Ly 37( Ala Ala Ala Lys His 450 kia ksp Zeu Jal .ys 530 .ys 'ro ,ys 'hr yr 10 .eu he eu et er u Ty 27 a Va 0 r Al r As 1 Ly G1L 355 5 Val Leu Ile Lys Asp 435 Ala Leu Lys Glu Leu 515 Thr Glu Gin Glu Pro 595 Arg Ile Val Asn Ala 675 Val 260 r Tyr 5 1 Arg a Pro D Leu Asp 340 1 Ile Val Glu Glu 420 Phe Glu Phe Ile Leu I 500 Gly Leu I Ala I Glu C Lys A 580 Ser G Leu G Leu A Leu P 6 Leu T 660 Leu V Lys I Va Ty Le Se.
32' Ly Asr Git Lei IlE 405 Glr Lys Leu Ser lie 485 ,ys let isn ~eu ;lu 65 la ;lu Ily .la he 45 yr al le 1 Ala r Tyr u Ala 310 r Asn S Glu i Tyr i Ser 1 Leu 390 Ala Ala Asn Glu Met 470 Gin Ala Gin I Ile I Lys 550 Ile C Gin I Lys L Asp P 6 Gin S 630 Ser A Ala P Tyr P Tyr A 6 Lys Lys 295 Gin Ala Ser Gin Asn 375 Lys His Leu Ala Arg 455 lu Asn 31n -ys jeu 535 yr hin2 le leu 9 he I 15 :er I .sp I he L he L 6 la T 95 Al 28 Ar Me As Al Asi 36( Prc Lei Let Tyr His 440 Ala Gly Phe Leu Asn 520 ila Leu kla Ile Chr 500 ~ys 4 eu 'yr leu 'ys 80 'hr 265 a Leu 0 g Ile t Arg n Met a Ser 345 i Phe Asp Leu 1 Leu Asp 425 Leu Ser Asn Pro Leu 505 Leu I Lys Ser C Phe i Ala I 585 Trp L Asn S Asn L Met G 6 Glu L 665 Leu L Ser L As Le Lei Le.
33( Gli Asr Tyr Lys Leu 410 Leu Tyr Val Thr Asn 490 Phe Pro rhr 'In ~sp 70 .eu eu er lys in 50 ,ys eu eu p Gi u Le i Al, 31 Ph i Ilt Asi Let LyE 395 Asr *Gli Asn Val Gin 475 Ser Asp Lys Pro Ile 555 Cys Asn Tyr Thr Lys 635 Asn His Glu Leu u Asn u Glu 300 a Ile 5 Lys Ala i Ala i Ser 380 Asn Gin Ala Leu Arg 460 Glu Asn Asn Asp Leu C 540 Thr I Leu I Ala I Arg L 6 Leu A 620 Glu P Asn G Phe L Asn G 6 Lys L 700 As 28t Ty: Gli Gli LeL Lys 365 Met Gin Asp Leu Gin 445 Ala Lys lu Lys Ser 525 lu la 'yr leu leu ila he lu ys lu eu 270 n Asn r Lys 1 Ala Ala 1 Asn 350 Tyr His Met Asp Tyr 430 Tyr Arg Ile Ala His 510 Pro Asp Phe 2 Phe 2 Lys A 590 Gly i Ser L Tyr A Lys G 6 Asp A 670 Lys A Gin A Ty: Asi Al Ph 33E Trj LeL Ser Asr Asp 415 Ala Leu Asp Ala 3ml 495 Tyr Leu iis kla la 575 la ~rg lys ly sp sp r Lys n Ser a Glu 320 Ser Ala Ile Glu Glu 400 Leu Arg Gin Glu His 480 Lys Ala Ile Arg Phe 560 Ser Ala Asn Asp Ile 640 Leu Lys Pro Ala 690 Tyr 705 Lys Asp Tyr Ser Tyr Thr Pro Phe Cly Giu Phe Ala Leu Ile Asp 715 720 WO 97/37044 PCTUS97/0522 3 Ala Lys Tyr Lys Asn 785 Lys Tyr Arg Leu Leu Leu Gin 755 Ala Ser 770 Ala Trp Glu Ser Thr Thr 725 Asn Arg 740 Ser Ser Leu Glu Gin Asn Lys Asp Tyr Ser Lys 730 Arg Leu Ser Leu Glu 745 Leu Leu Asp Leu Thr 760 Lys Cys Val Gin Leu 775 Leu Cys Glu Gin Gly 790 Ala Leu Glu Thr Leu Asp 735 Asp His Gin Lys Ala Leu 750 Asn Gin Lys Ala Lys Ser 765 Lys Gin Lys Asp Gin Thr 780 Leu Asn Leu Phe Lys Asn 795 800 INFORMATION FOR SEQ ID NO:969: SEQUENCE CHARACTERISTICS: LENGTH: 265 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...265 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:969: Met 1 Gly Asp Ile Val Lys Lys Ala Asn Glu 145 His Thr Ile Leu Arg Ala Ser Val Leu Ser Ala Leu Leu Leu Val Ala Ala Asn Leu Leu Glu Ile Glu Asp Gly Ser Asp Leu Asn 115 Glu Ile 130 Asn Lys Cys Gin Val Arg Pro Val Ile Asp Asn Asp 100 Ala Pro Asp Lys Met 5 Lys Ser Lys Pro Ser Val Thr Ala Lys Glu 165 Val His Val Pro Asp 70 Val Lys Gin Asp Ile 150 Leu Val Ser Ile Leu 55 Thr Ile Leu Gin Tyr 135 Leu Thr Val Glu 40 Lys Lys Gly Val Asn 120 Ala Tyr Lys 10 Ser Ala Ser 25 Lys Gin Thr Ser Ser Gin Tyr Asn Ile 75 Leu Ser Asn 90 Ala Glu Thr 105 Ser Ala Lys Ile Glu Leu Ile Val Ser 155 Leu Arg Asp 170 Asp Asn Asp Pro Ile Asn Leu Pro 140 Asp His Lys Lys Leu Leu Phe Gin Asn 125 Ser Pro Leu Arg Lys Lys Val Phe Lys 110 Ala Thr Met Lys Gly Met Val Met Val Ser Ile Ile Asn Cys Glu Leu Gin Arg Val Ser Asn Gin Phe Ala Pro 160 Asn Val Gly Trp Leu Gly Val Asn Ser Ala Lys WO 97/37044 PCT/US97/052 2 3 877 Lys Ala Ala Leu Ile Gin Glu Glu Met Ala Lys Ala Arg Ala Arg Gly 195 200 205 Ala Ser Val Glu Asp Lys Ile Ser Ile Leu Glu Lys Ile Tyr Ser Thr 210 215 220 Gin Tyr Asp Ile Asn Ala Gin Lys Glu Pro Glu Asp Leu Arg Thr Lys 225 230 235 240 Val Glu Asn Thr Thr Lys Lys Ile Phe Glu Ser Gly Val Ile Lys Gly 245 250 255 Val Pro Phe Leu Tyr His Tyr Lys Ala 260 265 INFORMATION FOR SEQ ID NO:970: SEQUENCE
CHARACTERISTICS:
LENGTH: 227 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...227 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:970: Leu Asp Phe Lys Ala Ile Glu His Gly Glu Glu Thr Leu Lys Asn Glu 1 5 10 Thr Leu Arg Ala Leu Ser Asp Phe Ala Lys Ser Ser Asp Thr His Ile 25 Val Ala Cys Ser Ile Glu Lys Asn Asn Lys Lys Leu Tyr Asp Ser Ala 40 Tyr Ile Ile Pro Pro Lys Gly Lys Ile Val Gly Lys His Arg Lys Ile 55 Tyr Leu Trp Gly Asp Glu Lys Ser Arg Phe Lys Arg Gly Lys Lys Tyr 70 75 Glu Val Phe Thr Leu Asp Phe Gly Asp Phe Ser Ala Lys Val Gly Leu 90 Gin Ile Cys Tyr Glu Thr Gly Phe Gly Val Gly Ala Asn Leu Leu Val 100 105 110 Leu Gin Gly Ala Glu Val Leu Ile Tyr Pro Ser Ala Phe Gly Lys Ala 115 120 125 Arg Ala Tyr Asn Trp Asp Leu Leu Ser Lys Ala Arg Ala Leu Glu Asn 130 135 140 Gly Cys Phe Val Cys Ala Cys Asn His Ser Gly Glu Glu Thr Asn Ala 145 150 155 160 Lys Leu Lys Gin Thr Leu Glu Phe Ala Gly Asp Ser Arg Ile Ile Ala 165 170 175 Pro Asn Gly Lys Ile Ile Ala Gin Ala Thr Lys Leu Asn Glu Val Ile 180 185 190 Ile Ala Glu Met Asp Leu Asn Glu Val Ala Leu Gin Arg Gin Lys Ile 195 200 205 WO 97/37044 PCT/US97/05223 878 Pro Tyr Leu Gin Asp Phe Asp Thr Lys Leu Thr Lys Lys Gly Phe Gly 210 215 220 Lys Leu Thr 225 INFORMATION FOR SEQ ID NO:971: SEQUENCE CHARACTERISTICS: LENGTH: 293 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...293 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:971: Met Ala Lys Glu Ile Leu Val Ala Tyr Gly Val Asp Ile Asp Ala Val 1 5 10 Ala Gly Trp Leu Gly Ser Tyr Gly Gly Glu Asp Ser Pro Asp Asp Ile 25 Ser Arg Gly Leu Phe Ala Gly Glu Val Gly Ile Pro Arg Leu Leu Lys 40 Leu Phe Lys Lys Tyr His Leu Pro Ala Thr Trp Phe Ala Pro Gly His 55 Ser Ile Glu Thr Phe Pro Glu Pro Met Lys Met Ile Val Asp Ala Gly 70 75 His Glu Val Gly Ala His Gly Tyr Ser His Glu Asn Pro Ile Ala Met 90 Thr Ala Lys Gin Glu Glu Asp Val Leu Leu Lys Ser Val Glu Leu Ile 100 105 110 Lys Asp Leu Thr Gly Lys Ala Pro Thr Gly Tyr Val Ala Pro Trp Trp 115 120 125 Glu Phe Ser Asn Ile Thr Asn Glu Leu Leu Leu Lys His Gly Phe Lys 130 135 140 Tyr Asp His Ser Leu Met His Asn Asp Phe Thr Pro Tyr Phe Val Arg 145 150 155 160 Val Gly Asp Ser Trp Ser Lys Ile Asp Tyr Ser Leu Glu Ala Lys Asp 165 170 175 Trp Met Lys Pro Leu Ile Arg Gly Val Glu Thr Asn Leu Val Glu Ile 180 185 190 Pro Ala Asn Trp Tyr Leu Asp Asp Leu Pro Pro Met Met Phe Ile Lys 195 200 205 Lys Ser Pro Asn Ser Phe Gly Phe Val Ser Pro Arg Asp Ile Gly Gin 210 215 220 Met Trp Ile Asp Gin Phe Asp Trp Val Tyr Arg Glu Met Asp Tyr Ala 225 230 235 240 Val Phe Ser Met Thr Ile His Pro Asp Val Ser Ala Arg Pro Gin Val 245 250 255 WO 97/37044 PCT/US97/05223 879 Leu Leu Mec His Glu Lys Ile Ile Glu His Ile Asn Gin His Glu Gly 260 265 270 Val Arg Trp Val Thr Phe Asn Glu Ile Ala Asp Asp Phe Leu Lys Arg 275 280 285 Asn Pro Arg Lys Lys 290 INFORMATION FOR SEQ ID NO:972: SEQUENCE CHARACTERISTICS: LENGTH: 136 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...136 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:972: Val Cys Gly Val Leu Ser Leu Gin Ala Arg Leu Ser Phe Glu Asn Cys 1 5 10 Ser Lys Pro Leu Asp Cys Ser Leu Phe Ala Thr Thr Cys Thr Pro Gin 25 Asn Pro Ile Gly Ser Cys Met Val Ser Ser Gin Gly Gly Val Cys Gly 40 Val Leu Ser Leu Gin Ala Arg Leu Ser Phe Glu Asn Cys Ser Lys Pro 55 Leu Asp Cys Ser Leu Phe Ala Thr Thr Cys Thr Pro Gin Asn Pro Ile 70 75 Gly Ser Cys Met Val Ser Ser Gin Gly Gly Val Arg Gly Val Leu Ser 90 Leu Gln Ala Arg Leu Ser Phe Lys Asn Cys Phe Leu Val Lys Ser Leu 100 105 110 Lys Pro Ser Gin Leu Thr Leu Met Leu Thr Gly Phe Leu Gly Gly Phe 115 120 125 Ser Phe Leu Thr Thr Gly Phe Trp 130 135 INFORMATION FOR SEQ ID NO:973: SEQUENCE
CHARACTERISTICS:
LENGTH: 360 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
WO 97/37044 PCT/US97/05223 880 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...360 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:973: Met Asn Gly Phe Cys Ala Arg Leu Arg Ala Ile Thr Phe Asn Glu Arg 1 5 10 Leu Lys Met Lys Ile Ala Val Leu Leu Ser Gly Gly Val Asp Ser Ser 25 Tyr Ser Ala Tyr Ser Leu Lys Glu Gln Gly His Glu Leu Val Gly Ile 40 Tyr Leu Lys Leu His Ala Ser Glu Lys Lys His Asp Leu Tyr Ile Lys 55 Asn Ala Gin Lys Ala Cys Glu Phe Leu Gly Ile Pro Leu Glu Val Leu 70 75 Asp Phe Gin Lys Asp Phe Lys Ser Ala Val Tyr Asp Glu Phe Ile Asn 90 Ala Tyr Glu Glu Gly Gin Thr Pro Asn Pro Cys Ala Leu Cys Asn Pro 100 105 110 Leu Met Lys Phe Gly Leu Ala Leu Asp His Ala Leu Lys Leu Gly Cys 115 120 125 Glu Lys Ile Ala Thr Gly His Tyr Ala Arg Val Lys Glu Ile Asp Lys 130 135 140 Val Ser Tyr Ile Gin Glu Ala Leu Asp Lys Thr Lys Asp Gin Ser Tyr 145 150 155 160 Phe Leu Tyr Ala Leu Glu His Glu Val Ile Ala Lys Leu Val Phe Pro 165 170 175 Leu Gly Asp Leu Leu Lys Lys Asp Ile Lys Pro Leu Ala Leu Asn Ala 180 185 190 Met Pro Phe Leu Gly Thr Leu Glu Thr Tyr Lys Glu Ser Gin Glu Ile 195 200 205 Cys Phe Val Glu Lys Ser Tyr Ile Asp Thr Leu Lys Lys His Val Glu 210 215 220 Val Glu Lys Glu Gly Val Val Lys Asn Leu Gin Gly Glu Val Ile Gly 225 230 235 240 Thr His Lys Gly Tyr Met Gin Tyr Thr Ile Gly Lys Arg Lys Gly Phe 245 250 255 Ser Val Lys Gly Ala Leu Glu Pro His Phe Val Val Gly Ile Asp Ala 260 265 270 Lys Lys Asn Glu Leu Ile Val Gly Lys Lys Glu Asp Leu Ala Thr His 275 280 285 Ser Leu Lys Ala Lys Asn Lys Ser Leu Thr Lys Asp Phe Lys Asp Gly 290 295 300 Glu Tyr Phe Ile Lys Ala Arg Tyr Arg Ser Val Pro Thr Lys Ala Phe 305 310 315 320 Val Ser Leu Lys Asp Gly Met Ile Glu Val Glu Phe Lys Glu Pro Phe 325 330 335 Tyr Gly Val Ala Lys Gly Gin Ala Leu Val Val Tyr Lys Asp Asp Ile 340 345 350 Leu Leu Gly Gly Gly Val Ile Val 355 360 INFORMATION FOR SEQ ID NO:974: WO 97/37044 PCT/US97/05223 881 SEQUENCE CHARACTERISTICS: LENGTH: 357 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...357 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:974: Met Lys Ala Ser Ile Tyr Asp Phe Thr Leu Asp Glu Leu Ser Gin Leu 1 5 10 Leu Lys Pro Ser Phe Arg Ala Lys Gin Leu Tyr Leu Trp Leu Tyr Ala 25 Lys Tyr Lys Thr Ser Phe Lys Asp Met Gin Asn Asn Phe Ser Lys Asp 40 Phe Ile Ala Tyr Leu Glu Gin Glu Phe Thr Leu Arg Thr Ile Glu Ile 55 Ala His Val Arg Glu Ser Val Asp Gly Ser Lys Lys Tyr Leu Phe Lys 70 75 Ser Leu Arg Asp Asn His Thr Phe Glu Ala Val Leu Leu Lys Met Lys 90 Asp Lys Lys Ile Asp Glu Glu Thr Asn Ala Val Leu Glu Gly Glu Lys 100 105 110 Tyr Thr Val Cys Val Ser Cys Gin Ile Gly Cys Gin Val Gly Cys Ser 115 120 125 Phe Cys Phe Thr Gin Lys Gly Gly Phe Val Arg Asp Leu Lys Ala Ser 130 135 140 Glu Ile Ile Gin Gin Ala Leu Leu Ile Lys Glu Ala Asn Asn Leu Pro 145 150 155 160 Ile Glu Lys Ala Leu Asn Ile Val Phe Met Gly Met Gly Glu Pro Leu 165 170 175 Asn Asn Leu Asp Glu Val Cys Lys Ala Val Glu Ile Phe Asn Thr Gly 180 185 190 Met Gin Ile Ser Pro Lys Arg Ile Thr Ile Ser Thr Ser Gly Val Ala 195 200 205 Asp Lys Ile Pro Ile Leu Ala Gly Lys Asn Leu Gly Val Gin Leu Ala 210 215 220 Ile Ser Leu His Ala Val Asp Asp Lys Thr Arg Ser Ser Leu Met Pro 225 230 235 240 Leu Asn Lys Lys Tyr Asn Ile Glu Cys Val Leu Asn Glu Val Arg Lys 245 250 255 Trp Pro Leu Glu Gin Arg Lys Arg Val Met Phe Glu Tyr Leu Leu Ile 260 265 270 Lys Asp Leu Asn Asp Ser Leu Asp Cys Ala Lys Lys Leu Leu Lys Leu 275 280 285 Leu Asn Gly Ile Lys Ser Lys Val Asn Leu Ile Leu Phe Asn Pro His 290 295 300 WO 97/37044 PCT/US97/052 2 3 882 Glu Gly Ser Lys Phe Glu Arg Pro Ser Leu Glu Ser Ala Arg Met Phe 305 310 315 320 Ala Asp Phe Leu Asn Ala Lys Gly Leu Leu Cys Thr Ile Arg Glu Ser 325 330 335 Lys Ala Leu Asp Ile Glu Ala Ala Cys Gly Gln Leu Arg Glu Lys Lys 340 345 350 Leu Gin Gin Lys Ile 350 355 INFORMATION FOR SEQ ID NO:975: SEQUENCE
CHARACTERISTICS:
LENGTH: 468 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...468 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:975: Met Ala Glu Trp Lys Thr Asp Thr Glu Glu Val Lys Lys Val Val Gly 1 5 10 Arg Cys Arg Glu Phe Lys Arg Ser Leu Gin Glu Glu Lys Cys Ser Pro 25 Phe Ile Lys Asp Leu Asp Ser Tyr Ala Leu Lys Ile Ile Val Glu Arg 40 Arg Lys Thr Glu Met Gin Leu Glu Lys Ala Ile Gly Glu Leu Lys Lys 55 Ala Lys Ser Asn Glu Asp Asp Ala Lys Val Ala Leu Arg Val Leu Gin 70 75 Gly Ala Ser Val Val Ser Trp Ile Trp Pro Pro Ala Arg Ile Ala Ala 90 Thr Ala Ala Ile Val Ala Ala Glu Ala Val Leu Lys Phe Met Lys Glu 100 105 110 Asp Thr Glu Lys Cys Lys Arg Asn Val Glu Leu Leu Glu Arg Met Leu 115 120 125 Glu Ile Tyr Ser Asn Gin Ala Lys Ala Ser Ala Asn Leu Met Asn Gin 130 135 140 Ala Trp Glu Gly Ile Lys Lys Arg Leu His Phe Tyr Thr Asp Lys His 145 150 155 160 Gin Glu Phe Ile Arg Arg Leu Lys Gin Ala Ser Asp Ala Ile Asp Asn 165 170 175 Glu Tyr Asn Phe Pro Thr Pro Gly Val Leu Leu Glu Tyr Asp Phe Glu 180 185 190 Arg Pro Ala Ile Ser Tyr Thr Pro Lys Lys Ser Val Phe Asn Glu Arg 195 200 205 Leu Lys Asp Leu Arg Glu Asn Phe Ser Ala Ser Leu Tyr Ala Asp Leu 210 215 220 WO 97/37044 PCTIUS97/052 23 LYS Asp Lys Ilie His His Asn Ala Leu Ser Asn Asp Asp Leu Giu 225 23023 Met Ile Ala Phe Arg Giu Gin Giu Phe Giu Lys Ser Leu 245 250 Met Gly Ala Tyr Ser Tyr Asp Giu Asn Pro Asn Asp Giu Gu Asp 255 Leu Asp Arg 240 Trp Arg Met Ala Ile Se, Met Ala Thr Ala Ser Lys Ile 385 Trp Giu Gin *Pro 290 -Lys *Glu Phe Gin Ala 370 Val Arg Asp Ala 275 Ser Asn Lys Asp Lys 355 Leu Asp Trp Leu rhr ValI Arg Ile Asn 340 Ile Glu Giu Aa ksn 420 eu Lys Git Leu Gl Cys Val 310 Lys Giu 325 Leu Glu Asp Pro Tyr Arg Lys Asn 390 Giu Phe 405 Lys Thr Lys Thr 265 1 Gin Giu Leu 280 Val Pro Ser 295 Lys Asn Phe Ser Pro Asn Thr Glu Leu 345 Val Leu Glu 360 Giu Phe Leu 375 Pro Tyr Pro Asp Ser Val Ala Cys Ala Giu Lys Ser Tyr Lys Asp 330 Giu Arg Glu Giu Phe 410 Hjis Asn Giu 315 Ser Arg Asn Ser Giu 395 Ser His Git 300 Ala Asn Ala Glu Arg 380 Val1 Ala Ala 270Gu s 2e8G5 s Ser Leu Thr Leu Giu Gly Ala Ile Asn 335 Thr Giu Asn 350 Asn Tyr Thr 365 Lys Glu Ser Ser Phe Asn Ile Val Pro 415 Leu Lys Ala Leu Leu Phe 320 Giu Leu Gin Phe Glu 400 ELeu Thr Ile Trp Ala Leu Met Arg Gin Ser Trp 435 Asn Arg Ser Gin Lys Asp Ser 450 455 Thr Arg met Phe 465 Ser Leu Gly Gly 445 Cys Gly Ile Leu Ile 460 INFORMATION FOR SEQ ID NO:976: Ci) SEQUENCE
CHARACTERISTICS:
LENGTH: 268 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 268 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:976: Met LYS Gin Sen Leu Ang Giu Gin Lys Leu Leu Lys Ile Leu Glu Asn 10 Asp Val Leu Thr Ile Leu ASP Ser Phe Ser Asn Tyr Leu Phe Giu Leu 25 WO 97/37044 PCT/US97/05223 Arg Glu Glu Leu Asp Phe Ile Glu Glu Glu Met Glu Gly Glu Ile Thr Glu His Thr His Phe Leu Asn 145 Glu Ile Phe Pro Ile 225 Glu Met SGni Val Glu Phe Lys Ile 130 Asn Thr Asn Val Thr 210 Pro Leu Leu Asr Asn His Val Glu 115 Pro Ser Ile Asn Gin 195 Asn Asn Phe Asp SLeu i Val Leu Lys 100 Ile Leu Phe Arg Asn 180 Ala His Trp Glu Gin 260 Thr Phe Tyr Ser Asn Gin Val Gin 165 Ile Asn Ala Ile Lys 245 Glu Ala Tyr 70 Ser Phe Asp Ser Ser 150 Ser Ser Phe Tyr Gin 230 Tyr Asn( Leu 55 Glu Gly Leu Gly Gly 135 Leu Tyr Glu Leu Asp 215 Ile Phe 31n Tyr Asn Leu Ser Gin 120 Lys Val Arg Asp Glu 200 Phe Asp Gin Asn Asp Val Ile Asn 105 Asp Asn Tyr Ile Met 185 Tyr Thr Met Asn Lys Phe Leu Asp 90 Gin Pro Asp Val Leu 170 Gin Tyr His Ser Ile 250 Asn Ser Asn 75 Ser Asp Gin Ala Tyr 155 Arg Ser Val Leu Val 235 Asp Ser Asn i le Leu Leu Lys Ser 140 Ala Leu Asp Gin Ile 220 Glu Glu Asp Phe Asp Asn Asp Thr 125 Ser Tyr Leu Ile Asn 205 Met Ala Val Leu Asp Ala Phe 110 Leu Phe Phe Glu Glu 190 Lys Asp Lys Thr Glu Val Asn Arg Ser Lys Met Lys 175 Asn Ile Ser Lys Asn 255 Asp Lys Leu Phe Arg Ala Leu 160 Pro Phe Tyr Ile Lys 240 Lys INFORMATION FOR SEQ ID NO:977: SEQUENCE CHARACTERISTICS: LENGTH: 258 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...258 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:977: Leu Leu Ser Asn Lys Thr Leu Pro Lys Arg Phe Ala Thr Tyr Ser Leu 1 5 10 Gin Glu Val Gly Val Ile Phe Leu Thr Thr Gin Ile Leu Ser Ile Met 25 Arg Lys Thr Arg Cys Ser Lys Thr Leu Phe Phe Ile Thr Arg Gly Arg 40 WO 97/37044 PCT/US97/05223 885 Glu Ser Phe Arg Tyr Gin Leu Cys Asp His Tyr Lys Gin Lys Arg His 55 Gin Phe Asp Glu Asp Phe Arg Ser Leu Leu Lys Ala Leu Lys Ile Ala 70 75 Leu Val Glu Lys Tyr Pro Leu Lys Lys Gly Ala Lys Ile Gin Gly Glu 90 His Cys Phe Glu Tyr Glu Ala Asp Asp Ile Ile Ser Phe Tyr Lys Lys 100 105 110 Lys Asp Pro Asn Asn Tyr Val Ile Ala Ser Met Asp Lys Asp Ile Leu 115 120 125 Tyr Ser Asn Arg Gly Ser His Phe Asn Leu Lys Thr Asn Ala Phe Phe 130 135 140 Asn Val Ser Gin Lys Glu Ala His Phe Phe Ala Tyr Tyr Gin Cys Val 145 150 155 160 Val Gly Asp Lys Gly Asp Asn Ile Lys Gly Val Lys Gly Ile Gly Gly 165 170 175 Phe Asn Tyr Lys Asp Phe Leu Asn Glu Asp Ala Lys Glu His Glu Leu 180 185 190 Trp Glu Gin Ile Ile Gin Ala Phe Lys Ile Lys Glu Asp Leu Ser Asp 195 200 205 Ser Glu Ala Lys Glu Lys Ala Leu Leu Asn Met Arg Leu Val Asn Met 210 215 220 His Gin Met Thr His His Gly Val Ile Lys Leu Trp Glu Pro Glu Phe 225 230 235 240 Lys Lys Ala Phe Phe Pro Lys Lys Pro Gin Arg Pro Asp Phe Lys Arg 245 250 255 Ile Ser INFORMATION FOR SEQ ID NO:978: SEQUENCE CHARACTERISTICS: LENGTH: 256 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...256 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:978: Leu Cys Leu Lys Leu Leu Ile Trp Asn Phe Lys Glu His Ser Leu Lys 1 5 10 Val Asn Phe Phe Ala Thr Cys Leu Gly Ala Ala Ile Tyr Ser Asn Ala 25 Ser Leu Asn Ala Ile Lys Leu Leu Arg Lys Glu Asn Leu Glu Val Val 40 Phe Lys Lys Asp Gin Thr Cys Cys Gly Gin Pro Ser Tyr Asn Ser Gly 55 WO 97/37044 PCT/US97/05223 886 Tyr Tyr Glu Glu Thr Lys Lys Val Val Leu Tyr Asn Ile Lys Leu Tyr 70 75 Ser Asn Asn Asp Tyr Pro Ile Ile Leu Pro Ser Gly Ser Cys Thr Gly 90 Met Met Arg His Asp Tyr Leu Glu Leu Phe Glu Gly His Ala Glu Phe 100 105 110 Asn Met Val Lys Asp Phe Cys Ser Arg Val Tyr Glu Leu Ser Glu Phe 115 120 125 Leu Asp Lys Lys Leu Gin Val Lys Tyr Glu Asp Lys Gly Glu Pro Leu 130 135 140 Lys Ile Thr Trp His Ser Asn Cys His Ala Leu Arg Val Ala Lys Val 145 150 155 160 Ile Asp Ser Ala Lys Asn Leu Ile Arg Gin Leu Lys Asn Val Glu Leu 165 170 175 Ile Glu Leu Glu Lys Glu Glu Glu Cys Cys Gly Phe Gly Gly Thr Phe 180 185 190 Ser Val Lys Glu Pro Glu Ile Ser Ala Val Met Val Lys Glu Lys Ile 195 200 205 Lys Asp Ile Glu Ser Arg His Val Asp Val Ile Val Ser Ala Asp Ala 210 215 220 Gly Cys Leu Met Asn Ile Ser Thr Ala Met Gin Lys Met Gly Ser Leu 225 230 235 240 Thr Lys Pro Met His Phe Tyr Asp Phe Leu Ala Ser Arg Leu Gly Leu 245 250 255 INFORMATION FOR SEQ ID N0:979: SEQUENCE CHARACTERISTICS: LENGTH: 469 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...469 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:979: Leu Lys Ile Phe Val Leu Leu Met Ser Val Ile Leu Gly Ile Ser Leu 1 5 10 Thr Gly Cys Ile Gly Tyr Arg Met Asp Leu Glu His Phe Asn Thr Leu 25 Tyr Tyr Glu Glu Ser Pro Lys Lys Ala Tyr Glu Tyr Ser Lys Gln Phe 40 Thr Lys Lys Lys Lys Asn Ala Leu Leu Trp Asp Leu Gin Asn Gly Leu 55 Ser Ala Leu Tyr Ala Arg Asp Tyr Lys Thr Ser Leu Gly Val Leu Asp 70 75 Gin Ala Glu Gin Arg Phe Asp Lys Thr Gln Ser Ala Phe Thr Arg Gly 90 WO 97/37044 PCT/US97/0522 3 Ala Gly Tyr Val Gly Ala Thr Met Ile Asn As 100 Gly Gly Asn Ile Tyr Gl 105 u Asp Arg 145 Val Asn Tyr Ala Asn 225 Pro His Glu Val Asn 305 Ala Tyr Gin Ser I Ile 385 Glu E Leu L Arg G Asn A 4 Leu S 465 Tyr 130 Ala Gin Met Ser Val 210 Lys Phe Phe Phe Ser 290 Phe Ser Ile Ala Leu 1 370 Phe A Ser T ys P lu L 4 .rg H 50 er H 11 Me' Asr Lys Glu Asr 195 Ser Gly Val Thr Lys 275 Val rhr Ile Ile Val 355 ?yr la ryr ro eu 35 is is 5 t Le n Gl s Al Arc 18c SLei Ty Let Ala Trp 260 Ile Ala Leu Asp Thr 340 Ala Ser His Glu Cys 420 Leu Asn Leu u Leu u Arg a Ile 165 g Ser 0 u Asp r Leu 1 Gly Gin 245 Ile Asp Leu Lys Ala 325 Arg Asn Gly Lys Val P 405 Lys A Ser G Ile L Lys Asn Gin 150 Lys Arg Lys Ser Tyr 230 Asp Ile Val Pro Asp 310 Val T Ala ryr Val S 3 Ile T 190 rg A krg S ly P eu T 4 Gly Asp 135 Arg Glu Ala Tyr Gly 215 Leu Leu Ile Pro Lys 295 Gly Val Ile ?yr er 175 yr I ila I er I 'he 4 yr v 55 Va.
12( Sei Arc Ile Glu Glu 200 Leu Asn Val Glu Ile 280 Leu Glu Ala Leu Leu 360 Thr Leu Asp Leu al 140 'al l Le 0 r Al g Al As I Va 18 Al2 SPhe GIu Phe Asp 265 Phe Glu Lys Ser Ser 345 Gly Phe Met Ser Glu 425 Thr Arg u Ile a Lys a Lys p Ser 170 1 Ser Tyr Tyr Tyr Ala Phe 250 Gly Met: Lys Val Glu I 330 Ala Phe Ala F Arg I 3 Ile A 410 Ser P Ala P Ser P As Al Gl 15 Se: GlI G1I Ala Tyr 235 Lys Lys Ile Gly rhr 315 Phe rhr ral %sp le 95 .sp 'ro ro he p Asn Val Arg Ala Tyr 110 n Tyr Tyr Lys Ala Ile 125 a Arg Val Gin Phe Asn 140 u Phe Tyr Tyr Glu Glu 5 160 r Lys Lys His Asn Ile 175 u Ile Leu Asn Asn Thr 190 i Gly Leu Leu Asn Pro 205 Leu Asn Gly Asp Lys 220 Gly Ile Ser Gin Ser 240 Asn Pro Asn Arg Ser 255 SGlu Pro Gin Lys Ser 270 Asp Ser Val Tyr Asn 285 SGlu Ala Phe Tyr Gin 300 Pro Phe Asp Thr Leu 320 Arg Lys Gin Leu Pro 335 Phe Lys Val Gly Met 350 Gly Gly Leu Val Thr 365 Thr Arg Asn Thr Ser 380 Lys Asn Lys Ala Phe 400 Ala Phe Ser Phe Ser 415 Lys Ile Ile Asp Ala 430 Gin Val Phe Cys Ser 445 Lys Asn Gly Phe Val 460 INFORMATION FOR SEQ ID NO:980: SEQUENCE CHARACTERISTICS: LENGTH: 350 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 888 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...350 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:980: Met Lys Gin Ile Lys Ala Leu Ala Leu Phe Ser Gly Gly Leu Asp Ser 1 5 10 Leu Leu Ser Met Lys Leu Leu Ile Asp Gin Gly Ile Glu Val Thr Ala 25 Leu His Phe Asn Ile Gly Phe Gly Gly Asn Lys Asp Lys Arg Glu Tyr 40 Phe Glu Asn Ala Thr Ala Gin Ile Gly Ala Lys Leu Leu Val Cys Asp 55 Ile Arg Glu Gin Phe Phe Asn Asp Val Leu Phe Lys Pro Lys Tyr Gly 70 75 Tyr Gly Lys Tyr Phe Asn Pro Cys Ile Asp Cys His Ala Asn Met Phe 90 Arg Asn Ala Phe Tyr Lys Met Leu Glu Leu Asp Ala Asp Phe Val Leu 100 105 110 Ser Gly Glu Val Leu Gly Gin Arg Pro Lys Ser Gin Arg Lys Glu Ala 115 120 125 Leu Asn Gin Val Arg Lys Leu Val Arg Glu Val Gly Glu Glu Ala Arg 130 135 140 Phe Asp Leu Ile Leu Asp Arg Thr Gin Ala Ser Gly Glu Lys Pro Gin 145 150 155 160 Phe Leu Asp Glu Leu Leu Leu Arg Pro Met Ser Ala Lys Leu Leu Glu 165 170 175 Pro Thr Phe Met Glu Lys Lys Gly Trp Val Asp Arg Glu Lys Leu Leu 180 185 190 Asp Val Ser Gly Arg Gly Arg Ala Arg Gin Leu Gin Met Ile Lys Asp 195 200 205 Tyr Gly Leu Lys Tyr Tyr Glu Lys Pro Gly Gly Gly Cys Leu Leu Thr 210 215 220 Asp Ile Gln Val Ser Asn Lys Ile Lys Asn Leu Lys Glu Tyr Arg Glu 225 230 235 240 Met Val Phe Glu Asp Gly Val Ile Val Lys Asn Gly Arg Tyr Phe Val 245 250 255 Leu Pro His Asn Ala Arg Leu Val Val Ala Arg Asn Glu Glu Glu Asn 260 265 270 His Lys Leu Asp Ile Glu His Pro Leu Met Asp Lys Ile Glu Leu Leu 275 280 285 Gly Cys Lys Gly Pro Leu Ser Leu Val Asp Lys Asn Ala Ser Gin Glu 290 295 300 Asp Lys Glu Leu Ala Gly Arg Ile Ala Leu Gly Tyr Ala Lys Thr Leu 305 310 315 320 Lys Asn Gin Ala Tyr Leu Ile Gin Ile Gly Asn Glu Lys Arg Glu Leu 325 330 335 Tyr Pro Leu Asp Lys Glu Asn Ala Arg Glu Tyr Leu Phe Ala 340 345 350 WO 97/37044 PCT/US97/05223 889 INFORMATION FOR SEQ ID NO:981: SEQUENCE CHARACTERISTICS: LENGTH: 405 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...405 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:981: Met Lys Lys Tyr Ser Ala Ile Pro Thr Pro Cys Tyr Ala Leu Glu Ser 1 5 10 Glu Arg Leu Glu Lys Asn Ala Lys Ile Leu Glu Ile Val Arg Gin Gin 25 Ser Gly Ala Lys Val Leu Leu Ala Leu Lys Gly Tyr Ala Phe Trp Arg 40 Glu Phe Gly Ile Leu Arg Gin Lys Leu Asn Gly Cys Cys Ala Ser Gly 55 Leu Tyr Glu Ala Lys Leu Ala Phe Glu Glu Phe Gly Gly Arg Glu Ser 70 75 His Lys Glu Ile Cys Val Tyr Ser Pro Ala Phe Lys Glu Ala Glu Met 90 Ser Ala Ile Leu Pro Leu Ala Thr Ser Ile Ile Phe Asn Ser Phe His 100 105 110 Gin Tyr Ala Thr Tyr Lys Asp Arg Ile Leu Asp Lys Asn Lys Gin Leu 115 120 125 Glu Asn Leu Gly Leu Ser Pro Ile Lys Met Gly Leu Arg Ile Asn Pro 130 135 140 Leu Tyr Ser Glu Val Thr Pro Ala Ile Tyr Asn Pro Cys Ser Lys Met 145 150 155 160 Ser Arg Leu Gly Ile Thr Pro Ser Gly Phe Glu Lys Gly Val Lys Glu 165 170 175 His Gly Leu Glu Gly Val Ser Gly Leu His Phe His Thr His Cys Glu 180 185 190 Gin Asn Ala Asp Ala Leu Cys Arg Thr Leu Glu His Val Glu Arg His 195 200 205 Phe Lys Pro Tyr Leu Glu Asn Met Ala Trp Val Asn Phe Gly Gly Gly 210 215 220 His His Ile Thr Arg Ser Asp Tyr Asp Val Asn Leu Leu Ile Gin Thr 225 230 235 240 Ile Lys Asp Phe Lys Glu Arg Tyr His Asn Thr Glu Val Ile Leu Glu 245 250 255 Pro Gly Glu Ala Ile Gly Trp Gin Cys Gly Phe Leu Ile Ala Ser Val 260 265 270 Ile Asp Ile Val Gin Asn Asp Gin Glu Ile Ala Ile Leu Asp Ala Ser 275 280 285 Phe Ser Ala His Met Pro Asp Cys Leu Glu Met Pro Tyr Arg Pro Ser WO 97/3 7044 PCTIUS97/05223 290 Ilie Leu 305 Lys Gly Cys Leu Lys Arg Val. Lys 370 Ile Asp 385 Tyr Lys Lys Clii Ala Gly 355 Asn Ser Asn Val1 Asn Gly 340 Asp Asn Gin Arg Ser Gin 325 Asp Lys Ser Gly Asn Val1 310 Gly Phe Ile Phe Phe 390 295 Giu Ala Met Val1 Asn 375 Lys Asn Phe Gly Phe 360 Gly Ile Asp Ser Ser 345 Gin Val1 Leu Glu Tyr 330 Phe Asp Pro Lys Glu 315 Phe Ser Met Leu Ser 395 300 Ile Leu Phe Leu Pro 380 Phe Ile Gly Glu His 365 Ser Ser Giu Giy Thr 350 Tyr Leu Tyr Val1 Pro 335 Pro Thr Ala Giu Giu 320 Thr Leu Ile Lys Asp 400 405 INFORMATION FOR SEQ ID NO:982: SEQUENCE CHARACTERISTICS: LENGTH: 321 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .321 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:982: Met Ala Asp Ser Leu Ala Gly Ile Asp Gin Val Thr Ser Leu His Lys 1 Asn Leu Gly Leu Trp Arg Phe Leu Sen 145 Asn Tyr Asn Ile Phe Ile Sen Ser 130 Ala Giu Ala Leu Ile Tyr Giu Arg 115 Lys Gly Leu Val1 Thn Ile Tyr Lys 100 Tnp Lys Asn Gin Asn Ile Arg Asp Giu Thr Trp Asn Leu Val Ile Glu 70 Ser Lys Ile Thn Lys
ISO
Leu Phe Ser 55 Leu Gin Gly Gly Giu 135 Leu Cys Lys 40 His Thr Asn Giu Val1 120 Met Val1 Phe 25 Ile Giu Ile Lys Asp 105 Ang Glu Sen 10 Arg Arg Asn Pro Se r 90 Asp Ile Gin Arg Leu Glu Asn Leu 75 Lys Ile Tyr Sen Thr Gly Val1 Ser Ile Asp Val Glu Ala 140 Arg Lys Val Leu Asp Leu Met Ala 125 Gly Asn Lys Val1 Met Arg Ile 110 Asp Leu Lys Tyr Giu Lys Pro Cys Arg Gly Asp His Gly Lys Tyr Giu Ile Gly Gly 160 Phe 1y55e s Ang Leu Val Gin Vai Val Asp Ile Giu Lys Met Leu Ile Asp Val WO 97/37044 PCTIUS97/0j5223 Pro Trp Ile Ile His Ser 195 Leu Lys Thr 165 Asp Giu Giu 180 Asn 170 Lys His Asn Asp Leu Glu Thr Gin Cys Val Leu 200 Leu Leu Ala Asp Asp 175 Leu Ser Lys 190 Pro Ser Val Lys His Ile Met Gin Met 210 Asp Phe Ile 215 Thr Asp Lys Leu Gly 220 Leu Ile Asn Gly 225 Thr Lys 230 Ile Leu Leu Giu His Phe Asn Pro Thr Asp Val Ser Asn 245 Phe Gly Leu Ile Ile 250 Gin Asp Leu Giu 240Pr MetPr Giu Ala Ser Thr Ser Lys 275 Asn Glu Asp Giu Val Ile Lys 265 Asn Vai Lys Asn Pro Ile Val Val1 280 Leu Ser Ser met Ser 285 Phe Asn Pro Leu 270 Giy Ser Ser Ile Ser Lys Met Ala Arg 290 Ser Asn Ser 295 Gin Lys Ala Asp Asp 300 Gin Pro Lys Asp 305 Ala Ile 310 Arg Val Val Lys 315 Phe Leu Glu Leu 320 INFORMATION FOR SEQ ID NO:983: SEQUENCE
CHARACTERISTICS:
LENGTH: 735 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: Misc -feature LOCATION .735 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:983:
ATGCAATTTC
TCTTTTAGTA
AGTCATGCCG
AACGCTCAAA
AACACCTTTA
ATGCAAGCCG
CTTAGCGGTG
CCCATTACTA
CAATCTCAAjA
TTGAACGCGC
TTGAGCGTAG
TTATTCTATG
AAAAAACCTT
TCGCTGAAGA
TTGAGCACAA
ACCAAATCTA
ACTACACCAA
AACAATACTA
GCGTTGCGTC
ACCCTTTAGA
ACAGCATGCT
TTGATCCCAG
GGTATAAGCA
ACTATGGTTA
ATCTTCCTTA
AA.ATGGGGCG
TAATCCCTTT
TAAACTCAAT
CAACGCTTTG
CCTCCAATCC
TAACCCTAAA
ATTGGTAGAA
TTCTTCTTTG
CTCTTATTCT
TTTCTTTA.CC
CACTAATTTT
TCTTTATTTT
TATGCGAGCG
TTAAATCAAG
CAAATTGAAA
AAAAACAATG
ACCCTTCAAA
CTAGTCCAAG
AACTTAAAAA
TCTTCTCAGA.
AAAAACGTTT
AAGAAAAAAA
GGTTTTGTGG
TATCTTTATC
TTTATTCTTA
TGGGTTTTGA
ATATTCCATT
AACGCATCCA
AACGATTTCT
ATGAAATCAC
AAACATGCAA
CTAAATTAAC
CCCCACTGA
ACATTGAAAA
AATAGTCATG
CGTTAGAAAA
AATGCAAGAA
ATTTAGAATT
ACAATTCAGC
TCGCTCAAAT
TTCAAATTCT
CAAGCATGTA
TGGGGTAGGT
ATCAAGGGTT
TCGTTATTAC
GTAATGGCTT TGATGGTTTA 120 180 240 300 360 420 480 540 600 660 720 WO 97/37044 PCTIUS97/05223 GGCAAAATGA
ATAAC
INFORMATION FOR SEQ ID NO:984: SEQUENCE
CHARACTERISTICS:
LENGTH: 1899 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc -feature LOCATION 1 .1899 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:984: ATGAAACcA~p
CTTGGAGGGP
GACAATTTC'I
AGCAATAATC
GAGGGCAATA
TTGTTAGACG
ATGTTAGGGT
CGCATGCAAA
AACGCTGAAA~
GAAGTAGTAG
AAAATCCCTA
AAGGCGGTGG
GAAATGTTTG
CAAGCCCCTA
GGAGGCATGA
ATGGATGGTT
GAAATCTTGG
AAGCCTGATT
GCTAATGATG
TTGGCGAATA
AAACAACAGC
AGGCGTATCA
ATTTCTGAAA
ATGGCGGCTT
CACGAACTCA
TTAGAAGAAA
GGCATGGTGA
CGGAACGCCT
GAAGAAATGG
ACCTTAAGCG
GTCATTACAG
GAAAGCCGTT
LCGAACGAACC
LTTTTACTCA'l
*TAGCTTCTAC
AAGTGGAAAA.
*ATCGTGTGAT
AGAAAAAAAT
GGCTCATGCC
AAAATATGGG
AACCCAATGT
AAATCGTGGA
AAGGTGTGCT
CTGGCGAAGC
TGGGCTTAGG
GCATCATTTT
TAAGTGGGAA
TTGGGAGCGA
ATCCGGCTTT
TTAATGGCAG
TGAATTTGCA
TCATCAATGA
ATTTAAAAGA
GTCCTAAAGA
TGACTAAAGG
TAGGCTACAC
TCGCTGAAAT
TTTCTACCGG
GTTACTACGG
TTTTAGGAGG
ATCTTTTCAT
ACTATAGAGA
GCGAAAGGGT
TGATCCCTTT
TAAAAAACCT
TTTTTTCCTA
CACTAAAAAT
TGTGAGTATC
TTATATCGCT
CAATTATTCT
TATTTTAGTG
TGGGGGTATT
GCGTTTTZXAT
TTTCTTAAJA
GTTAGTAGGG
GCATGTGCCG
GGCAAGCAGG
TATTGATGAA
TGATGAGAGA.
AAACGCACCT
AATGCGTCCA
GGTGGAAATC
AGAAGTCGCC
AGCCGCGCTT
AGCGGTTGAA
AAAGAAAATC
GAGCACTAGG
CCTTAACACG
TGATGTGCTT
TGCGAGCAAC
CATGAGCAGT
CGGTTATGGG
TAAAAACCTA
GGCGATTGAA
GCGTGAAATC
AGAAGAGCALA(
TTTTTTCAAA
CGCTCTTTCA
GTGAGCTACC
GGTCAAACTT
AAACGAGTGC
GGTTTTAGCG
ATTTTAGGGC
TTTGGCATGG
GACATGGCGG
TACCCCGAAC
CCTCCAGGKA
TTTTTCTCTA
GTTAGGGATT
ATTGATGCCA
GAGCAAACCC
GTCATTGTTT
GGGCGCTTTG
TTAAAAGTGC
AAACTCACCG
TTAGCGGGAC
AGAGGGATTG
GTCGCCTACC
GTGAATAAAG
CCTGAAGAAA
TTAGGCGGGA
GATTTAGAAA
GTCAGCGGGC
AGCAGTAGGG
:TAGAAGAGC
kTCATGGTCA %TCAGCGAAT2 3CGAGTTAA
GTCCCATTGT
ATTCTGATGG
ATGAAATCAA
TGATCAAAGC
CTGATCTAAC
AATCTAACTT
TATGGATGTT
GGAGCCCGAA
GCAATGAAGA
GATACGCCA
CCGGTAAAAC
TGGGAGGGAG
TGTTTGAAAC
TAGGCAAGAG
TAAACCAGCT
TAGCCGCAAC
ACAGGCAGGT
ATATTAAAGG
CAGGGCTTGC
GAAACAACCA
CTGGGCTAGA
A~TGAAGCGG
1CTCTATCAT
ACAAATACTT
GGGCGGCTGA
GAGCGACTGA
TTATGGTGTT
ATTTAGCGA
3CTATCAGCA
"AGAATTGTT
kCGAAGCCGC
TCTTGCGGTT
CAGTTTTTCG
ACAGCTCATC
CAGCCATAAA
CTTAGTGCCT
TTTTACCGAC
TATGGCAAAC
AAAACTCATT
AGCCAAGA
TTTAGGGGCT
CCTTTTAGCA
CAGTTTCATT
CGCTAAAAA-A
CAGAGCGGCT
CTTAGCCGAA
GAACCGCCCT
TTTAGTGGAT
CGTGAAACTC
AGGGGCGGAT
AAAAGAAGTC
AAAGAAAAGC
GCATGCCGTG
TCCAAGGGGC
GATGCAAAAA~
AGAAGTCTTT
TATTATTAAA
AGAAAAGCAA
AAAAAC COCA 1'GTCAAACAA~
TGACAAAGAA
CAACAATTTA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1899 WO 97/3 7044 PCTIUS97/05223 893 INFORMATION FOR SEQ ID NO:985: SEQUENCE CHARACTERISTICS: LENGTH: 3867 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 3867 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:985:
ATGGAAATAC
GGAGCGTTGA
ATTCCAGCCA
CTTAGTTGGG
GTTTGGCGCA
TACAAATCCC
CATTATTGGG
GGGACTTATA
CAAAAAGCCA
AGCGCTGATC
GTAGAAATCA
ACTTTGCAAG
GGCGCCACGC
CGTTTGCAAT
GTTCAAGGGG
GGCATTATCG
TTAAATATCA
CAAAGTGGCA
GTCATTAACC
GGGCCTTTTG
GATGGCACGA
AATATCGGCA
GAAAATCTAA
GGCTATGCTT
AACGGCACAG
GATGCTCATA
GATTTTAGTG
GTGGCCGTTA
GGGGAATACA
TTGGAAACTG
CTAGTTATCA
AATGTTGAAA
AAGCTCATGT
AACAAACACA
TTAGCGCCAT
TTGTTGGGGG
GACTCAAACA
TTCAAGCAGG
TTTTATCCAG
TCAAAGGCGG
AACTATCAGG
CTTTGCGTTT
GCACCACGAG
ATAATCGTGT
CTTCAGAAGG
TCAATTTGGC
ACGTGGGAGC
AAGTGGATTT
CTAGCAATAA
TTGCCCCTCC
CTAAAAACGA
CACCCAATAA
CTGGCGGCAA
TTAAAGTGGG
AAGGCGGTGT
CCGGGAATAT
TGGCAGGATC
CCACTTTCAA
CAGCTAATTT
OTGTTACAGA
AAAACTTCAA
CTCATTTTAG
GCACTAGGTC
ATGATTTTTA
TCACCAGAAA
TTAATAATCT
CCGCAAAATC
ACCGCAAGAG
TATCGCCACA
AGCCGAAGAA
AAAAGGCTTT
TAAGATTGAT
GCAATGGAAC
GCTTAGAAAC
GGGCCAATTC
AGTGAATTTC
GGGTTCTGGA
GATCACTAGC
TTCAAACAGC
GTATTTAGCC
TAACCATCTC
GACTCATATT
AGAAGGTGGC
CAAGAAAGAG
CACGCAAAAA
AGACACGGTT
AGGGTTTAAA
CAATCTGTCC
CACCOTTOAT
AAGCGCGAAT
TAACGATATT
TAAAGGTATT
CAAAGTCAAT
CATTAATGAA
CGAAGATATA
AATCTTTTCT
CTATAGCCCT
ATTCGCTTCT
AACCTTGGGT
AATCGCCCTT
AGTCATGCCG
GGCACTGCTG
GCGAATAAAA
AATGAATTTC
GGAGGTTGGG
AAGCTTGAAG
TTTACTGGTG
AATGGCAATT
AACGCTAAAA
GCCGGGAGAA
AGTAAAAATG
GTTA.AATTAA.
CCTTCATACA
ACTGTGGGGG
GGCACACTOG
TACAAGGATA
ATCAGTCAA-A
ACAGAAACTG
GTCAATATTT
GCTTCTCTTA
AATCAAGCGA
GGGCCTTTAA
TTTGAGTTTA
AGTTTGGGA.A
GATACGTA
ATCAACAAGC
TTGATTGTTA
GGCAGTCAAT
GGGGGTGTCA
TGGAATTATT
TCAACCCCAG
CAAAATGCGG
TAGTTTCTCT
CCTTTTTCAC!
TAGGAACGGT
CCCCAGATAA.
CTAACAAGGA
ACTGGGGGAA
TGGATATGAA
GGGATTTAGA
CTTTCACAAG
ATATTTCAAT
AAGCCAGCTC
CGGAAATTTC
ATGGTAATGT
GCACGATCAA
ATCAAAACGC
ATTTGTGGCA
AACCTAATAG
ATAACAATAG
AACCCACGCA
TCCACTTAAA
CCACGAATGC
GCGGGCGCAC
GAGTGAATAA
AGGCTOOTGT
GATTTGTGAA
ATGGTGGTTT
TCATCACAGC
AAACCAATGG
CGCGTATCAA
AATTTAAAAG
TTGACGCTAG
AAAACCCTTG
TCATGGACTA
CGTTTTAGCA
GACCGTGATC
CTCAGGGCTT
ACCCGATAAA
ATACGACTTA
CGCCGCTAGG
AGACGCTGTA
CGTGAATATG
CTATAAGGAT
TGATAATTTT
TACGGTTTTG
TCTTTATGAT
GTGGATGGGC
CACTTCAAAA
CGCTCAAGCG
AAGCGCCGGG
TACCACTTCT
CAACACAGAG
AGTCATTGAT
CACTAAAGCC
GGCTCATTTG
CCTTTTAGTG
TCAAGTGGGT
GGATACTAAA
TTTAAAGGTG
CAACACCTTA
TTCCACTAAT
GATAAGTGTG
TACCGTGCGT
CGGTGAAAAA
GAATGTTAAA
GGGCACATCA
TAGTCAATTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 WO 97/37044 PCT/US97/05223 TCAAATTTAA CCATTCAGGG GGATTTTATC
AACAATCAAG
CGAGGCGGGA
GATATAGACA
CTCATTAAA
TCTACAGGTA
GCCCTTTATA
AAAGCATGCG
TATCTTATCG
ATTTCGGTGT
TTACCCACAA
CCTTTCGCTC
ATTGAGAGCG
TCAGGCGCGC
TATGCCAGAC
GCCACTGACG
TTGAGCTTGA
ACCAACCATA
TCTTTAGAGA
AATGTTTGGG
TATGGCACA
GGTTTTGGAA
GCCAATAACG
TTTGAAGCTC
TTACAAGATT
TATGGTTATG
AGCTATAACC
AAAAATGGCG
TATTATGGGG
TTTGGATCGA
TTAAGCACCT
TTGAATTTGG
AATTTAGGAA
ALAGTGGCAAxC
GCGCGACCGG
ATACAGAGCA
CCAATGGCAT
ACAATAATAA
GTATGGCTAT
GTAAGGCATG
ATTATTTAGG
ACACCACTAA
ACAGCGCCAC
TGTTTGAATT
AAGGCAGGGA
AALATGATTGA
CTTTAAACAA
GTAATGCGAT
TTAACTCGTT
GCGCGGCAGA
CTAACGCTAT
GCGCCGGCGT
GCTATGGTTA
CTAATTTTGG
AAGGGGCGCT
TGAATCAAAG
ACTTCGCGTT
ATTTAGGTTC
CGAGCAGTCA
ACACTTCATA
ATGATGTGGC
ATGCAAGAGC
GCGTGGTTTA
TGAGGTATAG
CTTAAATGTA
ATTTTACAAA
TGTTTTATTG
TAGTAATGTT
CCGCATGGAT
CGGCAATCA.A
GAGAAATATA
CAATTCTACG
TA.ATGCGCAT
TCCTAATTTA.
GGCTAACCGC
TCTCTTGCAA
TAACACAAGC
CGTAGCCAGT
GATTTTAAAT
CGCTCAACGC
AGTGTTGTAT
TGGGGGAGCG
AGACGCTTTC
TAGCTCCTTT
CGTGTATAGC
AGGGAGCGAT
CTATAATTAC
TTTTAGGAAC
AACCAACTTT
GCATTTATTC
CTTTTATTTG
GTCTTTA.AAC!
GATGATGGGT
TTTGCACAAT
TTTCTAA
GGCAATGCAG
CCGCTCATCA
AAAGCGAAAA
AATCTAGAAG
ACTTGTGTGG
AGCATGGTGA
GGCATCAGTA
CCTACTGAGA
TCTGCTAACT
GTCGCTATCA
TCTAAAGATA
ACTTTATTGA
ACCGGTGAAA
TTAGAGCATA
TCTCGTTTAG
TTACAAGCTT
CAATTTGCCC
AGCTTGAATA
CTTAACGGGA
AGCAATCAAG
CGTTTTTTTG
CAATCAAGCT
TTAGCCTATA
GCTTTAGTGT
AAAAGCAATA
AACGCTAACG
CATGCGGGAG
ACCTTTAAAA
GGGGAATTGC
TTGATTTCCA
GCACTATCA-A
CAGCTATGAT
AGATTAACAG
TCATTGGTTA
AGCAATTCAA
TGCGAAATAC
ACAACCCTGA
AAACGGCTA-A
ATGGTGGCA\
ACGCTCTCGT
ATCAGCATGA
TTGACACGCT
TTGATAGCCA
TCACCAAGCA
A-ACAAAGCGG
TCAATCTCTC
TAAAAGGCCA
CTAAATATGA
GCGGCTCTA\
ATGTGGA.AGC
CGAACTCTCT
CCAACCAGCA
TGAATTTCAA
GCGCCACAGC
TAAAACCAAG
GCCAATCACA
CTAACGTGGA
TTTTACAAGA
TCAATGCCGC
AATTGGCTAA
ACGCAAGCCA
CTATCTGGTC
GTTTAATAAT
CGCTCAAGAT
TGGTAATGTT
AGAGCGCCTA
TGATGACATT
CAATTACAAG
CGGCTCTAAA
TACCACCAC
GAAGAACGCT
TTTTGGCACT
CTATACTCAT
TGATGCGGGT
ATTGAATGCG
CTTACAAACC
TAGGAAGCAC
AGAATTCGCT
AAAACCTACC
CGCTTCATTG
CATTGTGGGC
TALACTCTGGG
TGAATTTGAC
AAGCACTCTA
AAGAGCGAGT
CGTGGGCGTG
AGTGGCTTTA
AGCGCGTTAT
GTTCGCTCAC
rCGCAGTCCT
AGAAGTGTTT
TTCGCTTCC
2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3660 3720 3780 3840 3867 INFORMATION FOR SEQ ID NO:986: SEQUENCE CHARACTERISTICS: LENGTH: 4455 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .4455 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:986: GGGGCGGTCT CTTTACGAGG GCAGTTTAAT TTAAGCAATA ATTCTTCTTT AGATTTTCAA WO 97/37044 WO 9737044PCTIUS97/05223
GGCTCTAGCC
AGCCCCATC;
CTCTTAAACC
GATCAAGcGG
AATCGTGTGI
TTCTTTGGC.P
ACTAACCCTC
GTTACGCTC'I
TTTAACTACC
CA.AGGCGTGT
CAAGCCAGCG
ATTTCGCAAA
GGCTATAATT
ATCAAAA.AGA
GCGCTAAACC
ATTGATAACG
GCGACTAAAA
CAAAAGCCTT
CAGCTTTTGG
GAAATTTATC
AGCGTAACTT
CAAACCGATG
GCCGGGCTCG
TTAGGGAATT
CCTGAGCAAA
ATGAACGATA
TGGACCGGGC
AAGCTTATAG
ATCACCGGCT
ATTGTCAAGC
GGCGAATTTT
ATTAAAAGCG
GGTTTGGGGG
AGTCAAGTGT
AATTCCACTT
ATTTTTGCCG
TCTAATCAAG
AACGCCACTA
GTGCAACAAA
AACGCTTCTT
AGCTTGGTGG
GTTGATTTTT
GCTCAATTCC
AGCTTTGTAA
ATAGAGGTGC
GGGTTAGAAG
TCTCAAACGC
CAAACGCCTT
AGCCGTTATA
ACCTTAATCA
GGCCAACGGG
AACAACGCCT
TCTTATGGTA
ATTCAAGGAC
TGGCTTTCAA
GAAAACCACT
TTGATTTCTA
CTATCACCT(
CTTTCCATCZ
CTAACA.ACAC
GCTTGAATAq
ATAACATCAI
TGCGCATTA.
TCAATAACGC
CTCAAATCCC
AAAAGGTTTP
TTTATCTCAC
GCAGCAATA.A
CCTATAACGC
TCAGTAATAT
TATTAGGGAA
AGCTTACCAA
CAAACAATTC
TAGGGCAAAC
GCGATTACAC
AGTCCATCTC
TTACCGGCAC
TTAACAGCCA
GGATTTTTAG
CTAATATTTT
TGATAGTAAA
AAAATCAAAC
GCGGTTTGAA
TAGTGGGGGG
GCAGCATGTC
TTATTTCCGC
CGAGCGACGC
TAGGCCA.AGA
TTTTAGACAA
ATTTGATCCC
GGCAAAAAGG
TCTCTAACGC
GAAACAACCA
TTTCTAACAT
ATAACAATGT
GCGATCCAAC
CTAGTAATGC
TTACGGCGAA
CAAA.AATTAA
AAGCCAACAA
CGAATAACTT
TTCAAAATTT
TAGGGGGGGC
CTGTAAATCC
TTATGAATGT
TTGATTACAA
ATATCAACGG
TTTTATTACA
CTCAAAACAA
ATAA.AGTGAT
AAAGCGCACT
CATTGATGAT
CTTTAA.ATGA
AC CCTAATTT
TAACACGGCC
SAGCCCTTGAC
3TAGCGTGCTC 7CGCTAACATI
TCAAGCGGAC
TGATGGGAT'I
CCTAAAAATC
GGGCATTAA-A
TAACAACC'
GAGCAGCGTG
CACCACGAAA,4
GCAAGGCAAC
CAAAGCGTTA
TGATTTTTCG
GCTCATCACG
GGTCGTGCAA
AGACACCAAT
TGATATTGTG
GGCTGATTTG
TTTAGGGAGC
AACTTCGCTC
CATGCTGGGT
GGGCGAAGTG
TACGCTAGGG
CCTAAGCCAG
CACGGCGATT
ATTAGCCGGA
CATCAATGAT
TAACGATATA
TTTAAAAAAC
CACGCTCAAT
AGTCTTAGCG
TAATCTTGGT
GGATTTTAGT
TAATGGAGGC
CATCGCTTTC
TAACGTCACC
TTCCGTGTCT
GACAGCTACC
GTCAAACAAC
TGGTTTCAAT
AGGCTCTGCA
CCTCACGATT
AAACATTCAA
AGTGATCGCT
ATTGAATAAT
GCTCATTCAA
CAGCGTGGCT
TATCAACCCT
AAACCACATA
AGATAAGGGG
CATTTTAAGC
GGACTTTACC
CAATCAAATT
GGAGACTAAA
AATCTTAGGC
TAGAAATAAC
3TTTAATTTCI
ATTAAAGTGC
AATTTAAkAAP
GATTTACTAAP
7ATGAATGGTP
TATGACGCTP
ACCGAGAGCT
AACACGCTCT
AATGGCGTGT
AAAGGCTATT
AATAACAATC
CCTATCAGCG
*GGGCAAATGG
*CTTTCAAGTT
CCTAGCGATT
AATTTCAATA
AGTGCGGTGG
TGCCAAAAAT
GGCTATATTG
GGGAACGCAT
ATTCTCAACC
CAAGAGGGCA
GCAATGCAAT
AGTGATAGCG
CTTTTGGGGC
AAGGATTTGA
CTGGGGGGCA
TTATTGAGTA
GGGCAAGTCA
GATGTAGCCG
TCTTTAGAAA
GCTAAAGGAT
AAAAAGGGGA
TTCAACGCGC
ACGCTCAGTT
ACTAACCATT
ATGCTTAACG
CAAGGCAATC
GCCACAAACC
GCGCCAATCG
TTTTCAGGCA
AACGTTAAAA
TCCAACCAAG
GGAGCGTTTA
TCAAACGCTT
TTGGGAGCGA
GTAGGGGGGA
A.ATGGCGCAA
AACAGCTTGC
GAGGAAAAAA
TTATTATTGA
CTTTCTGTCC
CCTCCCACCT
GAAGCTGTTG
GAAAACCCGC
GTAACAAAAG
GCTACCAGCC
ATGATAACGC
CCTTGAGTTq ACAGCCAGCTf
GCGATCTGA.A
ATTGGTATGA
AAAACCAAAC
TTAAAATAA
ATAACATTGG
ATTCTTATAG
ACAACCCCAIA
TAACCTCTGA
CGTTACACGT
CGCTCAAACT
TGAGCAATTT
GGAAAAACAT
ACGGCACTTT
TTTTTGGGGG
TTAGAGGCAC
ACACGACTTT
GGGGGACTGG
AAGCGAATAT
TCAATAAGGT
CCATTA.ACAA
TGATTGGGGG
AGAATAATTT
TCAGACAAAA
TTGATTTGCA
AAAAGGGGTT
TAAGCGTGAT
CTTTGGGCAA
GCTTGCTGCA
TAGGGTCTAT
TTTTCGCTCC
AAGGCAATGT
TTAACGCAGG
CTGGAACGCT
CTAGCAACGG
TGTTTATCAA
CTTGCACCAC
CCTTAAATAA
ATATTTACGC
ACCTGTATCT
CGGTATTAGA
ACAACAACGC
CTTTA-AGCAC
TCCATTTTAA
TCATTAATCT
CTTACACTTT
AATCGTATTT
ACGGCGTATT
GTGTAGCACT
TTCACAACCA
TACAGGATTA
GGGGGA.ATAA
TTTTTGCGCC
ATCTTCAA.
TTTTAGAAAT
*TTTTTCTCAA
GGGAGGCAAC
TGTTTTTAGC
TGGTAATA
*GCGTATCAAC
TTATAGTTTC
*CCAACTGAGC
CTCTGAAJATC
CGATGACGCA
CCAATCCTAT
ATCTTCTGTC
CTATAACAAG
CTACCCTGAA~
AAAAGGCGAT
TAACGACTTG
GATTATAGGA
CTTGGGCTAT
TTATTTGGGG
TAACGCTAAA
GGGGAGTGCG
CGTAAGCTCG
TTTCAATCAA
AGCCGGGGGA
GTATTTAACG
TGATAACCTC
ATTAGGCTTT
AAACCCTGAA
GTTCAATCAG
GCTGCAAGAT
GCAAATGATT
AAACCAGCAG
TTATGAACAA
CTATGGCTTG
TTTTGTGCAA
AAATTCGCTC
CAATTTGTTG
CCTTAAGATT
CGCTAGCTGC
CGCTCAAAAT
TAACGATGAA
TAACGGGGTG
TTACAATAAC
GAAAAACGCT
CACGCAAAA-A
CGGGATTTAT
TTTAGAAAAT
CAACACCACC
ATTAAAA.AGC
GAAGCTCTAT
GACTTATTTG
ACCTAACTCA
GATTAAAATG
CATTGTGGGC
CGCTATCAAG
GATTTATTTA
CACCGCAAGC
GGCGAGTTAC
120 1 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 WO 97/37044 WO 9737044PCTIIJS97/05223
ACCCA.ACAAA
AATTTTTCAG
GAGGTTTTTG
OTGGGAGGAG
TATGACCGAT
TTTAACGGGA
GCTTTTTTGA
AGTCATATCA
ACCTGGACAA
GTGGTGCTAA
GCTAAAATGC
TCGGTTTTAA
TATTTTGTAA
GTGCGTTTTG
GCGAGCGTGA
GTGGGGCTTA
CGAGTGGCGT
CCAGCCGTTT
AGCGCTTGTT
TCAAATACTC
CGAGCTTTAT
TGGTTAAAAG
ACATCATGCA
AAAGAAACGA
ATTCTTCTAA
CGAGCGTGAA
AACCTCAAGT
AAAATCCAGC
CGCTCAACAT
CGGCGAGGTT
TGGGTGAAAA
TCACAGGAGG
AAATGGGCTT
TTTAG
GACAAAACTC
AGAGCTTAAA
TCAACTCAC
TTCTGGGGGC
CGTGATCCTT
TTCTTTGGCT
ATTCACTTTG
TTCCTTGCTC
TGGGAATTAC
GGGCTTOAGC
TTACCAACAA
GGGGTTAGAG
GGGTAGGGAT
CACTTTATTG
CGAAATGCAT
GCAATACCAA
TCTGATTTTA
AACAAGCGTT
?AACACCCCA
AATGGCACGC
GGGGGTTATG
AATAATGTGG
AGCGCGAATG
TCTGTGTTGA
GGCTATGATT
TATCATTTCA
TTCGTCATGC
AGCCGTAAAT
CTTTTGATCA
TACCGCAAGG
TTGTGGCGTT
GATCTTAATA
GOOCTAGAGA
TTAGCGATCC
ATAACCTTTG
TTTATGGCTT
TGGCTTATGG
ATGTGGGGAT
AAACTTATGG
ACCAACGCTA
TCATGTTCAA
TAGGCTTGAG
ATTCAAACCC
ATTTTGGTAA
AAGCTA.AAGG
GGGAAATTTT
TGATGTATGT
TCACTGGGAA
GGGAGAGTCC
TAACCCTAGT
GATTCAAGGG
GAATGTGGGC
CTATAGCGGT
GTATGCGAGG
AGGCAATGCG
CA.ACTACAAC
ACAAAAAAGC
CGGGATGAAA
TTCTAACGAA
AA.ATTCCTAT
CGACAATGTG
TAACACTTTT
GAATGCGGGG
TGTGGGCATG
3540 3600 3660 3720 3780 3840 3900 3960 4020 4080 4140 4200 4260 4320 4380 4440 4455 INFORMATION FOR SEQ ID NO:987: SEQUENCE CHARACTERISTICS: LENGTH: 903 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .903 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:987:
ATGAGAAAAA
GCGAGTA.ACG
ATGAAAGGCG
ATCCCCCCGT
CCAAAAGGCT
GTGGTATTGA
TTGGTAGCGC
TTGAAAGGTT
GCGCGCTTGG
GTTGTTGAAA
CCGATTTTGG
CAAGCAGGGA
CAATACGAAG
TTGGTCTATT
CAAAAACATC
TAA
CGATTTCAGC
CTTTGATTTT
TCGCTTTCAG
ATAACATTTG
CGGTATTTGT
AA.ACTAAAAA
AAACTTTGGG
CTGAAAAATC
CTTCTGGGGC
TTCCTTACCA
ATATTCAATA
TCAAACGAAA
GGAAAATGCC
TAAACAGCTT
AAATTAAATC
GTTGTTTTTA
ACAAACAGAT
CGTTGATTCT
GGAAGGCGCT
GAGCGTAGTG
CGGCCAGTAT
GATTGATAGC
CTATACTTTC
GATCACATTC
AAAAGCGAAA
TGGCAATGTT
CGATACACTG
GTATGTCC
GTTGAATGTT
TGGTGCTGAC
TCAGCGTGTA
TTTAGCCTAA
AATCTTAAAA
TACCGCTTGT
GATCCGGGCG
TTCGTCTCTC
GTGCGTGAAA
CATGGCCGTG
GAGCAGGTCG
GCCACAAAAG
TGGAGCAATA
TGTGTAACTA
AGCTTTGGCG
TCCGTGGCGT
TGGAATATTG
TAGGGTTATC
AAGATGGGGC
TCTTTGATTT
ATCAGACCGC
TAGGCACTAA
CAGATAACGG
TTGATGAAAA
ATGTGTATGC
GGCCAGAGCT
GGGGAGTGAA
TCAGCGATAA
TTTTTAAAAA
ATGTGCCAGA
TGAATATGGA
ATATAAAGA.A
GTCTGTTCAT
CGTCTCGGCG
AACGCACGAA
TAGTTATTGG
TCGTAAATCG
CACGCTGACT
AGCCAACCGC
TTACACTGGG
TCCTATAAA
AGGCAATATC
ACTGCTCAAT
TTCCAAGAAA
AGGCCAGCCG
TAATTTCGCG
GTGCGCTAAA
WO 97/37044 WO 9737044PCT/US97/05223 897 INFORMATION FOR SEQ ID NO:988: SEQUENCE CHARACTERISTICS: LENGTH: 1581 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 1581 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:988:
ATGAAACTTT
TTTTCTGTGC
AGGGGGGGGT
TTAAGCCTAG
ATTAAATCCA
TTAGACGCGC
GAGTTTTATA
TTGCAAGTGA
GTCATTCAGC
GAAGAGCGGC
GATGAAGAAC
AGCGTGTTGT
TTAGATGGCG
GTGAGCTTCA
GTGGGCAAAC
GAACGCATTG
GATTTAGCGA
AGGATTGTAG
GGAGGCTTTA
GCTTGCATGG
GCGACGCTGA
GCTAATATCA
AAGGCGATCC
TCCTTGATCG
CTCACCACAG
ATTTATCAAG
GTGAAAAATA
TTAACCCTCG
CTTCTTTGCT
TGAACATGCT
CGTCTGCTTT
GTTTAGAGGG
TTTTATTGGA
GCGTGAAACT
TAGGGATCAT
AAGGCCGAGA
GCGCTAAAGA
ACAATAAAGA
TGTCTGATGC
AAATGCTTAC
CGCTAGATGC
GCATGGCGAT
GTGGGGGGAG
TCGCTTTAAG
GTCCGAGTTT
TTTTAGTGAT
CGTTAGTGGT
CTTTACCGGG
TTATCAACGA
ATTTAGGCTA
CTTCGGTGTT
GCATTGGGAT
CCCTTTTACC
AAAGAGCTTA
TTTAATCGTT
AGAAACTAAA
TTTAGGGGTG
AGAATACAAC
GATCAGTTTT
ATTGCAAGGT
CACCCCTTTA
TCGTAACCGC
AGAAATTTCG
CTTGATTTCT
TGCGATGAAT
GGAAATGGGG
AGATGCGAAA
GCAAGGGGCT
CGTTTTAGAC
CGGGCAGATT
GAGTGGGGCG
AGGGAAAGAC
GGGCTTTATG
CAATCTTTTT
AATGGCAGGG,
ACGCATTAGA
TATCAATGCG
GCTATACGCT
TTTAGCCTCT
TAAATTAGCC
TTTATATTCG
GGCCCTAAAA
CAAACCGATG
GCTAAAAAGC
GAGCTTTTAG
CATAGCCAGT
GAGCAAGAGG
TTGGATCAAT
GTGCAATTGC
AAATCCGCTC
ATGACGGATT
GGTAAAATCT
GTGGTGTATG
AAGATTTTTG
AATAAGGTCT
AACGGGAATT
ATGAACGCGC
AGCATTAAAA
GCACTTTATT
TTGATCGTGG
ATTGTTTTAA.
GAAGTTTTAA
AGCCGGGCGA
TATGGCACAG
ATCATCACCG
CAAACGAAAA
CGCTTCTTTT
TCACTTTAGG
AGGCTTTAAA
AAAATATCTT
ATGAAGATGA
TTGAAATCAA
AATTGCGTAA
TTGGTTTGGC
CCGGCATTAA
ATTTGCAGAT
TAGAGGCTCA
TACTCAAAGC
ATCAAAATAA
GGGATTTTTC
ATTCAGCCCC
TTAGCGTGGC
CCATTCAGGT
CTTCCATTAT
ACTCCATGGC
CGGTCATGGC
CCGTGGGGAT
GAGAAGGAGA
TTTTTGACTC
GAGCGATTAA
CTATTATAGG
GCCTTTACTT
AGGGGTAGGG
TTTGGATTTA
AAACAAGTAT
GCTTAAAGAC
GGCGAAAAAA
AAAAGAAGCG
AAACACGATC
AGAGCCTGTA
GACTTTAGAA
GATGGCAGTG
AAAATTAGGC
GATCCCTATT
CCAGCCGGTG
AGGCGCGAAT
GGTGATTAGG
TCA.AGCGAGC
TTTAGAAAAA
CGCTCTAGTT
TGGGGTGATC
GATTTTTGGA
TGCCGTGGAT
GGGTGTTGTT
CAATATCACT
AGGCTTTGCC
CACGCAAGGG
TTGGTTTGGC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1581 INFORMATION FOR SEQ ID NO:989: SEQUENCE CHARACTERISTICS: LENGTH: 2382 base pairs TYPE: nucleic acid WO 97/37044 PTU9/52 PCTIUS97/05223 898 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .2382 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:989:
ATAATGAATG
TTTTGGACTT
ACAACCACCG
CGTAATTCCA
TTGAATATTG
ACGGGCGTGC
AACACCAACA
GAACTGGCGA
GGAACGAGCG
GAAATCCCCA
TCTAATGGGA
AACCAAATGC
ATTAGCGCTC
GTGCAAAACT
TATTACCAGT
GCCTATAACC
TTTGGGATCG
AAATTCACTT
AGCGTGTATA
GCAAAAAACC
TTTTTTGACA
GTCAATACTG
TTATACCGCC
GATGCAGGAA
ATCAATTTCA
TACGAAAAAA
CGTTATAACC
TTTTATTTCA
GTAGGCACAA
TATTTTAACC
TTTACCGGGC
CTGGAACTCT
GACGCTAATA
AAAGATATTT
AGTTACACTT
AGCGATGTGT
ATCGTTACCA
ACCACTTTGT
TTTAACATGA
GTTATTTGAG
TTAATTCTTT
AGCAAAAATT
CA.AGCTCTCG
AAAACGCCTT
TGCCTAAAAT
TGATTTTAGT
TTTTCCCTGT
TCCAATACGG
AAGAGTGGGA
ATTTTGTCGA
TGTTTAACAC
AAGGCAATTG
ACTTGTTGGA
ATTATCAATA
GCTTCATCAA
TGTATCAAAA
ATTTCACGCA
TGAGCGGTCA
CTAATTGCGG
ATATCCGCCG
GTAAAGTCAA
GATCCACCAC
CTTCACTCAA
ATAACGGCAT
AAGACGCTCC
AATGGAATCC
ATTACCAAAG
GCACGGATTA
ATCAAGTGAG
GCTATGGGGA
ACTACACGCC
TCACAAGCCA
TTGGCAAAA.A
ACGCTAAAAC
TAAACACCGT
AAACAGCGGG
GGGAACGCAA
AATATTGGTT
GGTAAAAACC
TATGAGCGCG
CAGTTCTAGT
CACGGTGATT
GCAAAATGTG
TTCGGTGCGC
CAATGGTATC
AACTTTCCAG
CCCTAACACT
AAATCAAGCG
TCCCAAAGAA
TTATGGGCGA
GATTAATGGG
TGCGATTTAT
CAACTCTTAC
TGAGCGCCCT
CTACTTTGGC
TGACATGAGT
AAATAAGATT
TCTGTATTCT
ATCCGTGGTG
ACAAACTTTT
CAGGAAAAAC
TAATTTCAAC
GCTAACGATC
TCCCTTTAAG
GGCAGTCAAT
AAGCTACATT
TTTTCAA.ATC
TTTTAACGCG
CAACAGAGAG
TATTAGGGGG
CACGATGGTT
GCTCCCTTTT
CACGATTGGG
GCCTTTTACA
CATGACGCCG
AAkATCAAAGC
TAGCGGGATA
CCGTATTTTT
AAAGATAAGC
GCCCCAATTT
TCCAACAAAG
CCAGGGATTC
GGTTTTGGTG
CCCATTTATG
TCAGTGGATA
TTTGGAGGCG
GCTGAAAGGA
AAAGGCAAGC
ACGGCTGGGA
CA.AGGTTTCA
AAGATCAATG
CATCCAGGCA
GACAATCAAG
GATCCGGATA
AGGGATTTTG
TTACCCTTTA
TATAGCGACA
A.ATGCCTTTG
AACATGGGAA
CCTAGCATGC
AATTATACCG
ACGCCGGGCT
GTGGGTCAAA
GTCGGCTATA
CCGCCCCAAT
TTTAATGTCA
AACTATTTTG
CCGGTCAATG
CTTAATTTCC
ACTAACCCTG
GTCAGCCCGC
TTAAGCTCTT
GAATACGCGC
TACTATTGGG
GTGAATGCGA
GGCACTAGCC
TAGCGTCGGT
ACCATTTTTT
CATGGCAAAG
AACTCAAAAA
AAATCAGAGA
GGGGCGGTAA
GCGCGCCGTA
GGATTGATGT
TGGTGAATGT
TCACTTTTTG
CCTTAGCCCA
TGTTGGGTAA
GGCAAAATAG
CGACCAATAC
CTTTGAGCGC
ATGGAGGGCG
GGAAAGTGGG
GGTTTTCCAA
AAGGCAAGGG
CGAATAGCCC
AGCCAAAACT
TGCGCTTTTT
CTAATAATGG
CTGTGTATGC
TGAGATACAC
CCC CAAAAAAC
AACCCATTAA
TCAGCAATAT
TGGAAGGCGG
TGATTTTTGC
CGAGATCGCA
ATGCGGCTTA
CTAATCCTAA
ACCAATTCAT
TCTTTTATAG
CCACGATTAA
TGTGGAATTT
GCTTGCAAAT
CTAACGGGAA
CGTTTTGACT
AAAAAAAGTT
CGAAGAGGTG
AACGGGGAAT
CGCTACAGGC
CGGGCATAGC
TTCCAATATT
GATTAAGGGG
CATCACTAAA
GGGGCGCTCC
AACTTTAGGA
ATATATAGGC
CCCTACAAAG
TTTTAAAGCT
GCAAGATTAC
AGCCAAGCGC
GGGCGATTTT
CCAATACCAA
AGAAATCAGC
TTGCTGGCAA
CAATCTTATC
AACTGAAGAT
GAGTGGGTTT
CAGCGATGAA
TTTTTTAAAT
CACGAAAGAG
AGAATTATTG
CGGTAATTTT
CTCAAGATAT
GAATCATTAT
AGGCGTGGAG
CACTTTTATA
AGGGCCTAAA
TTTAGACGCG
CCGCGCTTAT
AAATGGGGCT
GCAAATTTCT
CAATAACATT
AGAAGCTGCG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 WO 97137044 PCTIUS97/05223 899 CCTCCCAGGA GCATCACAGC GTATGTGAGT TATAATTTTT AA INFORMATION FOR SEQ ID NO:990: SEQUENCE CHARACTERISTICS: LENGTH: 1002 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .1002 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:990: 2382
ATGCTAATCG
TCATCGTTAT
GGGGAGCAAA
GTGCCTGCCA
TCTGATATTG
CATGTGGCGG
TTTGTGGGTA
TTCCAAGAAA
GAAGTTGACG
CGTTTGAAAA
AGCGGCCATC
TTGAAATACG
CCTGAGATTA
CCTAAATTTT
GATATTGGCG
GA.AGCGTTTA
GATTTGAATG
CTCGCTTTAA
TAGGCGTGGC
CCATAAAGCT
TGTTCAATAC
TTAAAGCCAC
CGTTGAATGT
ACCCTAAAGC
AAACCATTGT
CTTCTAAAAA
ATGTCAAAAA
AAGCCCTTGA
TCAAGTTTGG
TTTTTATTTG
CCACTATCAA
GACCTAGAGC
AGGGCGTGGA
ACGCGGAAGT
AAAAGCTTTA ATTTCCTATT CTTTAGGGGT TCTTCTTGTT TAACGCTTCT AATCAAGAGA TCCAAGTCAA GGATTATTTT TCCTGTTTCT AAAATAATCT ACTTGGGCAG CTTTGCAGAA TTGGGATAGG GTCGTGGGCA TTTCGGATTA CGCTTTTAAA TCTCAAAGAT CCTGAACGCA TTAAACCCAT GAGCAGTGAT GGAATTATTA AAAAAGCTTA GCCCTGATCT TGTGGTAACC GGTAGAGCAT GCGAAAAAAT TTGGGATTTC ATTCCTTTCT AGAGGTCATG GAAGATATTG ACGCTCAAGC TAAGGCCTTA ATTGGCTAAA ATGCAAGAAA CTTTGGATTT TATCAAAGAG GAAAAAGGGG GTGGAGCTTT TCCATAAAGC CAATAAAATC TTCAGATATT TTAGAAAAAG GGGGCATAGA CAATTTTGGC GCGCGCTGAC ATTAGTGTGG AAAAAATCGT TAAAGAAAAC GTGGATAAGC CCGCTTAGCC CTGAAGACGT GTTGAACAAC AGCCATTAAA AACAAGCAAG TCTATAAGCT CCCCACAATG CCCACTCATT AGTCTTTTTA TCGCTTTAAA AGCCCACCCT TATTAATGCG ATAATCAAAG ATTACTATAA AGTGGTCTTT TGAACCCTTT TTGTGGCACT GA 120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1002 INFORMATION FOR SEQ ID NO:991: SEQUENCE CHARACTERISTICS: LENGTH: 1203 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 WO 9737044PCTIUS97/05223 900 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION .1203 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:991:
ATGTATAAAT
AGCGATATTG
AAAAACCAAC
CAAAAGGTTG
AACGAAAGTT
CAAAAACAAC
TTCTCACAAG
GCGTTTGAAA
AAGAAGCTCA
ATAGATGAGC
CTCATTTTGA
AAAGAACGCC
GAAAATGAAG
GCTAGCTCTT
TTGAACGATT
AAAATTTTTA
GTTTTAGACG,
ATTGAGCATA
ATTAAAAGCG
GGCTTTGAAG
TAA
TAGGA.ATATT
CTAAAGACAT
TGAACAGTCG
AGATTGAACG
TGGTGCAAGA
GATCGTTTTT
CCCTAAAGGG
ACTTGCACCA
ATGTGCAAGC
AAAAAACTCG
GCATGCAAAA
AGAATTTAAA
AGAGGGTCAG
ATCAAAATAT
ACGAAGTGGT
GCGAGTCTAT
GGAAAATCGT
AAAACGGGAT
GCATGCGGAT
TTACCATGAG
TTTATTAGCC
CCAACATAAA
TTTGAGTTCT
CCAAATGGTC
AAAAGTCCTA
ACAAAAGAGA
GCAGAATTTA
AAGCACCCTG
TTTAGAAGTC
TGAATCAACC
AGATTATGCG
CGCTCTTTTA
TTTGAAAAAA
CAACACCACG
GCAAAAATTT
CACGCTAGTG
GTTCGCTAAA
CCGCACGATT
CCAAAA-AGGC
AGAAAAGCAC
ACCTTACTAT
GAAACCCTTT
TTAGGCGAAG
CCTTAAAAA
ACCAACTACC
GTGTTTGATA
GCCTCTTCTA
TCTAAAATGT
AAAAGCAGCA
TTAAAATCCT
ATCTATPJACC
AAACGCTTGA
TCTTCTCAAG
AGCTATAACG
GGCCCCTATA
TCAAAAACCC
GAAATCAACA
TATTCTCAAT
TATGTTTTAG
ATTAACCCCT
CAGCTAACAC
TGAAAAAAAC
CGATCCGCTC
AGAGTCTTGA
GCAAGTCTTT
CGCTTTTAGA
ATGATGTGAT
CGCAACTGAG
TTCAAAAAAT
TGCAAACCGA
AACGGCTATC
ATATCATCAA
CTTTAGAAGT
GGCCAAAAAC
TTGACCCGGT
CAAACGCTTT
TGCTTAAAAA
TGGATAAAAT
GACGCATTGA
TAGAACTCAT
GCAAAAAGTG
CCATGAAGAA
TAAAGAGCTT
AAAAAATCGT
AGATCATTTG
GGATTTTCTT
TTTGCAAGTG
CCAAGAAGAA
CTCCTCTGTC
ACAAAATAAG
CCTTTTAGAA.
ACAAAATAGG
CAAACAAGTG
GATCGCTCCC
TTATAATTTA
GGTGCGTAAT
AGTCGTTATC
CGCCCCCACC
TCAACGCTTG
CGCGCGCAAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1203 INFORMATION FOR SEQ ID NO:992: SEQUENCE CHARACTERISTICS: LENGTH: 1650 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1 .1650 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:992: ATGGATAAAA ACAACAATA.A TAATCTCCGC TTGATTTTAG CGATCGCTCT GTCTTTCTTG TTTATCGCTC TTTATAGCTA TTTTTTCCAA GAACCAAACA AAACAACAAC CGAAACCACA AAGCA.AGAAA CA.ACCAACAA CCACACAGCA ACAAGCCCCA CCGCGTCCAA TACCATCACC WO 97/37044 PTU9/52 PCTIUS97/05223
CAAGATTTTA
GAACATGCCA
AAAAAGTATC
TCCAAAGAAA
AAGCCTTTAG
TATAGCGCCT
GGCACTCTTA
ATCGCATTCA
CCGGTGGCTG
AAAATTGAAA
TTTTTATCCA
TTTGAAGCCT
AAAAACGAAG
ATTTCACCCA
GTGTTTGTTT
CTTTTAACGA
ATGCAAAAGC
GAACCCCAAA
CTAGGGGGTT
CTTTATAACG
ATCATGGATC
AGCGTTACGC
CTATTATTCA
CACAACATCC
CGCGCGCACG
GCGTTACCCA
AAATTGAAAT
TAACCCCTAA
ACTCGCAACC
AAGTGCGTTT
CAAAAACCAC
GCATCATTAA
AATCGCCCAA
ATTTAGACAG
AAATTGAAGA
GCGTGGATAG
TAATTGATTC
CGAATTTGCA
TGCTCACTGA
TACTGGATTA
TTATCGTGCG
TCAAAGAATT
AGTTACAAGC
GTTTGCCCTT
CTGTGGAATT
CGTATTTCAT
CAAACACCAT
CGATCTTTCT
TTTCGGTGTT
CTCAAAACAT
AACCATCCCT
TGATTCTTTA
AGAAAAGGGC
CTCCCTAAAA
TTTAGACCCC
CCTTGGGCCT
AACCCTGACT
TAACCTTATC
CTACACCTTT
TAAAGACGCT
GTATTTCACT
AGAAATAGGG
TGGCTATATT
TGTGATAGAG
TTTGTATCAA
CATTATCCTC
AGCCCCTAA.A
CCACATGATG
AATCTTACAA
GAAAAGCTCA
TTTACCGCTT
GACCGATCCC
AATCACTTTC
GCAGCAACTC
AAAGGAATAA
CAAGAGAGTT
GGGCGCATCA
TTTTTAGAGC
GAGCTCCCCC
ACGCTCAATA
AACGAACAGC
TTTTATGATG
CCTAGCTATG
TCAGGCGTGC
AAAGAAATCA
ACTTTGCTTT
ACTAAAAACC
GGCCCTAAAG
TATGGCTTGA
TTTGTGGGCA
TACCCCTTAA
ATGAAAGAAC
CAGCTTTACA
ATCCCAGTAT
GAGTGGGTTT
CTTATGGGAG
ATGCA.AGCAA
CCTGCAGGGT
ATCATTAATA
TGTTAAGCAC
AACAGGTTTA
ATGTGAGCCA
TTTTAGCAGC
ATAAAGCGTT
TCGTTTTAAC
ATTTGCATTA
TGATCACCAI\
TTTTAGAAAA
AACGCTTTTC
TCACTAAAGA
CCTTAGGGTT
ATTACCGCTC
TCACTTTCTT
ATTGGGGTTG
GCTATAAGGG
TCCAAGAAAA
A.AAAACATGG
TTTTTGCGAT
TATGGATCCA
CGTCTATGTA
AGATTTTTAA
TAGTCTTGTA
AAGTCTTAGA
GATTTCTTTT
TCTCAAAGAT
TCTTTTTAGC
CGATAAACTC
CAACACCCCT
CCAAGATTTA
TGATTTAAAA
TGGTTACAGG
CACCGACAAA
TAACACCCTC
TCCTCAAGGT
CATTTCCCTT
TTTGAAAGCG
TGCCAAAGGC
GGCTATCATT
CATGGTGAGC
ATACA.AGGGC
GGCTAACCCG
TTATAGAGTG
TGATTTATCC
TTGGCACCAG
ACTCTTGCCC
TTGGACCACA
GAATAAAAAA
240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1650 INFORMATION FOR SEQ ID NO:993: SEQUENCE CHARACTERISTICS: LENGTH: 1440 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1440 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:993:
ATGAAAAAAT
GTCATTGAGC
GTCCTTGATT
AAACTCACGC
AAACTTTCTT
ATCGTTTTTT
GCGCCCATGC
TCTTACCCAG
CCCTTTGTTT
TTTTAGAAGA
CTAAAGA.ACC
TCACTATCAC
TAGAAACGAC
CTTCAAAAGA
AAAAACCACA
AGTCCAAACT
GTCTTTCTTT
GATTAAAACT
AAGACAAGTT
TCACATATCC
CTTAAGCCCT
ATTGAAAGAA:
AAATAAACCC
AGGCTCTMA
CTGACCTTCT
TCGCCGCATA
TTAGGCGTTT
ACGGCAATCG
AACCGCCCTA
CCGCACTCAA
AGCTCATCGC
AACTCTAAAA
CTAACCCTCT
AAGGCACTTT
ATAATATCTC
TCTATCAACC
CTATCCCTAG
ACCCAATACC
AACAATCTCC
ACAGCCTTTT
TCAAGCCCTT
TAAGGCTAAA
CCCACACAAA
CCTTGATGAA
AAACACCCAA
TTCTTTAAAC
TCAAAACTTT
ACAGCCTTTA
WO 97/37044 WO 9737044PCTIUS97/05223
GTA-ACTCCTA
CCCCCTTTAA
GAAAAAACGC
GAAAATAGGG
TGCGGGAAGT
CGCGTTGATA
GAAAATAAA
CCTTTAGAAG
AGCTCTACAG
TATCTGATAG
TTAGTGAAAG
GAAACCAGCG
AATTTGAATG
ATAATCAAAG
GGGGTGTGCA
ATTGAAAACT
GCAAAGTAAG
AGCATTCTTC
TCCCTAACAA
ATAATGTGGA
GGGTTTATGA
AAGACAAAGA
GCGGTAAAAT
ACCCTCAAAC
AAAAGTGTAA
AAGAGCCTTT
CCATATATGA
AATTGGCTTA
AAAAATTCAT
AGAGCAGCGA
TGGCCTTAGA
CTCGTGTTGT
CCCCACTAAC
ACAAGATCpA
CACCTCTAGT
AAAACAAGCG
TGATGAAAAM
GATTACAACA
CATTACCCCC
TTTTGAAGCT
AAGGGCTAGA
AAAACAAGCG
ACGCCCCAAA
TTCTTCCACA
GGAATTTGTG
ATACAA.AGAA.
AATAGAAGAG
GTGTGTCAAA
GAAGTTAMA
GAAAACAACC
GCTGATGCGA
ATTAGAGATC
TTACAAGCCT
GACATTACCC
TATACTAAAA
AAAAATAATT
GCGAGAAAAG
TGGGAGAGCG
CAAGACGATC
CGAAAAAGCG
GAAGTGTATG
TGGGTTAAAA
CAACCACGAG
AAGGGGAATT
CGCCAACAAA
TCTTTGTAGC
GTGAAAACAAk
CTAATATCAA
ATCGCCCAAG
CTTGCGATTA
TCTCTGTTCA
TCGCCATTCT
ACCOCACGAC
AGTATGAAAT
AAGTAGAGCC
AAATAACGCG
AGGGGCATTA
ACCATGTGCG
CCAAAAGCAC
ATTTATTCAA
CGACGCTAAT
GCCACCCACT
TGAAAGCAAC
AGAATTCGCA
CAT TTTAAAA
CAGCACCGCT
TAAAACAGAG
CCA.AGCCAGA
CAGGCAATGC
CACCACGCAA
GACTTTTTAT
TAATGAATTG
TTTAAACGAT
CTTTAAAGAA
GCCTTTGAGT
TGAAGTTTAA
600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 INFORMATION FOR SEQ ID NO:994: SEQUENCE CHARACTERISTICS: LENGTH: 810 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION I1.. .810 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:994:
GTGAAAGTTT
ATACAGGCCG
TGCATCACCG
GTGAAAGCGC
GCGTTATGCA
GGGTTGTGCG
GAGGCGATTT
CTCCCTGAAG
GCGATGGGTG
GAACATTTTC
TTAAGCGCCA
TTGATTGTGG
GAGCTTTTAA
AATTTGTGGA
ATCCGCAAGT
ATTTGAAAGC
CTCAAAACAC
AAATCCTAGC
ACGCTCAAAT
TTTTAGATCC
TAAGCTTAAA
TCTATGCGCT
TTTTAAGGGA
AAGGGGAATT
AGCGATTCAA
GCTTACTCGC
CTATCATCAT
GCATCAAAAA
TTTAAGCATT
GTTTCAAACT
ACAGGGCGTG
TATTAGAGAT
CATTGAATGC
GGTGATGGTG
AAAACGCCTT
CACAGGCGTT
TTTAGGCGTT
TAGCAACGAT
CACCAAAAAC
TCAAGGGCTG
TCAAAACCCC
GCATGTTTGA
GCTGGTAGCG
TTTGGCGTGT
CATGGGGTGT
GATTTTTCGA
GTAGCGAACG
GCAAAGAATG
TTACCTAAAA
CAAGCGCGAG
AAAAACGCTG
TGGGTGTTTT
ACGCATGGCA
GATTTAAAAA
TTAAACATTG
ATAGCGGTGG
TTGGGACAAG
ATCCATTGAG
TCAAGGCGTT
CTTTAGAAAC
GGGCTTTGCT
CCAATTTACT
ACGATAAAAG
TGATTAAAGG
TAGAAGACGC
CGGGTTGTAC
ACGCTATCAC
GGCATGGGCA
GGGCGCTGGG
CGTGATCACT
TGTAGAGAGC
CAAAATGGGG
TTGTGATTTT
TTTAGAAGAG
AACCCCTAAC
CGCTTCAAAA
GGGGCATACA
TGAATTTATC
TTTGTCTAGC
AAAGGCTAAA
TGGGCCTTTG
120 180 240 300 360 420 480 540 600 660 720 780 810 INFORMATION FOR SEQ ID NO:995: WO 97/37044 PCTIUS97/05223 903 SEQUENCE CHARACTERISTICS: LENGTH: 1197 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genoic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1197 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:995:
TTGGGGGCTA
CTCAATCAGC
ATTACCAAAG
AGCGGTTCGA
TTAAGGTTTT
GAGTTTTTTG
TTCATCAATT
CAATTTTTTT
GACACGAATA
TTCCCTTTAT
AGCACGCCTC
TTTTGGTGGG
GCTTTTTTAG
ATCTCAGTAA
TATGATGGGC
GGAGGGATCC
AATGCGTTAG
GCGTTCACTG
AAAGGGTATC
AGCGCTAATT
TCATGGATAC TTTTTTAACA AGTATCTTGG GAGCTATCAT GCAAGGCCTT ATGTGGTGGA ATGTTTTTGG GTTTAAGGCG GGGCGCTATG ATCAAGGGTG GGAAGTGTAT TATCAGCCCT GGTGGTGGAG TTCTTTTGGG AGAGGTTTGG CGACCGTGCC TTATTTGAAA AAGGGAGGCA ATGGCTGGCA TGGGATCACC ACGACTTATT ATTATTTTGC ACCTAAAACT TATAACGCTC GGAATTTTGA AAACGTAGGC TTTCGCTCTC ATTATAGGGG GTGGTATAAC CCAGAGACAA ATGGCTCTTT GTTAGGGAGG AATGGCGTTA ATAATTTCAA CTGGTCCATT GGCTTTTATA GCTCTCACAC CATGCCAAGG GGGAATAACA CGACTAGGCA TGCCGGAATG ATCGGCTATG TGGCTGATGC GATCACTAAC GCGAACACTT ATAAGCGTTT TGCATGGCAT GTTTTTGGGC GGCAAGTGGG GAGGGCTAAT GAATATTCCT AATCGGTTCT CCTTAACTTT AGGATCACTT AAGCAGGGTA TTTTGGAGCG CCCAAATTCA ACCAAGACAG AAGTTACATG ATGACCAACC
TCCTGCTTAT
TACCGCTTTT
AAGCGAATAT
ATAAGACTGA
CGTTTAACTC
ACCCTAGTAA
CTTATAAAGG
CCGGTTTTAA
AA.AGCATGAT
ACACTTATAG
CTTTAAATAT
ACACCTTTGG
CTTCCTATAT
ATTTTTGGGA
TCACTTTTTA
GCGTCTCTCA
TGCAATTCAA
ATTATGGGGC
ATAACCCGGA
TCACGCTGAA
GCGGGGACTT
TTACGATACG
TGATTTCATG
GACGCAAAGG
TTGGATTTAT
CAGCAACGAT
TTTAGACGCT
GCTAGTCTAT
CATGACGACC
CTTAGAAGAC
CCGCCAGGTT
CAATTCAGAC
CAGTAATGAA
TAATACGGCT
CACTTCGGTT
TGCGAATAAA
TGCGAGCTAT
TAGGATCAAT
TGGCGATTTT
GTTTTGA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1197 INFORMATION FOR SEQ ID NO:996: SEQUENCE CHARACTERISTICS: LENGTH: 1335 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: WO 97/37044 WO 9737044PCTIUS97/05223 904 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .1335 (xi) SEQUENCE DESCRIPTION: SEQ, ID NO:996:
TTGAGTAACG
AATGGTAACG
GCGGCTGCAG
GGCTATTACT
AACTACGGCA
GGGAATGTGT
A.ATAGCACTA
GCTATTCAAT
CAAGATGAAT
GAATTCAATT
GTTTATAACG
AACTCCAACC
AACGTGCAAG
TACTTGCCCC
ACTAAAGTGG
GGTTTCTTTT
TTATACACTT
AACCGCTCTG
TCCACGCTCA
CAGTTCCTCT
CGCCACAACC
TACAAATCAG
GGGTATTCAT
CTTTCTCCCA
ACGTGTCTGC
GGACGGCTGG
GGCTCCCTAG
CGAACACCAA
TTTTTAATGC
GTAGCACTAG
ACCTTCAAGG
TGCTCCTAGA
CAGCCGCTTT
AGCTCACTAA
AAGCTAACGC
TkACTTTGGA
AATTCAGAGC
GCTATAAGCA
CTTATAACGG
ATGGGGTGGG
TGGATATGGG
GAGATGACCC
TTGACTTCGG
AGCACACGGT
CAGGGACTAC
TCTAA
ATACCTTTAT
GAACGCTCTT
TGGCACAACT
CTTGACTGAT
TTTCCCCAAC
GATGAATAAG
CGGTGCGACT
CCAACAAAAT
AGCTTTCAAC
TACAGGTTTG
AAACACCATT
TGTGCAAGGG
TAAAATCAAC
CGGGAACAGC
ATTCTTCGGG
AGCGAGCGTG
GACTGATGTG
CTTTTTTAGC
CAATGTGAAA
TATGAGAATG
GGAATTTGGC
CGTGAAGTAT
TCGCTTTTAG
TTAAGTGGTG
CTTAACACTC
AGGATTTTAA
ATGCAACAAC
GCTTTAGAGA
GGTTCAGATG
ATCTTAAATA
TCTGCCGTAG
GTGCAAGGCA
AGCGGGAGCG
CGTGCTAGTC
GCGCTCAACA
CGTGCAACGA
AAGAAAAGGA
GGCTTTAGAT
TTGTATAACA
GGTATCCAAT
TTGCATGGGA
AACTTCGGTA
GTAGTAGTGC
TTCCGTCCTT
GGGCGTATCC
CGGTAGGCAG
AAAGCGCTTG
GCACGATCGG
AGCTCACCTA
AGAATGGGAC
GTCAAACTTA
ACGCAGCGAA
CTGCTAACAT
TTATTGATCA
CGGTTAATAA
AGCTCCCTAA
ATCAGGTGAG
ATATTTTAAA
ATATCGGTTT
CCACTCAAAA
TCTTTAGCCG
TAGCCGGTGA
AAATCAATAA
AGTTGGACGG
CTACGATTTA
ATAGCGTTTA
CACGAAACTC
TGGGACTTGC
CACCGCTGCG
CAGCCAGACT
CTTGAATGCG
TGCTACTGCT
CTCTCAACAA
CTTGCTCAAG
TGGGAATAAG
ATCTCAATTG
CGCTGGGATA
CGCTCTTTAT
AAGCATGCCT
CGGGTTTTAC
GCGCTATTAT
TAATGTAGGG
CTCCTATCAA
GACCTTCCAA
CACGCACTTC
GAAATCCAAC
TAACACTTAT
TTGGTCTTAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1335 INFORMATION FOR SEQ ID NO:997: SEQUENCE CHARACTERISTICS: LENGTH: 402 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 402 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:997: ATGGAGTTAA TTAAGAA.ATT GGAAAAAGAA AGCGAAGTTT TAAAGAAAGA TTTACAACAA CATTCAAACG AGCTTTTTAA AATGTTGATT ATTGATAATG AAGATTTGTT TAAAGAGCAA WO 97/37044 WO 9737044PCT/US97/05223
TTTGAGATTA
AAAACCAAAT
AGACATTTTT
GGTGATGTTA
GAGTTGGAAG
TGTTTAAAGC ATGGGTTGAA ATCGTAAAAA TGATGTTTGA ATTAACCAAA TTGATGGCGA AATGATTGGC TACACAGAAG AACTTTTAAC CTTTTTAGTT TTALATGGGAT TTTTAAATCC AAAGTAATAC CTAAAATGCC TATTTTTTGC AATGCGAAGA TTTTAATGCC CTAAGAAGTT TAGTTTATCT TTCTGTGCTT AAACGATAAA TCCTAATAAA ATCCCATTTT AA INFORMATION FOR SEQ ID NO:998: SEQUENCE CHARACTERISTICS: LENGTH: 2091 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .2091 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:998:
ATGAAAATCA
GCTGAAGATG
GTGAAAAACA
TTAAACCAAT
ACCGGTGCGA
TCTCCGGCCT
ATAGCCTTTG
GGGGGCGTTC
ACCACCACCA
GAATACCAGG
GGAGGCGGGA
TTCACAAAAA
GGCGGTTCAT
TTGCAACAAG
GGCGGTGGGG
AGTTTTAACG
CAGCTTAACG
TCTAAAGACA
TTGAGCTTAG
GATCTAGAAG
GGTTGCGCGT
AACCAGGTCA
AGCACTTTAG
AAGTCCCTTC
ACTTATTCTT
AAAAACCCCT
GGCATCGGCG
AGGTATTATG
AAAAATCCCT
ACGGATTTTA
CTCGAGCATT
ACAATTACTT
TTGACAATCT
ATCAAGCTGT
GCATCAGCTG
GATCGTTTGA
CTTGTAATGG
TTCTCAATAC
TGCCTGCCTT
ACCCTACCAC
CAATCCCA.AT
CCGCTACTAT
CATGGGGGTT
CTATTAACGA
CTAATGAAAA
CGCAGTTCGC
CCCAACAAGT
AATGCACCGC
TTGTGAAAGA
ATCAGGATAG
GGAACGACTC
AAAACATGAC
TGGATACCAG
TTAGGCGCAT
TGCAAGCGGG
GTTTCTTTGA
CTTTGCTCTC
CATGAGTGTG
ACAAAATCTT
AAATTCCTTA
A.AGCTCAAGC
GGCTTTAGCG
TGGCCCTGGC
CAACACGCCA
AGCCAGTAAT
CGCTTATCAA
GAATAGCTCC
AGAATACACT
CCAGCTAAAG
CATCAATGTC
TGGCGGTAAG
AATGATCAAA
CACCCAAATC
TCAAGAAATG
AGCGGACAAT
AGGATCAGCT
GACTCTCA.AT
GGCTTTGTCT
AAAAGCGATC
GCATGCCACT
CAAATACAAC
CGGCGTGATT
CTATAAGCAA
TTATAACCAT
TCTTTCTCTC
GGCTATCAAA
GCAGACAGAT
GTCAATCTAG
GCGATCAATC
CTCAATGCGG
CCCAATCTTG
AACTACAGCT
GTAGGGCCCA
ACTATCCAAA
AAAAATATGG
TACCCCGATG
ATTAGTAGCG
CTTACCACCC
ACCGGGAATG
AACGCTCAAG
ACGCAACCAG
CTCAATAGAG
TTCCACAGCA
GGTGTGATTA
TCCTTAGAGC
CAAACCATTT
AATAGCGGTA
CAAAACCCTA
CAGCTCCAAA
AACTATCAAA.
TTCTTTGGCA
GCTTATATCA
TCATGGCTTC
TCGGTGAAGC
ACGATAACTT
CCAGCACGCC
TCACTAGCGC
CTGTGGGCAT
GCCCAGAACA
ACAACACCGG
ATGGTATCCT
CCGCTTTAAA
TAGTCAATAT
GGAATGGCAA
TCAATGACGC
AAAACCCGCA
TGATGGATAT
CCGTTTTAGA
ACAATTTCAA
CTAACGCTCA
TTCAAGGGCC
ACGACAACAC
AACACACCGC
TGAATTTTAA
TCTCTAACTT
ATTCCCCAGA
CTGTTGCGCA
ACAATAACGG
AAAAAAGGAA
AATCTAATTT
ATTATCAAGG
GGTCCAAAAA
GAGCAACCTT
TAGCGCGATT
TACCACCACT
GTGGCAAGTC
TTTAGAAAAT
TAGCGGAACG
ATCTAGCAGC
CCAAAACCAA
CAATCAAACT
TTATTATTCA
TGAAAACCTT
TGTGAATGGT
TTTTGGCGAT
AAAAACCCAA
CCCCTACACT
AGCAGAGATT
TATCCAACAA
TTATGGTTCA
TTATTATGGC
AGAAGCCCTT
GCCTAACGCT
AGGTTTGCTC
AGAATTAGGC
GGCGATGAAC
TTGGGGGTTA
TTTTAACTCG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 WO 97/37044 WO 97/ 7044PCTIUS97105223
GCTTCTGATG
AAAAACACCA
GCGTTAGCCG
ATTTATAACG
ATGA.ACCTCG
TTGGGCGTGA
AAATACAGAA
TGTGGACTTA
ACTTTTTAGG
GGACTTCGTG
CTAATGTCAG,
CTAGGCCCAA
AAATCCCCAC
GGCTCTATAG
TGGGGTGGGT
CAAAAATAAC
GCTTAATTCC
CGCTTCTAAC
GAAAAAAGAC
CATTAACACG
CGTGTATCTC
ATGGACGCGC
A-AGCTTTCTG
CAACAAGTGA
TTCCAATTTT
AGCGATCATG
GATTATTATT
AATTATGTGT
TTTATAACTT
TGGGGCTTTT
ATTTGACCAT
TGTTTGATTT
CCGCTCAGCA
CTTTCATGGG
TTGCTTACTA
CATCAACGAT
TGGTGGCTTT
GATGAATGGC
AGGCTTGAGA
TGGCATGGAA
GGCTGAACTC
G
1740 1800 1860 1920 1980 2040 2091 INFORMATION FOR SEQ ID NO:999: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1116 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1116 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:999:
ATGCAATTTC
TCTTTTAGTA
AGTCATGCCG
AACGCTCAAA
AACACCTTTA
AAGCAAGCCG
CTTAGCGGGG
CCCATTACTA
CAATCTCAA-A
TTGAACGCGC
TTGAGCGTAG
TTATTCTATG
GGCAAAATGA
AATGCGCAA.A
TGGGTAGGGA
ACGGATTATC
CGTGTGAATG
AATTCCTTTT
GTCATGTTTA
AAAAAACCTT
TCGCTGAAGA
TTGAACACAA
ACAAAATCTA
ACTACATCAA
AAACCTACTA
GCGTTGCGTC
ACCCTTTAGA
ACAGCATGCT
TTGATCCCAG
GGTATAAGCA
ACTATGGTTA
ATAACCACCT
AACATTCGAG
GTGGTTTAGG
GGGCTAAAAT
TGGATAGGCA
ATGAAACGCA
ACGTGAGTTA
ATCTTCCTTA.
AAATGGGGCG
TAACCCTTTT
TAAACTCAAT
CAACGCTTTA
CCTGCA.ATCT
TAACCCTAAA
ATTGGTAGAA
TTCTTCTTTG
CTCTTATTCT
TTTCTTTACC
CACTAATTTT
CTATGGGCTT
CGTGGGGTTT
CATGTGGGTG
GCACACGAGT
CAATGGCTTT
TGGCAAAGGG
TGTTTATAGT
TCTTTATTTT
TATGCGAGCG
TTGAATCA.AG
CAAGTCAAAA
AAAAACAATG
ACCCTTCAAA
TTAGCCCAAG
AACTTAAAAA
TCTTCTCAGA
AAA.AACGTTT
AAGAAAAAAA
GGTTTTGTGG
GGCATAGATT
TATGTAGGCT
AGTCAAATGG
TTTTTCCAAA
GAAATGGGCT
TTAAACGCTT
TTTTAG
TATCTTTATC
TGGGTTTTGA
AACGCATCCA
ATGAAATCAC
CTAAATTAAC
ACATTGAAAA
CGTTAGAAAA
ATTTAGAATT
TCGCTCAAAT
CAAGCATGTA
ATCAAGGGTT
GTAATGGCTT
ATCTTTTCA.A
TTGCTTTAGC
ATTTCATCAA
TCCCTTTG.AA.
TAAAGATCCC
CCCTCTTTTT
TTTATTCTTA
ATATTCCATC
AACGATTTCT
AAACATGCCC
CCCTACTGAA
AATAGTCATG
AATGCAAGAA
ACAATTCAGC
TTCAAATTCT
TGGGGTAGGT
TCGTTATTAC
TGATGGTTTA
TTTCATTGAT
GGGGAGTTCG
CAACTATTTG
TTTTGGGGTT
TTTAGCGGTC
CAAACGCCTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1116 INFORMATION FOR SEQ ID NO:1000: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1455 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 WO 9737044PCTIUS97/05223 907 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1455 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1000:
ATGGCTAAAA
AAAAAGACGA
TCAGAGGGCT
AAAGCCTTGA
GTGGCGACCT
AAGGCTTGCG
ATCGCGTGCG
AGCACCGAGT
GGGACGATTC
GCCTTTATCC
GGGGGGACGA
TCTTTGCATG
ATGCTCAAAG
AGGAGCGGGG
ACTAGCGGGG
CATAAATTCC
CATGAAA.AGC
CCTTTGCATA
AGCATGGGTA
AACGCTTTGA
TTTAGCCATA
AGCCTTTTGA
ACAGACAGCC
GATGCGCTTT
ATAGAGTTTG
TCACAACCGT
GCCAGTTTGG
GTTATGCGTT
GCGAATTTAA
CAGCGGTGCG
GTTTGCAAAT
CGAATTTGCT
GCGCGTTGAT
GCATTAAAGA
AAAAAGAGGT
TCAGAGCCTT
GCTATGAAAT
AAGACAAATT
CGTTGATTTT
TAGGGGTGAG
CCCCCAATAT
ACAGCCAAAA
AAATAGATGA
AGATTTTAAG
GTTACGGCTT
AAAAAATCCC
CCTTGCAGTG
ATCATTTAAA
ACTTGGCTAA
CTTGA
GATTGACATA
GTTTTACTTG
TAAAGGGGTT
AGAAATCGCC
CGATGCCCCT
CAAAATCATT
GCATAAAAAT
TGAGAAAGGT
AATGTTTCTA
CTCTAAACTG
GAGTAAGGTG
AGATGCGCAT
ACGGCTTTTA.
ATCAGTCGTG
AGAGGGCGTG
CAACCCCTCT
GGTCAAAAAA
AAAATACCTT
TGTATATTTA
TAGCCACCAG
TAAAGACAAC
GCTGAGTTTT
ATACACGCTA
AGAAATGCTC
GGCTCTAATT
CTTTTTGAGA
TTACAAGAAA~
CTCAAATACA
AATCGGTTAG
GATGGGCAAA
TCAGGGATCA
AAGATTAAGG
GACAAAGACT
CCTTTTAAGC
CTGATGAAAC
AAAAATTTAG
GGGGTGAATG
TTAGAGCATT
TTTTTGAGCG
CTCATCTCTC
GAATGCGTGA
TTCCATTTAA
GCCCACAAGC
GATAGGGCGA
GCTATCGCCC
ATCCTTTCTT
GAAAAAAACA
CCCAAACTCA
CAGTGCGTTT
CTAAGTCCAA
TCCCCATGCA
AAAGCAAAAA
AGTTTGTAGC
AAGAAGCGCT
CGATAGACAT
ACTTAATCTC
TAGATGTCAA
ATAAAAACGC
GCTTTGATTA
CGTTCATTGA
AAGAGCGTTT
TGAAAACTTC
ATTTATTGCG
TAAAAGATCG
AATTGTTTGA
AGATTGCGGG
ACAGCGCGTA
TCATTTGCTT
ACATGAGTGC
TAGCCGAAAA
AGCTTGTGAT
TTAAGCCCAT
GGCTGTCTTT
GGTTAGGATT
ACGAGCCGTT
AATCCTGTGC
GAGGGTGAM.A
CTATGGTGGG
TGGAGGGGGT
GCTTGATGTC
ATTGGCTAAA
CTTTGGGGTG
CCCTATAGAT
AAAAATCGTC
GGATAGCATT
CTTGATGATC
CCACCATTAC
CTTTTTACCC
AGTTTTATCG
GGAATTGGCG
TTTCATTTTA
ATTGGCGCAA
GATGATGCCA
TTTGTGCCTG
CCATTCTAAT
TCCTTTGACG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1455 INFORMATION FOR SEQ ID NO:100l: SEQUENCE CHARACTERISTICS: LENGTH: 762 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 WO 9737044PCTIUS97/05223 908 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1. .762 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1001:
AAACAGGTTA
GAAGAGCTTT
AATGATTTGG
AGGCTTTTAG
CTGCCCTTAC
GCCATTATCC
ATTGGGGCGT
GAACTGGTGC
GCCCAAAACA
GCGATCTATC
AATGAAAACA
ACCCCGGATT
TACAGCATCA
AATCTCGCAT
TAAGCCACCA
ACATGCAAAG
GTTTGATGAG
AAAAA.ACCGA
AAAGAGCGGA
ATGTGTTAAA.
GCTTGCAAAG
GCATCTATGT
AGCAAGACAA
AGCTCGTGTA
TTGATTACGC
CTGAAATCGC
GAAAGAAAAG
AGAAAAGCAT
CGTGTTTCGG
CTTTATCGCG
ACACCATTTC
TAAAAGCATT
CCGAGAGAGC
CAGCTCTAAA
GCAAAGCCAT
TAACCCCATT
TGAAAAGCGT
TTCCATGCCT
GCCGACTAAT
CCGTTCAATA
TTAGAGAGCA
CTAGAAAGGA
CTTGTCTTAG
GTGGATTTTT
TCCAGTAATG
ATTAATCGCA
GTGTGGCAAC
TTAGAAAGAG
GCGAGCGTGT
TATAAAATCG
AAAAACCCTA
AGGAATGATT
GCGAGCAGAT
AGCTTTCTGG
ACCGCTTGAA
CGATCGTTTT
TAA.ACCAGGA
AAGCGTTGGC
TTGACGATAA
GCTTTGAAGA
AAGTCCATAT
CCATTGCGGC
TATTGAGCTA
CCGGCTTTAA
GA
GGCTTTTTTA
TTTTTCGGTG
AATCGCTTAC
AATCAGTATC
CA.AGCATTAC
TCGTTCGCTC
ATCCCGCTAT
TTTGATTAAA
CGTCAATATT
TA.AACTTTTG
TTTGTTTGAC
AATCACTCGC
120 180 240 300 360 420 480 540 600 660 720 762 INFORMATION FOR SEQ ID NO:1002: SEQUENCE CHARACTERISTICS: LENGTH: 1440 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1440 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1002:
ATGAAAAAAT
GTCATTGAGC
GTCCTTGATT
AAACTCACGC
AAACTTTCTT
ATCGTTTTTT
GCGCCCATGC
TCTTACCCAG
GTAACTCCTA
CCCCCTTTAA
GAAAAAACGC
CCCTTTGTTT
TTTTAGAAGA
CTAAAGAACC
TCACTATCAC
TAGAAACGAC
CTTCAAAAGA
AAAAACCACA
AGTCCAAACT
GCAAAGTAAG
AGCATTCTTC
TCCCTAACA-A
GTCTTTCTTT
GATTAAAACT
A.AGACAAGTT
TCACATATCC
CTTAAGCCCT
ATTGAAAGAA
AAATAAACCC
AGGCTCTAAA
CCCCACTAAC
ACAAGATCAA
CACCTCTAGT
CTGACCTTCT
TCGCCGCATA
TTAGGCGTTT
ACGGCAATCG
AACCGCCCTA
CCGCACTCAA
AGCTCATCGC
AACTCTAAAA
GAAGTTAAAA
GAAAACAACC
GCTGATGCGA
CTAACCCTCT
AAGGCACTTT
ATAATATCTC
TCTATCAACC
CTATCCCTAG
ACCCAATACC
AACAATCTCC
ACAGCCTTTT
CGCCAACAAA
TCTTTGTAGC
GTGAAAACAA
TCAAGCCCTT
TAAGGCTAAA
CCCACACAAA
CCTTGATGAA
AAACACCCAA
TTCTTTAAA.C
TCAAAACTTT
ACAGCCTTTA
CGACGCTAAT
GCCACCCACT
TGAAAGCAAC
120 180 240 300 360 420 480 540 600 660 WO 97/37044 WO 9737044PCTIUS97/05223 GAAAATAGGG ATAATGTGGA AAAACAAGCG ATTAGAGATC
TGCGGGA.AGT
CGCGTTGATA
GAAAATAAAA
CCTTTAGAAG
AGCTCTACAG
TATCTGATAG
TTAGTGAAAG
GAAACCAGCG
AATTTGAATG
ATAATCAAAG
GGGGTGTGCA
ATTGAAAACT
GGGTTTATGA
AAGACAAAGA
GCGGTAAAAT
ACCCTCAAAC
AAAAGTGTAA
AAGAGCCTTT
CCATATATGA
AATTGGCTTA
AAAAATTCAT
AGAGCAGCGA
TGGCCTTAGA
CTCGTGTTGT
TGATGAAAAT
GATTACAACA
CATTACCCCC
TTTTGAAGCT
AAGGGCTAGA
AAAACAAGCG
ACGCCCCAAA
TTCTTCCACA
GGAATTTGTG
ATACAAAGAA
AATAGAAGAG
GTGTGTCA.AA
TTACAAGCCT
GACATTACCC
TATACTAAAA
AAAAATAATT
GCGAGAAAAG
TGGGAGAGCG
CAAGACGATC
CGAAAAAGCG
GAAGTGTATG
TGGGTTAAAA
CAACCACGAG
AAGGGGAATT
CTAATATCAA
ATCGCCCAAG
CTTGCGATTA
TCTCTGTTCA
TCGCCATTCT
ACGGCACGAC
AGTATGAAAT
AAGTAGAGCC
AAATAACGCG
AGGCATTA
ACCATGTGCG
CCAAAAGCAC
ATTTATTCAA
AGAATTCGCA
CATTTTAAAA
CAGCACCGCT
TAAAACAGAG
CCAAGCCAGA
CAGGCAATGC
CACCACGCA-A
GACTTTTTAT
TAATGAATTG
TTTAAACGAT
CTTTAAAGAA
GCCTTTGAGT
TGAAGTTTAA
720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 INFORMATION FOR SEQ ID NO:1003: SEQUENCE CHARACTERISTICS: LENGTH: 1704 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1...1704 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1003:
ATGATTTTTG
AACGAGCTTA
GGGTATCTTT
CCTTTTTTGT
GAGCATGCGT
TTTAAGGCCG
GATTTATTGT
CAAAACACGC
CCGGAATTGT
ATGAAAGGCA
TTGCAAAATG
GATTTGAGCC
AGTTTGCCGA
AGCTTGTTTG
ATCAAAACCA
GCGATAGGCA
AAAAGAOCGC
AAAGCTTCAA
GAATTTGAGA
AATAAAAACG
GGGATTTTAA
AAAACGCCCT
TGTATGAAGC
ATTTTGAACA
TTTACCCTAA
TTAAAGAGCA
TAAACACTAA
CTTTTA.AGGC
TTTTTGAATT
CGATCGCTCG
ATGACAAAAA
GCTTGGCCTT
GCGTGTATCA
AGATTTTTAA
TGCAAATCAT
TGGTTGGAGG
ATGAAGATTT
AGGAATATGA
TCGTGGAAAC
CCCATAAAGA
ATATCAAAAA
GGATTTTATC
GCGCTTAGCG
ATTTTTAGAA
AATCCACAOC
TCTCAAAAAC
AGCCAAACCA
TTTTATAGA.A
AGAGTTTTTA
CTCAAACAAC
TAGAAGCGAA
AAAAAATAGC
AATGATAAGC
GGCGTTGTTC
TGAAAGTTTA
AAAAAAAGCC
TTTGCATTTA
AGAGAGCTTT
GATGAGAGTT
ACGCTTAATG
AGCGTTAAAA
TCTCAAAATA
TTTTTAGATG
AGAAAAAAAT
TCTTTAGATC
GGCGATACCT
AAGCGCGTTT
AATGAGTTTG
GACACAGCGA
CCCTTOATAG
AATGTGATGA
GTGAAAGTCA
OAGATTGAAG
CCTTGCGGCT
GAAAAACGCC
CTTTTTAGCG
GOGGTAGGGA
TTAAAATCCT
ATCAAAAGGG
CATAGCGCCC
AACTCACAGC
GGGGGAAGGG
AAAATTTTCA
AC C CTTTAGA
AAAAAACTTA
ATCAAGTGAA
TTAAGGAAGT
GGAGCGTTTT
TTA.AGATTAT
ATGAAAAAAA
TCGTGGATTT
ATCAATTGTT
CTCAATTGCC
CTGTGACCGG
CTAGGGGGGT
TGCCTATCCG
GTGGGGTAAC
TTTTTGTGAT
ATCAAAAATT
AATATTTTAA
CACTAACCTT
GTATTTTGTG
AAGCCAAACC
GCCTTTAAAA
TTTCAAGCAG
TTTGACAATG
GATACACAAC
AAGCTTTTCG
CACAAAACCC
CCGATTGTTT
ATTGCGTAAC
TGAAATCATC
CCTAAAAACA
ATGCCCTAAA
GTATTGTOG
CACTTTAGAA
TTATAAAAGT
GCCCAAAATA
AGAGATTAAC
CTTTAA.ATAC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 WO 97/37044 WO 9737044PCT/US97/05223 910
GATGAAAATC
AAAAGGGGCA
CGTTTGAGTG
GCCCCTTTTT
TTTTATAACC
CACAACAGGC
GGGTTGTTAA
GCGGCTAAAA
TACCCAATGG
TTTTAGACTT
AGCTCATTAA
AAGCCCCCAT
ATCAAAACGC
AGGATTTGGA
TTTTAACCCC
AAAAAGGTCT
TCTATTGTAT
AGCAAAAAAG
TGAATTAGAA
AGAATACAGA
TGACAAACAC
TCGAGCGCTC
ACTCACTGAG
TTATTTCAGC
TGTTGAGCAT
TAACGCGCTA
TTAA
AAAGAAGGGG
ACCTTAGAGC
GATGATTTTT
ATTAAAAAAG
GGCGCTAGGA
GTGGGCGCAT
GCCCCTTTAA
TACGGCTTAG
TTTTAAGGGT
CTTTAAAAAG
TATACCATAA
GCGTTATTTT
GTAATCTTAT
TAACCGGGAC
AATTACAAGA
TGGAGGTGGG
TTTACTCAAT
CCTAGAAATC
GACCGCTTAT
TGATGAAATC
TTTAGAAATC
GGGCGTTGTG
CTTGCAAAGA
AATA.ATAGGT
1260 1320 1380 1440 1500 1560 1620 1680 1704 INFORMATION FOR SEQ ID NO:1004: SEQUENCE CHARACTERISTICS: LENGTH: 1125 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: H-elicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION .1125 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1004:
ATGAAAATCA
TTGGATAAAA
AAGCTCTTTT
TCTACCGATA
TGTTTGAAAG
AATAAAAGCT
ATTGATCCAA
ATCGCTCCTG
TTCAATCAAA
ACGCAGTTAG
AAAAGGGCTT
GGCATGTTAG
AATTACCCTG
AAGGAAGAAT
ACTTTAGAAA
AAAACCTCTG
AACGCA.AAAT
TGCAATGAGC
TTGAACGCTA
GTGTTAGTAA
AGGACGCTTC
TAAAAGCCAG
AAGAGGGCGT
ACTCTAATAT
CTTTCAAACT
AAGTGAGTTT
TGATTGAGCA
AACACCAAAC
AAAAAATCTC
TATTGGAAAT
CGGTGGTTGA
ATTATCAAAA
TTAA.AGAGGG
AAAACAACGC
TTGAGATTGA
TTTTCCTTGA
CTTCTTCGCC
AAATTTCCAC
AAACGATTTA
CTCTATCGCT
CGATTCAGAT
AGGCACGATC
TGTTTTAGAG
CCCCATGTTT
AGAAATCAAT
AACTAGCCAT
CCTTTCAGTG
TATCCATTCC
CCTTAAACTT
AAACGAAACG
AATCCTCCCT
CATTAAGTTG
TTTGTTTGAA
AAAAGGTTTG
AGCCTTAAAC
TTTTTTGATT
TTTGATGATG
GAAAACACTT
TCACACATCC
ATTGGACTAA
AACGGGAAGA
ACTAAAGATG
GACGCTGATG
GCCCCCTTTT
AAAAGGGAAT
GTAGGCACGG
ACCGAAGAAG
TTTTATGAAA
CACGCTTTTT
AAAGAATACA
TGCAGTTCTT
TCTTTGGATT
GATATTGAAA
GCTTTAGGGA
CAAGAACCTC
CCAATCACAC
TGCGCTACTT
ATTTAGAAGT
AAAGCTATAT
AGTTTTTAGA
ACAGCTTGGT
AGTTCCCTGA
TGGTGGATGC
TAGCCGGTGT
ATACCAAGCG
ACATCTCTTG
ATTTCAGTTT
TCACCAAGCT
CTTCTTCTTT
TAAGCTCCAC
CTGAGCATAG
AAGCCTTTCA
CAACGCAATT
TTGATGAAAA
TATAA
GCAAGCTTTT
CATTAAAGAA
TTCTACGCAA
TATTATCTCA
GATCAAACAA
ATTCCCTGTT
GTTTAAAAAG
TTTAATGCAA
GCTCTCTTAC
CATTTTGCCT
TAAAAGCGAC
CATTGATGGG
CACTTTAGGC
CATTAAGCTC
CGAAACCGCC
TTTGGGCGTG
TGTTTTAAAA
GCAAAGCCAC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1125 INFORMATION FOR SEQ ID NO:1005: SEQUENCE CHARACTERISTICS: LENGTH: 1107 base pairs WO 97/37044 WO 9737044PCT/US97/05223 911 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: H-elicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1107 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1005:
ATGGTTGTTT
TTTGGTGTAA
ATCAAAGTTT
GAGCCTTTGG
TGCGTTTCCA
AAAAGCTTAC
GGGCATTTAT
GCGTTGTTGC
GCTTTTAGGG
CACCCTAATT
CTTTTTGAAA
ATTGAAAGAA
CATTTAGCGA
TCTTTGAGTG
CCCATTAGCG
AAGCTTGGCG
GAAATCGCTT
AAATCTTTCA
CGTTTTGGAA
TAGGAAGCAC
AAATAGAAGC
TCAAACCCAA
GCGCGGAAGT
ATTTAGTCAT
AAAGGAACAA
TAGACATTTC
AAAACAAGAC
ACACGCCCTT
GGAGCATGGG
TTTTAGAAAC
GTTCTATCGT
GTGCGGACAT
CTTCCATTAA
TGGAGCGCTA
TGGTGCTGAA
TTGGTGGCTT
AGCTCTCTAG
GTGTAGCGAG
CGGCTCTATC
CTTAAGCTGT
AAAAGTAGCG
GTTTGTGGGG
TAACGCCATT
AAAACTAGCC
ACAAATCACG
TTTAAAGCCT
AGATCTTATC
GGATAAAATC
CTATTGGCTT
GCATGCTTTG
GCAATTACCC
ACCCTTAGAT
CACTTTGTGG
TGCGAGCAAT
TATCCAAATC
TTTAGATGAA
AGTGTAG
GGGAAAAACG
GGGAAAAATA
ATCTTAGATC
TTAGACGGCA
GTGGGCGTAG
CTAGCGAATA
CCCGTTGATA
AAATCTTTAA
GCTATTCAAA
ACCATTGATT
TTTGGTGCGT
GTGGAGTTTG
ATAAGCTATG
TTATACGCTT
CGTTATAAAG
GAAGTGGCGA
ATCTCTCAAG
GTGCTAGCGT
CCCTTAAAAT
TCGCTTTAAT
CTAACGATTT
TTGATGCGAT
CCGGATTGAA
AAGAAAGTTT
GCGAGCATTT
TCATTAGCGC
ATGCGCAAAA
CAGCGAGCAT
CTTTAA.AGAT
AAGACAACTC
CCATTAACCC
TAAGCGCGAT
ATTTGTTGTT
TGAAGAAGTT
CTTTAGAATT
TAGATAAGGA
CGCAAAAAA
CAATGAACAA
GAATAATTTA
GATAGAGGAG
AGCGAGCTTT
AGTGAGCGCG
TGGTTTGTGG
GAGTGGGGGG
CGCGCTCAAG
GGTCAATAAA
TGACGCGCTG
TGTCATTGCG
TAAATTAGCT
TAAATTCGAA
GGAAAACCCA
CTTGAATCAA
GTATGCTAAA
AGTTAGGGAG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1107 INFORMATION FOR SEQ ID NO:1006: SEQUENCE CHARACTERISTICS: LENGTH: 1254 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 WO 9737044PCTIUS97/05223 912 NAME/KEY: misc-feature LOCATION .1254 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1006:
ATGAAATATT
CTAGATATTA
AACGATGCCA
CAGCATTTTG
GATAAAAAAG
TCACGATTAA
ATTGTAAGTT
TATTTAAAAG
GGACCAGGAA
ATCAAAAACA
TATTACACGC
ACTCATGAGA
GGCTCTAAAA
ACGCATAAAA
TTTTTAGAAG
TATATGAAGA
AATGAATCCA
GAATTTGGCA
AGGCTTACCG
ATGTATATCA
CAGAGTTTTT
TATGGCTTTT
TTAAAAC CAT
ATTACGCTTT
ATGTTTCTCA
TCCATCTTGT
AACTTTATGA
TAGATCTATA
CCCCTTCTAT
TCACAAACAT
ACCGACTCAA
AGTATGGCGA
ATATCGCTAG
TTTTA.ATGTC
AAACTAAAAC
ATGACAAGTC
AATTGGGGTT
TTGACGCCTA
AAACGGTGTT
TGAATGGCTC
AAAAGACATC
TATTCCCTTT
TTTAATATAC
TCAAAAACTT
AAAATTGCAT
AAACAAAGAG
AGCGCTTGTG
TGTGGATACA
CCCTTTTGCA
CGCTTGGATG
CGCACTAGCG
TATTTTCCCC
AAAAACGCCC
CTCTCAAGGA
TTTAGCCCCT
CAAAATAACG
TATGGCTTTT
AAAAGAGAGC
TAAGGATAGT
TAATTTGAAT
TAACCAGATG
TCAAGA.ATAC
AAAGAATGTG
GCTATAGGGC
CCTAAGATTG
GAAGTCTTAG
CAAGGTGCTA
AGCGTGGCGG
GGAACGCTCA
GCGCACAACA
AAGCGCCTGA
GATTATACGA
AAATGGGCGA.
ATGATTTTAA
ATGGCTGTGG
GATGGCCAAC
CGCTATCCGG
GTTTCAGATA
GCGGAGCAAC
ATCGTGTATG
TTGATCGCTC
CCTAGGTTTT
GCCATGGGGC
AAAATACAAG
TTTTTGCAAC
AAGTGCGCTA
CGAACGATTT
TCAATTACGC
TAGAAAACGG
AAAAGACTTT
TGGCCATTGT
TTGTTTTTTC
TGCGTTATCA
ACGCTGAGCA
AATACAACAT
TCTCTAGCGT
CGGATGTTTA
GGATAGATGT
GATCCGGTTA
TCCTTTATGA
TGAGCCGAGA
TAAACAGCAA
CTATGGATGG
TTATTTTGCT
CCTTTGATTG
AGATAAAA.CG
CTCCATAGAT
AA.AGACTAGC
AGA.ACTCAAG
CAATAAAATT
TGACTACCCC
GGTGAATGAT
TAAATACATT
AAAAGAAATC
AACGGAGTTT
TCAAAAAGCC
GAGTTCTGAT
TTTGTATGAC
TTCAGGCGTG
TCCTAACATC
AGGAAGAAGC
AAACCTTAAT
GTATATCCGC
GAGAAATATC
AGACTATAAT
GTAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1254 INFORMATION FOR SEQ ID NO:1007: SEQUENCE CHARACTERISTICS: LENGTH: 660 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 660 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1007:
GTGAATAAAA
AGCGCGTGCC
TTCAATTATA
TATATCAATG
GCGGAATTCC
TCGGCTGTGC
TATTTTCTTDG
AGCACCGCTT
AACACCGCCT
ATTCCACTAT
TGATTGTTTA
CGCTAGATAA
CGTTTTATCA
GGGGGCAGTT
GGCTTTTTTA
GCACATGGGG
CGTGGTGGCT
TGAATTGCAA
AGCAAAAAAT
ATACAATTAC
TTATTGCTCC
GGGCTAATTG
TATTATTOAG
TATGACAGA.A
AAATTAGGCT
GTGATCTATG
CAACTGATCG
CAAACCCCTA
TTGGGGTTCT
AAGTTACAGG
GCGATGCGAT
TTTATAATGT
TGCGTATTGT
ATCAAGTCAA
CGGATTACTA
ATTCTTCTTT
GGATTATTTG
GACTTCTTAT
CTTCACGCAA
CCGTAATATC
TAAGCCTTGT
CGCTATCAAT
WO 97/37044 PTU9/52 PCTIUS97/05223 913 GGCTTTAGCG AGCAAATTTT GATGAGCGCT AATTCGCATT TTATTTTGGA
TTGGTATGAT
GTGGTGCTGC AAAAACGGGT TTTATATGTG GATGGGAGCG TGAGCGGGAG GACTTGTGGC TATCAGATGC TTTATAGAGA TTTGATTAAA AGCACGATCA AACGCATTGA TTTTAACCGC CCTGAACGCT ACTACTACAA TTTAAGACTG CCCCTTTATC AGCCATGTTA TAGGCA ATGA INFORMATION FOR SEQ ID NO:1008: SEQUENCE CHARACTERISTICS: LENGTH: 1251 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 1251 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1008:
ATGCAATATA
AATGGTCTGT
TTGTATAATC
GGGGTGGATT
AATATTTATA
AACCCTAACA
TACGCTAACC
GATTOGATAG
TATTTTGGGA
GGGAATCGGA
TTGTATGTAG
TTCGTTCCTT
CAAGTGGGGG
CTAGTGCATG
GGCTTGTTTT
TATATCGTTC
TTCTATGGCC
ATTTCAGGCT
GCTTTTGGGG
GATTTGTCTT
AAAAGCCTTG
AGAAAAATAA
CTTTGAA.AGC
AAGGGACGCA
TTTCTTATAG
ACAAACAGCG
ACGTTAAGCC
AGCGTTTTA.A
GGGGCAATAT
TTTTTATGGA
TCGCCACTTC
GGGGGGAAGT
TTATTTTAAC
GTAAGTTGGA
GCATGTATCA
TGATCGATCA
CGGCAAGGAA
GTGGGATCAA
ATGTTTTTTT
ATTATCAAGA
TTGATATGGG
GAGATAGCTC
GAAAAGATAT
TTTAGAAATT
AAAAAACCCT
CAATGGGTGG
TTTGGCTAAC
TTATTTGAGC
AATCGCTTTA
TCAAGGGGTC
TAGCATGCTT
CCTAAACGCT
GTTTGTTTTG
GGACACCCGC
GTATGACGCT
ATACGGCAAC
AACTTTCAAA
CAATAAGGGC
CGCGCCCGGC
AGGGCTTAAG
ATATTCTTTG
TGGGGGGTAT
TTTTGTCTTT
TATCATTTAG
GCCGTCAAAC
CACAGCTATG
TCGTTTGGTA
CTTTACATCA
GCCGGCGATG
GGGCGTTTTA
TCTGTGGCTT
TATAATGGGC
CTAGCGTCTT
GGCGCAGAAT
TTGCCTTTGC
TCTTTAOCTA
ACCGATATTA
TATAAAATTT
TATCTATOGA
GTACCAGCGA
ACTAAAAGGG
ATGAGCAGTT
GTGTATGCTT
TTTGGGAAGT
CGTTAGGGAT
CCTTTGGCTA
TGGGGGCTTT
TTGGGGCTAT
GTCTAGGGAA
TTTCTGATGC
ATACCGATTT
TCAAGCAkA
ATCAAATCAA
ATGATCCTGT
ACAAAAATAA
CCACCCAAAA
AGGGTTTCAC
CTACAAGCGC
TTAATTTTGG
CTTTTAATGA
TTTATTTTGC
TGCGTTTAGA
TTAGGGTTTG
ATAATTCTAA
TTTTGTTTTA
CCTTTTTTGT
TCTGGGGCTA
AGCGCGTCTT
TGGGGCTTGG
TTTTTTTGGT
GTATCTTCAA
TGTGGATTTT
TTCCATGCGT
TAAAGAGCAA
GTCTAAACGC
AAATTTGATA
TGTTTTAGTG
TTCOCGCACT
TAAAAATGCC
AACGGGTTTT
CAGGACTAAA
TAACTCCAGC
TGCGATGGTG
GACTTATAGG
AGCCACGAGA
A
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1251 INFORMATION FOR SEQ ID NO:1009: SEQUENCE CHARACTERISTICS: LENGTH: 2007 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 WO 9737044PCTIUS97/05223 914 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: CA) ORGANISM: Helicobacter pylori (ix) FEATURE: NANE/KEY: misc feature LOCATION .2007 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1009:
ATGGTGAAA-A
CTTTTA.AATA
GTCAACGACG
AATTCCCCCG
GTTACAAGCT
ATCCAAACCT
TATGAGCCAG
TATCAAATCA
ACCACTACAA
AAAAAATTAG
ATAACAOCAC
CA.AGCGAGCA
AGCGGTTATT
ATCAGCGCTA
GTTAGTGAAA
ACAGACGCTA
AACCAAGCCG
TCTTTAGGGG
ACGACTTCTA
AAAAACAGCA
GCTGACACTC
ATCACCACTG
AAGAATAACC
TACAACCAAA
ATTGTTAGTT
AAACAATTCT
AACCATGCGT
TTTGGAGCGG
AACAACAAGC
A.ATTCTGAGT
GCGAATTTCC
AAAGACAGCG
AACACGAACT
TATTTGAATT
ACACCAAAGG
ATTACAACAC
CAAGGGATA.A
CCTATCAAGC
ATGCCTTTAC
TTAATAATGT
GACATGGCGG
TTCAAAAGGC
AACTTGATTT
TATACCCATG
CAACAACAGA
TCATTATCAC
GGGCAGGGAT
TCCAAGGCAT
ACACGCAAA.A
GTTTTGCTGA.
AACAAGTGGT
TGTGTTATGA
ACACTTGGGG
TCGCCCATTT
TGGTGAATTT
CGCTCTCTAA
CCTATAGCCC
TCCAAACCAT
CTCAAACCAA
TTGGCCAAAA
TCATTAAATC
ACGCTCTTTA
TTTCTGTGGG,
ATGTGAATTT
AATTCTTATT
ATCATGCGGC
ACTACTCCTT
ACGTGTTCGC
CATTCAACAG
CCTAAACACT
TCTAGGCTCA
GGTGCTTTTA
CGCTTGTGGT
GCCAGGACAA
GCCAATATCC
TTTGACAGCC
CACTATCAAT
GAGTCATGGG
AAATATCAAT
TACCCTGAAT
AAGTGGCAAT
GATCGCTAI.C
TCAAAACAGC
AAGCATGCTC
GAAAAACTTT
AGTGCAAGGA
GGCAGGCTGT
TGGCACTCAA
CAAATCTAGA
TATCCCTAAC
GCAAGGCATA
CAACCAAGAA
TA.ATGGCGCG
AAGAAAATGG
CAGCTTCTTC
TA.ATTTCATC
GCTTTTTGGG
AGCCACCATG
CAACATGGGA
TCAGCATGGG
TATGGGGGCT
TTACTAA
CTTTCAGAGA
CTTGTAAAGC
AGCACTAGGA
GCATTGAACG
CCTGOTAGCA
AACACGACGA
ACTAAAAATT
AATGGAGAAG
GGAGACAAAA
AAAGCTATTT
ACAACCAATA
AGTGCATGCC
GGGACAATGT
GCGCAAGAAG
CTAGACGCTG
AAAAACGCGC
GAAAAAATCC
GGTGAGCGTC
GCGTATGTAG
GAGCAGCAGA
TACAGCGAAT
GCGCAAAGCT
GACACCAATT
CTCGGGCGTA
ATGAATGGGA
GGCGCTAGGT
AACTCGGCTT
AACGATAAAG
GGTATTGCAT
AATAACGTCT
GTGAGGATGA
ATTGAGTTAG
GAACTCAAAT
ATTATGAAAA
TGTCCTCCGA
ATTTGCTAGA
CTGCAGTGGG
ATGAGAACGC
CCATCACTTG
ATGCGATCAT
GGATCCCAGT
GAACGGGTGG
CAACCTCGTG
GCGCTCAAGA
CAAACTTCCA
GTGGGATGTT
CTGTCGCGCA
GAAAACCATT
AAGCCCAAGC
CTACAGCCTT
GTGGCACCAA
GACAAACGAT
TACAGCAAGC
TGGGCAACAC
TGCAAAATGC
ACTACCTCAA
ACCCCTTTAG
TCGGTATTCA
ATTACGGCTT
CTGACGTGTG
CCACCAATTT
TAGCCGGGAC
ATA.ACGCTAA
ATTTAGCCAG
GGCTTAAAAT
ACCGAAGGCT
GTTGAACAAT
TCCGAGTGCT
TGTCAAAGCC
CTTGTGGCAA.
GAATGGAGGT
TAATTCGTAT
CAACAAGGCT
TTTAAGCAAC
CGAACCAAAT
GAATGCAACC
GCTTTTAAA-A
AA.ATGGTGGT
TAAGAATGAA
AGCCAAAATC
CAACCCCTAC
GGAGATTTTA
TGTAAATGAC
TCCGGGTCAG
AACAAATCTT
CGAAAACATC
TTATAACAGC
GGTGAGTAAA
TCAAAACTCT
GAA.AGTGGGG
GGTGGGTTAC
TTTTGATTAC
GACTTATGGC
CTTAGGCAAA
TTCATGGCTT
AATGAACGTG
GCCTAAGAAA
CCCCACCATC
CTATAGCGTG,
120 180 240 300 360 420 480 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2007 INFORMATION FOR SEQ ID NO:1010: SEQUENCE CHARACTERISTICS: LENGTH: 1029 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 WO 9737044PCTIUS97/05223 915 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1029 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1010:
TTGACTAAAA
GGGGTGTTTA
GCTTATTTAG
TGCGAAGGGT
AACTCCCCTA
TCTCTTACCC
CAAAAAATCA
ACGCGCTTGA
TTAAAATGCG
TTAATGGAGG
GACGCTAAAG
GTTTCTATCC
GAGAGTTTGG
TATGAAAGCG
AAAGACGATA
GCTAAAGACA
ACTTTTAACG
CATGAGTGA
AATTCATGTC
TCTTCTTCAC
AGCAACGCCC
TTTTTA.AAAT
TTATGGATTT
TTTCTATCCA
GCCAAATCCC
ATTGCTCTTT
ATTTGACTAA
CGCAAGAAAA
CCATAGAAGA.
AAGCACGCCA
GCTTTTTTTC
CGTTAGCTTC
CCGCGCTCCA.
AACGATCTCA
CCTTGTTAGA
TTGGATGGTG
TAGCATGTCG
AAATATCGAG
CGCATGCGTT
TAAAAATTTA
TTCTCAAATC
CTTAAAAAAC
AACATTCAAC
TGCAGAGAAT
TCTCTCCCTT
GTTGCAAGAC
TTTTAAAAAC
CCCTTATTTT
TTTAGAAAAC
ACAGAATTTT
AATCGTTCTT
AAGCCTAAGC
GTTATCGGGG
GTTAA.AAAAT
GGCATGGGGA
TCTAAAGAAC
AAGATCAAGC
CAATCCCCTA
TTGAATGCCT
GCTTTAGATG
ATCCTTGCTT
AAAAATATTT
AAACTGCGTT
GTTTTAGAGT
AGTTTGCGTT
TATTTTATGG
AAAGGATTGT
AACGCCCAAG
GTGAATTTCT
CTTTAATTTG
CTTTA.ACCGC
TTATAGGAGT
TTCGTTTTTT
TCCATTCTTT
TTTTAGAACA
TATTGGAAAA
AAAAAACCTT
ACACTTTTTT
TTAAAACCTT
TTTTAGCGCC
CCTTTTACCA
CTCAAACCCC
CCTTGTTCCA
TGCAAGCCTT
CTAAAGACAA
TTCAATCTTA
CGTGCTTTTA
TTATCTTAAC
TCCTTTTAA.A
AGATCCTCAA
AGATAAAAGC
ATCCATTCAG
ATTCAAACCC
AAACGATAAT
TCAAGAAGGT
GAGTTCTAAA
AAAGTTGAGT
ACAAAATAA
TAGCGTCTCT
ATCCCATTTT
TGTTTCTATG
CACCAAGCTG
CAAAATAAGC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1029 INFORMATION FOR SEQ ID NO:101l: SEQUENCE CHARACTERISTICS: LENGTH: 882 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: No (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: risceature LOCATION .882 WO 97/37044 WO 97/ 7044PCTIUS97/05223 916 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l0ll:
ATGAGATTGT
CCTTTAAGCG
CAACCCCATT
TTTGAJXACGC
GTGTTAAAA-A
AATGCCTTAG
GCGGGATTAG
TTACCCAA.AG
AATCTCGCAA
AAACCCCCAC
TTAATATTGT
GATTTGAAGC
ATAGAAACCA
GACAAAATAI\
AAGAATTCAA
TGTTCTTGTT
ATGACGCCCC
CAAACGCTCC
CTTTTA.ATA.A
ACGCTCAATT
AAATGTTTTC
AA.ATCCAAGC
GTTTTCATTT
GCCAAGATGA
TGGATTTAAG
ATGTAATCAA
TAGACATTAG
ATAAGGAGCG
GCCCAAAAAC
AATTAGGAAA
ACTGAGCGCT
CATTAAATTG
GGCAACGCCA
AACGCCTAAA
GGATTCTAAA
CTACCAAAAT
TTCAAGCAGC
AGCCCCACCG
AAAAGAGCGA
CCATGCTTAC
AAAAAAAATT
CGTTTTGGGT
TTACTTGGTC
ATCTAAAGAA
TTTATATGCC
ACTTTGATGT
OTTCATTGGC
CCTATAAAGG
ATCATGGAGG
AAAACGATGG
GACATCTACC
AAGGATAAA.A
CCTAGCCTGA
GCCAAAhAGCC
AAGGGGTTAG
GTTCCTACAC
CGTGTTGATG
TTACTAAGCG
GAACTGATTA
GGAAAATTCT
TACTGGCTGA
AAAATGCGCT
CCGTGCAAAC
TTGAAGGGCA
ATTTTAAAGA
TCTTGTCTAA
AGCAACTCCG
AAGAAAAATC
CCTCAAACGC
CGGTTATTGC
CAGGGTCTTT
CGAACCATAA
ACAAATACGG
AAGAAGCTGA
AA
AGAAAAAATA
AAAAGAAGTC
CACCCTCACT
AAAAGTCATC
AGCCTCTTTG
AAAAGCTAAA
CTTTCTTTTT
TCAGCATGCC
TCTAGAGTTA
TGTCTTACTC
TTTAAAAAAA
AATCATCTCA
CCTGCTTTTA
AAATAAGATA
INFORMATION FOR SEQ ID NO:1012: SEQUENCE CHARACTERISTICS: LENGTH: 861 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .861 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1012:
ATGCTTAGAA
GCTCAAGAAA
TACAACAATA
CGTGATATTT
AAAATCTATG
CAAAATGGTA
GAAGTGGAAG
AAATTGTCTT
GCTAAAGCGG
TATCGGGGGA
AGAGACGGGA
AGCAATAGCG
AATGGCTATA
TCTACAAGGC
GCCAGCCAGC
ATCAATTTCG
ATACCCACAC
AAATGTATAT
TTAGGACTAG
TTAGGGGGAT
ACATTTTCCA
TGATTAAGGG
TCACCACGAT
AAGCAGCCTT
AAAACTGGGA
ATAACGCTTT
AAATGAGCGT
TCAACGAAAC
TTTTACGCCC
CCCCTTTGTG
TATCGTGTTT
TTTGGGTAAG
TGACAGGAAA
GGCGGATGTG
TGAGAGCCGT
CCATGACGCT
GGCAGCGAAC
TGACGCTAAC
TTACACTAAC
CATCCTAGCC
TAGGAGTCTC
AGGGACTCCT
CGATAGCATT
TAACACCACT
GTCTCTTGTA.
GTAACCACTA
GAGCTCCAAC
AATGTGGCCA
CTCTTAAGGG
AACACCGTGA
GCTTCAGCAG
GACTTTTTAA
TTTGGATACC
TATTACA.ACC
TTCCACCCTA
AGTGAAGTCA
AGCGTGAGCT
TCAGCCCTCT
TTGTCGC TAG
AGGGTGAAAG
AACGCCAAAG
GTGGGGGCTT
TAACGATAGA
TCGATCCCAA
GTCCAGGCGC
GAAAGAATCA
GCATGAACGC
ATCAAkATAT
ACTACGATTT
ATAGCGTTTT
ACAACCTCAC
CCAAAGCCAA
CAGTTTGCAA
GACTTTTGAA
CAACCAAATC
AATGGCGCAA
TGGCGTCGCT
CATGATTAAA
GGTTGCGGGT
AACTTATGGC
CACTGCAGCC
TTTTTACTAC
ACAAGATCCA
AGCTAAAATT
ACGAGACAAT
TGACCAGGAA
WO 97/37044 PCT/US97/05223 917 INFORMATION FOR SEQ ID NO:1013: SEQUENCE CHARACTERISTICS: LENGTH: 1026 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...1026 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1013: ATGAAACGGC TTTTATTGTT AGCCTTGGCC CTCTTTTTTA GCCTCTCATG CACTAACGCT CAAGAAATTA AAGAAACTCA AGAGACTAAA AAAACTAAAG AAACTAAAAG CCAAACCCGT 120 TTTAACATTT CCACCACTAA GGTCATAGAA AAAGAATTTT CTCAAAGCCG GCGCTATTAC 180 GCGCTTTTAG AGCCTAATGA AGCGCTGATT TTTTCTCAAA CCCTGCGTTT TGATGGCTAT 240 GTGGAAAAGC TTTATGCGAA TAAAACCTAT ACCCCCATTA AAAAGGGCGA TAGGCTATTG 300 AGCGTGTATT CCCCTGAATT AGCGGGCGTT CAAAGCGAGC TGTTATCGTC ATTGAAATTC 360 AACCAGCAAG TGGGAGCGAT TAAAGAAAAA TTAAAACTAC TAGGGCTAGA AAACTTTAGC 420 ATTGAAAAAA TCATCAGCAG CCATAAAGTC CAAAATGAAA TCACTATTTA CTCTCGTTTC 480 AACGGCGTTA TTTTTAAAAA AAGCCCGGAT CTCAATGAGG GGAGTTTCAT TAAAAAAGGG 540 CAAGAGCTTT TCAAAATCAT AGATTTAAGC CGATTGTGGG CGCTTGTTAA AGTCAATCAA 600 GAGGATTTAG AATTTTTAAA AAACACGCAT CAAGCGATTT TGTTCGTAGA GGGGGTTAAA 660 GGCAAGCAAG CAATCACGCT TGAAAACATC AACCCCATCA TAAATGCGCA AGATAAAATG 720 CTAGAAGCGC GCTTCAATGT GCCTAATCTT AAATTGCTTT ATTACCCTAA CATGTTCGCT 780 CAAGTAGAAA TCTTTCACAA ACCCCAAAAA ATGAAGATCT TGCCTAAAGA AGCGGTTTTG 840 ATTAAGGGGG GGAAAGCTAT CGTGTTTAAA AAAGATGATT TTGGCTTAAG CCCGTTAGAA 900 ATTAAAGCCG TCCGCTTGAG CGATGGGAGT TATGAAATTT TAGAGGGTTT AAAAGCGGGC 960 GAAGAAGTCG CTAATAACGC TTTATTCGTG CTAGACGCTG ACGCTCAAAA CAATGGGGAT 1020 TATTGA 1026 INFORMATION FOR SEQ ID NO:1014: SEQUENCE CHARACTERISTICS: LENGTH: 1392 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: WO 97/37044 WO 9737044PCT/US97/05223 918 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1392 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1014:
ATGGCTTTTT
TTTATGGGTA
AAAAAGCCCG
ACGAGTTTAG
CAAAAAGCCA
CCTGAAGATA
TTCCAAGCGA
CCCATTCAAA
GAAGAATTAG
CAAGTGTTTA
AAGCATATCC
CGCTTGATCT
AAAAGTAACG
TTGCACAAAA
GACGATGATT
AAAACTTTGG
CGCAAACTCA
GAGAACCCTC
TTGAGTTTGA
AGCGCTCCTG
TTATCCCCTT
GCGATCGCTA
GGGCTTATGC
ATGTTGATCT
GCGCTCGCCA
TGCTTGGCGT
CCGGGATCGT
AAAACGCTAA
TGCAAGAAAA
TTTATTGCAA
GTTGTATCGC
CCTTTAAACC
AAATCTTGCA
GCGCGCTCTT
CGATTAAAGA
ATCAAGTCAT
CCACCCAAAG
AAACTTCTAA
TTTCAAAAGA
AGCACCTTTC
AAACCACGCC
CTTTTAACAC
AAGATGAGGG
AATTGACTTA
ATGAGGGCAT
GGCAAGAAAG
AGATCATGCC
AA
AAAGCCTCTA
TGGTTTTTCT
TAGGGATTAT
AAAAGCCTAT
AGGCGTAGAA
GCAAATTACC
TATCGCTTTA
TTTACAAGAA
AAGTAAGAAC
TAATCATTTG
GTTAAACCGC
CTTAGATCCT
CAACGCGCAA
AGCGCTCAAG
CAGAGCGATT
ACAAAGCCCA
CAGTTATCGC
TTACGATCCT
CGCGTTTAAT
CCTTTTA.AGC
TATTGAGTGG
CTTTTTGCTC
TTTTAATGTA
AAGGAAATCA
CAAACCGAGT
TATTTATGGC
GAATTGACTC
AATTCAGATA
TTAGAAAGCA
AAATCAAAAA
AAAATCAAAG
GTGAGCGCTT
AGTTATGAAA
CTTTTAGATG
AAATTGGATC
ACCTTTTTTA
TATTTTGAAC
TTTTGGCAGT
GCCCTAAACC
ATCATTTCTC
TTTTCGTGGC
GCGATGCTA-A
CAACGCAATA
CAAAATACAG
CCGGCTTTAA
GGGCCTTTCG
TGCGTTTTTT
TGGATTTAAA
GTTATATTAG
AAAATAAAAA
AAAGCCCTGA
TGTTAGAAAC
TCAGAGATTT
AGGCTTACCC
CTTTGTTTAA
AAAAGCTCCA
AAGATTACCC
ATTTTAAAGA
TTCTAGGAAT
GCTCAGAAGC
ATTTAGCCTC
TCTATAGTCT
ATATCCAAAA
AAATTTTTAA
AAAGCCTGTA
AAGACAAGAT
ATGAAAGGGC
TCTCTCGCTC
CTAAAGCCTT
TATTTTATTT
GGATTTAGAA
CGATAAAAAA
CAGCGCCCTA
TGCCAAAATG
AACAGACGCT
TGATAAAATC
CATTCTTTAT
GGCTAACGCG
AATTTTTGAA
GGCGTTTAAC
CGCTCTCGCT
CAATGAAATC
GGTTGTCAAG
TAAAAAGAAA
TTATGCGAGC
TTTAAGCCAA
GGAAAAAACT
TTATGAAAAA
TTATTATTAT
TATGGCGTAT
CTTCGCTCTA
GGCATGGATA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1392 INFORMATION FOR SEQ ID NO:1015: SEQUENCE CHARACTERISTICS: LENGTH: 1125 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (geflomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 1125 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1015: ATGCCAAAAA GAATGAAGTG TTTTAGTCAA AAATGGTTGG TTTTTTTTGT TACCCATTGG WO 97/37044 WO 9737044PCTIUS97/05223 919
CTTTTGTTGG
GCGTTAGAGG
AAGGTGAATA
TTCATCCCTA
TTAGCCATGG
ATTTGGCAAT
GATGAAAGAA
TACAAGCAAA
GTTCAAAACG
AAAAAATACC
AAATTCAACA
GTGAGTTTAG
TTGAATTTGA
CCCTCTAAAG
CAACGCCAAC
CATGTGGTCT
TCCAGTATCC
ATCATCCCCA
CTTCTTTAAG
CTTTTGGGGT
AGGAAGAGGA
TCATTAAAAA
CCGAGTCTAA
TCATGCCAAG
GAGATCCCAT
CCGGTGAGTG
CTATTAAAGC
TCCCTAAAGA
GCCTAGACAA
TGGGCGTCCC
GTTTAGAAAC
ACCCCACCTA
TCAAACAAA
TGCCTAAAGA
AATTAGCCAA
CTAACAAAAA
TCATGCGAAG
CAATGCGAGC
ATGGAAAAAG
CATGCTCATA
GTTTTCATCA
CACGGCCAAA
TAAAAGCACT
GTATTTGGTC
CGCCGGCACT
AACACGAGAG
TCTCAAAGAT
GTTTAAAGA
CTTAAAATCT
CACCATTTAT
TAAAAACGCT
AACCTTATCT
TAACCTCAAA
ATTACTCGCT
ATGGCTTTTG
TTTTTATCCC
TTAGTCAAAA
GGAGCGAGCG
AGGGCTTATA
GAGTTAGGGC
CAAGCGGCGA
GCTATGGCGT
TCAGACATTA
TATATCCGCT
AAAGAGTATT
CACACTTCTT
TACAACCACC
ATCCCTTATG
CAAGCTAATC
TCTATCGCTA
GATTCTAATA
ACAAGGGAAT
AATCCAATAT
AAATGCCAGG
GATTTGACGT
TGCCGCAAGA
GCAGGAAAAA
TTAAGGTCAA
TCGCTTATTT
ATAATTACGG
AAGTCTTGCT
CCATTCTAAG
TGCTCAATCG
TAGTCCAGGT
AATTCCGCTA
AAAA-ACTCGC
CTAAAAGCCC
AACGCTATCA
TCTTTATCCA
TTTAA
TGACACCAAA
CGCTTTAAALA
GAATTACCAA
ATTTTTGTTT
AGCGGTAGGG
TCATTACATT
GAAACGGCTC
CTTACGCAAG
AGATGAAGAT
CCTGGCGTTG
TGGGGCGAGG
GGCCAAAAAT
TAACATTCTG
CCTTTTCAAA
TTTCATCACC
AGTCAGCATT
CCAGCGCTTA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1125 INFORMATION FOR. SEQ ID NO:1016: SEQUENCE CHARACTERISTICS: LENGTH: 1002 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 1002 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:l0l6:
ATGCTAATCG
TCATCGTTAT
GGGGAGCAAA
GTGCCTGCCA
TCTGATATTG
CATGTGGCGG
TTTGTGGGTA
TTCCAAGAAA
GAAGTTGACG
CGTTTGAAA
AGCGGCCATC
TTGAAATACG
CCTGAGATTA
CCTAAATTTT
GATATTGGCG
CTCGCTTTAA
TAGGCGTGGC
CCATAAAGCT
TGTTCAATAC
TTAAAGCCAC
CGTTGAATGT
ACCCTAAAGC
AAACCATTGT
CTTCTAAAAA
ATGTCAAAAA
AAGCCCTTGA
TCAAGTTTGG
TTTTTATTTG
CCACTATCAA
GACCTAGAGC
AAAAGCTTTA
TAACGCTTCT
TCCTGTTTCT
TTGGGATAGG
TCTCAAAGAT
GGAATTATTA
GGTAGAGCAT
AGAGGTCATG
ATTGGCTAAA
GAAAAAGGGG
TTCAGATATT
GCGCGCTGAC
GTGGATAAGC
AGCCATTAAk
CCCACTCATT
ATTTCCTATT
AATCAAGAGA
AAAATAATCT
GTCGTGGGCA
CCTGAACGCA
AAAAAGCTTA
GCGAAAAAAT
GAAGATATTG
ATGCAAGAAA
GTGGAGCTTT
TTAGAAAAAG,
ATTAGTGTGG
CCGCTTAGCC
AACAAGCAAG
AGTCTTTTTA
CTTTAGGGGT
TCCAAGTCAA
ACTTGGGCAG
TTTCGGATTA
TTAAACCCAT
GCCCTGATCT
TTGGGATTTC
ACGCTCAAGC
CTTTGGATTT
TCCATAAAGC
GGCATAGA
AAAAAATCGT
CTGAAGACGT
TCTATAAGCT
TCGCTTTAAA
TCTTCTTGTT
GGATTATTTT
CTTTGCAGAA
CGCTTTTAAA
GAGCAGTGAT
TGTGGTAACC
ATTCCTTTCT
TA.AGGCCTTA
TATCAAAGAG
CAATAAAATC
CA.ATTTTGGC
TAAAGAAAAC
GTTGAACAAC
CCCCACAATG
AGCCCACCCT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 WO 97/37044 WO 9737044PCTIUS97/05223 920 GAAGCGTTTA AGGGCGTGGA TATTAATGCG ATAATCAAAG ATTA.CTATAA AGTGGTCTTT GATTTGAATG ACGCGGAAGT TGAACCCTTT TTGTGGCACT GA INFORMATION FOR SEQ ID NO:1017: SEQUENCE CHARACTERISTICS: LENGTH: 1602 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1602 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1017: 960 1002
ATGGAAATTA
TTAAAGATCG
GGGCAGGCTA
GCGATGA.ATA
GCCATTAGCA
ATTGCTGGGG
AACGATGCGC
TTGCATGTAG
ATTATGGGGG
GGCATTGTGG
TTAATGCTCA
GTCGTGCCTT
GTTTTAAAAC
GCGCTTTTG.A
AATAGCCACG
TTAAGCTTTG
AGTCAAACTT
TGGATTATG
CTCATTAAAA
GCGCTTTCAG
ACGCATATTG
CAATCCAGAA
GATTTAGAAG
TCGCTCATGC
AAGAAAGAAA
TTGAAAAAGA
CTTCTTTTTG
GAAACATCAA
CTCTCGCTCT
ATTCTAAGGG
TTGGCGCGA.A
TGGGTGGGGC
GGGAAGTGGT
ATGTTTTCAT
CCACTTTAAT
CTGGAATGGC
CTAGTTGGGT
TTAAAAAGAC
ATTTGGTGGC
GCCTCTATGC
TTTTTATCCT
AAAGCGTCAA
CGCATGGGGC
TAGAAGATGC
TAGTAGGGGC
COGTGGGGTC
CGGTCATCAC
TOOTGCGC
GGCGTTTTGC
AAATTGAAGG
TAGAGAGCTT
AAAAGTCGCT
TTGTTACCGC
TAGCTCTTGG
AGAGTTTGAA
TTTGTTCCTC
ATTGTTGCTC
CGATGTGTCT
GATTTTGATC
TTCTACGATT
TAATGTCATG
AGGCGCTCCC
AGCAGCTGGA
A.ATCTCGCCT
TATCGCTTAT
GTTGATGAGC
GGTGGGTTTT
TTTTAAAAGA
TGAGCTTTTT
TAATGATGTA
GAGCAGCCCT
AGCTGGGATC
TGAAATCACA
CGTGCTTTTA
GGTGTTTGGG
TAGAATCAGA
CTTTTTAGAG
GAAAAAAAGC
TAAA.AAAGTG
TTGGTTGGTA
TTTTATAGAA
AAAGCTTCCA
ATTGGCGCCG
ATCTTTGCGG
AATAATGTCG
GCTGCGGTTT
AAGGGCCGTA
TTGGCTAGCC
GTTTCCACTT
ATGTCTGCTA
TTAATGGGGG,
AAAGAAGATA
TTAGCCTTTA
GAAATCCAGC
TTTGTGTTA.A
AATGTCCCTT
GCTAACGCTA
ATGGGGAGTA
GCTTTA.GGCT
GAATTAGACA
GCCTCTCAAT
GTGGGCTTTT
GACAACATTG
CGCTTTGATA
AAGAACACCG
TATAAAGAAG
ACCGTGCCOG
AAGTATTTCT
AAAAACTCCA
CTTTGCTCGC
CCGTGATTGG
GCCCTGCCGT
GTGAAATGCT
TCGTTTCGCC
TTTTGAGTGG
CGCACTCCGT
TCAATTGGCA
CTTTGATAGC
AAAAGAGCGC
GCTGGTATTT
TCGCTTGCGG
AAAAAGCCCC
TGATTTTTGC
TCGGTCCTTT
CTTTAAACTC
TGAGCTTGTA
AAATGCAAGC
TAGGCTTACC
TAAGGGAGCG
TAGCGGCGCA
AAGCCAATTT
CTATCGCTTT
AAGTGATCAA
TTTCTGCGCT
AG
AAAAGACACT
TCTCATTTTT
GGGGTATATG
AGGCTCTAAA
AGGAGCGATC
TGAATTTATT
GGCGTTATGG
AGTGGGAGGG
TTTTTTATCC
CATGTTTTTT
GGCTTT.AAAG
GATCGTGA.AG
TTGTGTCCTT
GCAATTAGAA
CGCCGCGCTT
AGCCGCA.ATC
TGTGCCGTTG
TGGGCCAAAG
TTTTTGCATC
GGTAAGCTCT
TTTGAGGGAG
CTTTGGGGAA
GAAAGAAAAA
GGAATTGAAA
ACGCTCCATT
TTTAGGGGCG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1602 INFORMATION FOR SEQ ID NO:1018: SEQUENCE CHARACTERISTICS: WO 97/37044 PTU9/52 PCTIUS97/05223 921 LENGTH: 990 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .990 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1018:
ATGTCAAATA
TTAGGGCTAA
TTTTTAGAGG
TTTGTTAAAA
GAATTAGAAG
GATGAAGTGA
GCGGCAAAAT
GACAATGGCG
AAATACAACG
GAAAGCAAGA
GAATCCTACT
CTTTTAAGCG
AAGGATAGTT
AAGGAATTTG
TTGAGCGTGA
ATGAAAAGCT
ATGAGCGTGT
GCATGTTGGA
TCGTGCTTTT
CTAGGGAATA
AAGGCGATCG
CCAAGCTCGC
AAAGAGGCTC
CCCAAGCGAA
TGGCGAGTTT
AGAGCGCGGC
TTGCCGCTAA
TAAAAGACGT
GTGGCGAGCT
GGTTAAAAAT
AAGGCTATAT
TGGGGGATTT
ATGAAGTGGA
TGGTTACCAT
TAAAAATAAA
TTATTTGGCT
CAGCGTGAGC
CATTAAAAAG
TCAAGCTGAA
AAGAGATGAA
TTTGGCTAAA.
GCAAAAGCGC
TTACCAAAAG
GGCTAAAGAG
CAAAGCCCTA
TAGCCCTAAG
CAGCGTGCCT
CCCAGCGTTG
TGCGACCTGG
GGCCATACCC
TAAACCTTAA
GCGATTCTTA
TATCGCCCTA
TCTAAAGTCC
GGCGATTTAG
GCCGGGCATA
ACGATCAATT
GAGACTTATA
GATGAAGCCT
TATAAAATGG
AGCGCGGCTT
GCCCCTATTG
GGCTTTCCTG
GAAAAGTATT
AAAAGAAGCG
AAAGCGACGA
TTAGAAGAGT
CAGGGGGTGG
AGGCTGAAGT
CTGGCCGCAT
TTTTTAGCAT
AAGCCGCTAA
CTGCGAGGGA
AGCGCGTTCA
ATGCGGCTTA
CTTTAGGGGG
TAGGGCAAGT
ATGGGGAAGT
TGGTGCTCAT
TGAACGAGTT
CGAAATTCAG
ATAATTCCAA
TGGAAAACTT
GGCTTTATTG
GTTGCAAGGG
TGAAAAGGTG
TTCTAGCCCT
AGCCGTTAGC
CGTTTGGCAA
AGATTTGTAT
TGAAAGCACC
GGCGAGTTCT
GAATGAAGTG
GAGTAACGTG
GATAGATTTA
TAAAGTGGGT
GGTCAAATAT
CACTTACGAC
TAGGGTGGGG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 990 INFORMATION FOR SEQ ID NO:1019: SEQUENCE CHARACTERISTICS: LENGTH: 879 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature WO 97/37044 WO 9737044PCTIUS97/05223 922 LOCATION 1_.879 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1019:
AGACTGGAAT
ATTGAGCCTT
GATGGCGTGT
GCCTCTAACG
GCGTTCTATC
AATTTGAGTC
GGCGTGAGTT
GCGGACAGCT
ACCATTCCAA
TATTGCGGGT
ACCGATTTAG
GGCGTGAGTA
CTGCTTTCAG
TATGTCATGA
GGGTTTAATG
CACCGCTACA
TATCTCTAAA
ACATGCGTCA
CTGAATTTAA
AAGCTTTGGA
AAGCGATTCG
TGAATGTGGG
ATGAGCTTGC
AAACAGGGAT
TTGATATTTA
CCAAATGCGG
ATTTTTATAT
CCCTGTTTAA
GCCCGGATAT
CGCGTTTTGA
CTCAGGCTTT
AATCACTTAT
AAACGATTTA
TATTGACTAT
TAATTTCATC
CATTTATGGC
GATCTCGCGC
CGCAAGCACC
CAATCTTGCA
CTTGCCTGAT
ATCCCAATTG
CAATTGGAGC
TAATGTTTTC
GCCAGGCACT
AGTGGCTTAC
AGCCCTAGCG
TCTCAAGTTA
CGATACGCTA
TCAAGCCAGT
TCACAATACG
TATGAAGTGG
ACTTGGCCCA
GGTAATGTTT
TGGCTTAGCC
TATGGGACAG
GGGTTAGTGC
CCTAAAACCA
AACAAATTCT
GACGCTATTA
AAATGGTAG
CGGCCCTTGT
CAAGGGGGGT
AAAATATCAA
ATTTTAGCGG
CGCAACGCTT
GTGGGACTTT
CCACTAGGGG
TTATCATTAA
GCTTTGTTAC
CTGAAAAACC
ATATGCACAA
AAAGCCACTG
ATGTGGATCA
AAAGAGCGAT
GATCAGCCCC
TATGCCAGGA
GCCTGAAGTO
GAGGCCTGCG
GATCGTAACC
TAGATACAAG
GTATTTAATG
ATTGGATTAC
CGGTTTGGAT
CAAAACCCCT
ACCGGGCTAT
GAAGGGTTTG
AACAAGCCCC
CGCAGAGCCT
120 180 240 300 360 420 480 540 600 660 720 780 840 879 INFORMATION FOR SEQ ID NO:1020: SEQUENCE CHARACTERISTICS: LENGTH: 1455 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1455 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1020:
ATGGCTAAAA
AAAAAGACGA
TCAGAGGGCT
AAAGCCTTGA
GTGGCGACCT
AAGGCTTGCG
ATCGCGTGCG
AGCACCGAGT
GGGACGATTC
GCCTTTATCC
GGGGGGACGA
TCTTTGCATG
ATGCTCAAAG
AGGAGCGGGG
TCACAACCGT
GCCAGTTTGG
GTTATGCGTT
GCGAATTTAA
CAGCGGTGCG
GTTTGCAAAT
CGAATTTGCT
GCGCGTTGAT
GCATTAAAGA
AAAAAGAGGT
TCAGAGCCTT
GCTATGAAAT
AAGACAAATT
CGTTGATTTT
GATTGACATA
GTTTTACTTG
TAAAGGGGTT
AGAAATCGCC
CGATGCCCCT
CAAAATCATT
GCATAAAAAT
TGAGAAAGGT
AATGTTTCTA
CTCTAAACTG
GAGTAAGGTG
AGATGCGCAT
ACGGCTTTTA.
ATCAGTCGTG
GGCTCTA.ATT
CTTTTTGAGA
TTACAAGAAA
CTCAAATACA
AATCGGTTAG
GATGGGCAAA
TCAGGGATCA
AAGATTAAGG
GACAAAGACT
CCTTTTAAGC
CTGATGAAAC
AAAAATTTAG
GGGGTGAATG
TTAGAGCATT
CAGTGCGTTT
CTAAGTCCAA
TCCCCATGCA
AAAGCAAAAA
AGTTTGTAGC
AAGAAGCGCT
CGATAGACAT
ACTTAATCTC
TAGATGTCAA
ATAAAAACGC
GCTTTGATTA
CGTTCATTGA
AAGAGCGTTT
TGAAAACTTC
GGCTGTCTTT
GGTTAGGATT
ACGAGCCGTT
AATCCTGTGC
GAGGGTGAAA
CTATGGTGGG
TGGAGGGGGT
GCTTGATGTC
ATTGGCTAAA
CTTTGGGGTG
CCCTATAGAT
AAAAATCGTC
GGATAGCATT
CTTGATGATC
120 180 240 300 360 420 480 540 600 660 720 780 840 WO 97/37044 WO 9737044PCT/US97105223 ACTAGCGGGG TAGGGGTGAG AGAGGGCGTG CATAAATTCC CCCCCAATAT CAACCCCTCT CATGAAAAGC ACAGCCAAAA GGTCAAAAAA CCTTTGCATA AAATAGATGA AAAATACCTT AGCATGGGTA AGATTTTAAG TGTATATTTA AACGCTTTGA GTTACGGCTT TAGCCACCAG TTTAGCCATA AAAAAATCCC TAAAGACAAC AGCCTTTTGA. CCTTGCAGTG GCTGAGTTTT ACAGACAGCC ATCATTTAAA ATACACGCTA GATGCGCTTT ACTTGGCTAA AGAAATGCTC ATAGAGTTTG CTTGA TTTTTGAGCG ATTTATTGCG CTCATCTCTC TAAAAGATCG GAATGCGTGA AATTGTTTGA TTCCATTTAA AGATTGCGGG GCCCACAAGC ACAGCGCGTA GATAGGGCGA TCATTTGCTT GCTATCGCCC ACATGAGTGC ATCCTTTCTT TAGCCGAAA.A GAAAAAAACA AGCTTGTGAT CCCAAACTCA TTAAGCCCAT
CCACCATTAC
CTTTTTACCC
AGTTTTATCG
GGAATTGGCG
TTTCATTTTA
ATTGGCGCAA
GATGATGCCA
TTTGTGCCTG
CCATTCTAAT
TCCTTTGACG
900 960 1020 1080 1140 1200 1260 1320 1380 1440 1455 INFORMATION FOR SEQ ID NO:1021: SEQUENCE CHARACTERISTICS: LENGTH: 669 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (gerioric) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .669 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1021:
ATGGTAGGAA
GTGGTTTTAG
AAAAAAGACA
AGCACGTTTT
ACCAAAGACA
ATTAAACAAG
ATTAAACAAG
TCGCCCGTTC
GAGTATAAAG
ATCATTGGCT
GCTGAAATTG
GCTGAATGA
TGAAAACTGA GATGAAATCT TTTTTAAAAC CGTTCATGTT GTTGTATGCT TTAGCGCATG GCGCTCCAAT GAGTCCAAAT GTAGAAAAAA CGCCTAAAGA AGAAGCCA.AC GCAACCACAA CAGTGCCACC TTTAGACACA GCCACACAAA AGATTAAACA AGAGATTAAA. CAAGAGATTA AAACTAAACA AGAGCAAGAA AAAGAAA.ATA AAA.ACGATCA AAAAACCCCC ACAACCCCCT TCGCAGTCAG TGGCGTGAAC GTGCGCGCTT CGCTTATAAA AAATAAAAGC GTGAAGGTTT AATTTTCTCA CGAAACAAAG GGATATGTGT TTTTTGCACA GCCTTTGTTG CTGTGCTTGG TTTTTACGTG GCGAGACAGA GCGCCAAAAC CCGCCACAGA ACAAAACCCC AACA.AGAGAT TAAACAAGAG AACAAGAGAT TAAACAAGAG AGCCTAAACA AAACAGTGTC TAATGGGAAA AAAACCTCTA TTCCTAGCAC AAAAGGCAAA TAGAAATCCA Ak-ACGATTGG TTTTAAA.ACT TTTAAAA.AAG 120 180 240 300 360 420 480 540 600 660 669 INFORMATION FOR SEQ ID NO:1022: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 408 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCTIUS970523 924 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .408 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1022:
GAGCCTGATG
TTGGTATTCA
AAAATGGCTC
GCTCTCAAGC
TTATTTGATG
TTGTTTGGGC
GCTTTAAAGC
CAACAAGCCG
CCTTTTGGGT
AAAAAAAGGG
AGCCTAATAG
TAGAAATCGT
TCGTTGAAAT
GAAACGCTTT
CAGAACATTG
GTTTAACTTA
CGAAAAGCTC
GGTGTCCCAT
TTTCATGTTC
GCTAGGCTTT
GAGCTGGCAA
AATCACCCCT
ACCTTAAGGA
AAGCTCGCTC
CATTTTTATA
CCTTGGGCGA
GTCTTCTTTT
AkATTAGAGG
ATTTTGGCGT
TTCAAAGGTT
CCTATGAATG
TCATGGCCAT
TTGATTTTAA
TGGCAATCGG
TGAAATAA
TTTTGTTTTG
TTTAAGCCGT
TGGGCCTGTG
GCTTTTTATT
AAAGTTAGGC
TTTTATTTAC
120 180 240 300 360 408 INFORMATION FOR SEQ ID NO:1023: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1908 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .1908 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1023:
CTGTCCTCCG
AATTTGCTAG
GCTGCAGTGG
AATGAGAACG
ACCATCACTT
TATGCGATCA
GGGATCCCAG
AGAACGGGTG
TCAACCTCGT
AGCGCTCAAG
CCAAACTTCC
TGTGGGATGT
ATCCGAGTGC
ATGTCAAAGC
GCTTGTGGCA
CGAATGGAGG
GTAATTCGTA
TCAACAAGGC
TTTTAAGCAA
GCGAACCAAA
GGAATGCAAC
AGCTTTTAAA
AAAATGGTGG
TTAAGAATGA
TGTCAACGAC
CAATTCCCCC
AGTTACAAGC
TATCCAAACC
TTATGAGCCA
TTATCAAATC
CACCACTACA
TAAAAAATTA
CATAACAGCA
ACAAGCGAGC
TAGCGGTTAT
AATCAGCGCT
GCAAGGGATA
GCCTATCAAG
TATGCCTTTA
TTTAATAATG
GGACATGGCG
ATTCAAAAGG
AAACTTGATT
GTATACCCAT
CCAACAACAG
ATCATTATCA
TGGGCAGGGA
ATCCAAGGCA
ATCTAGGCTC
CGGTGCTTTT
CCGCTTGTGG
TGCCAGGACA
GGCCAATATC
CTTTGACAGC
TCACTATCAA
GGAGTCATGG
AAAATATCAA
CTACCCTGAA
TAAGTGGCAA
TGATCGCTAA.
AAGCACTAGG
AGCATTGAAC
TCCTGGTAGC
AAACACGACG
CACTAAAAAT
CAATGGAGAA
TGGAGACAAA
GAAAGCTATT
TACAACCAAT
TAGTGCATGC
TGGGACAATG
CGCGCAAGAA
120 180 240 300 360 420 480 540 600 660 720 WO 97/37044 WO 9737044PCTIUS97/05223 925
GCTGTCGCGC
GGAAAAcCAT
CAAGCCCAAG
CCTACAGCCT
CGTGGCACCA
GGACAAACGA
ATACAGCAAG
TTGGGCAC
TTGCAAAATG
TACTACCTCA
AACCCCTTTA
ATCGGTATTC
TATTACGGCT
TCTGACGTGT
GCCACCAATT
TTAGCCGGGA
TATAACGCTA
AATTTAGCCA
GGGCTTAAAA
TACCGAAGGC
AAGCCAAAAT
TCAACCCCTA
CGGAGATTTT
TTGTAAATGA
ATCCGGGTCA
TAACAA.ATCT
CCGAAAACAT
CTTATAACAG
CGGTGAGTAA
ATCAAAACTC
GGAAAGTGGG
AGGTGGGTTA
TTTTTGATTA
GGACTTATGG
TCTTAGGCAA
CTTCATGGCT
AAATGAACGT
GGCCTAAGAA
TCCCCACCAT
TCTATAGCGT
CGTTAGTGAA
CACAGACGCT
AAACCAAGCC
CTCTTTAGGG
GACGACTTCT
TAAAAACAGC
CGCTGACACT
CATCACCACT
AAAGAATAAC
TTACAACCAA
GATTGTTAGT
CAAACAATTC
CAACCATGCG
CTTTGGAGCG
AAACAACAAG
TAATTCTGAG
GGCGAATTTC
AAAAGACAGC
CAACACGAAC
GTATTTGAAT
AACACGCAAA
AGTTTTGCTG
GAACAAGTGG
GTGTGTTATG
AACACTTGGG
ATCGCCCATT
CTGGTGAATT
GCGCTCTCTA
CCCTATAGCC
ATCCAA~cCA
TCTCAAACCA
TTTGGCCAAA
TTCATTAAAT
GACGCTCTTT
CTTTCTGTGG
TATGTGAATT
CAATTCTTAT
GATCATGCGG
TACTACTCCT
TACGTGTTCG
ATCAAAACAG
AAAGCATGCT
TGAAAAACTT
AAGTGCAAGG
GGGCAGGCTG
TTGGCACTCA
TCAAATCTAG
ATATCCCTAA~
CGCAAGGCAT
TCAACCAAGA
ATAATGGCGC
AAAGAAAATG
CCAGCTTCTT
ATAATTTCAT
GGCTTTTTGG
TAGCCACCAT
TCAACATGGG
CTCAGCATGG
TTATGGGGGC
CTTACTAA
CCTAGACGCT
CAAAAACGCG
TGAAAAATC
AGGTGAGCGT
TGCGTATGTA
AGAGCAGCAG
ATACAGCGAA~
CGCGCAAAGC
AGACACCAAT
ACTCGGGCGT
GATGAATGGG
GGGCGCTAGG
CAACTCGGCT
CAACGATAAA
GGGTATTGCA
GA.ATAACGTC
AGTGAGGATG
GATTGAGTTA
TGAACTCAAA
780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1908 INFORMATION FOR SEQ ID NO:1024: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 525 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) C(iii) HYPOTHETICAL: NO (iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .525 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1024:
ATCAAACGCA
ACCCCAAGTC
CATTCAACCA
CAATCATTCA
TTTTACAAAA
TTTATTAACC
TTTATTAACC
TTTATTAACC
TCATTAACCC
TCATTAAATC
ATCTTTTTTT
TTCAACCATT
ATCATTCAAC
ACCTATTTAA
CTTTTATTAA
CTTTTATTAA
CTTTTATTAA
ACGCTACAAC
AAACGCATCA
CAAAAAAGGG
CAACCATTCA
CATTCAAGCA
AAATCCCCTA
CCCTTTTATT
CCCTTTTATT
CCCTTTTATT
ATTTTCAAAC
TTAAATCAAT
GTAGGAATGG
ACCATTCAAC
ACGCTACCTT
TTTTTTATTA
AACCCTTTTA
AACCCTTTTA
AACCCTTTTA
CATTTGATTC
TAAATACCAC
CAACCATTCA
CATTCAATCA
ATTTTTATA
TTCCCCCTTT
TTAACCCTTT
TTAACCCTTT
TTAACCCTTT
CATAA
TAGATACAAT
ACCATTCAAC
TTCAATCATT
CTATTTATCT
TATTAACCCT
TATTAACCCT
TATTAACCCT
TATTTCCCCT
120 180 240 300 360 420 480 525 INFORMATION FOR SEQ ID NO:1025: Ci) SEQUENCE CHARACTERISTICS: WO 97/37044 WO 9737044PCTIUS97/05223 926 LENGTH: 2247 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .2247 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1025:
ATGGAAGACT
TTTAGTTTCA
AAGGTTTTTA
ATAGATGGGC
CTATCCTATT
GATGATAATA
CAAGATTTTC
TTCAATATAT
TTTGTGTGGT
GATGATCTCT
ATCACACCCA
GGCTTAGGGG
AGGAGCGGTA
GTTGTGTTTG
TTCAACCAAA
TTCGCTTATG
GACACACGCC
TTTGGATTGG
A.ATCAGGCGC
AAAAAGGGGC
TTTTTCATAG
AAAGTCGTGT
AGAGCGCTTA
AAAGAAACTT
GCCATGATCA
GCAGTGAGTA
GAGCTGTTTT
TGCAAAAGAA
TTTGTTAA.AG
AGTAAGGCGC
ATTTTAGACA
CATTTTGAAA
ATTGATGATA
ACCCCTGATG
CCCATCAAGT
AAGGTGAGTC
TTTTATGATG
TTTTGTATAA
TAGGGTTAAT
AAGATAAAGC
TGAAAGAAAG
CTTTTATCAC
GATTGCCTGA
CTGCATACTA
CCTTAGTCGT
TCTTAAAATA
TTGGTAGCGC
ACAATAAAAA
ATTTTATCGC
AGGGTGTGGG
ATCCTAAGGC
AAGTGTTCAT
TGGATTTTGG
TAAAAGGGCA
CTAAGTTGGT
GAAACCTTTT
TTGAGTTTGT
GTTCTATGGC
CTTTAATGGA
GTCCTGCCAC
ACAGCTCGGT
GGAATTTCAC
TTGGTGTGAT
TCAACGTGAT
GCTGCTTGAT
CGGTAGGGAT
AATTAGAAAA
ACCTTTCTTT
AGCTTTCTAA
ATACGGGTAA
AATTGATGAC
GCCACAAGGC
CAAGCTTGAG
ACTTGCAAGC
CACCTTATAT
AGCGTTATTT
TAACCAGCCT
GGTTAAAACC
ATCAGGGTTA
GAGCGATGAT
TTTTAAGGCA
ATATAGTTCT
CTTAACTCGG
GAGTTGGGAA
ACGCGCCTTT
TTACGCAGGA
TTTCATCATG
TGACACTATG
CTATGAACCT
TA.ATGATGTG
TGGCATGGTG
GTTCCCTGAA
TGTCATCAAT
CAAAAGAAAA
AAGCGGTATC
ATTTTTTGGA
TAGAAACATG
TCAAGGGGTC
GAGCGCTAAT
CGCTAATCCT
GATTTATAGC
GCTCATGGAT
CATGGCAGAA
CGACCCCCCA
GAATATGTAT
AGTGTTAGGG
GACCAACACT
TATGGGCGAT
GCTTTACTAT
CAAGAAATAC
CGCTAAAACC
TTCATAGAGG
TTCCTCTACA
CAAAAGAAAA
TTTGGCTTTT
TTTTTCTTGA
GATCTTTTTG
CTCACTTTTA
ATTTTATGCT
ACTAGGGACA
ACTGAAGAGA
GACAAACGAG
CAGGCGTTTA
CCCAATATGA
GAAACTTGCG
TTCTCCTTAA
GTTTTGACTG
GCTAGTGGAG
AGACCTAATG
TGCAATATTT
AAAATCATCA
AACTTGATTG
GGTGAAGAAG
TGGAATAACT
TATACATCAG
GATTTTGATT
AAAGAAAGCA
AATTTGATTC
GAATTTACGC
TACAACATGC
CTTGGTTATG
TATGGGATCA
AAATACACAA
TCTATCAGCA
GAGCTTATCA
GATGATCCCT
AAATTGGGGA
AGAGGCGAAT
ATTATAAGTT
AATTCATAAA
AAAGCTTTAA
GGTTGCAAGC
TTCTCTTAGG
ATATATGGGT
GCTCGCTCAA
CTTATATATT
TAGGAGCGAA
AAATGATCAA
AGGTTATTGT
TTGGCTTGAT
TCAATTACCC
GGAAAATCAG
AAACACACCG
A.AGACATACT
GGGATTTTTC
AAAAAGATCC
ATAGGGATCT
TGCCTGAAAC
ATGAAGACAC
ATAAGAGTGG
TCAAGACAAT
CGTTTGCGCC
TCAGGCGTTT
CTATTGTTGG
TGCCAATCCA
TCTGTGGCTA
GCCCCGCTTT
GTAGGAATGG
ACAACGATAA
GGCAAGATGT
ACAAGGAGCG
TTTTAGAAAA
TCTTCACCGA
AAGTGCCTAA.
TGAGCTATGA
GGTTGTTATT
AACTCAAAA.A
AGAAATCATT
TATACTATTA
TAATTTTTAT
CTATGCGATA
GATTTATGGG
CATTACCTTT
TAAAAAAGTT
AGCCAAGCTT
AGGCAGGCGT
TGCTCCTACT
TCAAAATATC
AGAAAAACGC
ATTTAATCCT
CTCTCAAATT
CACTCAAATC
TTTTTTTAGC
CATGTGGACT
CCCCACGATG
AAACATGGAA
CGACAATCTA
GGGTGGCGCT
TTACAATAAC
AAGGATCGAT
ACCGATATTA
TGATCCACAA
TTTAGAGACC
TGTGTTTCAA
CGCTAAGACT
CTACTATGAA
GAGCCGGAGC
GTTTTTGATG
CACGCTCAAA
TGAGCTCATT
TCAAGCAACA
TAAATCTTTA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 WO 97/37044 PCTIJS97/05223 GTGCCTGTGG GTTCAAGCGA ACTGTGA 2247 INFORMATION FOR SEQ ID NO:1026: SEQUENCE CHARACTERISTICS: LENGTH: 1215 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .1215 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1026:
CTGAATGCAA
CATGCTAGAA
AGCGTTTTGA
ACAGCCTTAA.
GAGCCTAAGC
ATGACCAATT
GCTTATCAAT
GAATCATGCG
GCGTATATCC
GTGATGGGGG
TTGCTCTATT
AAGGGCAGTC
GAGATACAAG
AATGACCAAG
GCCATGCCCC
ATAGAGGAAG
ATTAAGAAGC
GCTGATTTCG
TCCTCTAAAA
GTGTGCAAAA
ATGCAAGAGC
AATCTCAATC
TTATACTTGA
AACATTTAGC
GCATGTTCTT
CCGCTAAAGG
GCGATAACCT
TCGGATCTAA
CAGGGACTTA
CTAGCGTTTT
AATTGCTCAT
GGAAAACGCG
GTTGGGAAAA
ATAGAATCAG
AATTGCAAA.A
AAACTTTAGC
CTCAAACGCA
CGCTAGAAAA
CGCCCGCTAA
ATAATATCAG
ATTGCTCTCC
TTTAA
CTCCCTAAAA
ATTTATTCCT
CCCACTCATT
AAGTTTGAAT
GGTTAAAAAC
TAAAGACTTT
AGAAAATTTA
TAAAATCAAT
AAAAAGCTAT
TAAAGACGAT
CTACCATGAC
AAACGAGAAA
GCGTTTGAAA
AAGCGCTAAT
TCAAACAGAG
AAAATCTCAA
AGAAAAAGAT
AAAAAGCCCA
CGTTAAAAGC
AGGGCAAAGG
AACAAAATCA
AGTTTAATTT
CATATCCCTT
TTGAACGCTG
AAGCCCAAAT
AACGCTAAGC
GGCTATGAAA
TTTTCGGATC
GGGCATAATA
GCGTTTGCTT
AACCTAAAAG
GCTAACGCTG
GAATCTAAAA
AGCAATTTGG
ACCCAAAAAT
GAAATGAAAG
AAACCCATGT
AAAAAACCGG
AACACCAAAA
AATGCGATTT
CTCTTAAGA.A
ATTTCTTAAT
TTAAAGCCTT
AAGAAAACCC
CGCCTGTTAC
AAAAAGAAGT
TGGCGGGCAT
CGAGTGCGGG
ATAGCCCCTT
CTGAAGTGGC
ACATGATCAA
ATGCTGAAAA
TCTTTGACTC
ATCTAGACCC
CTCAAATAGA
AGGCAGCTAG
ATTTGGCTCA
CTAAAGCGAG
CCGCTTCCAA
TAGCTAACCA
TAAGCTA-AAT
ACAAAAGGTG
ATGGCTAGGC
TACTAAAACA
TAATGTCATG
TTTAAAAGCC
TGCGTGGAAA
CATTTACCAT
TTTGCGTAAC
CCTAAAAGAA
GTCTTACAAT
ATATTATGA.A
GCAGTCTAGT
TATCGGCAAC
AAAATCTCAA.
CGAGCA.AGCC
AATCA.ATAGT
TCCAAAACGC
AAATAAAGA.A
TATCACCCTC
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1215 INFORMATION FOR SEQ ID NO:1027: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 813 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) WO 97/37044 WO 9737044PCTIUS97/05223 928 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicabacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. .813 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1027:
ATGAAAAP.GT
GTGGCAGCAG
GCAGTGAAAG
TCACAAAGCG
GGTAAAACCG
TTTTTCGGGG
GCCGTGTTTG
CCATGCGCGA
GGTATTGACA
GGGGCTCAAA
AGCCCTTATA
TTAGGGATTC
ACTATCAATG
AGCCTTTATG
TTGTAGTGTT
AAGGTAGCAC
GCGAGAAAAA
TTAAAAACCC
ACTATCTGGC
AGAAAAGATG
GAGCGAACGC
CCAAkAGTAGG
CTTTATACAA
TCGCGGGTAA
AGCACACCTC
GCACCCATAT
TTTATTATTT
TAGGGTATCG
TAAAACGCTC
AGAAGTGCAA
CGCTTGGTAT
CCCCAAAAGC
CGTTATGCAA
GTTTGGTGCG
TTTGACATCA
GACTATGC
TGTCATTAAT
CTCTTGGGGT
dTATAGCCTT
TGGTCAGCAT
TAACCATGGG
TTACAATTTC
TGTTTATCGG
AAGCAATTGG
CTGGGGATTA
AGTGAATTTA
GGCTTAGGGC
CGCTATTACG
GATAATGGTG
AATCTGTCTG
AAGGAAGACG
AATACGACAG
GATCCGGCGA
CAAGAATTTG
AACTTGAGCT
TGA
TAGTGTTAGG
AAAAGCCAAA
GCTATCAAGT
ACTATCCTAA
TTACTGTGGG
GCTTTATGGA
GGGTGTGCAA
ACATGTTCAC
CGAGTTTTGG
GGGCCTTTTT
TTTTCCAGTT
ACTTTGGCGT
TCACTTACCG
TAATAGTCTT
AGATTATAAA
CGGTCAGGCT
GTTCCCTGTG
TTATAAGCAG
TTATGGGCAT
ACTCAATGAG
TTATGGTGTG
TTTCTTTTTT
GGAAACTAAA
CCTTTTTAAT
GAAGATTCCT
CCGTCAATAC
INFORMATION FOR SEQ ID NO:1028: Wi SEQUENCE CHARACTERISTICS: LENGTH: 759 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION .759 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1028:
ATGAAACCTT
GAGACTTTTG
ATTGAATACC
AATTGGCTTA
CCTTTGGA.TA
AAAGAATGCG
ATTTCAGTTT
AAAAAGGTTT
AAGGGAGTAA
AAAATTGTTA
CGAATAAGCA
GTTGGAAATA
GGAGAAATTG
TTATGATTTG
TGATTTTAGG
TTTTTGGGGT
TGGCAAGCAA
CCAAAATACG
GATTTATACC
TGCGTTACTT
GCTTATGACG
AAGGAACAGG
AGTTTGGGGG
ATTATTTGGA
ATGGCGATGC
CACCGCCCTA
ATTATTTGAA
CGAGGTTGTG
CTGATATTAC
ATGAAAGCAA
GAGCGTTTTA
TAATTTGAGT
TTGGTGTAAA
TTTGAATGTC
TATAGTGGCT
TATCTCAAGA
120 180 240 300 360 WO 97/37044 WO 9737044PCTIUS97/05223
CGCACCGCTT
TTGATCGTTG
AGAGAGGAAT
CGCCTAAAGC
TCTTTTTTGG
GCCAACGCTT
TCTAAAAAGC
GGGGGAGTTG
TTTTTTATAA.
TTTTACTCTA
ACCCAGCCCC
AAGACACGAT
TAGGGCGTTT
GTATTTTAGA
GCTGCAAGCT
AAACGAATAC
CACGA.ACGGG
ATTCCCAAGG
TTTTGATCCC
TAGCGTGGGT
GAGTTTGTCG
AGCGCACCTT
AAACGAAAAA
CTTTGGAATT
GAATTACCCA
TTTAGCGGCT
TTAGAGATTG
TTAGTGTGA
ATGCGATCGC
A-ACAAACTTC
TTAGCGGTGA
GGCGTTGCAT
CTGGCACGAC
AAAAAGAATA
TCCTCTGGAG
TACGATGAGT
ATCCAAAAAG
CCAATTGTTT
TATTTTAGAG
TTGCGAGTTG
420 480 540 600 660 720 759 INFORMATION FOR SEQ IED NO:1029: SEQUENCE CHARACTERISTICS: LENGTH: 2778 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 2778 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1029:
TTAGTTATAA
AAAAAATTTA
GAAAATGACG
CAAAAAAATG
GTCAAGTCCA
GCAAAGATTC
TTGTTCAACC
TTTCATTTTG
AAGGAAAAAG
CAAAAATTAG
GGGAGCGTGG
GATGTGAATA
TTAAAACGCC
ATGTGGGGCT
ATCCAAGATG
AAAACGGATT
TACAGGATTT
GAAAAAGCGC
GCGCAAATTT
GACTTGGATA
GATATGGTGC
ATTAGGAGGG
TCCGAAAATT
GTCAATAGCT
CAATTCGGGT
AGGAATCTTT
TATCGCTACT
TTCTATCTTC
GCTCTAAACC
AAGCTCAAAA
TTTCTTATGT
GCGTGGGCGA
AAGGGTATTT
ATGAAAAAGC
ACGGCTTAA-A
AGCATGCTAA
TGGAGGTGCG
GGGGGGACAG
GTGTGATTGA
TGAATGACGG
TGTATATGCG
TTTCCACCCA
CAGATATTTT
TTAAAGTTAA
TAAAAACCGA
AAGACGAAAA
ATATCAATGA
AATTGTTACT
CTTTGAGGCG
CATTGATGGA
TGOCTATOG
TTGGCACAGG
AACAACACTC
TCTTGTTTTC
AAACGATTTG
CGAAACCTCT
CGGGCTTTCT
TATGGTGGAT
TAAAGACGTT
CAGGATTGCC
ATCCCAAATG
AACGGCTTTA
CACAGAAAAG
TATTTATATC
ATCTTTGAGC
GAAATTGCGC
TAGGGGTTAC
TGACGCTAAG
AATAGAGATT
AAGGAAAGAT
AATCGCCGAT
AAACGGGCTT
TGTCATCATT
AGGGCCTAAA
TTTAGGGTTT
TTTGTTAGTG
CTCTTATGGA
GCAAAGCATG
AAGCTAAAAT
GCATGTATCA
GCTTCTCCAA
CAATCCAATC
TACATGTCTG
TCTAAAAAAA
TATGCCACTT
GGGGTAGAAA
GGGATCAAAA
AAAACGGCTT
GTCAGTGAGG
AAACAATCCA
GCGAACAAGC
TTAGATCAAT
TTAGACGCTC
CTCCATTATA
GACAACCCGG
GTCTTTAATA.
AAGGGCTATG
GTGAAAGTCA
TCAGGGAACC
GATAAATACA
TTCTCTAAAG
AGCGTAGAAG
GGGCTCATGC
AGCTTGTATG
CCATTAAGGA AATCAGTATT
ATACCGOCOT
AAGAAACCCC
AAACGCCTAA
ACATGCTCGC
TAGACACCOC
TTGAAAACGG
TCAAGGGTTA
AGGGCGACAC
TAGAGGGGCA
GAGCGTTATT
TTTATGAGGG
AGCGCGATTT
TAGAATACGA
ATATTTCTTC
AGGTCAAAGA
TAGTCCCCTT
TTGAGCATTT
CGTTTGCGGT
TTTATCGTAT
AGCGCACGAG
ACTTGACCAA
TCAAGATTGA
AGGGGCGCAC
TTAATGGGAG
CTAACATTGC
TGAAGCTTTA
AAAAGAGGCT
AGAAATGAAA
TAATGAAATT
TGTTTTAGCT
CATTTTAGAG
TGGGACTGAA
CTTTGATGAG
GGGCTATTAT
GATCGTGTTT
AAGCGATAAA
CATGGGCTGG
TTCTTTGCGT
GCCTTTTTTG
GGGGATCCAA
AAAAACCTTA
AAGAGCGGAT
GGTGAAGCCA
TGAAGTGGGC
CGATAGGATC
ACTGAGAAAT
AGAAAAAAGG
CGGGCAGTTG
CGTGAGCGAA
CACAGGOGGOG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1S60 WO 97/37044 PCTIUS97/05223 930 GGTAGATCTT ATCCGGGCAT
GCCAAAAGGG
TTGACTAATC CA-AGGATTTT
TGACAGCTGG
TACAGGATAA GCTACCAATA
CATCCAACAA
ATGCTGGGTA ATAGJACCCA
TGTGAGCTTA
GGTTTCAGCA GCCCCTTATA
CAACCGCTAC
AGGCAATGTT CCACACCCGC
ATCGGTGATT
TTGGTTCCTG AA.AGCTGTTC
TAGTCCTGGA
ATTTGGGATA GGGATTACCA
CACGCCTATC
GACAACACCG ATGATTATTA
TTTCCCTAGA
ATGTCTGGTT TGCCAAGCTC TGGCACGCTC CGTAACACCA AAGTTTATGG TAAATTCGCC ATAGATTTGA TCGCTCGTTT TAAAACGCAA GATTACTTGC CCTTAAACTC CACTTTCTAC AGGAACGGCT CAATCACACC TAAAGATGAG TTTACCGCTT CTACTGAATT GAGCTATGGG TGGTTTTTTG ACTTTGGTTT CTTAACCTTT AACGCTCCCA CCACGACGGC GAATTTTAAA AGGGCGACTT GGAGGGCTTC TACAGGCTTA TTGGTGTTGA TTTTCCCTAT AGCGTTTTTC TGTAAGGGC TGTGCTTTAA.
CCCTAACATG
ATGGGAACAA GGTTTTA.A
GCGGGGCGTA
TATAGCTCTA
GGCGGGGGCT
GGGTATAACT
TATTCCTCTG
ATCAACCGCT
GCGATCACCA
ACCAGTTCTT
AATGGGGTTA
AATTCTTGGA
GCTTACCACC
GGGGGCTATA
ATGGGGGGCG
TTTGGCTTGT
GTGTTAAAAG
AAAACCCCAA
GATTATGGCG
CAGATTGA.AT
AACCALATGGG
AACGATTACA
TGTTTGCCGG
CGATCAATCT
TTGGGGTGAA
TGAATGTTAC
TTAATGAAGT
TATCAGGCGG
CTTCACCAGA
TCACCCTTGA
TCTTTAGTTC
ACGGGTTAGG
ATTTGCAAAA
TCTTTAGGTA
TAACCACGGT
GGCTTGGAGG
CGGCTAAAAT
CTAGGGGGAG
TTGTAGGGGC
GGATTTCGCC
GCGATGGCAA
CGCAACATTT
GAATTTGAGC
TTATGCGGAT
TGTCGGGCGC
CAk.ACTCCTT
GGCCTCTCCA
TAGAACTCCA
AATAAAAGGT
TGTGAGCTAT
CTATGCGACA
CGGGAATGTC
ATATTTATTG
TAACACCGAT
GAGAGGCTTT
CGATGGGATT
GCGTTTAGCG
TTTCTTCTAT
TGGGTTTGA.A
CATGGGGCCT
TGGCAAAAAA
TGAATTTTCT
1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2778 INFORMATION FOR SEQ ID NO:1030: SEQUENCE CHARACTERISTICS: LENGTH: 576 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .576 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1030:
ATGAAACAAC
CTTATCGCTA
GCTGAGAGCG
GAATTAGTGC
AAAGGGATCA
AAGGAATTGA
GAAAACACCG
GTGTTTCATA
AAGGCCAAGC
GAAATGCAA.A
TATTTTTGAT
AGAATAIXCAG
CTAAAAAGAC
CTTTAGAAAT
TTTTAATTGA
ACGCTCAAAA
CTAAAGAAAG
ACCGCATGCG
ATTTGCATAA
AATATATTTT
TATTGGAGCC CCAGGGAGTG CGAAACAATC GCTCATTTTT TGAGCGAGGC TTATTGATTG TGTGGTAGA.A
ACGATCCTTT
TGGCTATCCC AGGAGCGTGG TGAAGTGATC TTA.AAAAGCG GGTTTTAGGG CGCTCTAGGG GGTGTTTTTA GATCCGTTGG A.ATTATCAAT GGTGAAA.GAA GTCTTTTGCA AATTAA
GTAA.AACCAC
CTACTGGGGA
AGAAATTCAC
CAGCGATTAA
AGCAAATGCA
TGATTGAAGT
GGGCTGATGA
TTGAAATCCA
GCATTGAAGA
TGACGCAGAG
TTTACTCAGG
TTCTCAJ\GGC
A.AGCTCTAGT
AGCTTTGGAT
AGAAGTGAGC
TAATGA-AAGG
AAACTTCTAC
AATCGTGAAT
120 180 240 300 360 420 480 540 576 INFORMATION FOR SEQ ID NO:1031: WO 97/37044 PCTIUS97/05223 931 SEQUENCE CHARACTERISTICS: LENGTH: 1023 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. 1023 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1031:
TTGAGTTCGT
GGGATTAGCT
TTGTTTCAAG
CAGACCGGTG
AATTGCGTGG
CGTAACCAGC
AAAAGGTGGT
AACTCTAGGG
TATACTTATG
GCCGGGTTTT
TATTTTAAGG
GCTGATGGCA
CCTAACAGCG
TTTTTAGTGA
ATCAAAkATCC
AAACAAGGCC
AGCTTAACCC
TAA
TATGGCTGAC
TGGAAGTGGG
TGCCTTTTGG
GCTGTCAGCC
TCAATTGGAC
CGATGTATGG
TTGGGCTGCG
CCGCTAACGC
GTTTTGGCAC
TTGTGGGCGT
ACGGGTATGT
CGATCACATG
TCTATACCAC
ATGTGGGCAT
CCACGCTCCC
CACTAGAGAA.
AAACCTTACG
AAACCCCTTA
TAGGGTCGAT
CGATGTTTCG
AGCTTCAGGG
TTCTCGCACC
GCTAGGCGTG
TTATTATGGC
CATATCGCCC
AGACATGCTT
GAATTTTGCG
TTATGGCGTC
CGGAGACACG
AGGAAAATTA
TAGAACCAAT
TAATTACTTT
TGGGAACCCG
CCGTCAGTAT
AATGCCCATG
CAAAAGACCA
GCTAATGATG
ACTCCAGGAA
ATGCTTAGCA
ATGACAGGGT
TTTTTTGATT
TTTTATTTGA
TTTA.ACGTTA
GGTAACACTT
AATACGGACG
ACGCCGGCGA
AACGCTAAAG
ATTTTTGAAC
TTCAAAGGTT
ACTACTATCA
TCTATGTATT
AAAAGAATGG
ACGCTTATAG
ATGGCAAAGT
CGCCAGGCTA
CCAATAAGGA
ATAAGCATTT
ACGGGCATAC
GCGATCAAA.A
TAGACAAGCC
GGACTAATAA
CTGACGCTTA
GTTGTAATGT
TGAATCACAC
ACCATGGCAT
CTACTACTAT
CCGGAGCAGA
TGCGTTATGT
CGCGTTTGTG
AAACGGCGAG
CCCTGATGGG
TACTAAAGCT
CATTCCTGGC
TGTGGGTAAA
CAATTTCTCT
AGCGGACATG
TAAAGCCACG
TCGTGTGGGG
CATGACTAAC
GGGGATTAAC
GATTTTCCAA
TGAGTTTGGT
AAGAGCGAAA
AACCAATTTC
TTATACTTTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1023 INFORMATION FOR SEQ ID NO:1032: Ci) SEQUENCE CHARACTERISTICS: LENGTH: 1587 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genornic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PTU9/52 PCTIUS97/05223 932 (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 1587 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1032: ATGAAATTAA AAAAACGAAA AGTTGCGGCT ACATTGCTAA AGCGTTTGAC
TTGTTCACTA
TTCTCCAAAG
GAAACTTTTG
ACCGTGAATG
TGGGCAAAGG
TTGAGCCTTT
GACCCTAGGG
TATCCGGCTA
CTTACCTATG
GAGCAAATGG
AAGAACATGA
TTGTTCCCTA
CCTACAA-AGA
CCCGGTGCTA
AAAACGACTT
GCACCCGCTC
CAAGGTCCTG
GTGGTTGGTG
AACCCTGTGG
GCGGGGATTG
AAGCATGGTA
GAATATGGTA
AAACTCGTGT
GGGCCAAACG
CA.AGGCCCTC
ATGACACACA
CGGGTTCATT
TGGGTTTTAA
TTAACCTTAC
TAGGCGGTGT
ATTTTACCCC
GTATGAATGC
GTATTGGCTA
ACGCCTACTT
ACAGCGACAG
ATTGGATTTA
AATTCTTGCT
TCTATCGTGA
ATCTA-ATGAT
AAATAGAATA
TCTATGTGTT
GTTATAACAC
GTGGTGCGAC
GTGCTTACCT
CTCTTGATGG
ACAACATTAC
AGTTCAGTTG
TTGGTATGTA
GGTTAGAGTT
GTCAGCCGCT
AAAACATGGG
TCAGTTATAG
AGGGGCGGTT
CCGTTCGCCT
GGGTAAGCTA
TTTGGGCGGA.
CCCAAGCTAT
GACTAAAATG
TATGTATATG
GCCCGGGCAT
AGTCCATATG
CCAATTGTTC
CTTTAGCTCT
AAAGCCTTGG,
CCACCCTTAT
CGATACCAAT
GTATGACTAT
TTGGGATCCG
GCTCTATTTG
CAACATCGGT
TATCGAACAA
CGATGCTGAT
GAGCGTTTAT
TCTAGACTAT
CCAAATCCGT
CAACTTGAAT
TGGTATCGCA
TTTCTAA
ACTTATGAAG
ATTAACCCTG
GAGGGGTCTG
CAAGTTTATG
TGGGATAAAA
TGGCAACAGC
GGTGAGTGGA
TCAAGGCGCT
GTAATGGGGC
CAAGGTTTTT
TGGGGTCGTG
GGTATTCATA
GTGTATCTCA
CCTGAGTTTA
CGTTGGA.ATA
TTCTTGGATA
CACCACCACA
AACCCTA.ACA
TGGGTCGGTG
GCGTTCACTG
CAGCGCTTCA
CAGTTCAGCA
GCGGGTTACA
AATGGTTTGT
AAAAGCATTA
TGCATGGGGA
TTAA-AGGTAT
TGCATTTAGG
ATAACACTAG
CTTCTTGCGG
AAGGGCCAGG
ACGGCTTGTT
ATGAAGTTTA
GCTTTGATGT
ATGGGACTTT
GTATCGCTGA
AGGCGGGTAT
TCCCAATGGT
GCGGTAGAGG
ACGCTGAATA
ATGGTA.AGTG
TAGACATTAA
TGAACTTAGG
GCATCTACAG
AGTATGTTAA
CTACCGCACC
AGCATGTTAA
ACCCTGGAAC
TTGAATCTTC
CTCAAGACAG
CTTGCCACTA
TTTTATCAAC
CTATCCTACA
TAGGGGATGG
GTATGATAGG
CACTGATTCT
TGGTATCATT
CCCTAATTAC
TAAAGCGAAT
TACCGAGCAG
CAAGCTTACT
TGGTCAATGG
TATTTATCGC
AGGTACATTG
TATAAGGAAT
CGGCCGTTAC
GCGTGGCTTG
CAACTACTTT
TACTTGGGGT
CTTAGGCTTT
AGGTGGAGGT
AAGGGCTTTG
AGCGGGTCTC
CGGTTTCCTT
GGCGTTCGCG
AAGCCATTTG
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1587 INFORMATION FOR SEQ ID NO:1033: SEQUENCE CHARACTERISTICS: LENGTH: 705 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 705 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1033: WO 97/37044 PCTIUS97/05223 933
ATGACAAAAA
TATTTAAAAG
CCTGATTTTT
GCTTACCCTA
CTGAAAATCC
CCTAGCTTTT
TGTATTGGCG
AGCGAGCAAT
ATTTGGGCGA
TTTTTAAAGC
CAAAACGCTA
TCTTTGGAAT
TTGCCATGGC
AATTGGAAAA
TGGGGTTATT
GAGATTGTGG
ACACGCTTTT
TGAAAGAAAA
AAGATTTAAC
TAGAAAATAT
TTGGCACTAA
AAATCTTAAA
AAGAAATTTT
TAGAAAATTT
TAATTTCAAA
A.ACTTTAA.AG
GCCTAACGCG
GGCTTTTACC
AATAGGGCAT
ATTTGATTTT
GACCAGAGAA
TGATCTTAAT
AAAAAGCGCG
TCAAAAAACG
AGGGATTGAT
TAAAACAATC
TCCGCTATGC
CCGCAGCATT
TTTTTGCATT
GGTGAAATCA
AGCGAGAGGC
TTTAAAGGTA
A.AGGGTTTTA
TATTCCAATT
TCTTTAGAAG
CCCTTATTGT
AGCGTGGATG
ATTTCATTTT
CTGTTTTTAA
TTGATAGGGT
TCACTTTAGG
CTTCCAAACA
GAGTGCTTTT
AAAATTTTAA
GGGCTGTAAA
TAATTGTGGC
ATATTTACCT
ATGGGGGGAG
GCTTATTGAT
TATAA
AAGCCATGCG
GTTTGTATTC
AGCGCAAA-AC
TTTAGAAGAA
AAAGGAAAGC
AATTGTCTAT
GGAATTTTTA
TTATGAGCCT
CACGCATGGT
CGTGAATGCG
TGGGAGCGCG
120 180 240 300 360 420 480 540 600 660 INFORMATION FOR SEQ ID NO:1034: SEQUENCE CHARACTERISTICS: LENGTH: 1398 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 .1398 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1034:
ATGTTCATTT
AAGGCGAATA
AGGAGCGCGA
GTGCTAGTAA
AACAAA.AGCA
GCTTTGA.ACG
GTGCGCATGA
ATTTATTTAG
GTGGAATTTA
TGGAAAAGCT
CCTGGCTGGC
CCTTATCAAA
GAAGCGTGCC
GGCTTTGTGA
AAAGACGCCC
TACCGCTCTG
AAAATCTATC
AAAAAAGAAA
TTAGAAAGCA
TTGAAAGGCG
ATGATACCAA
TTTATGTGTG
TTGCTTTTGA
GGAATTTCAC
TTCAAGAATT
TGAAACAACC
TTGAAACGCT
ACACGAGCAA
GCCGTATCGG
ATAAGGGGGA
ACATAGAATG
TTGATATTCA
AAACCCGTTG
ATATCAATAA
TCAAAAACTA
TTTTGAATTT
GTTTGAAACA
TTTTAGAATG
TGCTTTCTTC
AAATTTTAGC
ATTAAAACAA
TGGGCCTACG
TTTGTTGAGG
GGATATTGAC
AAGCAGCATT
CAGCCTAGAG
TTTAGAAAA.A
GGACAAAGAT
TTTGGTGCAA
TAATGATGTG
CTCTAGCATG
TGCTGGCGGA
TGCCTTTGGC
CGAAAAAATG
TGATGGCGAG
CAATGAAGAA
GCGCGTTTTA
CATGCAAGAT
TACGAATGAA
GAATCTGAA
AAAGTCCCTT
GTGTATGATG
CGCACGCTTG
GATAAGATTA
TACATTGAAT
CCTAAAGCGA
AATTTCGCTT
TACGGCTCTT
GAAAAACGGC
GGCTTTGATA
GTTTTTGAAA
ACGGATTTGT
GTGGAGATCG
TCTAAAAGTT
ATTTTGCGTA
GACTTGTTAA
GGGACTTTAG
GATTTAAACG
AAACTGGATC
TTCATAGAAG
TTGAGCCTTT
ACGCTCATTT
AATTGAGCGG
TCAATAAAGC
CTTACACGAG
GCGAGTATTT
ATAGAGTTTC
TGAGCATGCA
TTGAGCAGGA
GCCCTTTAGG
CTTTAGCGCT
TATTCCCCCA
CTAAATACTG
TAGGGAATAG
ATTACTTACT
TGAGTAAAAA
GAGGAATAAA.
TTTCTAAAGC
AAAACCCTAA
A.ACTGCTTGG
AGTCCACAAT
AGGGCATGCC
CTATGAAGTG
CTTCAAAGAA
GGATCTAAAC
AGACGCTATG
TAATGGGGAT
TAATAGCAGC
TTTTGTGTTA
CAAAGGGCGC
CGCTAACGCC
CCATGAAAAT
GATGCACAAT
CTTTTTCATT
AGGGGTGCAT
ACGCTTGGAT
TCCAAACTTT
GTTGAGCGTT
AAACAAGGCT
TATTGGGTTT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 WO 97/37044 WO 9737044PCTIUS97IoSZ23 934 AAAGACCCTA GCGCGTATTT CCAATTAGGC GTGAGCGAGA GCGAAAAACA AGAAATTGAA AACAAGATAG AAGAAAGAAA ACGTGCCAAA GAACAAAAAG ATTTTTTAAA AGCCGATAGC ATCAGAGAAG AGCTTTTACA ACAAAAAATC GCTTTGATGG ACACCCCACA AGGCACGATT TGGGAGAAGC
TTTTTTAA
INFORMATION FOR SEQ ID NO:1035: SEQUENCE CHARACTERISTICS: LENGTH: 3642 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .3642 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1035: 1260 1320 1380 1398
ATGAAACAAT
ATCTTAAAGC
TATGCGGATG
GTGCATCGTC
TTTACAGGAA
CACTCTCAAA
TTAAGCGGTC
TTAGGCAGTA
TATCCTAATG
GTAGAAGTGG
AACTTGAACG
GTGAATGTAG
ACTTGCAGTT
TATTCTTTTA
TTCACTTTTG
TTCAATAAAA
AAAGGCACAA
CAAGCCACTT
AATCAAAGCA
CTTAAGGGTT
CAAAACGCTT
GATGCGAGCT
AGTGGTGGCG
GGGAGTTCTA
GAGAAGAGCG
CATGCGATCA
AGTTTCGCTC
AATGAAAACG
TCTACGCAAG
TTAAAAAGAA
GTCCTTTATG
GAACAGACAT
CATGGTATGC
ACCAACTCAT
ACAACCAAGA
TGTATAACTA
ACGCTACTTT
GGCATACTGA
GCAATCGTGT
CTAATAAGGT
GCAATGCTAA
CTTTAGCTAG,
AAGGGACGAC
AAGAGAACGC
AGTTTAACGC
GCTCTTTTA.A
TCCAAAACAG
CCCAGCACCC
TTGCGACTTT
CCTTTAATAA.
TTAATAACAC
TTACTTTAAG
AAATCACTCT
TAACGATTTT
ATAGCTTGAC
AAGGTTTGTG
CTGCAACATC
TCTATCAAGT
ACCAAAAAAG
GCTTATGCCT
TTTGGGGCTT
TATATGGAGT
CACAAAAACT
CATCACAGCC
CACCGGAGGG
TAATCTAGGT
TGTTACTTTT
GGGATCGGGA
TACTATCAAT
CAGCGTTATT
GGTGGGCGTA
TAACGCTACT
CACTTTTAGC
TACCAATAAT
TGGTGCGAAT
CTCCTTTAAT
CCAAATTCAA
TGAGCAAGCC
CGCTACTTTC
TTCGTTCAAC
CGGTAAAAAT
CACTCAAGGG
AAATTCTAGA
AAACGCCCTA
GGATATGATC
TAAACCCACT
GGGTTACAAA
ATAAAACGAT
TTACTGATTG
AGTTGGGGGG
TGCGATAAA.T
TGGGCAGGGG
AATTTAAAAA
GAATATAATG
GCGAGTAGTG
AGCGCTGGGA
GCTGGCACGC
TCCAATATCA
ACCATTAATT
GGGGCTAATT
AACACGACTT
GGGGCGAAAT
ACCGCTTTTA
TTTAGTAACG
GGGGGGACTT
AACAGCTCTT
TTTAACAATT
AACAATACCG
ACTCCTGTTG
GACTTGAAAA
ACGACTTTCA
GGTGGGATCA
AAAACGAACG
ACTTACAATG
GACTCTTCGC
ATAGGGGATA
CGCATCAAAA
GCGGGTTTGC
AAAAAAGCCA
GGGAGGAAAA
GTAATGCGGC
ATGATAACGG
GGGGGAATTT
GGAATAGCTT
CTATCAATGT
ACACCGGCAC
GCGCGTATAA
CGGTTTCTTT
GCTCCACTTC
TTAGCAATTC
TAAATGGGGG
ATAGCGGTAG
CTTCCTATAC
TTACTTTTAA.
TTAGCGGCAG
CAAACCACCA
GTAAAATCAC
ATACAAACAA
ATGGTGCAAC
ACCTCACAAG
CTTACAATCA
AAAGCTCTTC
GGGTTACCGG
CCTCTAAATC
CTATCTACAA
TCAAAAA.ACA
TAGTGGGGTG
AAAGGTATGC
AACACAACAA
TAACTACTAC
CACTTATTTT
AGACATTGAA.
CACTTCTTGG
GAATAACAGC
AGCCACTTTA
AACTTCGCAA.
AAATGGGGAT
TGGGCCTAGC
AAGCGGCAGT
GGCATTCACT
TTTTACTTTT
TTTTAATAAT
TGACCAGACC
TGCTACCACT
ACTAACGATA
TATAGAAAAA
CATGACTATT
CCTTGATTTT
TTTAGGCAGT
TCTTTTAAAC
AAAACCGCAA
GCAGCTTTTG
CTCTACAAAC
ACTGCAAGAA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 WO 97/37044 PCTIUS97/05223
ACTTTCAGCC
CCTGTCATTA
CCTTGGTATA
TATTACTTGC
AGCGCAAGTA
TCTTCTAGCG
GGGCCTTGGC
GTCTATATCA
CTAATCTTTA
GCCGGGGCTT
TTGAATGGCC
GCCAAAGACG
AATGGAGGGA
AATAGCGGTT
AACAGCACTT
ACTAACGCTA
GGCTCTCAAA
ACCTTTGATA
CTCAATAACA
GTTTTTAGCG
ACCCTTGTAA
CAGCTCATCA
AATGGCGTCT
TTTTCACCCA
GATATGGAAA
CCTAATAGCT
AGCATTGACT
TTCACTGGGC
GATGCGCCGC
GATGCGAGTG
GCGCAAAAGA
GCGCGAGCCT
ACAATTCCAT
ACGGCTCCAA
ACCATAAATA
CGAGCGTTCA
ATAGCAATCT
ACACOOTGOC
CC TAT TACCA
CAGCGAATCT
ATGGGGTAGA
ATTCAAGCTC
TAAATTCTAA
GGAAATTCAT
GTTACCAATT
CGTTTGAGAT
CTTTTAATTT
ATTCAAATTT
ATACCGCTAA
ACGTGGTATT
TCACTTTAA.A
CTCATTCGGT
GCTCTTCTAA
ACTACCAAGG
ATGATGTGGT
ACAGCATTTC
AATCGGATCG
ATAACAATAA
TTTATGCGAG
AAAACAGCGC
AATCTAATGT
GGCATTGCTG
TTTACATCAC
TAATTTTAAC
TATTATTCAG
ATTTGACTTA
TTATATTCCT
AATATGGGGG
GGTGATTGGG
TTTTGGGGAC
ATGCACAGGC
GCGTTCTGGC
TAGTATCAAT
TATGACTTTT
CGGCAAGCTT
TTTCAATGCA
CAGCGGCGAT
TGGCGCAAAA
CAATAATTCT
GCAAATCGCT
TTTTAATAAT
TAACAGCCCT
AAACTTGAAC
GATTAATATT
AGCAATTGAA
GCATCGGGCT
GTATTCTTTC
TATCCGGCGT
TTTGTATTAT
TTTAGGGAAT
CGGGAAAACT
GATCGTTTTT
GATCATTCGC
GAATTTGCAA
CGGCAGTATT
GGGCTTCAAG
GCTTTAGAGA
TCCGCTTCAA
AAATCCCAAA
AGCTACACTA
TATAACGCAA
ACTTCAGGGA
ACGACTAACG
AATCGTATAG
ATCGCTAACG
TCCACGCAAA
TTGGTGTATG
GGGCAAGCGA
AGCTTGAATT
AATACTATTT
AGCGCGACCA
GGGAACGCTG
ACCGGCTCTG
ACGAACACGA
GCTCCTTTGT
GGTGAAGCTA
TACAACGACG
AGCAGTGAAA
AACAACCAAA
TTGGGCGTTG
CAAAACGCTC
TTAAACAACA
CTATTCACTA
GGGGCTAAAA
TTTGGGGACA
TGCATAGGCT
GAAAGCGGGA
GCATTCTTTT
GCGGGACTTA
ATTATATCAA
ATTTTACAGA
ACTCGTTTAA
CATGGACTGA
GCGCTCTTAA
GCACTTATAG
GCACCGGTGG
CTACCATCAC
ACATGGACAA
GCACAACTTT
CTTTTGAAAA~
TTTCA.AATAA
TTAATAACGC
CTTCGTTTGT
TTTTTGGGA.A
TTAATATTGC
GCGTGAAAGG
CTTTTGGCGA
TCACAAATGG
CTTTCAGTAA
AGCTCGTTTC
CCTACAATTT
GCATGGTGTT
TCGGTTTTAT
CCATTTACTA
AAGCGGAATT
ATATATGGAC
ATAAGGGAGC
TTATCACAGG
ACCGCATTTC
AA
CACGCCACCC
TGCTGACATG
GAGCGGGACT
ACAAACCTTT
TCACAATGTT
TGGGCATTGC
CGCTTATCAT
GGCAGCCAAT
GCAACATAAC
TTCGCAGAAT
CACTAACCAA~
CACCAACTTT
CAACCAGTTC
TAATTTTAAC
GGGGGATTTC
CTCTACTAAT
AGGGAATGCA
GAAAGTTACT
TGGGACGATT
CAACCCTATC
AAATCTATGG
TAGTGCGGGT
CCAAGAGGTT
TGATTATGTG
GACCTACATG
TTACGACAAC
TTCTCAAACG
GAGCGTAAGC
AGGGAGTAAT
GCATTATGAA
TAGCGGTGGG
1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2700 2760 2820 2880 2940 3000 3060 3120 3180 3240 3300 3360 3420 3480 3540 3600 3642 INFORMATION FOR SEQ ID NO:1036: SEQUENCE CHARACTERISTICS: LENGTH: 1503 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 1503 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1036: ATGGCTGAAT GGAAAACGGA CACAGAAGAA GTCAAAAAGG TTGTTGGAAG ATGCAGGGAA WO 97/37044 PTU9/52 PCTIUS97/05223 TTTAAAAGAT CCCTGCAAGA. AGAAAAATGC GCGCTAAJA
TCATAGTGGA
GAATTAAAAA AAGCCAAAAG GGTGCTTCTG
TCGTCTCGTG
GTAGCCGCTG
AAGCTGTTTT
GTGGAGCTAT
TAGAACGAAT
CTTATGAATC
AAGCTTGGGA
CAGGAATTTA
TCAGGCGTTT
CCAACTCCAG GCGTTTTATT AAAAAAAGCG TTTTTAATGA TACGCTGATT
TGAAAGACAA
ATGATAGCTT
TTAGAGAGCA
TCTTATGATG
AAAACCCTAA
TTGGAAAAGA GTTTAGAGGA AGCTTGACTT TAGCCAAAAA ACAGAAAAA. TCAAAGAATC TTAGAAACAG AGTTAGAGCG GAACGGAATG AAAATTATAC AAAGAGAGCT TTATTGTAGA TGGCGTTGGG
CGGAATTTGA
AAAACCGCAT GCGCTCATCA TTGGGCTTTG
ATGCGACAGA
TTGTGGCATT TTGACGCGAA TTAGGCGTGA AACACACGAA
TGA
GCGCAGAAAA
CAATGAAGAT
GATTTGGCCT
GAAATTTATG
GCTTGAAATC
GGGCATTAAA
GAAACAAGCG
GGAATACGAT
GCGCTTGALAG
GATCCATCAT
AGAATTTGA.A
CGATGAATTA
TTTGATGCCA
TCGTTGCGTT
CCCTAACGAT
CGCTACAGAA
GCAAAAAGCG
TGAAAAAAAC
TAGCGTTTTT
TGCCCTAAAG
GTTGGAACAG
TGTTTTAGGG
AGGATATTCA
AGTCCATTTA
ACTGAAATGC
GACGCTAAAG
CCTGCTAGAA
AAAGAAGACA
TACTCGAATC
AAGAGGTTGC
AGCGATGCGA
TTTGAACGGC
GATTTGAGGG
AACGCCTTGT
AAGAGTTTAG
GATCGCATGG
AGCGTTTTAG
AAAAACTTTA
TCAAACGCTA
AATTTGAGTC
TTGGAGTATA
CCCTATCCGG
AGCGCCATTG
GCTTTACAAG
ATCGCAAAAG
AATTTAGCGC
CTATGGACAG
TCAAAGACCT
AATTGGAAAA
TTGCTTTAAG
TAGCTGCTAC
CAGAAAAATG
AAGCCAAAGC
ATTTTTATAC
TAGATAACGA
CTGCTATCTC
AAAATTTTAG
CTAACGATGA
AGGAtTGGAT
CAATATCTAA
GCGTGCCTTC
AGGAAGCTTT
TCAATGALAGC
AAAAAATCGA
GGGAGTTTTT
AAGAAGTCAG
TGCCTTTAGA
CTACGCTTAA
GATTCATCCC
TGGTGAGAGA
AATTTCTACA
TGATAGTTAC
AGCCATAGGA
AGTCTTGCALA
GGCAGCCATT
CAAAAGGAAT
GAGCGCGAT
TGATAAGCAC
ATACAACTTT
TTATACCCCT
CGCTTCTTTA.
TTTAGAACGC
GGGTGCATAT
AGAGCAGGAG
TTATAACGA\
AGAAGGTTTT
CTTTGATAAT
TCCCGTTTTA
AGA.AAGCCGT
CTTTAATGA
AGATTTAAAT
AGACAACGAT
TAGGGGGTAT
AGAGTTATTA
GAAACAGAAT
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1503 INFORMATION FOR SEQ ID NO:1037: SEQUENCE CHARACTERISTICS: LENGTH: 245 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .245 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1037: Met Gin Phe Gin Lys Thr Leu Ser Sen Leu Ser Leu Phe Leu Ser Leu ~1 5 10 Sen Leu Phe Leu Ser Phe Ser Ilie Ala Glu Glu Asn Gly Ala Tyr Ala 25 Ser Val Gly Phe Glu Tyr Ser Ile Ser His Ala Val Glu His Asn Asn 40 Pro Phe Leu Asn Gin Giu Arg Ile Gin Thr Ile Ser Asn Ala Gin Asn 55 Gin Ile Tyr Lys Levi Asn Gin Ile Giu Asn Gii Ile Thr Asn Met Gin 70 75 Asn nThr Phe Asn Tyr Thr Asn Asn Ala Levi Lys Asn Asn Ala Lys Levi WO 97/37044 PCT/US97/05223 937 90 Thr Pro Thr Glu Met Gin Ala Glu Gin Tyr Tyr Leu Gin Ser Thr Leu 100 105 110 Gin Asn Ile Glu Lys Ile Val Met Leu Ser Gly Gly Val Ala Ser Asn 115 120 125 Pro Lys Leu Val Gin Ala Leu Glu Lys Met Gin Glu Pro Ile Thr Asn 130 135 140 Pro Leu Glu Leu Val Glu Asn Leu Lys Asn Leu Glu Leu Gin Phe Ser 145 150 155 160 Gin Ser Gin Asn Ser Met Leu Ser Ser Leu Ser Ser Gin Ile Ala Gin 165 170 175 Ile Ser Asn Ser Leu Asn Ala Leu Asp Pro Ser Ser Tyr Ser Lys Asn 180 185 190 Val Ser Ser Met Tyr Gly Val Gly Leu Ser Val Gly Tyr Lys His Phe 195 200 205 Phe Thr Lys Lys Lys Asn Gin Gly Phe Arg Tyr Tyr Leu Phe Tyr Asp 210 215 220 Tyr Gly Tyr Thr Asn Phe Gly Phe Val Gly Asn Gly Phe Asp Gly Leu 225 230 235 240 Gly Lys Met Asn Asn 245 INFORMATION FOR SEQ ID NO:1038: SEQUENCE CHARACTERISTICS: LENGTH: 632 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...632 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1038: Met Lys Pro Thr Asn Glu Pro Lys Lys Pro Phe Phe Gin Ser Pro Ile 1 5 10 Val Leu Ala Val Leu Gly Gly Ile Leu Leu Ile Phe Phe Leu Arg Ser 25 Phe Asn Ser Asp Gly Ser Phe Ser Asp Asn Phe Leu Ala Ser Ser Thr 40 Lys Asn Val Ser Tyr His Glu Ile Lys Gin Leu Ile Ser Asn Asn Glu 55 Val Glu Asn Val Ser Ile Gly Gin Thr Leu Ile Lys Ala Ser His Lys 70 75 Glu Gly Asn Asn Arg Val Ile Tyr Ile Ala Lys Arg Val Pro Asp Leu 90 Thr Leu Val Pro Leu Leu Asp Glu Lys Lys Ile Asn Tyr Ser Gly Phe 100 105 110 Ser Glu Ser Asn Phe Phe Thr Asp Met Leu Gly Trp Leu Met Pro Ile WO 97/37044 PCT/US97/05223 Leu Val 130 Asn Met 145 Asn Ala Glu Ala Glu Arg Val Gly 210 Gly Glu 225 Glu Met Thr Ala Ala Ile Glu Arg 290 Gly Ser 305 Glu Ile Val Leu Val His Val Ala 370 Ile Asn 385 Lys Gin Glu Lys Tyr His Thr Arg 450 Gly Tyr 465 His Glu Glu Glu Glu Arg Ser Ser 530 Leu Gly 545 115 Ile Gly Glu Lys Tyr 195 Pro Ala Phe Lys Gly 275 Glu Glu Leu Val Ile 355 Lys Glu Gin Lys Glu 435 Val Thr Leu Val Ala 515 Val Gly Leu Gly Leu Trp Met Phe Gly Lys Glu 180 Ala Pro His Val Lys 260 Lys Gin Asn Asp Asp 340 Lys Leu Ala His Ser 420 Ser Asn Leu Ile Phe 500 Thr Ser Gly Gly Pro 165 Glu Asn Gly Val Gly 245 Gin Ser Thr Ala Pro 325 Lys Gly Thr Ala Leu 405 Arg Gly Lys Asn Ala 485 Leu Asp Gly Tyr Ile 150 Asn Val Leu Thr Pro 230 Leu Ala Arg Leu Pro 310 Ala Pro Val Ala Leu 390 Lys Arg His Val Thr 470 Glu Glu Ile Leu Gly 550 135 Phe Gly Val Arg Val Glu Gly Ala 200 Gly Lys 215 Phe Phe Gly Ala Pro Ser Ala Ala 280 Asn Gin 295 Val Ile Leu Met Asp Phe Lys Leu 360 Gly Leu 375 Leu Ala Glu Ala Ile Ser Ala Val 440 Ser Ile 455 Pro Glu Ile Asp Glu Ile Ile Lys 520 Met Val 535 Ser Ser Met Phe lie 185 Lys Thr Ser Ser Ile 265 Gly Leu Val Arg Asn 345 Ala Ala Gly Val Pro 425 Ile Ile Glu Val Ser 505 Gly Leu Arg Met Gly Asn 170 Val Ile Leu Met Arg 250 Ile Gly Leu Leu Pro 330 Gly Asn Gly Arg Glu 410 Lys Ser Pro Asn Leu 490 Thr Met Glu Glu Ala Ser 155 Asp Asp Pro Leu Gly 235 Val Phe Met Ala Ala 315 Gly Arg Asp Ala Asn 395 Arg Glu Glu Arg Lys 475 Leu Gly Val Lys Phe 555 Asn 140 Ala Met Phe Lys Ala 220 Gly Arg Ile Ile Glu 300 Ala Arg Val Val Asp 380 Asn Gly Lys Met Gly 460 Tyr Gly Ala Ser Gin 540 Ser 125 Arg Lys Ala Leu Gly 205 Lys Ser Asp Asp Ser 285 Met Thr Phe Glu Asn 365 Leu Gin Ile Lys Thr 445 Met Leu Gly Ser Tyr 525 Arg Glu Met Lys Gly Lys 190 Val Ala Ser Leu Glu 270 Gly Asp Asn Asp Ile 350 Leu Ala Lys Ala Ile 430 Lys Ala Met Arg Asn 510 Tyr Asn Lys Gin Leu Asn 175 Tyr Leu Val Phe Phe 255 Ile Asn Gly Arg Arg 335 Leu Gin Asn Glu Gly 415 Val Gly Ala Gin Ala 495 Asp Gly Ala Thr Lys Ile 160 Glu Pro Leu Ala Ile 240 Glu Asp Asp Phe Pro 320 Gin Lys Glu Ile Val 400 Leu Ala Ser Leu Lys 480 Ala Leu Met Phe Ala 560 Glu Glu Met Asp Leu Phe Ile Lys Asn Leu Leu Glu Glu Arg Tyr Gin 565 570 575 WO 97/37044 PCT/S97/05223 939 His Val Lys Gin Thr Leu Ser Asp Tyr Arg Glu Ala Ile Glu Ile Met 580 585 590 Val Lys Glu Leu Phe Asp Lys Glu Val Ile Thr Gly Glu Arg Val Arg 595 600 605 Glu Ile Ile Ser Glu Tyr Glu Ala Ala Asn Asn Leu Glu Ser Arg Leu 610 615 620 Ile Pro Leu Glu Glu Gin Ala Ser 625 630 INFORMATION FOR SEQ ID NO:1039: SEQUENCE CHARACTERISTICS: LENGTH: 1288 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...1288 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1039: Met Glu Ile Gin Gin Thr His Arg Lys Ile Asn Arg Pro Leu Val Ser 1 5 10 Leu Val Leu Ala Gly Ala Leu Ile Ser Ala Ile Pro Gin Glu Ser His 25 Ala Ala Phe Phe Thr Thr Val Ile Ile Pro Ala Ile Val Gly Gly Ile 40 Ala Thr Gly Thr Ala Val Gly Thr Val Ser Gly Leu Leu Ser Trp Gly 55 Leu Lys Gin Ala Glu Glu Ala Asn Lys Thr Pro Asp Lys Pro Asp Lys 70 75 Val Trp Arg Ile Gin Ala Gly Lys Gly Phe Asn Glu Phe Pro Asn Lys 90 Glu Tyr Asp Leu Tyr Lys Ser Leu Leu Ser Ser Lys Ile Asp Gly Gly 100 105 110 Trp Asp Trp Gly Asn Ala Ala Arg His Tyr Trp Val Lys Gly Gly Gin 115 120 125 Trp Asn Lys Leu Glu Val Asp Met Lys Asp Ala Val Gly Thr Tyr Lys 130 135 140 Leu Ser Gly Leu Arg Asn Phe Thr Gly Gly Asp Leu Asp Val Asn Met 145 150 155 160 Gin Lys Ala Thr Leu Arg Leu Gly Gin Phe Asn Gly Asn Ser Phe Thr 165 170 175 Ser Tyr Lys Asp Ser Ala Asp Arg Thr Thr Arg Val Asn Phe Asn Ala 180 185 190 Lys Asn Ile Ser Ile Asp Asn Phe Val Glu Ile Asn Asn Arg Val Gly 195 200 205 Ser Gly Ala Gly Arg Lys Ala Ser Ser Thr Val Leu Thr Leu Gin Ala 210 215 220 WO 97/37044 PCTIUS97/05223 Ser 225 Gly Va1 Tyr His Ser 305 Leu Ser Gin Gin Gly 385 Asp Ala Ala Val Ala 465 Asn Asn Gly Val 2 Asn I 545 Gly C Asn 'J Val I Ser 1
E
Thr 1 625 Lys I Tyr S
GI
Al Trj Ser Leu 290 Asr Asn Thr Asn Lys 370 Gly Gly Ala Ser Asp 450 ly Gly Leu ksn ksn 530 ?he lu 'hr Jys )ro 10 ~rg ~eu ;er i Gly a Thr Met Thr 275 Thr Lys Ile Thr Asn 355 Thr Lys Thr His Gly.
435 Gly Ser Thr Lys Gly 515 Ile 2 Asn Tyr Val 2 Phe I 595 Trp Lys Met I Gin I Ile Leu Gly 260 Ile Val Thr Ile Ser 340 Asn Glu Asp Ile Leu 420 Arg Pro Ser Ala Jal 500 ,ly %sn Ile rhr krg 280 Jys ~sn 2 he 2 he 2 he Thr Asn 245 Arg Asn Gly His Ala 325 Gin Ser Thr Thr Lys 405 Asn Thr Leu Ala Thr 485 Asp Phe Lys Asn His 565 Leu Ser Tyr I Ala 5 Asn 2 645 Ser I Sei 230 Leu Leu Thr Asp Ile 310 Pro Ser Asn Glu Val 390 Val Ile Leu Arg Asn 470 Phe Ala Asn Leu Glu 550 Phe 3Au ly ?he 3er 530 ksn ~sn Ser 1 Ala Gin Ser Gin 295 Gly Pro Gly Thr Pro 375 Val Gly Gly Leu Va1 455 Phe Asn His Thr Ile 535 Leu Ser 4 Thr Glu Asp 615 Ser Leu Leu Lys Ser Tyr Lys 280 Asn Thr Glu Thr Glu 360 Thr Asn Gly Lys Vai 440 As Glu Asn Thr Leu 520 Thr Ile lu ly Lys 600 la rhr rhr Thr Asn Asn Val 265 Val Ala Leu Gly Lys 345 Vai Gin Ile Phe Gly 425 Glu Asn Phe Asp Ala 505 Asp Ala Val Asp Thr 585 Leu Arg 2 Pro Leu Ile C Ala Ser 250 Gly Gin Ala Asp Gly 330 Asn Ile Val Phe Lys 410 Gly Asn Gin Lys Ile 490 Asn Phe Ser Lys lie 570 krg ,a 1 ksn Slu Ily 650 31n Glu 235 Val Ala Gly Gin Leu 315 Tyr Asp Asn Ile His 395 Ala Val Leu Val Ala 475 Ser Phe Ser Thr Thr 555 Gly Ser Ile Val Asn 635 Gin 2 Gly 2 ilE Lys Tyr Glu Ala 300 Trp Lys Lys Pro Asp 380 Leu Ser Asn Thr Gly 460 Gly Leu Lys Gly Asn 540 Asn Ser Ile Asn Lys 520 Pro %sn ksp Ser Leu Leu I Vai 285 Gly Gin Asp Lys Pro 365 Gly Asn Leu Leu Gly 445 Gly Val Gly Gly Val 525 Va1 Gly Gin Phe Asp 605 Asn Trp Ala Phe Leu Asn Ala 270 Asp Ile Ser Lys Glu 350 Asn Pro Thr Thr Ser 430 Asn Tyr Asp Arg Ile 510 Thr Ala Ile Ser Ser 590 Phe Va1 ly Val Ile Tyr Gly 255 Pro Phe Ile Ala Pro 335 Ile Asn Phe Lys Thr 415 Asn Ile Ala Thr Phe 495 Asp Asp Val Ser Arg 575 Gly Tyr Glu Thr Met 2 655 Asn 2 Asp 240 Asn Ser Asn Ala Gly 320 Asn Ser Thr Ala Ala 400 Asn Gin Thr Leu Lys 480 Val Thr Lys Lys ial 560 Ile ly Tyr Ile 3er 640 sp ksn 660 Gin Giy Thr Ile Asn Tyr Leu Val beb 670 Arg Gly Gly Lys Val Ala Thr Leu WO 97/37044 PCT/US97/05223 941 675 680 685 Asn Val Gly Asn Ala Ala Ala Met Met Phe Asn Asn Asp Ile Asp Ser 690 695 700 Ala Thr Gly Phe Tyr Lys Pro Leu Ile Lys Ile Asn Ser Ala Gin Asp 705 710 715 720 Leu Ile Lys Asn Thr Glu His Val Leu Leu Lys Ala Lys Ile Ile Gly 725 730 735 Tyr Gly Asn Val Ser Thr Gly Thr Asn Gly Ile Ser Asn Val Asn Leu 740 745 750 Glu Glu Gin Phe Lys Glu Arg Leu Ala Leu Tyr Asn Asn Asn Asn Arg 755 760 765 Met Asp Thr Cys Val Val Arg Asn Thr Asp Asp Ile Lys Ala Cys Gly 770 775 780 Met Ala Ile Gly Asn Gin Ser Met Val Asn Asn Pro Asp Asn Tyr Lys 785 790 795 800 Tyr Leu Ile Gly Lys Ala Trp Arg Asn Ile Gly Ile Ser Lys Thr Ala 805 810 815 Asn Gly Ser Lys Ile Ser Val Tyr Tyr Leu Gly Asn Ser Thr Pro Thr 820 825 830 Glu Asn Gly Gly Asn Thr Thr Asn Leu Pro Thr Asn Thr Thr Asn Asn 835 840 845 Ala His Ser Ala Asn Tyr Ala Leu Val Lys Asn Ala Pro Phe Ala His 850 855 860 Ser Ala Thr Pro Asn Leu Val Ala Ile Asn Gin His Asp Phe Gly Thr 865 870 875 880 Ile Glu Ser Val Phe Glu Leu Ala Asn Arg Ser Lys Asp Ile Asp Thr 885 890 895 Leu Tyr Thr His Ser Gly Ala Gin Gly Arg Asp Leu Leu Gin Thr Leu 900 905 910 Leu Ile Asp Ser His Asp Ala Gly Tyr Ala Arg Gin Met Ile Asp Asn 915 920 925 Thr Ser Thr Gly Glu Ile Thr Lys Gin Leu Asn Ala Ala Thr Asp Ala 930 935 940 Leu Asn Asn Val Ala Ser Leu Glu His Lys Gin Ser Gly Leu Gin Thr 945 950 955 960 Leu Ser Leu Ser Asn Ala Met Ile Leu Asn Ser Arg Leu Val Asn Leu 965 970 975 Ser Arg Lys His Thr Asn His Ile Asn Ser Phe Ala Gin Arg Leu Gin 980 985 990 Ala Leu Lys Gly Gin Glu Phe Ala Ser Leu Glu Ser Ala Ala Glu Val 995 1000 1005 Leu Tyr Gin Phe Ala Pro Lys Tyr Glu Lys Pro Thr Asn Val Trp Ala 1010 1015 1020 Asn Ala Ile Gly Gly Ala Ser Leu Asn Ser Gly Ser Asn Ala Ser Leu 1025 1030 1035 1040 Tyr Gly Thr Ser Ala Gly Val Asp Ala Phe Leu Asn Gly Asn Val Glu 1045 1050 1055 Ala Ile Val Gly Gly Phe Gly Ser Tyr Gly Tyr Ser Ser Phe Ser Asn 1060 1065 1070 Gin Ala Asn Ser Leu Asn Ser Gly Ala Asn Asn Ala Asn Phe Gly Val 1075 1080 1085 Tyr Ser Arg Phe Phe Ala Asn Gin His Glu Phe Asp Phe Glu Ala Gin 1090 1095 1100 Gly Ala Leu Gly Ser Asp Gin Ser Ser Leu Asn Phe Lys Ser Thr Leu 1105 1110 1115 1120 Leu Gin Asp Leu Asn Gin Ser Tyr Asn Tyr Leu Ala Tyr Ser Ala Thr 1125 1130 1135 WO 97/37044 PCT/US97/05223 942 Ala Arg Ala Ser Tyr Gly Tyr Asp Phe Ala Phe Phe Arg Asn Ala Leu 1140 1145 1150 Val Leu Lys Pro Ser Val Gly Val Ser Tyr Asn His Leu Gly Ser Thr 1155 1160 1165 Asn Phe Lys Ser Asn Ser Gin Ser Gin Val Ala Leu Lys Asn Gly Ala 1170 1175 1180 Ser Ser Gin His Leu Phe Asn Ala Asn Ala Asn Val Glu Ala Arg Tyr 1185 1190 1195 1200 Tyr Tyr Gly Asp Thr Ser Tyr Phe Tyr Leu His Ala Gly Val Leu Gin 1205 1210 1215 Glu Phe Ala His Phe Gly Ser Asn Asp Val Ala Ser Leu Asn Thr Phe 1220 1225 1230 Lys Ile Asn Ala Ala Arg Ser Pro Leu Ser Thr Tyr Ala Arg Ala Met 1235 1240 1245 Met Gly Gly Glu Leu Gin Leu Ala Lys Glu Val Phe Leu Asn Leu Gly 1250 1255 1260 Val Val Tyr Leu His Asn Leu Ile Ser Asn Ala Ser His Phe Ala Ser 1265 1270 1275 1280 Asn Leu Gly Met Arg Tyr Ser Phe 1285 INFORMATION FOR SEQ ID NO:1040: SEQUENCE CHARACTERISTICS: LENGTH: 1484 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...1484 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1040: Gly Ala Val Ser Leu Arg Gly Gin Phe Asn Leu Ser Asn Asn Ser Ser 1 5 10 Leu Asp Phe Gin Gly Ser Ser Ala Ile Thr Ser Asn Thr Ala Phe Asn 25 Phe Tyr Asp Asn Ala Phe Ser Gin Ser Pro Ile Thr Phe His Gin Ala 40 Leu Asp Ile Lys Val Pro Leu Ser Leu Gly Gly Asn Leu Leu Asn Pro 55 Asn Asn Ser Ser Val Leu Asn Leu Lys Asn Ser Gin Leu Val Phe Ser 70 75 Asp Gin Gly Ser Leu Asn Ile Ala Asn Ile Asp Leu Leu Ser Asp Leu 90 Asn Gly Asn Lys Asn Arg Val Tyr Asn Ile Ile Gin Ala Asp Met Asn 100 105 110 Gly Asn Trp Tyr Glu Arg Ile Asn Phe Phe Gly Met Arg Ile Asn Asp 115 120 125 WO 97/37044 PCTIS97/05223 Gly Asn 145 Va1 Gly Val Ser Ser 225 Ile Val Met Phe Leu 305 Ile Leu Val Ile Ser 385 Glu Gly Asn Leu Asn 465 Leu Gly Gly Ala Val 545 Lys Ile 130 Asn Thr Ser Tyr Val 210 Asn Ser Tyr Ala Ser 290 Thr Asp Ile Va1 Val 370 Ile Ile Gly Gin Gly 450 Ile Gly Tyr ln Ile 530 ly Leu Tyr Ala Leu Glu Ser 195 Lys Asn Gin Asn Leu 275 Leu Lys Asn Ile Phe 355 Cys Ser Tyr Ser Ala 435 Gin Leu Asn Leu Asn 515 Lys Gly Ile Asp Leu Sex Ile 180 Tyr Gly Thr Thr Lys 260 Lys Ser Leu Ala Gly 340 Gly Gin Ala Leu Ala 420 Asn Glu Gly Leu Thr 500 Asn Asp Leu Gly Ala Lys Gin 165 Phe Sex Tyr Thr Tyr 245 Gly Leu Ser Ile Asn 325 Ala Gly Lys Asp Thr 405 Ser Ile Gly Glu Ile 485 Pro Phe Leu Ala Sex 1 565 Lys Ile 150 lie Asn Asp Tyr Lys 230 Asn Tyr Tyr Leu Thr 310 Asn Thr Leu Phe Leu 390 Gly Val Va1 Ile Va1 470 Val Glu Asp lie Gly 550 vIet Asn 135 Thr Pro Tyr Asp Asn 215 Asn Ala Asn Pro Ser 295 Pro Ser Lys Gly Arg 375 Gly Thr Thr Ser Asn 455 Ala Asn Gin Asn Arg 535 Leu Ser Gin Glu Cly Gin Ala 200 Pro Asn Gin Phe Glu 280 Asn Ser Val Ile Tyr 360 Gly Tyr Leu Phe Ser 440 Lys Met Thr Lys Leu I 520 Gin Gly Ile Thr Ser Ile Lys 185 Gin Asn Asn Gly Ser 265 Ile Leu Asp Val Gly 345 Gin Thr Ile Gly Asn 425 Gin Val Leu Asn 505 Met Lys Gly Asn Tyr Phe Lys 170 Va1 Gly Gin Leu Asn 250 Asn Lys Lys Trp Gln 330 Gin Lys Tyr Asp Ser 410 Sex Thr Phe Sex Gly 490 Gin Asn Leu Ile Asp Ser Lys 155 Asn Tyr Val Ser Thr 235 Pro Ile Lys Gly Lys 315 Asn Thr Pro Leu Thr 395 Gly Gin Asp Asn Ile 475 Sex Thx Asp ly Asp 555 Leu Phe 140 Asn Thr Asn Phe Tyr 220 Ser Ile Lys Ile Asp 300 Asn Phe Asp Cys Gly 380 Thr Asn Thr Gly Gin 460 Asn Asp Leu Ser Phe 540 Leu Leu Thr Asn Leu Asn Tyr 205 Gin Glu Sex Ala Leu 285 Ala Ile Asn Thr Asp 365 Gin Phe Ala Sex Ile 445 Ala Lys Sex Sex Gly 525 Trp Gin Ser Asn Gin Tyr Ala 190 Leu Ala Ser Ala Leu 270 Gly Leu Asn Asn Asn 350 Tyr Leu Asn Trp Leu 430 Phe Gly Ala Val Gin 510 Leu Thr Asn Lys Pro Leu Asn 175 Asn Thr Ser Ser Leu 255 Gly Asn Asn Glu Gly 335 Ser Thr Leu Ala Gly 415 Ile Ser Leu Gly Ile 495 Leu Asn Gly Pro Lys 575 Leu Sex 160 Ile Gly Ser Gly Va1 240 His Gin Asp Gin Leu 320 Thx Ala Asp Glu Lys 400 Thr Leu Met Ala Gly 480 Gly Leu Thr Leu Glu 560 Gly 570 Leu Phe Asn Gin Ile Thr Gly Phe Ile Ser Ala Asn Asp Ile Gly Gin WO 97/37044 PCT/us97/05223 580 Val Lys G1 625 Ile Ile Gly Phe Ser 705 Ile Leu Asn Val1 Asp 785 Asn Asn Gly Ser Ala 865 Ser Ala Ala Asn Val 945 Gin Leu Leu His Ile Ser 595 Asn Asp 610 Gin Asp -Lys Ser Tyr Glu Sle Phe 675 Ser Phe 690 Asn Ala Phe Ala Asn Leu Ala Ser 755 Ser Gin 770 Pro Thr Ala Ser Asn Asp Asn Ile 835 Ala Asn 850 Asn Asn Phe Vai Thr Gin Ser Leu 915 Asn Leu 930 Asn Pro Thr Pro Leu Lys Gin Ser 995 Ile GiuC 1010 Va Val Thi Val Gir 660 Ala Asn Asn Gly Leu 740 Asn Gly Thr Ser Glu 820 Tyr Val1 Leu Thr Lys 900 Ser 1
Y
Leu ?he 3er 980 Pyr iiu LMet -Ala Leu *Leu 645 Gly Pro Ala Gly Asn 725 Ser Gly Asn Ala Ser 805 Ser Ala Lys Thr Asn 885 Ile Thr Ala IleC Met 1 965 Ser Leu I Lys I Lei- Ala Asr 630 Asp Leu Tyr Gin Gly 710 Asn As n Leu Leu Ser 790 Asn Leu Asn Asn Ile 870 ksn 31u 31y Ilie .,1n 150 1 sn ~rg ~ys ~sn Gin Asp Ile 600 Leu Gly Lys 615 Ser Leu Glu Lys Val Leu Gly Asp Leu 665 Gly Leu Ser 680 Gly Asn Val 695 Thr Leu Ser His Ile Ala Gin Val Ser 745 Lys Ile Asn 760 Phe Ile Asn 775 Ala Thr Asn Ala Ser Asn Val Vai Thr 825 Gly Vai Val 840 Leu Tyr Leu 855 Ser Asn Gin Leu Asn Ile Val Leu Gin 905 Ile Tyr Gly 920 His Phe Asn 935 Vai Gly Gly Val Ser Val Tyr Ile Asp r 985 Leu Tyr Thr I 1000 Giy Val Leu 9I 1015 Val Gln Ser Ala 650 Ile Gin Phe Phe Phe 730 Asn Ala Ala Pro Asn 810 Ala Asp Tyr Al a Gln 890 ksn Leu eu Ile k a 170 ryr .eu ?hr *Lys Met Leu 635 *Ala Pro Val1 Val1 Asn 715 Thr Ile Thr Ser Cys 795 Ala Asn Phe Asn Vai 875 Gly Leu Glu Giu Ile 955 Asn Asn Ile TyrI Prc Ile 620 Leu Lys Asn Trp Gin 700 Al a Asn Asn Asn Cys 780 Thr Pro Gly Ser Asn 860 Leu Ala Val Ia 1 kisn 940 %sn 3lY Ile k.sn ,eu Ser Asp 605 Gly Giu Gin Asn Gly Leu Leu Gly 670 Gin Lys 685 Asn Ser Gly Asn His Ser Vai Thr 750 Asn Asn 765 Val Gin Thr Ala Ile Ala Phe Asn 830 Lys Ile 845 Ala Gin Giu Lys Phe Asn Ile Ala 910 Gly Gly 925 Ser Gin Leu Asn Gly Thr Asn Pro 990 Ile AsnC 1005 Gly Gin I AlE Phe Gir Gly 655 Lys Gly Thr Ser Gly 735 Met Val Gin Gin Leu 815 Phe Lys Phe As n Asn 895 Ser kia rhr rhr Pyr 975 ksn 'ly rg Leu Leu 1Gin 640 Ser Lys Asp Phe Leu 720 Thr Leu Ser Ser Asn 800 Asn Ser Gly Gin Ala 880 Asn Asn Leu Pro Thr 960 Thr Ser Asn Val 1020 Leu Leu Gin Asp Lys Gly Leu Leu Leu Ser Val Ala Leu Pro Asn Ser 1025 1030 1035 1040 WO 97/37044 PCT/US97/05223 945 Asn Asn Ala Ser Gin Asn Asn Ile Leu Ser Leu Ser Val Leu His Asn 1045 1050 1055 Gin Ile Lys Met Ser Tyr Gly Asn Lys Val Met Asp Phe Thr Pro Pro 1060 1065 1070 Thr Leu Gin Asp Tyr Ile Val Gly Ile Gin Gly Gin Ser Ala Leu Asn 1075 1080 1085 Gin Ile Glu Ala Val Gly Gly Asn Asn Ala Ile Lys Trp Leu Ser Thr 1090 1095 1100 Leu Met Met Glu Thr Lys Glu Asn Pro Leu Phe Ala Pro Ile Tyr Leu 1105 1110 1115 1120 Glu Asn His Ser Leu Asn Glu Ile Leu Gly Val Thr Lys Asp Leu Gin 1125 1130 1135 Asn Thr Ala Ser Leu Ile Ser Asn Pro Asn Phe Arg Asn Asn Ala Thr 1140 1145 1150 Ser Leu Leu Glu Met Ala Ser Tyr Thr Gin Gin Thr Ser Arg Leu Thr 1155 1160 1165 Lys Leu Ser Asp Phe Arg Ala Arg Glu Gly Glu Ser Asn Phe Ser Glu 1170 1175 1180 Arg Leu Leu Glu Leu Lys Asn Lys Arg Phe Ser Asp Pro Asn Pro Ser 1185 1190 1195 1200 Glu Val Phe Val Lys Tyr Ser Gin Leu Ser Lys His Pro Asn Asn Leu 1205 1210 1215 Trp Ile Gin Gly Val Gly Gly Ala Ser Phe Ile Ser Gly Gly Asn Gly 1220 1225 1230 Thr Leu Tyr Gly Leu Asn Val Gly Tyr Asp Arg Leu Val Lys Ser Val 1235 1240 1245 Ile Leu Gly Gly Tyr Val Ala Tyr Gly Tyr Ser Gly Phe Asn Gly Asn 1250 1255 1260 Ile Met His Ser Leu Ala Asn Asn Val Asp Val Gly Met Tyr Ala Arg 1265 1270 1275 1280 Ala Phe Leu Lys Arg Asn Glu Phe Thr Leu Ser Ala Asn Glu Thr Tyr 1285 1290 1295 Gly Gly Asn Ala Ser His Ile Asn Ser Ser Asn Ser Leu Leu Ser Val 1300 1305 1310 Leu Asn Gin Arg Tyr Asn Tyr Asn Thr Trp Thr Thr Ser Val Asn Gly 1315 1320 1325 Asn Tyr Gly Tyr Asp Phe Met Phe Lys Gin Lys Ser Val Val Leu Lys 1330 1335 1340 Pro Gin Val Gly Leu Ser Tyr His Phe Ile Gly Leu Ser Gly Met Lys 1345 1350 1355 1360 Gly Lys Met Gin Asn Pro Ala Tyr Gin Gin Phe Val Met His Ser Asn 1365 1370 1375 Pro Ser Asn Glu Ser Val Leu Thr Leu Asn Met Gly Leu Glu Ser Arg 1380 1385 1390 Lys Tyr Phe Gly Lys Asn Ser Tyr Tyr Phe Val Thr Ala Arg Leu Gly 1395 1400 1405 Arg Asp Leu Leu Ile Lys Ala Lys Gly Asp Asn Val Val Arg Phe Val 1410 1415 1420 Gly Glu Asn Thr Leu Leu Tyr Arg Lys Gly Glu Ile Phe Asn Thr Phe 1425 1430 1435 1440 Ala Ser Val Ile Thr Gly Gly Glu Met His Leu Trp Arg Leu Met Tyr 1445 1450 1455 Val Asn Ala Gly Val Gly Leu Lys Met Gly Leu Gin Tyr Gin Asp Leu 1460 1465 1470 Asn Ile Thr Gly Asn Val Gly Met Arg Val Ala Phe 1475 1480 WO 97/37044 PCT/US97/05223 946 INFORMATION FOR SEQ ID NO:1041: SEQUENCE CHARACTERISTICS: LENGTH: 300 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...300 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1041: Met Arg Lys Thr Ile Ser Ala Leu Phe Leu Ser Ala Cys Ile Gly Leu 1 5 10 Ser Ser Val His Ala Ser Asn Ala Leu Ile Leu Gin Thr Asp Phe Ser 25 Leu Lys Asp Gly Ala Val Ser Ala Met Lys Gly Val Ala Phe Ser Val 40 Asp Ser Asn Leu Lys Ile Phe Asp Leu Thr His Glu Ile Pro Pro Tyr 55 Asn Ile Trp Glu Gly Ala Tyr Arg Leu Tyr Gin Thr Ala Ser Tyr Trp 70 75 Pro Lys Gly Ser Val Phe Val Ser Val Val Asp Pro Gly Val Gly Thr 90 Asn Arg Lys Ser Val Val Leu Lys Thr Lys Asn Gly Gin Tyr Phe Val 100 105 110 Ser Pro Asp Asn Gly Thr Leu Thr Leu Val Ala Gin Thr Leu Gly Ile 115 120 125 Asp Ser Val Arg Glu Ile Asp Glu Lys Ala Asn Arg Leu Lys Gly Ser 130 135 140 Glu Lys Ser Tyr Thr Phe His Gly Arg Asp Val Tyr Ala Tyr Thr Gly 145 150 155 160 Ala Arg Leu Ala Ser Gly Ala Ile Thr Phe Glu Gin Val Gly Pro Glu 165 170 175 Leu Pro Ile Lys Val Val Glu Ile Pro Tyr Gln.Lys Ala Lys Ala Thr 180 185 190 Lys Gly Gly Val Lys Gly Asn Ile Pro Ile Leu Asp Ile Gin Tyr Gly 195 200 205 Asn Val Trp Ser Asn Ile Ser Asp Lys Leu Leu Asn Gin Ala Gly Ile 210 215 220 Lys Arg Asn Asp Thr Val Cys Val Thr Ile Phe Lys Asn Ser Lys Lys 225 230 235 240 Gin Tyr Glu Gly Lys Met Pro Tyr Val Ala Ser Phe Gly Asp Val Pro 245 250 255 Glu Gly Gin Pro Leu Val Tyr Leu Asn Ser Leu Leu Asn Val Ser Val 260 265 270 Ala Leu Asn Met Asp Asn Phe Ala Gin Lys His Gin Ile Lys Ser Gly 275 280 285 Ala Asp Trp Asn Ile Asp Ile Lys Lys Cys Ala Lys WO 97/37044 PCT/US97/05223 290 295 3( INFORMATION FOR SEQ ID NO:1042: SEQUENCE CHARACTERISTICS: LENGTH: 526 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...526 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1042: Met 1 Leu Lys Gly Ser Ile Glu Gin Pro Gly 145 Val Lys Ala Met Ser 225 Leu Asn Lys Leu Phe Asn Pro Arg Leu Ile Val Phe Ile Phe Ala r.Le 1 Gly Ile Val Ala Lys Ala Phe Leu 130 Ile Ile Thr His Asn 210 Asp Asp Gln Val Thr Gin Leu Ser Lys Glu 115 Glu Ile Gin Leu Leu 195 Met Ala Gly Pro SGly Leu Thr SGlu Ser Lys 100 Ile Gin Arg Gin Glu 180 Gin Thr Glu Glu Val 260 5 Phe Gly Asp Tyr Leu Leu Lys Glu Asn Gly 165 Glu Met Asp Met Met 245 Val SSer SLeu Glu Asn 70 Glu Asp Lys Glu Arg 150 Arg Glu Met Leu Gly 230 Leu Ser I Val Asp Ala 55 Ala Gly Ala Glu Leu 135 Leu Glu Arg Ala 31u 215 Gly Thr 'he SPro SLeu 40 SLeu Lys Ile Leu Ala 120 Arg Asp Glu Arg Val 200 Ala Lys Asp Thr Ser 25 Arg Lys Lys Ser Leu 105 Glu Lys Gln Ile Ala 185 Asp Gln Ile Ala Leu 1 10 Leu Gly Asn Gin Phe 90 Leu Phe Asn Phe Ser 170 Lys Glu Lys Leu Lys 250 ksp iLeu Gly Lys Asn 75 Glu Glu Tyr Thr Gly 155 Val Asp Glu Leu Leu 235 Val Ala Glu Leu Tyr Ile Leu Leu Ser Ile 140 Leu Gin Leu His Gly 220 Lys 3al Gln Thr Asn Leu Leu Leu Gln Val 125 Leu Ala Leu Ile Asn 205 Ser Ala Tyr Gly SLys Met SSer Leu Asp Gly 110 Lys Gin Glu Pro Ser 190 Lys Val Ile Asp Ala 270 Gly Leu Leu Lys Glu His Leu Val Pro Gly 175 Lys Asp Leu Pro Gin 255 Lys Leu Pro Leu Ala Asp Asp Ser Thr Ile Val 160 Ile Ser Ala Leu Ile 240 Asn Ile Ala3l Phe Gly Asp Phe Ser Gly Ala Asn Val Gly Lys Arg Met Ala Ile Val WO 97/37044 PCT/US97/05223 Leu Gly 305 Asp Val Lys Phe Leu 385 Ala Ile Leu Asn Ser 465 Leu Gly Lys Asp 290 Gly Leu Leu Thr Met 370 Val Thr Ala Arg Ala 450 Val Thr Thr Ser 275 Asn Ser Ala Glu Ser 355 Ala Val Leu Val Glu 435 Ser Leu Thr Gin Leu 515 Lys Gly lie Lys 340 Ile Leu Asn Thr Asp 420 Gly Arg Leu Gly Gly 500 Tyr Val Gin Ala 325 Arg Ile Tyr Leu Leu 405 Ala Glu Ala Tyr Ile 485 Ile Phe Tyr Ile 310 Leu Ile Ala Tyr Phe 390 Pro Asn Gly Ile Ala 470 Gly Tyr Trp Ser 295 Asn Arg Val Leu Ser 375 Leu Gly Ile Val Phe 455 Tyr Ile Gin Phe 280 Ala Gly Ser Gly Val 360 Met Ile Met Ile Val 440 Asp Gly Leu Ala Gly 520 Pro Asn Gly Pro 345 Gly Ala Val Ala Ile 425 Lys Ser Thr Ala Leu 505 Val Val Phe Ala 330 Ser Gly Gly Ala Gly 410 Asn Ala Asn Gly Ser 490 Leu Ile Ser 315 Met Leu Phe Val Val 395 Ile Glu Ile Ile Ala 475 Ile Pro Arg 300 Val Asn Gly Ile Ile 380 Met Val Arg His Thr 460 Ile Ile Lys 285 Glu Ala Ala Lys Leu 365 Ala Ala Leu Ile Leu 445 Ser Lys Thr Leu Arg Gin Pro Asp 350 Val Cys Ile Thr Arg 430 Gly Leu Gly Ala Ala 510 Ile Ala Ile 335 Ser Met Met Phe Val 415 Glu Tyr Ile Phe Ile 495 Gin Gly Ser 320 Gin Ile Gly Ala Gly 400 Gly Val Ile Ala Ala 480 Ile Thr Lys Asn Lys Arg Ala INFORMATION FOR SEQ ID NO:1043: SEQUENCE CHARACTERISTICS: LENGTH: 793 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...793 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1043: Met Asn Gly Tyr Leu Arg Val Lys Thr Pro Tyr Phe Leu Ala Ser 5 10 Val Leu Thr Phe Trp Thr Phe Asn Ser Phe Met Ser Ala Lys Asp 25 His His Phe Leu Lys Lys Val Thr Thr Thr Glu Gln Lys Phe Ser Ile 1 Val Lys WO 97/37044 PCT/US97/05223 949 Ser Ser Leu Asp Gly Gly Phe 145 Gly Va1 Arg Lys Phe 225 Ile Ser Asr Ser Phe 305 Phe Gly Phe Lys Asn 385 Phe Leu Gly Lys Ser 465 Ile Ser so Ser Asn Ala Gly Ile 130 Pro Thr Ile Ile Glu 210 Asn Ser Pro Ala Tyr 290 Ile Gly Gly Gly Ile 370 Cys Phe Asn Met Asn 450 Leu Asn Ala Arg Ile Thr Gly 115 Pro Val Ser Thr Thr 195 Lys Thr Ala Thr Thr 275 His Asn Ile Asp Phe 355 Leu Gly Asp Leu Arg 435 Pro Asn Phe 40 Pro Ile Ser Trp Gin Ser Thr Glu Gly 100 Gly Ile Thr Va1 Lys 180 Phe Gly Tyr Gin Lys 260 Asn Pro Glu Va1 Phe 340 Ser Pro Leu Asn Ile 420 Phe Ser Asn Asn Val Asn Thr Asn Tyr Phe Gin 165 Glu Trp Lys Gly Gly 245 Va1 Thr Gly Arg Tyr 325 Lys Asn Phe Tyr Ile 405 Va1 Leu Met Phe Asn 485 Ile 70 Ala Gly Gly Gly Gin 150 Tyr Ile Gly Pro Arg 230 Asn Gin Phe Thr Pro 310 Gin Phe Gln Lys Ser 390 Arg Asn Thr Pro Asn 470 Gly 55 Ser Leu Vai His Ala 135 Ser Gly Pro Arg Leu 215 Thr Trp Asn Lys Leu 295 Asp Asn Thr Tyr Gly 375 Tyr Arg Thr Glu Asn 455 Asn Met Asn Gin Leu Ser 120 Pro Val Pro Lys Ser 200 Ala Ala Ile Tyr Ala 280 Ser Asn Tyr Tyr Gin 360 Lys Ser Ser Gly Asp 440 Asn Tyr Leu Lys Asn Pro 105 Asn Tyr Asp Asn Glu 185 Ser Gin Gly Asn Leu 265 Tyr Ala Gin Phe Phe 345 Ser Gly Asp Val Lys 425 Leu Gly Thr Thr Glu Glu Val 90 Lys Thr Ser Arg Thr 170 Trp Asn Thr Met Gly 250 Leu Tyr Gin Asp Gly 330 Thr Va1 Glu Thr Va1 410 Val Tyr Ser Ala Ile 490 Glu Leu 75 Pro Ile Asn Asn Ile 155 Phe Glu Gly Leu Leu 235 Gin Asp Gin Asp Gly 315 Asp His Tyr Ile Asn 395 Asn Lys Arg Gly Va1 475 Thr Val Lys Gly Ser Met Ile 140 Asp Gly Asn Asn Gly 220 Gly Gly Ala Tyr Tyr 300 Gly Pro Asp Met Ser 380 Ser Ala Gin Arg Phe 460 Tyr Pro Arg Lys Ile Va1 Ile 125 Glu Val Gly Gin Phe 205 Asn Lys Phe Ile Tyr 285 Ala Arg Asp Met Ser 365 Ala Pro Phe Thr Ser 445 Asp Ala Gly Asn Thr Gin Arg 110 Leu Leu Ile Va1 Ala 190 Val Gin Tyr Arg Tyr 270 Gin Tyr Ala Arg Ser 350 Gly Lys Cys Glu Phe 430 Thr Ala Ser Leu Ser Gly Ile Gly Va1 Ala Lys Va1 175 Ala Asp Met Ile Gin 255 Lys Tyr Asn Lys Lys 335 Arg Gin Asn Trp Pro 415 Asn Thr Gly Asp Arg 495 Thr Asn Arg Phe Asn Ile Gly 160 Asn Glu Pro Leu Gly 240 Asn Ile Asn Arg Arg 320 Val Asp Asn Pro Gin 400 Lys Met Arg Thr G1u 480 Tyr WO 97/37044 PCT/US97/05223 Thr Gin Val Tyr 545 Val Gly Phe Arg Tyr 625 Asp Lys Pro Ile Asn 705 Ile Leu Ala Gly Ile 785 Phe Thr Asn 530 Gin Gly Ser Val Glu 610 Thr Ala Gly His Gly 690 Thr Val Gin Ser Ile 770 Thr Leu Pro 515 Val Arg Thr Arg Ile 595 Pro Pro Asn Pro Gin 675 Leu Val Thr Ile Leu 755 Gly Ala Asn 500 Lys Gly Ser Ser Tyr 580 Phe Val Ile Ile Lys 660 Phe Ser Pro Lys Ser 740 Gin Thr Tyr Tyr Glu Lys Lys Thr Tyr Tyr Thr 565 Tyr Ala Asn Arg Thr 645 Lys Ile Ser Phe Thr 725 Thr Ile Ser Val Thr Lys Ile 550 Asp Phe Asn Ala Gly 630 Ser Asp Leu Phe Thr 710 Ala Thr Asn Pro Ser 790 Lys Pro 535 Pro Tyr Asn His Arg 615 Leu His Ile Asp Phe 695 Glu Gly Leu Asn Asn 775 Tyr Glu 520 Ile Pro Phe His Tyr 600 Ser Asn Thr Phe Ala 680 Tyr Tyr Met Trp Ile 760 Gly Asn Asp Ala 505 Arg Tyr Lys Glu Gin Phe Gin Ile 570 Gin Val 585 Phe Thr Gin Gly Phe His Met Val 650 Gly Lys 665 Ser Tyr Ser Arg Ala Pro Thr Pro 730 Glu Arg 745 Phe Asn Lys Glu Phe Pro Asn Leu Ser 555 Phe Ser Gly Val Ala 635 Thr Lys Thr Ala Thr 715 Tyr Lys Met Ala Pro Gin Leu 540 Asn Asn Phe Arg Glu 620 Ala Asn Leu Tyr Tyr 700 Ile Tyr Asn Lys Ala 780 Phe Trp 525 Phe Ile Val Asn Tyr 605 Leu Tyr Pro Pro Ala 685 Ser Lys Trp Gin Tyr 765 Pro Lys 510 Asn Tyr Gly Met Ala 590 Gly Glu Thr Ala Phe 670 Lys Asp Asn Val Ser 750 Trp Pro Val Pro Phe Asn Glu 575 Asn Asp Leu Phe Asn 655 Val Thr Val Gly Trp 735 Val Phe Arg Gly Ala Asn Phe 560 Gly Tyr Asn Tyr Ile 640 Pro Ser Thr Leu Ala 720 Asn Asn Ser Ser INFORMATION FOR SEQ ID NO:1044: SEQUENCE CHARACTERISTICS: LENGTH: 333 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...333 WO 97/37044 PCT/US97/05223 951 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1044: Met 1 Val Glu Val Phe Ser Met Leu Glu Thr 145 Glu Phe Leu Asp Lys 225 Pro Val Gin Leu Gly 305 Asp Leu Ile Ala Arg Phe Lys Lys Ala Leu Ile Ser Tyr Ser Leu 1 Leu Ile Ser Asn Asp Ser Ser His 130 Ile Val Ile Phe Ile 210 Phe Glu Leu Val Ile 290 Val Leu Leu Gin Lys Thr Ile Ser Pro 115 Ala Val Asp Lys His 195 Leu Gly Ile Asn Tyr 275 Ser Asp Asn Val Val Ile Trp Val Asp 100 Asp Lys Glu Ala Glu 180 Lys Glu Arg Ile Asn 260 Lys Leu Ile Asp 5 Ser Lys Ile Asp Lys His Leu Lys Val Ser 165 Arg Ala Lys Ala Phe 245 Pro Leu Phe Asn Ala 325 Ser Asp Tyr Arg 70 Ala Val Val Phe Met 150 Lys Leu Asn Gly Asp 230 Ile Lys Pro Ile Ala 310 Glu Leu Tyr Leu 55 Val Thr Ala Val Gly 135 Glu Lys Lys Lys Gly 215 Ile Trp Phe Thr Ala 295 lle Val Leu Phe 40 Gly Val Leu Ala Thr 120 lle Asp Leu Asn Ile 200 lle Ser Trp Ser Met 280 Leu lle Glu Gly 25 Gly Ser Gly Lys Leu 105 Phe Ser Ile Ala Val 185 Ser Asp Val Ile Thr 265 Asp Lys Lys Pro 10 Val Glu Phe Ile Asp 90 Asn Val Phe Asp Lys 170 Lys Gly Asn Glu Ser 250 Ile lie Ala Asp Ala Gin Ala Ser 75 Pro Val Gly Leu Ala 155 Met Lys His Phe Lys 235 Pro Lys Gly His Tyr 315 Asn Thr Glu Asp Glu Glu Asn Ser 140 Gin Gin Lys Gin Gly 220 Ile Leu Ala Gly Pro 300 Tyr Ala Ile Val Tyr Arg Leu Pro 125 Phe Ala Glu Lys Ala 205 Leu Val Ser Ile Pro 285 Glu Lys Ser Lys Pro Ala Ile Leu 110 Lys Gin Lys Thr Gly 190 Leu Lys Lys Pro Lys 270 Arg Ala Val Asn Leu Ala Phe Lys Lys Ala Glu Ala Leu 175 Val Asp Tyr Glu Glu 255 Asn Ala Phe Val Gly Gin Pro Met Lys Pro Lys Val Lys Leu 160 Asp Glu Ser Val Asn 240 Asp Lys Pro Lys Phe Phe Leu Trp His INFORMATION FOR SEQ ID NO:1045: SEQUENCE CHARACTERISTICS: (ii) (iii) (vi) LENGTH: 400 amino acids TYPE: amino acid TOPOLOGY: linear MOLECULE TYPE: protein HYPOTHETICAL: YES ORIGINAL SOURCE: WO 97/37044 WO 9737044PCTIUS97/05223 952 ORGANISM: Helicobacter Pylori (ix) FEATURE: NAME/KEY: mnisc feature LOCATION 1 .400 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1045: Met Tyr Lys Leu Gly Ile Phe Leu Leu Ala Thr Leu 1 Thr Gin Leu Leu Ser Sen s0 Ile Giu Asn Glu Leu Asp Asp Thr Asn Leu 130 Leu His 145 Lys Lys Ile Ser Sen Leu Tyr Ala 210 Asn Leu 225 Giu Asn Val Lys Asn Gly Lys Phe 290 Giu Sen 305 Val Leu Lys Val Gin Leu Lys Gly 370 Thr Met Lys Lys Leu Arg Ser His Leu 115 Ala Gin Leu Ser Gin 195 Ile Asn Giu Gin Pro 275 Gly Ile Asp Val Asp 355 Tyr Arg Val1 Lys Gly Gin Leu Leu 100 Leu Ser Ser Asn Val 180 Thr Tyr Aia Giu Val1 260 Lys Pro Thr Gly Ile 340 Lys Val1 Glu 5 Ser Thr Giu Met Val Gin Giu Ser Thr Val1 165 Ile Giu Asn Leu Arg 245 Ala Thr Tyr Leu Lys 325 Ile Ile Leu Lys Asp His Ala Val1 70 Gin Lys Asp Asn Leu 150 Gin Asp Gin Gin Leu 230 Val Ser Ile Ile Val1 310 Ile Giu Ala Giy His Ile Ala Glu Giu 40 Ile Arg 55 Ala Leu Giu Lys Gin Arg Phe Leu 120 Asp Val 135 Ser Lys Ala Leu Giu Gin Asn Lys 200 Arg Leu 215 Lys Arg Ser Leu Ser Tyr Ala Pro 280 Asp Pro 295 Ser Lys Val Phe His Lys Pro Thr 360 Arg Ile 375 Ile Asn Lys 25 Lys Ser Lys Val Sen 105 Phe Ile Met Giu Lys 185 Leu Ser Leu Lys Gin 265 Leu Val1 Thr Ala Asn 345 Ile Asp Pro 10 Asp Asn Lys Lys Leu 90 Phe Ser Leu Ser Val 170 Thr Ile Leu Asn Lys 250 Asn Asn Tyr Pro Lys 330 Gly Lys Gin Leu Ile Gin Giu Ser 75 Thr Leu Gin Gin Gin 155 Lys Arg Leu Leu Ile 235 Ser Ile Asp Asn Asn 315 Giu Ile Ser Arg Glu Gin Leu Leu Leu Asn Gin Ala Val1 140 Leu Ser Giu Ser Giu 220 Ile Ser Asn Tyr Leu 300 Ala Ile Arg Gly Leu 380 Leu Leu His Asn Gin Giu Tyr Lys Leu 125 Ala Ser Ser Ser Met 205 Lys Lys Gin Thr Giu 285 Lys Leu Asn Thr Met 365 Giy Ile Ser Lys Ser Lys Lys Arg Arg 110 Lys Phe Gin Ile Thr 190 Gin Giu Gin Ala Thr 270 Val Ile Val1 Met Ile 350 Arg Phe Ala Giu Arg Val Asn Lys Val Gly Giu Giu Gin 175 Leu Lys Arg Asn Leu 255 Ser Val1 Phe Arg Leu 335 Tyr Ile Giu Asn Thr Leu Glu Arg Ser Phe Gin Asn Glu 160 Lys Lys Asp Gin Arg 240 Giu Tyr Gin Ser Asn 320 Lys Ser Gin Val1 Ala Arg Asn WO 97/37044 PCT/US97/05223 953 385 390 395 400 INFORMATION FOR SEQ ID NO:1046: SEQUENCE CHARACTERISTICS: LENGTH: 549 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...549 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1046: Met Asp Lys Asn Asn Asn Asn Asn Leu Arg Leu Ile Leu Ala Ile Ala 1 5 10 Leu Ser Phe Leu Phe Ile Ala Leu Tyr Ser Tyr Phe Phe Gin Glu Pro 25 Asn Lys Thr Thr Thr Glu Thr Thr Lys Gin Glu Thr Thr Asn Asn His 40 Thr Ala Thr Ser Pro Thr Ala Ser Asn Thr Ile Thr Gin Asp Phe Ser 55 Val Thr Gin Thr Ile Pro Gin Glu Ser Leu Leu Ser Thr Ile Ser Phe 70 75 Glu His Ala Lys Ile Glu Ile Asp Ser Leu Gly Arg Ile Lys Gln Val 90 Tyr Leu Lys Asp Lys Lys Tyr Leu Thr Pro Lys Glu Lys Gly Phe Leu 100 105 110 Glu His Val Ser His Leu Phe Ser Ser Lys Glu Asn Ser Gin Pro Ser 115 120 125 Leu Lys Glu Leu Pro Leu Leu Ala Ala Asp Lys Leu Lys Pro Leu Glu 130 135 140 Val Arg Phe Leu Asp Pro Thr Leu Asn Asn Lys Ala Phe Asn Thr Pro 145 150 155 160 Tyr Ser Ala Ser Lys Thr Thr Leu Gly Pro Asn Glu Gin Leu Val Leu 165 170 175 Thr Gin Asp Leu Gly Thr Leu Ser Ile Ile Lys Thr Leu Thr Phe Tyr 180 185 190 Asp Asp Leu His Tyr Asp Leu Lys Ile Ala Phe Lys Ser Pro Asn Asn 195 200 205 Leu Ile Pro Ser Tyr Val Ile Thr Asn Gly Tyr Arg Pro Val Ala Asp 210 215 220 Leu Asp Ser Tyr Thr Phe Ser Gly Val Leu Leu Glu Asn Thr Asp Lys 225 230 235 240 Lys Ile Glu Lys Ile Glu Asp Lys Asp Ala Lys Glu Ile Lys Arg Phe 245 250 255 Ser Asn Thr Leu Phe Leu Ser Ser Val Asp Arg Tyr Phe Thr Thr Leu 260 265 270 Leu Phe Thr Lys Asp Pro Gin Gly Phe Glu Ala Leu Ile Asp Ser Glu WO 97/37044 PCT/US97/05223 lie Asn 305 Ile Phe Gly Ile Lys 385 Glu Gly Val Ser Tyr 465 Ser Lys Gly Gin Gin 545 (2) Gly 290 Leu Ser Ala Asn Leu 370 Glu Pro Ala Phe Ser 450 Phe Val Leu Leu Leu 530 Asn 275 Thr His Pro Lys Trp 355 Tyr Leu Gin Asn Phe 435 Glu Ile Thr Leu Val 515 Ile Ile Lys Asn Pro Leu Gly Phe Gly Tyr Met Leu 325 Gly Val 340 Gly Trp Pro Leu Ala Pro Lys Leu 405 Pro Leu 420 Ala Ile Trp Val Leu Pro Pro Asn 485 Pro Leu 500 Leu Tyr Ile Asn Lys Glu Ile 310 Thr Phe Ala Ser Lys 390 Gin Gly Tyr Leu Leu 470 Thr Leu Trp Lys 295 Gly Pro Asp Val Val Leu Ile Ile 360 Tyr Lys 375 Met Lys Ala His Gly Cys Arg Val 440 Trp Ile 455 Leu Met Met Thr Phe Thr Thr Thr 520 Val Leu 535 Lys Ile Leu 345 Leu Gly Glu Met Leu 425 Leu His Gly Asp Ile 505 His Ile Asp Glu 330 Asp Leu Met Leu Met 410 Pro Tyr Asp Ala Pro 490 Phe Asn Ser Tyr 315 Tyr Tyr Thr Val Gin 395 Gin Leu Asn Leu Ser 475 Met Leu Ile Leu 300 Arg Gly Leu Ile Ser 380 Glu Leu Ile Ala Ser 460 Met Gin Ile Leu 285 Lys Ser Leu Tyr Ile 365 Met Lys Tyr Leu Val 445 Ile Tyr Ala Thr Ser 525 Asn Leu Ile Gin 350 Val Gin Tyr Lys Gin 430 Glu Met Trp Lys Phe 510 Val Glu Ala Lys Ala 320 Thr Phe 335 Phe Val Arg Ile Lys Leu Lys Gly 400 Lys His 415 Ile Pro Leu Lys Asp Pro His Gin 480 Ile Phe 495 Pro Ala Leu Gin Glu Asn Lys Lys Arg Ala His Ala 540 INFORMATION FOR SEQ ID NO:1047: SEQUENCE CHARACTERISTICS: LENGTH: 479 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...479 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1047: Met Lys Lys Ser Leu Cys Leu Ser Phe Phe Leu Thr Phe Ser Asn Pro WO 97/37044 PCT/US97/05223 Leu His Gin Thr Lys Arg Ser Lys Ser 145 Va1 Asn Asn Ser Asn 225 Cys Ser Thr Thr Pro 305 Ser Thr Ser Pro Leu 385 Asn Tyr Lys Gin Lys Va1 Ile Leu Asn Asn Pro 130 Lys Thr Asp Leu Ser 210 Va1 Gly Ile Pro Pro 290 Gin Ser Arg lu Lys 370 Ala Leu Leu %sn Ala Gly Leu Thr Ser Thr Pro 115 Ser Leu Pro Ala Phe 195 Ala Glu Lys Leu Cys 275 Tyr Thr Thr Gin Tyr 355 GIn Tyr Asn Asn His 435 Leu Thr Gly His Leu Gin 100 Ile Ser Gly Ser Asn 180 Va1 Asp Lys Trp Lys 260 Asp Thr Phe Glu Cys 340 Giu Asp Ser Glu Asp 420 Jal Val Phe Vai Ile Glu Ile Pro Ser Ser Lys 165 Pro Ala Ala Gln Val 245 Arg Tyr Lys Glu Lys 325 Tyr Ile Asp Ser Lys 405 Ile Arg I lE Lys Tyr Ser 70 Thr Val Ser Gin Lys 150 Va1 Pro Pro Ser Ala 230 Tyr Val Ser Ile Ala 310 Cys Leu Thr 21n rhr 390 The Ile The Glu Ala Asn 55 Thr Thr Phe Leu Gin 135 Asn Ser Leu Pro Glu 215 Ile Asp Asp Thr Ser 295 Lys Lys Ile Thr Val 375 Arg Met Lys Lys Leu Lys 40 Ile Ala Leu Ser Asn 120 Ser Ser Pro Lys Thr 200 Asn Arg Asp Lys Ala 280 Val Asn Arg Glu 7ln 360 lu Lys Glu 31u G1u 440 Leu 25 Val Ser Ile Ser Ser 105 Ala Pro Lys Thr His 185 Glu Asn Asp Glu Asp 265 Glu His Asn Ala Glu 345 Leu Pro Ser Phe Z Ser 425 Gly 10 Glu Leu Pro Va1 Pro 90 Lys Pro Gin Asn Asn 170 Ser Lys Glu Pro Asn 250 Lys Asn Lys Phe Arg 330 Pro ial rhr 3lu lal 410 3er ial Glu Asp His Tyr 75 Asn Glu Met Asn Ser 155 Glu Ser Thr Ser Asn 235 Leu Glu Lys Thr Ala 315 Ala Leu Lys Phe Ile 395 Glu Glu Cys I IlE Sei Lys Glr Arc Leu Gin Phe 140 Leu Va1 Gin Leu Asn 220 Ile Gin Ile Ser Glu 300 Ile Arg Lys Ala Tyr 380 rhr 1al Tyr ve t Lys Lys Lys 1 Pro Pro Lys Lys 125 Ser Leu Lys Asp Pro 205 Glu Lys Ala Thr Gly 285 Pro Leu Lys Gin Ile 365 Glu Arg Tyr Lys Ala 445 Thr Glu Leu Leu Thr Glu 110 Pro Tyr Gin Thr Gin 190 Asn Asn Glu Tyr Thr 270 Lys Leu Gin Asp Ala 350 Tyr Thr Asn Glu Glu 430 Leu Sei Prc Thr Asp Ile Pro Gin Pro Pro Pro 175 Glu Asn Arg Phe Arg 255 Asp Ile Glu Ala Gly 335 Trp Glu Ser lu Gly 415 rrp Glu Pro Arg Leu Glu Pro His Asn Glu Leu 160 Thr Asn Thr Asp Ala 240 Pro Ile Ile Asp Arg 320 Thr Glu Arg Glu Leu 400 His Val Ile Glu Giu Gin Pro Arg Ala Lys Ser Thr Pro Leu Ser Ile Giu Asn Ser 450 455 460 WO 97/37044 PCT/US97/05223 956 Arg Val Val Cys Val Lys Lys Gly Asn Tyr Leu Phe Asn Glu Val 465 470 475 INFORMATION FOR SEQ ID NO:1048: SEQUENCE CHARACTERISTICS: LENGTH: 269 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...269 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1048: Val Lys Val Tyr Pro Gln Val Leu Ser Ile Ala Gly Ser Asp Ser Gly 1 5 10 Gly Gly Ala Gly Ile Gin Ala Asp Leu Lys Ala Phe Gin Thr Phe Gly 25 Val Phe Gly Thr Ser Val Ile Thr Cys Ile Thr Ala Gin Asn Thr Gin 40 Gly Val His Gly Val Tyr Pro Leu Ser Val Glu Ser Val Lys Ala Gin 55 Ile Leu Ala Ile Arg Asp Asp Phe Ser Ile Lys Ala Phe Lys Met Gly 70 75 Ala Leu Cys Asn Ala Gin Ile Ile Glu Cys Val Ala Asn Ala Leu Glu 90 Thr Cys Asp Phe Gly Leu Cys Val Leu Asp Pro Val Met Val Ala Lys 100 105 110 Asn Gly Ala Leu Leu Leu Glu Glu Glu Ala Ile Leu Ser Leu Lys Lys 115 120 125 Arg Leu Leu Pro Lys Thr Asn Leu Leu Thr Pro Asn Leu Pro Glu Val 130 135 140 Tyr Ala Leu Thr Gly Val Gin Ala Arg Asp Asp Lys Ser Ala Ser Lys 145 150 155 160 Ala Met Gly Val Leu Arg Asp Leu Gly Val Lys Asn Ala Val Ile Lys 165 170 175 Gly Gly His Thr Glu His Phe Gin Gly Glu Phe Ser Asn Asp Trp Val 180 185 190 Phe Leu Glu Asp Ala Glu Phe Ile Leu Ser Ala Lys Arg Phe Asn Thr 195 200 205 Lys Asn Thr His Gly Thr Gly Cys Thr Leu Ser Ser Leu Ile Val Gly 210 215 220 Leu Leu Ala Gin Gly Leu Asp Leu Lys Asn Ala Ile Thr Lys Ala Lys 225 230 235 240 Glu Leu Leu Thr Ile Ile Ile Gin Asn Pro Leu Asn Ile Gly His Gly 245 250 255 His Gly Pro Leu Asn Leu Trp Ser Ile Lys Lys His Val 260 265 WO 97/3 7044 PCTUS97/05223 957 INFORMAkTION FOR SEQ ID NO:1049: SEQUENCE CHARACTERISTICS: LENGTH: 398 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: Misc feature LOCATION 398 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1049: Leu Met Trp Lys Gin Leu Ser Gly Ile Tyr 145 Asp Ile Thr Gly Asn 225 Ala Ile Gly Ala Ile Met Asp Thr Phe Leu Thr Ser Ile Arg Ile Ala Gly Arg Trp Asn Thr 130 Phe Thr Met Asn Arg 210 Phe Phe Ser Gly Pro Gly Trp Phe Ile Pro Thr Ala Asn Thr Thr 195 Asn Asn Leu Asn Leu Leu Arg Giu Trp Tyr 100 Ser Thr Pro Arg Thr 180 Tyr Gly Trp 2 iy G1u 260 Leu Phe Tyr Val1 Trp Glu Asn Tyr Lys Asn 165 Phe Ser Vali Ser Ser 245 Ile Asn Tyr Giu Tyr 70 Trp Phe Ser Ser Thr 150 Phe Pro Leu Thr Ile 230 His Ser *Gin Asp Ala 55 Tyr Ser Phe Asn Tyr 135 Tyr Giu Leu Giu Leu 215 Gly Thr Val Arg Thr 40 Asn Gin Ser Ala Asp i2 0 Lys Asn Asn Tyr Asp 200 Asn Phe Me t Thr Ala 25 Ile Ile Pro Phe Thr 105 Phe Gly Ala Val1 Tyr 185 Ser Ile Tyr Pro Thr 265 10 Ile Thr Asp Tyr Gly 90 Val Ile Leu Pro Gly 170 Arg Thr Arg Asn Arg 250 Arg Met Lys Phe Lys 75 Arg Pro Asn Asp Giy 155 Phe Gly Pro Gin Thr 235 Giy His Gin Asp Met Thr Giy Tyr Tyr Ala 140 Phe Arg Trp His Val1 220 Phe Asn Ala Leu Gly Val Ser Giu Leu Leu Gly 125 Gin Lys Ser Tyr Gly 205 Phe Giy Asn Gly Val1 Leu Phe Gly Thr Ala Lys 110 Trp Phe Leu Gin Asn 190 Sen Tnp Asn Thr Met Leu Met Gly Ser Gin Phe Lys His Phe Val1 Ser 175 Pro Leu Trp Sen Ser 255 Ile Leu Tnp Phe Asn Arg Asn Giy Gly Tyr Tyr 160 Met Giu Leu Asp Asp 240 Tyr Giy Tyr Asp Phe Trp Asp Asn Thr Ala Tyr Asp Gly Leu Ala Asp Ala Ile z 280 285 WO 97/37044 PCT/US97/05223 958 Thr Asn Ala Asn Thr Phe Thr Phe Tyr Thr Ser Val Gly Gly Ile His 290 295 300 Lys Arg Phe Ala Trp His Val Phe Gly Arg Val Ser His Ala Asn Lys 305 310 315 320 Asn Ala Leu Gly Gin Val Gly Arg Ala Asn Glu Tyr Ser Leu Gin Phe 325 330 335 Asn Ala Ser Tyr Ala Phe Thr Glu Ser Val Leu Leu Asn Phe Arg Ile 340 345 350 Thr Tyr Tyr Gly Ala Arg Ile Asn Lys Gly Tyr Gin Ala Gly Tyr Phe 355 360 365 Gly Ala Pro Lys Phe Asn Asn Pro Asp Gly Asp Phe Ser Ala Asn Tyr 370 375 380 Gin Asp Arg Ser Tyr Met Met Thr Asn Leu Thr Leu Lys Phe 385 390 395 INFORMATION FOR SEQ ID NO:1050: SEQUENCE CHARACTERISTICS: LENGTH: 444 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...444 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1050: Leu Ser Asn Ala Phe Ser Gin Tyr Leu Tyr Ser Leu Leu Gly Ala Tyr 1 5 10 Pro Thr Lys Leu Asn Gly Asn Asp Val Ser Ala Asn Ala Leu Leu Ser 25 Gly Ala Val Gly Ser Gly Thr Cys Ala Ala Ala Gly Thr Ala Gly Gly 40 Thr Thr Leu Asn Thr Gin Ser Ala Cys Thr Ala Ala Gly Tyr Tyr Trp 55 Leu Pro Ser Leu Thr Asp Arg Ile Leu Ser Thr Ile Gly Ser Gin Thr 70 75 Asn Tyr Gly Thr Asn Thr Asn Phe Pro Asn Met Gin Gin Gin Leu Thr 90 Tyr Leu Asn Ala Gly Asn Val Phe Phe Asn Ala Met Asn Lys Ala Leu 100 105 110 Glu Lys Asn Gly Thr Ala Thr Ala Asn Ser Thr Ser Ser Thr Ser Gly 115 120 125 Ala Thr Gly Ser Asp Gly Gin Thr Tyr Ser Gin Gin Ala Ile Gin Tyr 130 135 140 Leu Gin Gly Gin Gin Asn Ile Leu Asn Asn Ala Ala Asn Leu Leu Lys 145 150 155 160 Gin Asp Glu Leu Leu Leu Glu Ala Phe Asn Ser Ala Val Ala Ala Asn 165 170 175 WO 97/37044 PCT/US97/05223 Ile Gly Thr Ala 225 Asn Arg Thr Phe Tyr 305 Leu Arg Gin Val Asp 385 Arg Tyr Pro Gly Ile Ile 210 Asn Val Ser Asn Gly 290 Asn Tyr Ser Leu Lys 370 Phe His Asn Tyr Asn Ile 195 Ser Ala Gin Met Ile 275 Lys Gly Thr Tyr Ala 355 Leu Gly Asn Thr Ser 435 Lys 180 Asp Gly Val Val Pro 260 Leu Lys Ala Tyr Gin 340 Gly His Met Gln Tyr 420 Jal Glu Phe Asn Ser Ala Ala Phe Thr Gly Leu Val Gin 185 Gin Ser Gin Thr 245 Tyr Asn Arg Ser Gly 325 Asn Glu Gly Arg His 405 Tyr Tyr Ser Ala Gly 230 Leu Leu Gly Asn Val 310 Val Arg Thr Lys Met 390 Thr Lys Trp Gin Val 215 Arg Asp Pro Phe Ile 295 Gly Gly Ser Phe Ile 375 Asn Val Ser Ser Leu 200 Asn Ala Lys Gin Tyr 280 Gly Phe Thr Val Gin 360 Asn Phe Glu Ala Tyr 440 Val Asn Ser Ile Phe 265 Thr Leu Arg Asp Asp 345 Ser Asn Gly Phe Gly 425 Gly Tyr Ala Gin Asn 250 Arg Lys Arg Ser Val 330 Met Thr Thr Lys Gly 410 Thr Tyr Asn Gly Leu 235 Ala Ala Val Tyr Thr 315 Leu Gly Leu His Leu 395 Val Thr Ser Glu Ile 220 Pro Leu Gly Gly Tyr 300 Gin Tyr Phe Arg Phe 380 Asp Val Val Phe Leu 205 Asn Asn Asn Asn Tyr 285 Gly Asn Asn Phe Asp 365 Gin Gly Val Lys 190 Thr Ser Ala Asn Ser 270 Lys Phe Asn Ile Ser 350 Asp Phe Lys Pro Tyr 430 Lys Asn Leu Gin 255 Arg Gin Phe Val Phe 335 Gly Pro Leu Ser Thr 415 Phe Asn Gin Tyr 240 Val Ala Phe Ser Gly 320 Ser Ile Asn Phe Asn 400 Ile Arg INFORMATION FOR SEQ ID NO:1051: SEQUENCE CHARACTERISTICS: LENGTH: 133 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...133 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1051: Met Glu Leu Ile Lys Lys Leu Glu Lys Glu Ser Glu Val Leu Lys Lys 1 5 10 WO 97/37044 PCT/US97/05223 960 Asp Leu Gin Gin His Ser Asn Glu Leu Phe Lys Met Leu Ile Ile Asp 25 Asn Glu Asp Leu Phe Lys Glu Gin Phe Glu Ile Met Phe Lys Ala Trp 40 Val Glu Ile Val Lys Met Met Phe Glu Leu Thr Lys Lys Thr Lys Phe 55 Asp Gly Glu Met Ile Gly Tyr Thr Glu Glu Leu Leu Thr Phe Leu Val 70 75 Arg Asp Phe Phe Asn Gly Ile Phe Lys Ser Lys Val Ile Pro Lys Met 90 Pro Ile Phe Cys Gly Asp Val Lys Cys Glu Asp Phe Asn Ala Leu Arg 100 105 110 Ser Leu Val Tyr Leu Ser Val Leu Glu Leu Glu Glu Thr Ile Asn Pro 115 120 125 Asn Lys Ile Pro Phe 130 INFORMATION FOR SEQ ID NO:1052: SEQUENCE CHARACTERISTICS: LENGTH: 696 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...696 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1052: Met Lys Ile Lys Lys Ser Leu Phe Ala Leu Ser Phe Ser Leu Met Ala 1 5 10 Ser Leu Ser Arg Ala Glu Asp Asp Gly Phe Tyr Met Ser Val Gly Tyr 25 Gin Ile Gly Glu Ala Val Gin Lys Val Lys Asn Thr Gly Ala Leu Gin 40 Asn Leu Ala Asp Arg Tyr Asp Asn Leu Ser Asn Leu Leu Asn Gin Tyr 55 Asn Tyr Leu Asn Ser Leu Val Asn Leu Ala Ser Thr Pro Ser Ala Ile 70 75 Thr Gly Ala Ile Asp Asn Leu Ser Ser Ser Ala Ile Asn Leu Thr Ser 90 Ala Thr Thr Thr Ser Pro Ala Tyr Gin Ala Val Ala Leu Ala Leu Asn 100 105 110 Ala Ala Val Gly Met Trp Gin Val Ile Ala Phe Gly Ile Ser Cys Gly 115 120 125 Pro Gly Pro Asn Leu Gly Pro Glu His Leu Glu Asn Gly Gly Val Arg 130 135 140 Ser Phe Asp Asn Thr Pro Asn Tyr Ser Tyr Asn Thr Gly Ser Gly Thr 145 150 155 160 WO 97/37044 PCTIUS97/05223 Thr Leu Gin Ser Pro 225 Gly Ala Thr Gly Ile 305 Gin Asn Arg Asp Cys 385 Gly Ala Ile Ala Asn 465 Thr Gin Gin Lys Phe 545 Ala Phe Ser Asn Thr Ser Thr Ser 210 Thr Gly Glu Gin Lys 290 Asn Leu Pro Ala Asn 370 Thr Cys Tyr Leu lie 450 Met Tyr Glu Asn Gin 530 Phe Ser Ile Val Ser Thr Ser Ala 195 Lys Thr Ser Asn Asn 275 Thr Glu Asn Tyr Asn 355 Phe Ala Ala Tyr Asn 435 Asn Thr Ser Leu Asn 515 Phe Asp Asp Asn Gly 595 Gin Thr Ser 180 Leu Asn Glu Ser Leu 260 Pro Gly Met Ala Thr 340 Ala His Gly Phe Gly 420 Phe Ser His Leu Gly 500 Asn Phe Tyr Val Asp 580 Leu Gin Cys 165 Glu Asn Met Tyr Ile 245 Leu His Asn Ile Asn 325 Ser Gin Ser Ser Vai 405 Asn Lys Gly Ala Asp 485 Lys Gly Gly Asn Trp 565 Lys Phe Va1 Asn Tyr Gin Va1 Thr 230 Pro Gin Val Va1 Lys 310 Glu Lys Ala Ile Ala 390 Lys Gin Glu Ile Thr 470 Thr Asn Ala Lys His 550 Thr Asn Gly Asn Gly Gin Asn Va1 215 Tyr Ile Gin Asn Met 295 Asn Asn Asp Giu Gin 375 Gly Glu Val Ala Ser 455 Gin Ser Pro Met Lys 535 Ala Tyr Thr Gly Leu Ala Val Gin 200 Asn Pro Gin Ala Gly 280 Asp Ala Thr Thr Ile 360 Gly Val Thr Asn Leu 440 Asn Asn Lys Phe Asn 520 Arg Tyr Gly Asn Phe 600 Thr Ser Leu 185 Gly Ile Asp Leu Ala 265 Gly Ile Gin Gin Gin 345 Leu Pro Ile Leu Gin 425 Ser Leu Pro Tyr Arg 505 Gly Asn Ile Va1 Phe 585 Ala Met Asn 170 Asn Gly Asn Gly Lys 250 Thr Gly Phe Ala Ile 330 Phe Ser Ile Asn Asn 410 Asp Thr Pro Asn Asn 490 Arg Ile Trp Lys Gly 570 Leu Leu Met Val Thr Gly Gin Asn 235 Ile Ile Gly Gly Va1 315 Thr Ala Leu Gin Asp 395 Ser Arg Leu Asn Ser 475 Gin Ile Gly Gly Ser 555 Met Gly Ala Asn Gly Ala Met Thr 220 Gly Ser Ile Ala Asp 300 Leu Gin Gin Ala Gin 380 Asn Leu Ala Gly Ala 460 Pro Leu Gly Val Leu 540 Asn Asp Lys Gly Gly Pro Tyr Pro 205 Phe Asn Ser Asn Trp 285 Ser Glu Pro Glu Gin 365 Asp Thr Glu Leu Asn 445 Lys Giu Gin Val Gin 525 Arg Phe Ala Asn Thr 605 Ile Asn Gin 190 Ala Thr Tyr Val Val 270 Gly Phe Lys Asp Met 350 Gin Leu Tyr Gin Ser 430 Asp Ser Gly Thr Ile 510 Ala Tyr Phe Leu Asn 590 Ser Tyr Gly 175 Thr Lev Lys Tyr Asn 255 Leu Phe Asn Thr Asn 335 Leu Val Glu Gly His 415 Gin Ser Leu Le Val 495 Asn Gly Tyr Asn Tyr 575 Lys Trp Asn Ile Ile Asn Asn Ser 240 Asp Thr Gly Ala Gin 320 Phe Asn Ala Glu Ser 400 Thr Thr Lys Gin Leu 480 Ala Tyr Tyr Gly Ser 560 Asn Leu Leu Ala WO 97/37044 PCT/US97/05223 610 615 620 Asn Val Ser Ala Ser Asn Phe Gin Phe Leu Phe Asp Leu Gly Leu Arg 625 630 635 640 Met Asn Leu Ala Arg Pro Lys Lys Lys Asp Ser Asp His Ala Ala Gin 645 650 655 His Gly Met Glu Leu Gly Val Lys Ile Pro Thr Ile Asn Thr Asp Tyr 660 665 670 Tyr Ser Phe Met Gly Ala Glu Leu Lys Tyr Arg Arg Leu Tyr Ser Val 675 680 685 Tyr Leu Asn Tyr Val Phe Ala Tyr 690 695 INFORMATION FOR SEQ ID NO:1053: SEQUENCE CHARACTERISTICS: LENGTH: 371 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...371 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1053: Met 1 Ser Ser Pro Lys Asn Thr Gin Pro Pro 145 Gin Ile Gin Phe Gin Lys Thr Leu Ser Ser Leu Ser Leu Leu Val Phe Ile Thr Pro Asn Lys 130 Leu Ser Ser Phe Gly Leu Tyr Phe Thr Ile 115 Leu Glu Gin Asn 5 Leu Ser Phe Glu Asn Gin Lys Leu Asn Tyr Glu Lys 100 Glu Lys Ala Gin Leu Val Asn Ser 165 Ser Leu Phe Tyr Glu Asn 70 Ile Gin Ile Ala Glu 150 Met Asn Ser Ser Arg 55 Gin Asn Ala Val Leu 135 Asn Leu Ala Ile Ile 40 Ile Val Asn Glu Met 120 Glu Leu Ser Leu Ala 25 Ser Gin Lys Ala Thr 105 Leu Lys Lys Ser Asp 10 Glu His Thr Asn Leu 90 Tyr Ser Met Asn Leu 170 Pro Glu Ala Ile Glu 75 Lys Tyr Gly Gin Leu 155 Ser Ser Asn Val Ser Ile Asn Leu Gly Glu 140 Glu Ser Ser Phe Leu Gly Ala Glu His Asn Ala Thr Asn Asn Ala Gin Ser 110 Val Ala 125 Pro Ile Leu Gin Gin Ile Tyr Ser Ser Leu Tyr Ala Asn Asn Gin Asn Met Pro Lys Leu Thr Leu Ser Asn Thr Asn Phe Ser 160 Ala Gin 175 Lys Asn 180 185 190 Val Ser Ser Met Tyr Gly Val Gly Leu Ser Val Gly Tyr Lys His Phe WO 97/37044 PCTIUS97/05223 Phe Thr 210 Tyr Gly 225 Gly Lys Asn Phe Gly Phe Trp Val 290 Ala Lys 305 Arg Val Pro Leu Ala Ser Tyr Ser 370 195 Lys Tyr Met Ile Ala 275 Ser Met Asn Ala Leu 355 Phe Lys Thr Asn Asp 260 Leu Gin His Val Val 340 Phe Lys Asn Asn 245 Asn Ala Met Thr Asp 325 Asn Phe Asn Phe 230 His Ala Gly Asp Ser 310 Arg Ser Lys 200 Gin Gly 215 Gly Phe Leu Tyr Gin Lys Ser Ser 280 Phe Ile 295 Phe Phe His Asn Phe Tyr Arg Leu 360 Phe Val Gly His 265 Trp Asn Gin Gly Glu 345 Val Arg Gly Leu 250 Ser Val Asn Ile Phe 330 Thr Met Tyr Asn 235 Gly Ser Gly Tyr Pro 315 Glu His Phe Tyr 220 Gly Ile Val Ser Leu 300 Leu Met Gly Asn INFORMATION FOR SEQ ID NO:1054: SEQUENCE CHARACTERISTICS: LENGTH: 484 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...484 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1054: Met Ala Lys Ile Thr Thr Val Ile Asp Ile Gly Ser 1 5 10 Leu Ala Val Phe Lys Lys Thr Ser Gin Phe Gly Phe 25 Glu Thr Lys Ser Lys Val Arg Ile Ser Glu Gly Cys 40 Gly Val Leu Gin Glu Ile Pro Met Gin Arg Ala Val 55 Glu Phe Lys Glu Ile Ala Leu Lys Tyr Lys Ser Lys 70 75 Val Ala Thr Ser Ala Val Arg Asp Ala Pro Asn Arg 90 Ala Arg Val Lys Lys Ala Cys Gly Leu Gin Ile Lys 205 Leu Phe Asp Gly Gly 285 Thr Asn Gly Lys Val 365 Asn Tyr Tyr Lys Lys Leu Ile Phe Asp Tyr Phe 270 Leu Asp Phe Leu Gly 350 Ser Ser Leu Ala Ala Ile Glu Ile Tyr Gly Leu 255 Tyr Gly Tyr Gly Lys 335 Leu Tyr Val Leu Phe Leu Leu Phe Asp Asp Leu 240 Phe Val Met Arg Val 320 Ile Asn Val Arg Phe Lys Ser Cys Val Gly WO 97/37044 PCT/US97/05223 964 100 105 110 Gin Lys Glu Ala Leu Tyr Gly Gly Ile Ala Cys Ala Asn Leu Leu His 115 120 125 Lys Asn Ser Gly Ile Thr Ile Asp Ile Gly Gly Gly Ser Thr Glu Cys 130 135 140 Ala Leu Ile Glu Lys Gly Lys Ile Lys Asp Leu Ile Ser Leu Asp Val 145 150 155 160 Gly Thr Ile Arg Ile Lys Glu Met Phe Leu Asp Lys Asp Leu Asp Val 165 170 175 Lys Leu Ala Lys Ala Phe Ile Gin Lys Glu Val Ser Lys Leu Pro Phe 180 185 190 Lys His Lys Asn Ala Phe Gly Val Gly Gly Thr Ile Arg Ala Leu Ser 195 200 205 Lys Val Leu Met Lys Arg Phe Asp Tyr Pro Ile Asp Ser Leu His Gly 210 215 220 Tyr Glu Ile Asp Ala His Lys Asn Leu Ala Phe Ile Glu Lys Ile Val 225 230 235 240 Met Leu Lys Glu Asp Lys Leu Arg Leu Leu Gly Val Asn Glu Glu Arg 245 250 255 Leu Asp Ser Ile Arg Ser Gly Ala Leu Ile Leu Ser Val Val Leu Glu 260 265 270 His Leu Lys Thr Ser Leu Met Ile Thr Ser Gly Val Gly Val Arg Glu 275 280 285 Gly Val Phe Leu Ser Asp Leu Leu Arg His His Tyr His Lys Phe Pro 290 295 300 Pro Asn Ile Asn Pro Ser Leu Ile Ser Leu Lys Asp Arg Phe Leu Pro 305 310 315 320 His Glu Lys His Ser Gin Lys Val Lys Lys Glu Cys Val Lys Leu Phe 325 330 335 Glu Val Leu Ser Pro Leu His Lys Ile Asp Glu Lys Tyr Leu Phe His 340 345 350 Leu Lys Ile Ala Gly Glu Leu Ala Ser Met Gly Lys Ile Leu Ser Val 355 360 365 Tyr Leu Ala His Lys His Ser Ala Tyr Phe Ile Leu Asn Ala Leu Ser 370 375 380 Tyr Gly Phe Ser His Gin Asp Arg Ala Ile Ile Cys Leu Leu Ala Gin 385 390 395 400 Phe Ser His Lys Lys Ile Pro Lys Asp Asn Ala Ile Ala His Met Ser 405 410 415 Ala Met Met Pro Ser Leu Leu Thr Leu Gin Trp Leu Ser Phe Ile Leu 420 425 430 Ser Leu Ala Glu Asn Leu Cys Leu Thr Asp Ser His His Leu Lys Tyr 435 440 445 Thr Leu Glu Lys Asn Lys Leu Val Ile His Ser Asn Asp Ala Leu Tyr 450 455 460 Leu Ala Lys Glu Met Leu Pro Lys Leu Ile Lys Pro Ile Pro Leu Thr 465 470 475 480 Ile Glu Phe Ala INFORMATION FOR SEQ ID NO:1055: SEQUENCE CHARACTERISTICS: LENGTH: 253 amino acids TYPE: amino acid TOPOLOGY: linear WO 97/37044 WO 9737044PCTIUS97/05223 965 (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc -feature LOCATION .253 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1055: Lys Gin Val Lys Ser Arg Met Lys Giu Lys Pro Phe 1 Met Ser Phe Leu Leu Asp Asn Glu Leu 145 Ala Ile Val Lys Asp 225 Tyr Ala Lys Arg Met Pro Lys Glu Sen 130 Gin Gin Vali Sen Arg 210 Tyr Ser Phe Leu Leu Sen Leu His Ala 115 Ile Ser Asn Asn Ile 195 Tyr Ala Ile Leu Ser Glu Phe Gin Tyr 100 Leu Asn Ser Ser Ile 180 Ala Lys Ser Thn 5 Giu Gly Arg Ile Lys Ala Ala Ang Ser Ile 165 Ala Ala Ile Met Giu 245 Giu Phe As n Ala 70 Thr Ile Ang Ile Lys 150 Tyr Ile Lys Vali Pro 230 Ile Leu Ser Arg 55 Leu Giu Ile Sen Asp 135 Val1 Val Tyr Leu Leu 215 Lys Ala Leu Val1 40 Leu Val1 His Gin Leu 120 Asp Trp Gin Gin Leu 200 Sen Asn Pro Ser 25 Asn Lys Leu His Arg 105 Ile Lys Gin Ser Gin 185 Asn Tyr Pro Thr 10 His Asp Ile Ala Phe 90 Ala Gly Sen Arg His 170 Asp Giu Leu Thr Asn 250 Gin Leu Ala Ile 75 Val1 Asp Ala Ang Phe 155 Leu Asn Asn Phe Gly 235 Arg Glu Asp Tyr Val1 Asp Lys Tyr Tyr 140 Giu Glu Asn Lys Asp 220 Phe Asn Asn Lys Met Arg Leu Phe Sen Val 125 Giu Asp Ang Pro Leu 205 Thr Lys Asp Ser His Gin Leu Ile Leu Ile 110 Leu Leu Leu Giu Ile 190 Val Pro Ile Glu Leu Ser Leu Ser Asn Ser Asn Val Ile Val 175 Ala Tyr Asp Thr Gin Glu Val1 Gly Ile Gin Ser Ang Arg Lys 160 His Ser Glu Phe Arg 240 INFORMATION FOR SEQ ID NO:1056: SEQUENCE CHARACTERISTICS: LENGTH: 479 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCTIUS97/05223 966 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...479 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1056: Met Lys Lys Ser Leu Cys Leu Ser Phe Phe Leu Thr Phe Ser Asn Pro 1 5 10 Leu Gin Ala Leu Val Ile Glu Leu Leu Glu Glu Ile Lys Thr Ser Pro 25 His Lys Gly Thr Phe Lys Ala Lys Val Leu Asp Ser Lys Glu Pro Arg 40 Gin Val Leu Gly Val Tyr Asn Ile Ser Pro His Lys Lys Leu Thr Leu 55 Thr Ile Thr His Ile Ser Thr Ala Ile Val Tyr Gin Pro Leu Asp Glu 70 75 Lys Leu Ser Leu Glu Thr Thr Leu Ser Pro Asn Arg Pro Thr Ile Pro 90 Arg Asn Thr Gin Ile Val Phe Ser Ser Lys Glu Leu Lys Glu Pro His 100 105 110 Ser Asn Pro Ile Pro Ser Leu Asn Ala Pro Met Gin Lys Pro Gin Asn 115 120 125 Lys Pro Ser Ser Ser Gin Gin Ser Pro Gin Asn Phe Ser Tyr Pro Glu 130 135 140 Ser Lys Leu Gly Ser Lys Asn Ser Lys Asn Ser Leu Leu Gin Pro Leu 145 150 155 160 Val Thr Pro Ser Lys Val Ser Pro Thr Asn Glu Val Lys Thr Pro Thr 165 170 175 Asn Asp Ala Asn Pro Pro Leu Lys His Ser Ser Gin Asp Gin Glu Asn 180 185 190 Asn Leu Phe Val Ala Pro Pro Thr Glu Lys Thr Leu Pro Asn Asn Thr 195 200 205 Ser Ser Ala Asp Ala Ser Glu Asn Asn Glu Ser Asn Glu Asn Arg Asp 210 215 220 Asn Val Glu Lys Gin Ala Ile Arg Asp Pro Asn Ile Lys Glu Phe Ala 225 230 235 240 Cys Gly Lys Trp Val Tyr Asp Asp Glu Asn Leu Gin Ala Tyr Arg Pro 245 250 255 Ser Ile Leu Lys Arg Val Asp Lys Asp Lys Glu Ile Thr Thr Asp Ile 260 265 270 Thr Pro Cys Asp Tyr Ser Thr Ala Glu Asn Lys Ser Gly Lys Ile Ile 275 280 285 Thr Pro Tyr Thr Lys Ile Ser Val His Lys Thr Glu Pro Leu Glu Asp 290 295 300 Pro Gin Thr Phe Glu Ala Lys Asn Asn Phe Ala Ile Leu Gin Ala Arg 305 310 315 320 Ser Ser Thr Glu Lys Cys Lys Arg Ala Arg Ala Arg Lys Asp Gly Thr 325 330 335 Thr Arg Gin Cys Tyr Leu Ile Glu Glu Pro Leu Lys Gin Ala Trp Glu 340 345 350 Ser Glu Tyr Glu Ile Thr Thr Gin Leu Val Lys Ala Ile Tyr Glu Arg 355 360 365 Pro Lys Gin Asp Asp Gin Val Glu Pro Thr Phe Tyr Glu Thr Ser Glu 370 375 380 WO 97/37044 WO 9737044PCT/US97/05223 Leu 385 Asn Tyr Lys Giu Arg 465 (2) Ala Tyr Ser Ser Thr Arg Lys Ser Glu Ile Th 390 395 Leu Asn Glu Lys Phe Met Giu Phe Val Giu Va 405 410 Leu Asn Asp Ile Ile Lys Giu Ser Ser Giu Ty 420 425 Asn His Val Arg Phe Lys Glu Gly Val Cys Me 435 440 Glu Gin Pro Arg Ala Lys Ser Thr Pro Leu Se 450 455 46 Val Val Cys Vai Lys Lys Gly Asn Tyr Leu Ph 470 475 INFORMATION FOR SEQ ID NO:i057: SEQUENCE CHARACTERISTICS: LENGTH: 567 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 567 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1057: 1 t r 0 e Arg Tyr Lys Ala 445 Ile Asn Lys Phe Tyr Pro Giu Asp Lys Asn 125 Gin Leu Ala Asn Asn Glu Giu 430 Leu Glu Giu Lys Ile Giu Phe Pro Gin Asn Thr Asn Sen Ile Asn 190 Glu Gly 415 Trp Glu Asn Val1 Leu Ser Ala Leu Leu Lys Gly Lys Thr Phe Lys 175 Pro Leu 400 His Val1 Ile Ser Thr Gin Arg Tyn Lys Thr Asp Ala Pro Sen 160 Ile Leu Met Ile Phe Gly Asp Phe Lys Tyr Gin Lys Sen Vai 1 Ala Asn Leu Phe Giu Tyr Thn Lys Phe 145 Pro Ile Thr Ang Al a Glu His Phe Tyn Pro 130 Lys Glu Thr Asn Gly Phe Gin Ala Lys Gln 115 Lys Ala Leu Lys Leu Lys Leu Phe Phe Gin 100 Val Arg Phe Phe Pro 180 5 Asn Gly Asp Leu Tyr Phe Asn Val1 Ile Phe 165 Met Glu Tyr Giu Glu 70 Pro Lys Leu Phe Giu 150 Giu Lys Leu Phe Asn 55 Arg Lys Ala Thr Lys 135 Asn Leu Gly Lys Val 40 Phe Lys Ile Val1 Met 120 Giu Giu Giu Thr Asn 25 Gly Gin Lys His Lys 105 Asp Val1 Phe Phe Ile 185 10 Ala Tyr Sen Tyr Ser 90 Glu Leu Ile Gly Leu 170 Ala Leu Leu Gin Pro 75 Ser His Leu His Ser 155 Asp Ang Asp Leu Thr Leu Leu Leu Leu Asn 140 Val1 Thr Sen WO 97/37044 PCTIUS97/05223 lie Ser Leu 225 Ser Pro Gly Ser Val 305 Lys Thr Ser Arg His 385 Asp Val Glu Lys Gin 465 Phe Ile Ala Glu Tyr 545 Tyr Asp Glu 210 Ala Leu Leu Ser Leu 290 Gly Arg Tyr Phe Val 370 Lys Glu Leu Pro His 450 Asn Tyr Leu Leu His 530 Cys Pro Glu 195 Asn Leu Pro Lys Val 275 Glu Gly Ala Lys Phe 355 Ile Glu Asn Leu Leu 435 Asp Ala Asn Glu Thr 515 Ala Ile Met Lys Val Lys Ser Thr 260 Thr Lys Lys His Ser 340 Val Lys Arg Leu Asn 420 Lys Asp Arg Gin Ile 500 Gly Pro Asn Glu Asn Met Asn Val 245 Ser Gly Arg Lys Glu 325 Lys Met Arg Leu Leu 405 Lys Ser Phe Ala Asp 485 His Thr Leu Ala Gin 565 Arg lie Ser 230 Tyr Leu Cys Pro Ala 310 Asp Ala Pro Asp Met 390 Asp Arg Leu Leu Leu 470 Leu Asn Gly Lys Leu 550 Lys Leu Val 215 Val Gin Phe Pro Arg 295 Leu Phe Ser Lys Gin 375 His Phe Gly Glu Tyr 455 Ile Glu Arg Val Leu 535 Tyr Ser Phe 200 Asp Lys Met Glu Lys 280 Gly Phe Leu Lys Ile 360 Lys Ser Glu Lys Ile 440 His Lys Leu Leu Val 520 Gin Leu Leu Val Ile Ile 265 Ile Val Ser His Glu 345 Glu Leu Ala Leu Leu 425 Arg Lys Lys Thr Leu 505 Gly Asp Gin Leu Asn Ser 250 Phe Lys Tyr Val Leu 330 Tyr Phe Glu Gin Glu 410 Ile Leu Thr Gly Glu 490 Thr Leu Leu Asn Arg Gin 235 Glu Lys Thr Cys Pro 315 Gly Glu Glu Ile Tyr 395 Lys Lys Ser Ala Val 475 Gly Pro Leu Gin Asp Asp 205 Asn Asp 220 Leu Phe Ile Glu Ala Leu Met Gin 285 Gly Ala 300 Ile Arg Val Gly Glu Ser Ile Val 365 Asn Asn 380 Phe Asn Glu Gly Glu Tyr Glu Ala 445 Tyr Ala 460 Ile Phe Ala Arg Tyr Phe Lys Lys 525 Arg Ala 540 Lys Leu Glu Ala Phe 270 lie Ile Thr Ser Phe 350 Glu Lys Phe Val Arg 430 Pro Pro Asp Ser Ser 510 Gly Ala Asn Ser Ile Gin 255 Pro Ile Gly Leu Gly 335 Leu Thr Asn Lys Leu 415 Thr Ile Phe Glu Asn 495 Val Leu Lys Arg Arg Ile 240 Leu Cys Glu Met Glu 320 Val Lys Met Ala Tyr 400 Arg Leu Asp Tyr Ile 480 Leu Gly Val Ile Gly 560 Gly Leu Val Glu Val Gly Ile Ile 555 INFORMATION FOR SEQ ID NO:1058: SEQUENCE CHARACTERISTICS: LENGTH: 374 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 969 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...374 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1058: Met Lys Ile Ser Val Ser Lys Asn Asp Leu Glu Asn Thr Leu Arg Tyr 1 5 10 Leu Gin Ala Phe Leu Asp Lys Lys Asp Ala Ser Ser Ile Ala Ser His 25 Ile His Leu Glu Val Ile Lys Glu Lys Leu Phe Leu Lys Ala Ser Asp 40 Ser Asp Ile Gly Leu Lys Ser Tyr Ile Ser Thr Gin Ser Thr Asp Lys 55 Glu Gly Val Gly Thr Ile Asn Gly Lys Lys Phe Leu Asp Ile Ile Ser 70 75 Cys Leu Lys Asp Ser Asn Ile Val Leu Glu Thr Lys Asp Asp Ser Leu 90 Val Ile Lys Gin Asn Lys Ser Ser Phe Lys Leu Pro Met Phe Asp Ala 100 105 110 Asp Glu Phe Pro Glu Phe Pro Val Ile Asp Pro Lys Val Ser Leu Glu 115 120 125 Ile Asn Ala Pro Phe Leu Val Asp Ala Phe Lys Lys Ile Ala Pro Val 130 135 140 Ile Glu Gin Thr Ser His Lys Arg Glu Leu Ala Gly Val Leu Met Gin 145 150 155 160 Phe Asn Gin Lys His Gin Thr Leu Ser Val Val Gly Thr Asp Thr Lys 165 170 175 Arg Leu Ser Tyr Thr Gin Leu Glu Lys Ile Ser Ile His Ser Thr Glu 180 185 190 Glu Asp Ile Ser Cys Ile Leu Pro Lys Arg Ala Leu Leu Glu Ile Leu 195 200 205 Lys Leu Phe Tyr Glu Asn Phe Ser Phe Lys Ser Asp Gly Met Leu Ala 210 215 220 Val Val Glu Asn Glu Thr His Ala Phe Phe Thr Lys Leu Ile Asp Gly 225 230 235 240 Asn Tyr Pro Asp Tyr Gin Lys Ile Leu Pro Lys Glu Tyr Thr Ser Ser 245 250 255 Phe Thr Leu Gly Lys Glu Glu Phe Lys Glu Gly Ile Lys Leu Cys Ser 260 265 270 Ser Leu Ser Ser Thr Ile Lys Leu Thr Leu Glu Lys Asn Asn Ala Leu 275 280 285 Phe Glu Ser Leu Asp Ser Glu His Ser Glu Thr Ala Lys Thr Ser Val 290 295 300 Glu Ile Glu Lys Gly Leu Asp Ile Glu Lys Ala Phe His Leu Gly Val 305 310 315 320 Asn Ala Lys Phe Phe Leu Glu Ala Leu Asn Ala Leu Gly Thr Thr Gin 325 330 335 Phe Val Leu Lys Cys Asn Glu Pro Ser Ser Pro Phe Leu Ile Gin Glu 340 345 350 Pro Leu Asp Glu Lys Gin Ser His Leu Asn Ala Lys Ile Ser Thr Leu WO 97/37044 PCT/US97/05223 970 355 360 365 Met Met Pro Ile Thr Leu 370 INFORMATION FOR SEQ ID NO:1059: SEQUENCE CHARACTERISTICS: LENGTH: 368 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...368 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1059: Met Val Val Leu Gly Ser Thr Gly Ser Ile Gly Lys Asn Ala Leu Lys 1 5 10 Ile Ala Lys Lys Phe Gly Val Lys Ile Glu Ala Leu Ser Cys Gly Lys 25 Asn Ile Ala Leu Ile Asn Glu Gln Ile Lys Val Phe Lys Pro Lys Lys 40 Val Ala Ile Leu Asp Pro Asn Asp Leu Asn Asn Leu Glu Pro Leu Gly 55 Ala Glu Val Phe Val Gly Leu Asp Gly Ile Asp Ala Met Ile Glu Glu 70 75 Cys Val Ser Asn Leu Val Ile Asn Ala Ile Val Gly Val Ala Gly Leu 90 Lys Ala Ser Phe Lys Ser Leu Gin Arg Asn Lys Lys Leu Ala Leu Ala 100 105 110 Asn Lys Glu Ser Leu Val Ser Ala Gly His Leu Leu Asp Ile Ser Gin 115 120 125 Ile Thr Pro Val Asp Ser Glu His Phe Gly Leu Trp Ala Leu Leu Gin 130 135 140 Asn Lys Thr Leu Lys Pro Lys Ser Leu Ile Ile Ser Ala Ser Gly Gly 145 150 155 160 Ala Phe Arg Asp Thr Pro Leu Asp Leu Ile Ala Ile Gin Asn Ala Gin 165 170 175 Asn Ala Leu Lys His Pro Asn Trp Ser Met Gly Asp Lys Ile Thr Ile 180 185 190 Asp Ser Ala Ser Met Val Asn Lys Leu Phe Glu Ile Leu Glu Thr Tyr 195 200 205 Trp Leu Phe Gly Ala Ser Leu Lys Ile Asp Ala Leu Ile Glu Arg Ser 210 215 220 Ser Ile Val His Ala Leu Val Glu Phe Glu Asp Asn Ser Val Ile Ala 225 230 235 240 His Leu Ala Ser Ala Asp Met Gin Leu Pro Ile Ser Tyr Ala Ile Asn 245 250 255 Pro Lys Leu Ala Ser Leu Ser Ala Ser Ile Lys Pro Leu Asp Leu Tyr WO 97/37044 PCT/US97/05223 260 265 Ala Leu Ser Ala Ile Lys Phe Glu Pro Ile Ser Val 275 280 Leu Trp Arg Tyr Lys Asp Leu Leu Leu Glu Asn Pro 290 295 300 Val Leu Asn Ala Ser Asn Glu Val Ala Met Lys Lys 305 310 315 Glu Ile Ala Phe Gly Gly Phe Ile Gin Ile Ile Ser 325 330 Leu Tyr Ala Lys Lys Ser Phe Lys Leu Ser Ser Leu 340 345 Ala Leu Asp Lys Glu Val Arg Glu Arg Phe Gly Ser 355 360 INFORMATION FOR SEQ ID NO:1060: SEQUENCE CHARACTERISTICS: LENGTH: 417 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...417 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1060: Met Lys Tyr Leu Trp Leu Phe Leu Ile Tyr Ala Ile Glu 285 Lys Phe Gin Asp Val 365 Gly Lys Tyr Gin Ala Ala Asp Asp 125 Tyr Ser Thr Phe 270 Arg Leu Leu Ala Glu 350 Ala Leu Leu Ala His Glu Val Thr 110 Leu Leu Lys Met Pro Tyr Gly Asn Leu 335 Val Arg Phe Pro Leu Phe Leu Glu Gly Tyr Lys Tyr Arg 175 Lys Thr Val Gin 320 Glu Leu Val Ala Lys Lys Asp Lys Asn Thr Pro Ala Ile 160 Tyr Trp 1 Thr Ile Leu Val Asp Gly Leu Phe Pro 145 Gly Gin Asp Glu His Ser Lys Asn Lys Ala 130 Ser Pro Lys Lys Val Glu Gin Lys Lys Lys 115 Ala Ile Gly Glu Thr Arg Val Asn Val Ile 100 Thr His Ala Ile Ile 5 Leu Tyr Leu Lys His Ser Phe Asn Trp Thr 165 Ile Asp Ser Ala Glu 70 Leu Arg Asp Met Met 150 Asn Lys Ile Ile Asn 55 Gin Val Leu Tyr Ala 135 Lys Ile Asn Ile Asp 40 Asp Gly Ala Lys Pro 120 Ile Arg Ala Asn Lys 25 Asn Leu Ala Leu Leu 105 Ile Val Leu Leu Arg 10 Thr Asp Lys Ile Val 90 Tyr Val Val Ile Ala 170 Leu Ile Ala Thr Asn 75 Ser Asp Ser Asn Val 155 Asp Asn Gin Asn Ser Tyr Val Val Leu Asp 140 Phe Tyr Ile WO 97/37044 PCTIUS97/05223 Ala Asn Thr Pro 210 Ile Ala 225 Gly Ser Tyr Leu Pro Gly Ala Phe 290 Leu Gly 305 Asn Glu Glu Asn Ala Leu Gin Met 370 Lys Thr 385 Gin Ser Trp Ala 195 Met Ser Lys Tyr Ile 275 Val Leu Ser Leu Asn 355 Pro Ser Phe 180 Glu Ile Ser Ile Asp 260 Asp Ser Lys Ile Asn 340 Ser Arg Gin Leu Gin Leu Gin Leu 245 Thr Val Asp Glu Asp 325 Glu Lys Phe Glu Phe 405 Thr Lys Gly 230 Met His Ser Arg Ser 310 Ala Phe Tyr Ser Tyr 390 Glu Tyr 215 Met Ser Lys Gly Ser 295 Ala Tyr Gly Ile Met 375 Ala Phe 200 Asn Ala Leu Lys Val 280 Gly Glu Lys Lys Arg 360 Asp Met 185 Tyr Ile Val Ala Thr 265 Phe Tyr Gin Asp Thr 345 Arg Gly Gly Tyr Gin Val Pro 250 Lys Leu Pro Leu Ser 330 Val Leu Arg Leu Val 410 Thr Lys Ser 235 Asp Thr Glu Asn Leu 315 Ile Phe Thr Asn Ile 395 Gin Ala 220 Ser Gly Lys Asp Ile 300 Tyr Val Asn Val Ile 380 Leu Tyr 205 Thr Val Gin Ile Asp 285 Tyr Glu Tyr Leu Asn 365 Met Leu 190 Gly His Ser Pro Thr 270 Lys Met Gly Val Asn 350 Gly Tyr Asp Glu Glu Ser Asp 255 Arg Ser Lys Arg Ser 335 Leu Ser Ile Tyr Lys Asn Asp 240 Val Tyr Met Lys Ser 320 Arg Ile Asn Lys Asn 400 Pro Leu Lys Asn Lys Ile Gin Ala Phe Asp 415 INFORMATION FOR SEQ ID NO:1061: SEQUENCE CHARACTERISTICS: LENGTH: 219 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...219 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1061: Asn Lys Lys His Arg Leu Ala Phe Leu Gly Leu Ile Val Gly Val 10 Phe Phe Phe Ser Ala Cys Gin His Arg Leu His Met Gly Tyr Tyr 25 Glu Val Thr Gly Asp Tyr Leu Phe Asn Tyr Asn Ser Thr Ile Val WO 97/37044 PCT/US97/05223 973 40 Val Ala Tyr Asp Arg Ser Asp Ala Met Thr Ser Tyr Tyr Ile Asn Val 55 Ile Val Tyr Glu Leu Gin Lys Leu Gly Phe Tyr Asn Val Phe Thr Gin 70 75 Ala Glu Phe Pro Leu Asp Lys Ala Lys Asn Val Ile Tyr Val Arg Ile 90 Val Arg Asn Ile Ser Ala Val Pro Phe Tyr Gin Tyr Asn Tyr Gin Leu 100 105 110 Ile Asp Gin Val Asn Lys Pro Cys Tyr Phe Leu Gly Gly Gin Phe Tyr 115 120 125 Cys Ser Gin Thr Pro Thr Asp Tyr Tyr Ala Ile Asn Gly Phe Ser Glu 130 135 140 Gin Ile Leu Met Ser Ala Asn Ser His Phe Ile Leu Asp Trp Tyr Asp 145 150 155 160 Val Val Leu Gin Lys Arg Val Leu Tyr Val Asp Gly Ser Val Ser Gly 165 170 175 Arg Thr Cys Gly Tyr Gin Met Leu Tyr Arg Asp Leu Ile Lys Ser Thr 180 185 190 Ile Lys Arg Ile Asp Phe Asn Arg Pro Glu Arg Tyr Tyr Tyr Asn Leu 195 200 205 Arg Leu Pro Leu Tyr Gin Pro Cys Tyr Arg Gin 210 215 INFORMATION FOR SEQ ID NO:1062: SEQUENCE CHARACTERISTICS: LENGTH: 416 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...416 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1062: Met Gin Tyr Lys Lys Asn Lys Lys Arg Tyr Tyr His Leu Ala Leu Gly 1 5 10 Ile Leu Phe Cys Asn Gly Leu Ser Leu Lys Ala Leu Glu Ile Ala Val 25 Lys Pro Phe Gly Tyr Leu Gly Leu Leu Tyr Asn Gin Gly Thr Gin Lys 40 Asn Pro His Ser Tyr Val Gly Ala Leu Ala Arg Leu Gly Val Asp Phe 55 Ser Tyr Ser Asn Gly Trp Ser Phe Gly Ile Gly Ala Ile Gly Ala Trp 70 75 Asn Ile Tyr Asn Lys Gin Arg Leu Ala Asn Leu Tyr Ile Ser Leu Gly 90 Asn Phe Phe Gly Asn Pro Asn Asn Val Lys Pro Tyr Leu Ser Ala Gly WO 97/37044 PCT/US97/05223 974 100 105 110 Asp Val Ser Asp Ala Tyr Leu Gin Tyr Ala Asn Gin Arg Phe Lys Ile 115 120 125 Ala Leu Gly Arg Phe Asn Thr Asp Phe Val Asp Phe Asp Trp Ile Gly 130 135 140 Gly Asn Ile Gin Gly Val Ser Val Ala Phe Lys Gin Asn Ser Met Arg 145 150 155 160 Tyr Phe Gly Ile Phe Met Asp Ser Met Leu Tyr Asn Gly His Gin Ile 165 170 175 Asn Lys Glu Gin Gly Asn Arg Ile Ala Thr Ser Leu Asn Ala Leu Ala 180 185 190 Ser Tyr Asp Pro Val Ser Lys Arg Leu Tyr Val Gly Gly Glu Val Phe 195 200 205 Val Leu Gly Ala Glu Tyr Lys Asn Lys Asn Leu Ile Phe Val Pro Phe 210 215 220 Ile Leu Thr Asp Thr Arg Leu Pro Leu Pro Thr Gin Asn Val Leu Val 225 230 235 240 Gin Val Gly Gly Lys Leu Glu Tyr Asp Ala Ser Leu Ala Lys Gly Phe 245 250 255 Thr Ser Arg Thr Leu Val His Gly Met Tyr Gin Tyr Gly Asn Thr Asp 260 265 270 Ile Thr Thr Ser Ala Lys Asn Ala Gly Leu Phe Leu Ile Asp Gin Thr 275 280 285 Phe Lys Tyr Lys Ile Phe Asn Phe Gly Thr Gly Phe Tyr Ile Val Pro 290 295 300 Ala Arg Asn Asn Lys Gly Tyr Leu Trp Thr Phe Asn Asp Arg Thr Lys 305 310 315 320 Phe Tyr Gly Arg Gly Ile Asn Ala Pro Gly Val Pro Ala Ile Tyr Phe 325 330 335 Ala Asn Ser Ser Ile Ser Gly Tyr Val Phe Leu Gly Leu Lys Thr Lys 340 345 350 Arg Val Arg Leu Asp Ala Met Val Ala Phe Gly Asp Tyr Gin Glu Tyr 355 360 365 Ser Leu Met Ser Ser Phe Arg Val Trp Thr Tyr Arg Asp Leu Ser Phe 370 375 380 Asp Met Gly Gly Gly Tyr Val Tyr Ala Tyr Asn Ser Lys Ala Thr Arg 385 390 395 400 Lys Ser Leu Gly Asp Ser Ser Phe Val Phe Phe Gly Lys Phe Leu Phe 405 410 415 INFORMATION FOR SEQ ID NO:1063: SEQUENCE CHARACTERISTICS: LENGTH: 668 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...668 WO 97/37044 WO 9737044PCTIUS97/05223 975 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:i063: Met Vai Lys Asn Thr Lys Gly
I
Lys Lys Gly Tyr Val1 Ala Thr Ile Gin 145 Thr Giy Ile Ile Ile 225 Ser Phe Glu Asn Phe 305 Asn Phe Arg Gly Ala 385 Ala Leu Asn Leu Ser Ser Ser Gin Ala Thr Ser Asn Gly Thr Ile 115 Ser Thr 130 Lys Ala Thr Thr Glu Pro Ser Thr 195 Asn Thr 210 Ile Thr Gly Tyr Lys Asn Ala Val 275 Ser Leu 290 Ala Giu Gin Ala Val Asn Arg Giy 355 Cys Ala 370 His Phe Asp Thr Asn Ser Thr Val1 Tyr Gly 100 Thr Lys Leu Lys Asn 180 Ser Thr Thr Trp Giu 260 Ala Asp Ser Glu Asp 340 Thr Tyr Gly Leu 5 Leu Asp Arg Leu Ala Ile Cys Asn Thr Leu 165 Lys Trp Asn Leu Ala 245 Ile Gin Ala Met Gin 325 Ser Asn Val1 Thr Val1 405 Leu Pro Asn Leu 70 Phe Gin Asn Tyr Ala 150 Asp Lys Asn Ser Asn 230 Gly Ser Ala Gly Leu 310 Val1 Leu Pro Gly Gin 390 Asn Asn Ser Leu 55 Ala Thr Thr Ser Ala 135 Asn Phe Leu Ala Ala 215 Ser Ile Ala Lys Lys 295 Lys Val Gly Gly Gin 375 Giu Phe Ile Gin Gin 10 Asn Tyr Asn 25 Ala Val Asn 40 Leu Asp Vai Leu Ala Phe Tyr 120 Ile Gly Thr Val Thr 200 Gin Ala Ser Ile Ile 280 Pro Asn Lys Val1 Gin 360 Thr Gin Lys Asn Cys Asn 105 Tyr Ile Giu Ile Tyr 185 Ile Giu Cys Gly Gin 265 Val1 Phe Ala Asn Cys 345 Thr Ile Gin Ser Aia Gly 90 Asn Glu Asn Gly Asn 170 Pro Thr Leu Pro Asn 250 Gly Ser Asn Gin Phe 330 Tyr Thr Thr Ile Arg Leu Thr Asp Lys Ala 75 Pro Val1 Pro Lys Ile 155 Gly Trp Ala Leu Asn 235 Giy Met Giu Pro Ala 315 Glu Giu Ser Asn Gin 395 Tyr Ser Leu Ala Ala Val Gly Pro Gly Ala 140 Pro Asp Ser Pro Lys 220 Phe Thr Ile Asn Tyr 300 Gin Lys Val Asn Leu 380 Gin Ser Giu Asn Arg Asn Gly Ser Gly His 125 Tyr Val1 Lys His Thr 205 Gin Gin Met Ala Thr 285 Thr Ala Ile Gin Thr 365 Lys Ala Glu Asn Thr Asp Ser Leu Asn Gin 110 Gly Gin Leu Arg Gly 190 Thr Ala Asn Cys Asn 270 Gin Asp Glu Pro Gly 350 Trp Asn Giu Leu Tyr Leu Asn Pro Trp Giu Asn Gly Ile Ser Thr 175 Lys Giu Ser Gly Gly 255 Ala Asn Ala Ile Thr 335 Gly Gly Ser Asn Gly Glu Val1 Leu Ala Gin Asn Thr Pro Ile Asn 160 Gly Ala Asn Ile Gly 240 Met Gin Gin Ser Leu 320 Ala Giu Al a Ile Ile 400 Asn 415 Thr Tyr Asn Ser Ile Thr Thr Ala Leu Ser Asn Ile Pro Asn Ala Gin WO 97/37044 PCT/US97/05223 Ser Gly Gin 465 Ile Gin Arg Phe Ala 545 Asn Thr Val Met His 625 Asn Leu Leu lle 450 Thr Val Val Tyr Phe 530 Leu Asn Ser Tyr Gly 610 Ala Thr Tyr Gin 435 Asp Ile Ser Gly Tyr 515 Asn Tyr Lys Trp Asn 595 Val Ala Asn Ser Asn Thr Asn Ser Tyr 500 Gly Ser Asn Leu Leu 580 Ala Arg Gin Tyr Val 660 Ala Val Ser Asn Gin Gin 485 Lys Phe Ala Phe Ser 565 Asn Lys Met His Tyr 645 Tyr Tyr Glu 470 Thr Gin Phe Ser Ile 550 Val Ser Met Asn Gly 630 Ser Leu Tyr 455 Leu Asn Phe Asp Asp 535 Asn Gly Glu Asn Leu 615 Ile Phe Asn Lys Lys Asn 440 Leu Asn Gin Gly Arg Asn Asn Gly Ala 490 Phe Gly Gin 505 Tyr Asn His 520 Val Trp Thr Asp Lys Ala Leu Phe Gly 570 Tyr Val Asn 585 Val Ala Asn 600 Ala Arg Pro Glu Leu Gly Met Gly Ala 650 Tyr Val Phe 665 Asn Asn Pro 475 Met Lys Ala Tyr Thr 555 Gly Leu Phe Lys Leu 635 Glu Ala Pro Ser 460 Phe Asn Arg Phe Gly 540 Asn Ile Ala Gin Lys 620 Lys Leu Tyr Tyr 445 Tyr Arg Gly Lys Ile 525 Phe Phe Ala Thr Phe 605 Lys Ile Lys Ser Asn Lys Ile Trp 510 Lys Gly Leu Leu Met 590 Leu Asp Pro Tyr Pro Gin Val Gly 495 Gly Ser Ala Gly Ala 575 Asn Phe Ser Thr Arg 655 Gin Ile Gly 480 Ile Ala Ser Asp Lys 560 Gly Asn Asn Asp Ile 640 Arg INFORMATION FOR SEQ ID NO:1064: SEQUENCE CHARACTERISTICS: LENGTH: 342 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...342 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1064: Leu Thr Lys Lys Phe Met Ser Trp Met Val Val Ile Gly Ala Leu Ile 1 5 10 Cys Val Leu Leu Gly Val Phe Ile Phe Phe Thr Ser Met Ser Val Lys 25 Lys Ser Leu Thr Ala Tyr Leu Asn Ala Tyr Leu Glu Gin Arg Pro Asn 40 WO 97/37044 PCT/US97/05223 Ile Phe Asn Leu Pro Lys Cys 145 Leu Phe lle Gin Ala 225 Glu Pro Met Asn Arg 305 Thr Tyr Glu Gly Lys Ile Ser Pro Asp Lys Ile Leu 115 Asn Leu 130 Ser Leu Lys Cys Gin Glu Phe Lys 195 Asp Lys 210 Arg His Ser Leu Ser Val Ala Leu 275 Phe Lys 290 Ser Gin Phe Asn Lys Ile Met Gly lie Ile Gly Val Ala lie Ser 100 Glu Asn Thr Asp Gly 180 Thr Leu Phe Gly Ser 260 Phe Gly Ile Ala Ser 340 Cys Met Ser Gin Ala Phe Leu 165 Leu Leu Arg Lys Phe 245 Tyr Gin Leu Val Leu 325 His Val 70 Asp Leu Ser Leu Asn 150 Thr Met Ser Phe Asn 230 Phe Glu Ser Leu Leu 310 Leu Glu Ser Phe Thr Ile Leu 135 Ala Asn Glu Ser Leu 215 Val Ser Ser His Gin 295 Asn Glu Lys Lys Leu Gin 120 Glu Leu Ala Ala Lys 200 Ala Leu Pro Ala Phe 280 Ala Ala Ser Glu Asn Ser 105 Gin Lys Asp Glu Gin 185 Asp Pro Glu Tyr Leu 265 Lys Phe Gin Leu Pro Leu Leu 90 Ile Lys Phe Glu Asn 170 Glu Ala Lys Ser Phe 250 Ala Asp Val Ala Ser 330 Phe Arg 75 Lys His Ile Lys Lys 155 Ile Asn Lys Leu Phe 235 Ser Ser Asp Ser Lys 315 Val Lys Phe lie Ser Ser Pro 140 Thr Leu Leu Ala Ser 220 Tyr Leu Leu Thr Met 300 Asp Asn Cys Leu Lys Gin Gin 125 Thr Leu Ala Ser Ile 205 Val Gin Arg Glu Ala 285 Ala Asn Phe Glu Asp Leu Ile 110 Ile Arg Asn Tyr Leu 190 Glu Ser Gin Ser Asn 270 Leu Lys Thr Phe Gly Pro His Gin Pro Leu Asp Thr 175 Lys Glu Ile Asn Gin 255 Tyr Gin Asp Lys Gin 335 Phe Gin Ser Ser Leu Asn Asn 160 Phe Asn Leu Gin Lys 240 Thr Phe Gin Lys Leu 320 Ser INFORMATION FOR SEQ ID NO:1065: SEQUENCE CHARACTERISTICS: LENGTH: 293 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...293 WO 97/37044 PCT/US97/05223 978 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1065: Met Arg Leu Leu Phe Leu Leu Leu Ser Ala Thr Leu 1 Glu Trp Thr Phe Val Glu Tyr Ser Phe 145 Asn Ala Leu Lys Asp 225 Ile Gly Ile Tyr Glu Gin Pro Asn Leu Ala Leu Ser 130 His Leu Leu Ala Ile 210 Ile Glu Leu Lys Ala 290 Lys Asn Pro Lys Lys Ser Leu 115 Lys Leu Ala Glu Val 195 Val Ser Thr Leu Glu 275 Gly Ile Ala Ile Thr Asn Leu 100 Ser Asp Ala Ser Leu 180 Ile Pro Val Asn Leu 260 Ala Lys 5 Pro Leu Lys Pro Ala Asn Lys Lys Pro Gin 165 Lys Ala Thr Leu Lys 245 Asp Glu Phe Leu Lys Ala Lys 70 Gin Ala Lys Lys Pro 150 Asp Pro Val Pro Gly 230 Glu Lys Asn Ser Glu Val 55 Ile Leu Leu Ala Gin 135 Pro Glu Pro Leu Gly 215 Arg Arg Ile Lys Asp Asp 25 Val Gin 40 Gin Thr Met Glu Asp Ser Glu Met 105 Lys Ala 120 Leu Arg Ser Leu Lys Glu Leu Asp 185 Leu Leu 200 Ser Phe Val Asp Tyr Leu Ser Pro 265 Ile Lys 280 10 Ala Pro Thr Val Lys 90 Phe Gly Phe Lys Arg 170 Leu Ile Leu Ala Val 250 Lys Pro Asp Leu Glu 75 Lys Ser Leu Leu Glu 155 Ala Ser Leu Lys Asn 235 Leu Thr Ile Ser Thr Gly Thr Tyr Glu Phe 140 Lys Lys His Tyr Lys 220 His Leu Ser Met Lys Asn Phe Gin Met Gin Ile 125 Leu Ser Ser Ala Val 205 Asp Lys Ser Lys Leu Leu Ala Glu Lys Asp Asn 110 Gin Pro Gin Pro Tyr 190 Ile Leu Ile Asp Glu 270 Leu Val Pro Thr Val Phe Asp Ala Lys His Ser 175 Lys Lys Lys Ile Lys 255 Glu Ala His Ala Pro Ile Lys Ile Ser Gly Ala 160 Asn Gly Lys Leu Ser 240 Tyr Leu Asn Ser Lys Leu Gly Asn Leu INFORMATION FOR SEQ ID NO:1066: SEQUENCE CHARACTERISTICS: LENGTH: 286 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature WO 97/37044 PCTIUS97/05223 979 LOCATION 1...286 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1066: Met Leu Arg Asn Gin Phe Arg Ile Val Phe Val Ser Cys Ile Val Ala 1 5 10 Ser Ser Leu Gln Ala Gin Glu Asn Thr His Thr Leu Gly Lys Val Thr 25 Thr Lys Gly Glu Arg Thr Phe Glu Tyr Asn Asn Lys Met Tyr Ile Asp 40 Arg Lys Glu Leu Gin Gin Arg Gin Ser Asn Gin Ile Arg Asp Ile Phe 55 Arg Thr Arg Ala Asp Val Asn Val Ala Ser Gly Gly Leu Met Ala Gin 70 75 Lys Ile Tyr Val Arg Gly Ile Glu Ser Arg Leu Leu Arg Val Thr Ile 90 Asp Gly Val Ala Gin Asn Gly Asn Ile Phe His His Asp Ala Asn Thr 100 105 110 Val Ile Asp Pro Asn Met Ile Lys Glu Val Glu Val Ile Lys Gly Ala 115 120 125 Ala Asn Ala Ser Ala Gly Pro Gly Ala Val Ala Gly Lys Leu Ser Phe 130 135 140 Thr Thr Ile Asp Ala Asn Asp Phe Leu Arg Lys Asn Gin Thr Tyr Gly 145 150 155 160 Ala Lys Ala Glu Ala Ala Phe Tyr Thr Asn Phe Gly Tyr Arg Met Asn 165 170 175 Ala Thr Ala Ala Tyr Arg Gly Lys Asn Trp Asp Ile Leu Ala Tyr Tyr 180 185 190 Asn His Gin Asn Ile Phe Tyr Tyr Arg Asp Gly Asn Asn Ala Phe Arg 195 200 205 Ser Leu Phe His Pro Asn Tyr Asp Leu Gin Asp Pro Ser Asn Ser Glu 210 215 220 Met Ser Val Gly Thr Pro Ser Glu Val Asn Ser Val Leu Ala Lys Ile 225 230 235 240 Asn Gly Tyr Ile Asn Glu Thr Asp Ser Ile Ser Val Ser Tyr Asn Leu 245 250 255 Thr Arg Asp Asn Ser Thr Arg Leu Leu Arg Pro Asn Thr Thr Ser Ala 260 265 270 Leu Ser Lys Ala Asn Asp Gin Glu Ala Ser Gin Pro Pro Leu 275 280 285 INFORMATION FOR SEQ ID NO:1067: SEQUENCE CHARACTERISTICS: LENGTH: 341 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature WO 97/37044 PCT/US97/05223 980 LOCATION 1...341 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1067: Met Lys Arg Leu Leu Leu Leu Ala Leu Ala Leu Phe Phe Ser Leu Ser 1 5 10 Cys Thr Asn Ala Gln Glu Ile Lys Glu Thr Gin Glu Thr Lys Lys Thr 25 Lys Glu Thr Lys Ser Gin Thr Arg Phe Asn Ile Ser Thr Thr Lys Val 40 Ile Glu Lys Glu Phe Ser Gin Ser Arg Arg Tyr Tyr Ala Leu Leu Glu 55 Pro Asn Glu Ala Leu Ile Phe Ser Gin Thr Leu Arg Phe Asp Gly Tyr 70 75 Val Glu Lys Leu Tyr Ala Asn Lys Thr Tyr Thr Pro Ile Lys Lys Gly 90 Asp Arg Leu Leu Ser Val Tyr Ser Pro Glu Leu Ala Gly Val Gin Ser 100 105 110 Glu Leu Leu Ser Ser Leu Lys Phe Asn Gin Gin Val Gly Ala Ile Lys 115 120 125 Glu Lys Leu Lys Leu Leu Gly Leu Glu Asn Phe Ser Ile Glu Lys Ile 130 135 140 Ile Ser Ser His Lys Val Gin Asn Glu Ile Thr Ile Tyr Ser Arg Phe 145 150 155 160 Asn Gly Val Ile Phe Lys Lys Ser Pro Asp Leu Asn Glu Gly Ser Phe 165 170 175 Ile Lys Lys Gly Gin Glu Leu Phe Lys Ile Ile Asp Leu Ser Arg Leu 180 185 190 Trp Ala Leu Val Lys Val Asn Gin Glu Asp Leu Glu Phe Leu Lys Asn 195 200 205 Thr His Gin Ala Ile Leu Phe Val Glu Gly Val Lys Gly Lys Gin Ala 210 215 220 Ile Thr Leu Glu Asn Ile Asn Pro Ile Ile Asn Ala Gin Asp Lys Met 225 230 235 240 Leu Glu Ala Arg Phe Asn Val Pro Asn Leu Lys Leu Leu Tyr Tyr Pro 245 250 255 Asn Met Phe Ala Gin Val Glu Ile Phe His Lys Pro Gin Lys Met Lys 260 265 270 Ile Leu Pro Lys Glu Ala Val Leu Ile Lys Gly Gly Lys Ala Ile Val 275 280 285 Phe Lys Lys Asp Asp Phe Gly Leu Ser Pro Leu Glu Ile Lys Ala Val 290 295 300 Arg Leu Ser Asp Gly Ser Tyr Glu Ile Leu Glu Gly Leu Lys Ala Gly 305 310 315 320 Glu Glu Val Ala Asn Asn Ala Leu Phe Val Leu Asp Ala Asp Ala Gin 325 330 335 Asn Asn Gly Asp Tyr 340 INFORMATION FOR SEQ ID NO:1068: SEQUENCE CHARACTERISTICS: LENGTH: 463 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 WO 9737044PCT[US97/05223 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 463 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1068: Met Ala Phe Cys Ala Arg Gin Lys Pro Leu Lys Glu 1 Phe Giu Asp Asn Gin Asp Ser Ala Phe 145 Glu Lys Giu Asn Gin 225 Lys Ile Giu Ala His 305 Arg Asn Ile Leu Tyr Ala Lys Ala Met Leu 130 Lys Glu Ala Lys Arg 210 Val1 Ser Asn Arg Ile 290 Leu Lys Leu Leu Asp Tyr Lys Al a Lys Leu 115 Lys Pro Leu Asn Lys 195 Leu Ile Asn Giu Ser 275 Phe Ser Leu Ser 5 Phe Phe Leu Lys Leu Trp Lys Ala Met Gin Met Pro 100 Giu Thr Ser Lys Leu Gin Giu Ile 165 Ala Gl'n 180 Leu Gin Leu Asp Leu Asp Ala Thr 245 Ile Leu 260 Giu Ala Trp Gin Gin Ser Lys Thr 325 Gin Giu Met Asp Arg Tyr 70 Giu Glu Thr Ile Giu 150 Leu Val Ile Giu Pro 230 Gin His Val Tyr Pro 310 Thr Asn Gly Leu Tyr 55 Glu Lys Asp Asp Arg 135 Lys Gin Phe Phe Asp 215 Lys Ser Lys Val1 Leu 295 Ala Pro Pro Met Giu 40 Ile Leu Gly Ile Ala 120 Asp Ile Ser Ser Giu 200 Tyr Leu Asn Lys Lys 280 Ala Leu Ser Pro Leu 25 Lys Ser Thr Val Tyr 105 Phe Phe Lys Lys Ala 185 Lys Pro Asp Ala Thr 265 Asp Ser Asn Tyr Phe 10 Gly Lys Asp Gin Giu 90 Cys Gin Asp Giu Asn 170 Leu His Ala His Gin 250 Ser Asp Lys Leu Arg 330 Asn Val1 Pro Lys Asn 75 Asn Lys Ala Lys Ala 155 Val1 Phe Ile Phe Phe 235 Thr Lys Asp Lys Tyr 315 Ile Thr Gly Ala Lys Lys Ser Gin Ser Ile 140 Tyr S er Asn Pro As n 220 Lys Phe Al a Phe Lys 300 Ser Ile Tyr Ile Met Phe Ser Oly Ile Thr Ser Asn Ser Asp Lys Ile Thr 110 Cys Ile 125 Pro Ile Pro Ile A la Ser His Leu 190 Ile Lys 205 Arg Leu Asp Ala Phe Ile Leu Lys 270 Ser Lys 285 Lys Thr Leu Tyr Ser His Asp Pro Arg Gin Val1 Leu Ala Ser Leu Ala Gin Leu Leu 175 Ser Glu Ile Leu Leu 255 Tyr Asp Leu Ala Ile 335 Phe Phe Thr Arg Glu Leu Pro Glu Ile Thr Tyr 160 Phe Tyr Leu Tyr Ala 240 Gly Phe Arg Giu Ser 320 Gin Ser 340 345 350 Trp Gin Ile Phe Lys Giu Lys Thr Leu Ser Leu Lys Asp Giu Gly Ala WO 97/37044 PCT/US97/05223 982 355 360 365 Phe Asn Ala Met Leu Lys Ser Leu Tyr Tyr Glu Lys Ser Ala Pro Glu 370 375 380 Leu Thr Tyr Leu Leu Ser Gin Arg Asn Lys Asp Lys Ile Tyr Tyr Tyr 385 390 395 400 Leu Ser Pro Tyr Glu Gly Ile Ile Glu Trp Gin Asn Thr Asp Glu Arg 405 410 415 Ala Met Ala Tyr Ala Ile Ala Arg Gin Glu Ser Phe Leu Leu Pro Ala 420 425 430 Leu Ile Ser Arg Ser Phe Ala Leu Gly Leu Met Gin Ile Met Pro Phe 435 440 445 Asn Val Gly Pro Phe Ala Lys Ala Leu Ala Trp Ile Met Leu Ile 450 455 460 INFORMATION FOR SEQ ID NO:1069: SEQUENCE CHARACTERISTICS: LENGTH: 374 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...374 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1069: Met Pro Lys Arg Met Lys Cys Phe Ser Gln Lys Trp Leu Val Phe Phe 1 5 10 Val Thr His Trp Leu Leu Leu Ala Ser Leu Ser His Ala Lys Met Ala 25 Phe Glu Ser Asn Ile Asp Thr Lys Ala Leu Glu Ala Phe Gly Val Asn 40 Ala Ser Phe Leu Ser Gin Met Pro Gly Ala Leu Lys Lys Val Asn Lys 55 Glu Glu Glu Trp Lys Lys Leu Val Lys Arg Phe Asp Val Asn Tyr Gin 70 75 Phe Ile Pro Ile Ile Lys Asn Met Leu Ile Gly Ala Ser Val Pro Gin 90 Glu Phe Leu Phe Leu Ala Met Ala Glu Ser Lys Phe Ser Ser Arg Ala 100 105 110 Tyr Ser Arg Lys Lys Ala Val Gly Ile Trp Gin Phe Met Pro Ser Thr 115 120 125 Ala Lys Glu Leu Gly Leu Lys Val Asn His Tyr Ile Asp Glu Arg Arg 130 135 140 Asp Pro Ile Lys Ser Thr Gin Ala Ala Ile Ala Tyr Leu Lys Arg Leu 145 150 155 160 Tyr Lys Gin Thr Gly Glu Trp Tyr Leu Val Ala Met Ala Tyr Asn Tyr 165 170 175 Gly Leu Arg Lys Val Gin Asn Ala Ile Lys Ala Ala Gly Thr Ser Asp WO 97/37044 PCT/US97/05223 983 180 185 190 Ile Lys Val Leu Leu Asp Glu Asp Lys Lys Tyr Leu Pro Lys Glu Thr 195 200 205 Arg Glu Tyr Ile Arg Ser Ile Leu Ser Leu Ala Leu Lys Phe Asn Ser 210 215 220 Leu Asp Asn Leu Lys Asp Lys Glu Tyr Leu Leu Asn Arg Gly Ala Arg 225 230 235 240 Val Ser Leu Val Gly Val Pro Phe Lys Arg His Thr Ser Leu Val Gin 245 250 255 Val Ala Lys Asn Leu Asn Leu Ser Leu Glu Thr Leu Lys Ser Tyr Asn 260 265 270 His Gin Phe Arg Tyr Asn Ile Leu Pro Ser Lys Asp Pro Thr Tyr Thr 275 280 285 Ile Tyr Ile Pro Tyr Glu Lys Leu Ala Leu Phe Lys Gin Arg Gin Leu 290 295 300 Lys Gin Asn Lys Asn Ala Gin Ala Asn Pro Lys Ser Pro Phe Ile Thr 305 310 315 320 His Val Val Leu Pro Lys Glu Thr Leu Ser Ser Ile Ala Lys Arg Tyr 325 330 335 Gin Val Ser Ile Ser Ser Ile Gin Leu Ala Asn Asn Leu Lys Asp Ser 340 345 350 Asn Ile Phe Ile His Gin Arg Leu Ile Ile Pro Thr Asn Lys Lys Leu 355 360 365 Leu Ala Thr Arg Glu Phe 370 INFORMATION FOR SEQ ID NO:1070: SEQUENCE CHARACTERISTICS: LENGTH: 333 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...333 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1070: Met Leu Iie Ala Arg Phe Lys Lys Ala Leu Ile Ser Tyr Ser Leu Gly 1 5 10 Val Leu Leu Val Ser Ser Leu Leu Gly Val Ala Asn Ala Ser Asn Gin 25 Glu Ile Gin Val Lys Asp Tyr Phe Gly Glu Gin Thr Ile Lys Leu Pro 40 Val Ser Lys Ile Ile Tyr Leu Gly Ser Phe Ala Glu Val Pro Ala Met 55 Phe Asn Thr Trp Asp Arg Val Val Gly Ile Ser Asp Tyr Ala Phe Lys 70 75 Ser Asp Ile Val Lys Ala Thr Leu Lys Asp Pro Glu Arg Ile Lys Pro WO 97/37044 PCTIUS97/05223 984 90 Met Ser Ser Asp His Val Ala Ala Leu Asn Val Glu Leu Leu Lys Lys 100 105 110 Leu Ser Pro Asp Leu Val Val Thr Phe Val Gly Asn Pro Lys Ala Val 115 120 125 Glu His Ala Lys Lys Phe Gly Ile Ser Phe Leu Ser Phe Gin Glu Lys 130 135 140 Thr Ile Val Glu Val Met Glu Asp Ile Asp Ala Gin Ala Lys Ala Leu 145 150 155 160 Glu Val Asp Ala Ser Lys Lys Leu Ala Lys Met Gin Glu Thr Leu Asp 165 170 175 Phe Ile Lys Glu Arg Leu Lys Asn Val Lys Lys Lys Lys Gly Val Glu 180 185 190 Leu Phe His Lys Ala Asn Lys Ile Ser Gly His Gin Ala Leu Asp Ser 195 200 205 Asp Ile Leu Glu Lys Gly Gly Ile Asp Asn Phe Gly Leu Lys Tyr Val 210 215 220 Lys Phe Gly Arg Ala Asp Ile Ser Val Glu Lys Ile Val Lys Glu Asn 225 230 235 240 Pro Glu Ile Ile Phe Ile Trp Trp Ile Ser Pro Leu Ser Pro Glu Asp 245 250 255 Val Leu Asn Asn Pro Lys Phe Ser Thr Ile Lys Ala Ile Lys Asn Lys 260 265 270 Gin Val Tyr Lys Leu Pro Thr Met Asp Ile Gly Gly Pro Arg Ala Pro 275 280 285 Leu Ile Ser Leu Phe Ile Ala Leu Lys Ala His Pro Glu Ala Phe Lys 290 295 300 Gly Val Asp Ile Asn Ala Ile Ile Lys Asp Tyr Tyr Lys Val Val Phe 305 310 315 320 Asp Leu Asn Asp Ala Glu Val Glu Pro Phe Leu Trp His 325 330 INFORMATION FOR SEQ ID NO:1071: SEQUENCE CHARACTERISTICS: LENGTH: 533 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...533 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1071: Met Glu Ile Arg Asn Ile Lys Glu Phe Glu Lys Ala Ser Lys Lys Leu 1 5 10 Gin Lys Asp Thr Leu Lys Ile Ala Leu Ala Leu Leu Phe Leu Ile Gly 25 Ala Ala Leu Leu Ala Leu Ile Phe Gly Gin Ala Asn Ser Lys Gly Leu WO 97/37044 PCT/US97/05223 985 40 Leu Leu Ile Phe Ala Ala Val Ile Gly Gly Tyr Met Ala Met Asn Ile 55 Gly Ala Asn Asp Val Ser Asn Asn Val Gly Pro Ala Val Gly Ser Lys 70 75 Ala Ile Ser Met Gly Gly Ala Ile Leu Ile Ala Ala Val Cys Glu Met 90 Leu Gly Ala Ile Ile Ala Gly Gly Glu Val Val Ser Thr Ile Lys Gly 100 105 110 Arg Ile Val Ser Pro Glu Phe Ile Asn Asp Ala His Val Phe Ile Asn 115 120 125 Val Met Leu Ala Ser Leu Leu Ser Gly Ala Leu Trp Leu His Val Ala 130 135 140 Thr Leu Ile Gly Ala Pro Val Ser Thr Ser His Ser Val Val Gly Gly 145 150 155 160 Ile Met Gly Ala Gly Met Ala Ala Ala Gly Met Ser Ala Ile Asn Trp 165 170 175 His Phe Leu Ser Gly Ile Val Ala Ser Trp Val Ile Ser Pro Leu Met 180 185 190 Gly Ala Leu Ile Ala Met Phe Phe Leu Met Leu Ile Lys Lys Thr Ile 195 200 205 Ala Tyr Lys Glu Asp Lys Lys Ser Ala Ala Leu Lys Val Val Pro Tyr 210 215 220 Leu Val Ala Leu Met Ser Leu Ala Phe Ser Trp Tyr Leu Ile Val Lys 225 230 235 240 Val Leu Lys Arg Leu Tyr Ala Val Gly Phe Glu Ile Gin Leu Ala Cys 245 250 255 Gly Cys Val Leu Ala Leu Leu Ile Phe Ile Leu Phe Lys Arg Phe Val 260 265 270 Leu Lys Lys Ala Pro Gin Leu Glu Asn Ser His Glu Ser Val Asn Glu 275 280 285 Leu Phe Asn Val Pro Leu Ile Phe Ala Ala Ala Leu Leu Ser Phe Ala 290 295 300 His Gly Ala Asn Asp Val Ala Asn Ala Ile Gly Pro Leu Ala Ala Ile 305 310 315 320 Ser Gin Thr Leu Glu Asp Ala Ser Ser Pro Met Gly Ser Thr Leu Asn 325 330 335 Ser Val Pro Leu Trp Ile Met Val Val Gly Ala Ala Gly Ile Ala Leu 340 345 350 Gly Leu Ser Leu Tyr Gly Pro Lys Leu Ile Lys Thr Val Gly Ser Glu 355 360 365 Ile Thr Glu Leu Asp Lys Met Gin Ala Phe Cys Ile Ala Leu Ser Ala 370 375 380 Val Ile Thr Val Leu Leu Ala Ser Gin Leu Gly Leu Pro Val Ser Ser 385 390 395 400 Thr His Ile Val Val Gly Ala Val Phe Gly Val Gly Phe Leu Arg Glu 405 410 415 Arg Leu Arg Glu Gin Ser Arg Arg Arg Phe Ala Arg Ile Arg Asp Asn 420 425 430 Ile Val Ala Ala His Phe Gly Glu Asp Leu Glu Glu Ile Glu Gly Phe 435 440 445 Leu Glu Arg Phe Asp Lys Ala Asn Leu Lys Glu Lys Ser Leu Met Leu 450 455 460 Glu Ser Leu Lys Lys Ser Lys Asn Thr Ala Ile Ala Leu Glu Leu Lys 465 470 475 480 Lys Lys Glu Lys Lys Ser Leu Lys Lys Val Tyr Lys Glu Glu Val Ile 485 490 495 WO 97/37044 PCTIUS97/05223 986 Lys Arg Ser Ile Leu Lys Lys Ile Val Thr Ala Trp Leu Val Thr Val 500 505 510 Pro Val Ser Ala Leu Leu Gly Ala Leu Leu Phe Val Ala Leu Gly Phe 515 520 525 Ile Glu Lys Tyr Phe 530 INFORMATION FOR SEQ ID NO:1072: SEQUENCE CHARACTERISTICS: LENGTH: 329 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...329 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1072: Met Ser Asn Ser Met Leu Asp Lys Asn Lys Ala Ile Leu Thr Gly Gly 1 5 10 Gly Ala Leu Leu Leu Gly Leu Ile Val Leu Phe Tyr Leu Ala Tyr Arg 25 Pro Lys. Ala Glu Val Leu Gin Gly Phe Leu Glu Ala Arg Glu Tyr Ser 40 Val Ser Ser Lys Val Pro Gly Arg Ile Glu Lys Val Phe Val Lys Lys 55 Gly Asp Arg Ile Lys Lys Gly Asp Leu Val Phe Ser Ile Ser Ser Pro 70 75 Glu Leu Glu Ala Lys Leu Ala Gin Ala Glu Ala Gly His Lys Ala Ala 90 Lys Ala Val Ser Asp Glu Val Lys Arg Gly Ser Arg Asp Glu Thr Ile 100 105 110 Asn Ser Ala Arg Asp Val Trp Gin Ala Ala Lys Ser Gin Ala Asn Leu 115 120 125 Ala Lys Glu Thr Tyr Lys Arg Val Gin Asp Leu Tyr Asp Asn Gly Val 130 135 140 Ala Ser Leu Gin Lys Arg Asp Glu Ala Tyr Ala Ala Tyr Glu Ser Thr 145 150 155 160 Lys Tyr Asn Glu Ser Ala Ala Tyr Gin Lys Tyr Lys Met Ala Leu Gly 165 170 175 Gly Ala Ser Ser Glu Ser Lys Ile Ala Ala Lys Ala Lys Glu Ser Ala 180 185 190 Ala Leu Gly Gln Val Asn Glu Val Glu Ser Tyr Leu Lys Asp Val Lys 195 200 205 Ala Leu Ala Pro Ile Asp Gly Glu Val Ser Asn Val Leu Leu Ser Gly 210 215 220 Gly Glu Leu Ser Pro Lys Gly Phe Pro Val Val Leu Met Ile Asp Leu 225 230 235 240 WO 97/37044 WO 9737044PCT/US97/05223 987 Lys Asp Ser Trp Leu Lys Ile Ser Val Pro Glu L 245 250 Phe Lys Val Gly Lys Glu Phe Glu Gly Tyr Ile P: 260 265 Ser Ala Lys Phe Arg Val Lys Tyr Leu Ser Val M~ 275 280 Thr Trp Lys Ala Thr Asn Asn Ser Asn Thr Tyr A~ 290 2953 Glu Val Glu Ala Ile Pro Leu Glu Glu Leu Glu Az 305 310 315 Met Ser Val Leu Val Thr Ile Lys Pro 325 INFORMATION FOR SEQ ID NO:1073: SEQUENCE CHARACTERISTICS: LENGTH: 292 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .292 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1073: Ys Tyr Leu Asr 255 ro Ala Leu Lys 270 et Gly Asp Phe 285 3P Met Lys Ser ~n Phe Arg Val Glu Arg Ala Tyr Gly 320 Lou Gin Asn Ala Ala Arg Glu Ile Tyr Tyr 160 Val1 Gly Arg Leu Giu Ser Pro Lou His Sen Gly Phe Ser 1 Val1 Val Asp Glu Ala Leu Val Sen Glu 145 Thr Thr Ile Thn Leu Phe Phe Ile Gly Ang 130 Leu Ile Gly Ser Arg Ang Asn Tyr Val1 Gly 115 Thn Ala Pro Leu Pro Giy Tyr Ile Gin Thr 100 Thr Trp Ala Lys Asp 180 5 Ile Val Ala Asp Ala Asn Phe Pro Ser Thr 165 Tyr Giu Met Lys Tyr 70 Leu Leu Ang Thn Thn 150 Gly Cys Pro Pro Asn 55 Ser Asp Ser Tyr Thr 135 Gly Ile Gly Lou Gly 40 Ile Ser Asn Gin Lys 120 Ang Asn Asn Phe Ser 25 Asp Lys Gln Phe Ala 105 Gly Gly Val1 Leu Asp 185 10 Leu Gly Pro Tyr I le 90 Ile Val1 Tyr Phe Ala 170 Ile Lys Val Giu Phe 75 Ser Ang Ser Leu Ile 155 Tnp Tyr Pro Ile Tyr Val1 Sen Gin Ile Lou Met 140 Ile Leu Leu Sen Thn Met Gly Gly Tyr Tyr Asn 125 Al a Lys Ser Pro Ala Tyr Arg Sen Ang Ala Gly 110 Val Asp Lou Arg Asp 190 Ala Sen Gin Asn Ala Gin Tyr Gly Ser Asp Phe 175 Tyr WO 97/37044 WO 9737044PCT[US97/05223 988 Thr Ala Glu Lys Pro Lys Thr Pro Thr Asp Leu Al 195 200 Gin Leu Gly Leu Val His Met His Lys Pro Gly Ty 210 215 22 Phe Tyr Ile Asn Trp Ser Pro Lys Thr Lys Ser Hi 225 230 235 Leu Leu Ser Ala Val Phe Asn Asn Val Phe Asn Ly 245 250 Gin Thr Ser Pro Tyr Val Met Ser Pro Asp Met Pr 260 265 Ile Lys Arg Ala Ile Ala Glu Pro Gly Phe Asn Al 275 280 Ala Tyr Lys Trp 290 INFORMATION FOR SEQ ID NO:1074: SEQUENCE CHARACTERISTICS: LENGTH: 484 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .484 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:i074: a r 0
S
S
0 a Lys 205 Gly Trp Phe Giy Arg 285 Asn Tyr Tyr Lys Lys Leu Ile Asn 125 Sen Ser Asp Cys Val1 Lys Tyr Thr 270 Phe Ser Leu Ala Ala Ile Glu Ile 110 Leu Thn Leu Leu Gly Ser Gly Val1 255 Asp Glu Val1 Leu Phe Leu Leu Phe Asp Leu Glu Asp Asp 175 Ser Asn Leu 240 Asp Ala Val1 Arg Phe Lys Ser
CYS
Val1 Gly His Cys Val 160 Val1 Met Ala Lys Ile Thr Thn Val Ile Asp Ile Gly Ser 1 Leu Giu Gly Giu Val Ala Gin Lys Ala 145 Gly Ala Thr Val1 Phe Ala Ang Lys Asn 130 Leu Thr Val Lys Leu Lys Thr Val1 Glu 115 Sen Ile Ie Phe Sen Gin Glu Ser Lys 100 Ala Gly Giu Arg 5 Lys Lys Glu Ile Ala Lys Leu Ile Lys Ile 165 Lys Val Ile Ala 70 Val Ala Tyr Thr Gly 150 Lys Thr Ang Pro 55 Leu Arg Cys Gly Ile 135 Lys Glu Ser Ile 40 Met Lys Asp Gly Gly 120 Asp Ile Met Gin 25 Ser Gin Tyr Ala Leu 105 Ile Ile Lys Phe 10 Phe Glu Arg Lys Pro 90 Gin Ala Gly Asp Leu 170 Gly Gly Ala Sen 75 Asn Ile Cys Gly Leu 155 Asp Phe Cys Val Lys Arg Lys Ala Gly 140 Ile Lys WO 97/37044 PCT/US97/05223 989 Lys Leu Ala Lys Ala Phe Ile Gin Lys Glu Val Ser Lys Leu Pro Phe 180 185 190 Lys His Lys Asn Ala Phe Gly Val Gly Gly Thr Ile Arg Ala Leu Ser 195 200 205 Lys Val Leu Met Lys Arg Phe Asp Tyr Pro Ile Asp Ser Leu His Gly 210 215 220 Tyr Glu Ile Asp Ala His Lys Asn Leu Ala Phe Ile Glu Lys Ile Val 225 230 235 240 Met Leu Lys Glu Asp Lys Leu Arg Leu Leu Gly Val Asn Glu Glu Arg 245 250 255 Leu Asp Ser Ile Arg Ser Gly Ala Leu Ile Leu Ser Val Val Leu Glu 260 265 270 His Leu Lys Thr Ser Leu Met Ile Thr Ser Gly Val Gly Val Arg Glu 275 280 285 Gly Val Phe Leu Ser Asp Leu Leu Arg His His Tyr His Lys Phe Pro 290 295 300 Pro Asn Ile Asn Pro Ser Leu Ile Ser Leu Lys Asp Arg Phe Leu Pro 305 310 315 320 His Glu Lys His Ser Gin Lys Val Lys Lys Glu Cys Val Lys Leu Phe 325 330 335 Glu Val Leu Ser Pro Leu His Lys Ile Asp Glu Lys Tyr Leu Phe His 340 345 350 Leu Lys Ile Ala Gly Glu Leu Ala Ser Met Gly Lys Ile Leu Ser Val 355 360 365 Tyr Leu Ala His Lys His Ser Ala Tyr Phe Ile Leu Asn Ala Leu Ser 370 375 380 Tyr Gly Phe Ser His Gin Asp Arg Ala Ile Ile Cys Leu Leu Ala Gin 385 390 395 400 Phe Ser His Lys Lys Ile Pro Lys Asp Asn Ala Ile Ala His Met Ser 405 410 415 Ala Met Met Pro Ser Leu Leu Thr Leu Gin Trp Leu Ser Phe Ile Leu 420 425 430 Ser Leu Ala Glu Asn Leu Cys Leu Thr Asp Ser His His Leu Lys Tyr 435 440 445 Thr Leu Glu Lys Asn Lys Leu Val Ile His Ser Asn Asp Ala Leu Tyr 450 455 460 Leu Ala Lys Glu Met Leu Pro Lys Leu Ile Lys Pro Ile Pro Leu Thr 465 470 475 480 Ile Glu Phe Ala INFORMATION FOR SEQ ID NO:1075: SEQUENCE CHARACTERISTICS: LENGTH: 222 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature WO 97/37044 WO 9737044PCTIUS97/05223 990 LOCATION .222 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1075: Met Val Gly Met Lys Thr Glu Met Lys Ser Phe Gin His Pro Pro Thr Ile Ile Gin Asn 145 Giu Thr Vali Thr Pro Ala Asn Lys Lys Lys Lys Giu 130 Asp Tyr Lys Leu Lys 210 Leu Val Val Giu Asp Gin Gin 115 Lys Gin Lys Gly Giu 195 Gly Leu Leu Giu Glu Thr Giu 100 Giu Glu Lys Val Lys 180 Ile Tyr Val1 Gly Lys Ala Val1 Ile Ile Asn Thr Ala 165 Ile Gin Val Val Phe Ser Asn 70 Pro Lys Lys Lys Pro 150 Val1 Ile Asn Phe Leu Tyr Giu 55 Aia Pro Gin Gin Pro 135 Thr Ser Giy Asp Leu 215 Aia Val 40 Thr Thr Leu Giu Giu 120 Lys Thr Gly Ser Trp 200 Lys Phe 25 Lys Glu Thr Asp Ile 105 Ile Gin Pro Vali Leu 185 Aia Leu 10 Met Lys Arg Thr Thr 90 Lys Lys Asn Leu Asn 170 Ile Giu Leu Asp Gin Ala 75 Ala Gin Gin Ser Met 155 Vali Lys Ile Leu Leu Ser Asn Thr Thr Giu Glu Val1 140 Giy Arg Asn Giu Lys Tyr Ala Ser Glu Gin Ile Thr Ser Lys Ala Lys Phe 205 Leu Ala Pro Thr Gin Lys Lys 110 Lys Pro Lys Phe Ser 190 Ser Phe Leu Met Phe Asn Gin Gin Gin Val Pro Pro 175 Vali His Al a Ala Ser Ser Pro Glu Glu Glu Gin Leu 160 Ser Lys Giu Leu Lys Lys Ala Giu 220 INFORMATION FOR SEQ ID NO:i076: SEQUENCE CHARACTERISTICS: LENGTH: 135 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .135 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1076: Giu Pro Asp Ala Thr Ser Arg Arg Thr Leu Asn His Pro Tyr Phe Gly 10 Val Phe Val Leu Leu Val Phe Thr Phe Trp Val Phe Asn Leu Thr Leu 25 WO 97/37044 WO 9737044PCT/US97/05223 991 Arg Ile Gin Arg Phe Leu Ser Arg Lys Met Ala GI 40 Lys Leu Lys Leu Ala Pro Tyr Glu Cys Gly Pro Va 55 Pro Asn Arg Val Ser His His Phe Tyr Ile Met Al 70 75 Leu Phe Asp) Val Glu Ile Val Phe Met Phe Pro Tr 90 Lys Lys Leu Gly Leu Phe Gly Leu Val Glu Met Le 100 105 Phe Leu Ala Ile Gly Phe Ile Tyr Ala Leu Lys Ar 115 120 Trp Gin Lys Leu Glu Val Lys 130 135 INFORMATION FOR SEQ ID NO:1077: SEQUENCE CHARACTERISTICS: LENGTH: 635 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1 635 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:i077: an Li Lys Ala Met Ala Gly Asn 125 Asp Ser Leu Asn Gin Gly Gin Leu 125 Arg Giy Thr Lys Leu Leu Ile Phe 110 Ala Asn Pro Trp Glu Asn Gly Ile 110 Ser Thr Lys Glu Gly Lys Phe Asp Val1 Leu Leu Ala Gin Asn Thr Pro Ile Asn Gly Ala Asn 175 Glu Gin Ile Phe Phe Ser Giy Tyr Val Aia Thr Ile Gin Thr Gly Ile 160 Ile Leu Ser Ser Asp Pro Ser Aia Val Asn Asp Ala 1 Ser Gin Thr Asn Thr Ser Lys Thr Giu 145 Ser Ser Aia Ser Gly Ile Thr Ala Thr 130 Pro Thr Thr Val1 Tyr Gly Thr Lys Leu 115 Lys Asn Ser Arg Leu Aia Ile Cys Asn 100 Thr Leu Lys Trp 5 Asn Leu Phe Gin Asn Tyr Ala Asp Lys Asn 165 Leu Ala Thr Thr 70 Ser Ala Asn Phe Leu 150 Ala Leu Leu Ala 55 Phe Tyr Ile Gly Thr 135 Val Thr Asp Asn 40 Cys Asn Tyr Ile Glu 120 Ile Tyr Ile Val 25 Ala Gly Asn Giu Asn 105 Gly Asn Pro Thr 10 Lys Ala Pro Val1 Pro Lys Ile Gly Trp Ala 170 Ala Val1 Gly Pro 75 Gly Ala Pro Asp Ser 155 Pro Arg Asn Gly Ser Gly His Tyr Val1 Lys 140 His Thr WO 97/37044 PCT/US97/05223 992 Asn Thr Thr Asn Ser Ala Gin Glu Leu Leu Lys Gin Ala Ser Ile Ile 180 185 190 Ile Thr Thr Leu Asn Ser Ala Cys Pro Asn Phe Gin Asn Gly Gly Ser 195 200 205 Gly Tyr Trp Ala Gly Ile Ser Gly Asn Gly Thr Met Cys Gly Met Phe 210 215 220 Lys Asn Glu Ile Ser Ala Ile Gin Gly Met Ile Ala Asn Ala Gin Glu 225 230 235 240 Ala Val Ala Gin Ala Lys Ile Val Ser Glu Asn Thr Gin Asn Gin Asn 245 250 255 Ser Leu Asp Ala Gly Lys Pro Phe Asn Pro Tyr Thr Asp Ala Ser Phe 260 265 270 Ala Glu Ser Met Leu Lys Asn Ala Gin Ala Gin Ala Glu Ile Leu Asn 275 280 285 Gin Ala Glu Gin Val Val Lys Asn Phe Glu Lys Ile Pro Thr Ala Phe 290 295 300 Val Asn Asp Ser Leu Gly Val Cys Tyr Glu Val Gin Gly Gly Glu Arg 305 310 315 320 Arg Gly Thr Asn Pro Gly Gin Thr Thr Ser Asn Thr Trp Gly Ala Gly 325 330 335 Cys Ala Tyr Val Gly Gin Thr Ile Thr Asn Leu Lys Asn Ser Ile Ala 340 345 350 His Phe Gly Thr Gin Glu Gin Gin Ile Gin Gin Ala Glu Asn Ile Ala 355 360 365 Asp Thr Leu Val Asn Phe Lys Ser Arg Tyr Ser Glu Leu Gly Asn Thr 370 375 380 Tyr Asn Ser Ile Thr Thr Ala Leu Ser Asn Ile Pro Asn Ala Gin Ser 385 390 395 400 Leu Gin Asn Ala Val Ser Lys Lys Asn Asn Pro Tyr Ser Pro Gin Gly 405 410 415 Ile Asp Thr Asn Tyr Tyr Leu Asn Gin Asn Ser Tyr Asn Gin Ile Gin 420 425 430 Thr Ile Asn Gin Glu Leu Gly Arg Asn Pro Phe Arg Lys Val Gly Ile 435 440 445 Val Ser Ser Gin Thr Asn Asn Gly Ala Met Asn Gly Ile Gly Ile Gin 450 455 460 Val Gly Tyr Lys Gin Phe Phe Gly Gin Lys Arg Lys Trp Gly Ala Arg 465 470 475 480 Tyr Tyr Gly Phe Phe Asp Tyr Asn His Ala Phe Ile Lys Ser Ser Phe 485 490 495 Phe Asn Ser Ala Ser Asp Val Trp Thr Tyr Gly Phe Gly Ala Asp Ala 500 505 510 Leu Tyr Asn Phe Ile Asn Asp Lys Ala Thr Asn Phe Leu Gly Lys Asn 515 520 525 Asn Lys Leu Ser Val Gly Leu Phe Gly Gly Ile Ala Leu Ala Gly Thr 530 535 540 Ser Trp Leu Asn Ser Glu Tyr Val Asn Leu Ala Thr Met Asn Asn Val 545 550 555 560 Tyr Asn Ala Lys Met Asn Val Ala Asn Phe Gin Phe Leu Phe Asn Met 565 570 575 Gly Val Arg Met Asn Leu Ala Arg Pro Lys Lys Lys Asp Ser Asp His 580 585 590 Ala Ala Gin His Gly Ile Glu Leu Gly Leu Lys Ile Pro Thr Ile Asn 595 600 605 Thr Asn Tyr Tyr Ser Phe Met Gly Ala Glu Leu Lys Tyr Arg Arg Leu 610 615 620 Tyr Ser Val Tyr Leu Asn Tyr Val Phe Ala Tyr WO 97/37044 PCT/US97/05223 993 625 630 635 INFORMATION FOR SEQ ID NO:1078: SEQUENCE CHARACTERISTICS: LENGTH: 174 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...174 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1078: Ile Lys Arg Ile Ile Lys Ser Asn Ala Ser Leu Asn Gin Leu Asn Thr 1 5 10 Thr Arg Tyr Asn Thr Pro Ser His Leu Phe Phe Lys Lys Gly Val Gly 25 Met Ala Thr Ile Gin Pro Phe Asn His Ser Thr Ile Gin Pro Phe Asn 40 His Ser Thr Ile Gin Pro Phe Asn His Ser Ile Ile Gin Ser Phe Asn 55 His Ser Thr Ile Gin Ala Thr Leu Pro Tyr Phe Tyr Asn Tyr Leu Ser 70 75 Phe Tyr Lys Asn Leu Phe Lys Asn Pro Leu Phe Phe Ile Ile Pro Pro 90 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro 100 105 110 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro 115 120 125 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro 130 135 140 Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Asn Pro Phe Ile Ser Pro 145 150 155 160 Ser Leu Thr His Ala Thr Thr Phe Ser Asn His Leu Ile Pro 165 170 INFORMATION FOR SEQ ID NO:1079: SEQUENCE CHARACTERISTICS: LENGTH: 748 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: WO 97/37044 PCT/US97/05223 994 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...748 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1079: Met Glu Asp Phe Leu Tyr Asn Thr Leu Tyr Phe Ile Glu Asp Tyr Lys 1 5 10 Leu Val Val Ile Phe Ser Phe Ile Gly Leu Ile Ala Leu Phe Phe Leu 25 Tyr Lys Phe Ile Lys Thr Gin Lys Lys Val Phe Lys Asp Lys Ala Asn 40 Gin Pro Gin Lys Lys Lys Ser Phe Lys Glu Ile Ile Ile Asp Gly Leu 55 Lys Glu Arg Val Lys Thr Phe Gly Phe Trp Leu Gin Ala Ile Leu Leu 70 75 Leu Ser Tyr Ser Phe Ile Thr Ser Gly Leu Phe Phe Leu Ile Leu Leu 90 Gly Asn Phe Tyr Asp Asp Asn Arg Leu Pro Glu Ser Asp Asp Asp Leu 100 105 110 Phe Asp Ile Trp Val Tyr Ala Ile Gin Asp Phe Pro Ala Tyr Tyr Phe 115 120 125 Lys Ala Leu Thr Phe Ser Ser Leu Lys Ile Tyr Gly Phe Asn Ile Ser 130 135 140 Leu Val Val Tyr Ser Ser Ile Leu Cys Ser Tyr Ile Phe Ile Thr Phe 145 150 155 160 Phe Val Trp Phe Leu Lys Tyr Leu Thr Arg Thr Arg Asp Ile Gly Ala 165 170 175 Asn Lys Lys Val Asp Asp Leu Phe Gly Ser Ala Ser Trp Glu Thr Glu 180 185 190 Glu Lys Met Ile Lys Ala Lys Leu Ile Thr Pro Asn Asn Lys Lys Arg 195 200 205 Ala Phe Asp Lys Arg Glu Val Ile Val Gly Arg Arg Gly Leu Gly Asp 210 215 220 Phe Ile Ala Tyr Ala Gly Gin Ala Phe Ile Gly Leu Ile Ala Pro Thr 225 230 235 240 Arg Ser Gly Lys Gly Val Gly Phe Ile Met Pro Asn Met Ile Asn Tyr 245 250 255 Pro Gin Asn Ile Val Val Phe Asp Pro Lys Ala Asp Thr Met Glu Thr 260 265 270 Cys Gly Lys Ile Arg Glu Lys Arg Phe Asn Gin Lys Val Phe Ile Tyr 275 280 285 Glu Pro Phe Ser Leu Lys Thr His Arg Phe Asn Pro Phe Ala Tyr Val 290 295 300 Asp Phe Gly Asn Asp Val Val Leu Thr Glu Asp Ile Leu Ser Gin Ile 305 310 315 320 Asp Thr Arg Leu Lys Gly His Gly Met Val Ala Ser Gly Gly Asp Phe 325 330 335 Ser Thr Gin Ile Phe Gly Leu Ala Lys Leu Val Phe Pro Glu Arg Pro 340 345 350 Asn Glu Lys Asp Pro Phe Phe Ser Asn Gin Ala Arg Asn Leu Phe Val 355 360 365 Ile Asn Cys Asn Ile Tyr Arg Asp Leu Met Trp Thr Lys Lys Gly Leu 370 375 380 Glu Phe Val Lys Arg Lys Lys Ile Ile Met Pro Glu Thr Pro Thr Met WO 97/37044 PCT/US97/05223 995 385 390 395 400 Phe Phe Ile Gly Ser Met Ala Ser Gly Ile Asn Leu Ile Asp Glu Asp 405 410 415 Thr Asn Met Glu Lys Val Val Ser Leu Met Glu Phe Phe Gly Gly Glu 420 425 430 Glu Asp Lys Ser Gly Asp Asn Leu Arg Ala Leu Ser Pro Ala Thr Arg 435 440 445 Asn Met Trp Asn Asn Phe Lys Thr Met Gly Gly Ala Lys Glu Thr Tyr 450 455 460 Ser Ser Val Gin Gly Val Tyr Thr Ser Ala Phe Ala Pro Tyr Asn Asn 465 470 475 480 Ala Met Ile Arg Asn Phe Thr Ser Ala Asn Asp Phe Asp Phe Arg Arg 485 490 495 Leu Arg Ile Asp Ala Val Ser Ile Gly Val Ile Ala Asn Pro Lys Glu 500 505 510 Ser Thr Ile Val Gly Pro Ile Leu Glu Leu Phe Phe Asn Val Met Ile 515 520 525 Tyr Ser Asn Leu Ile Leu Pro Ile His Asp Pro Gin Cys Lys Arg Ser 530 535 540 Cys Leu Met Leu Met Asp Glu Phe Thr Leu Cys Gly Tyr Leu Glu Thr 545 550 555 560 Phe Val Lys Ala Val Gly Ile Met Ala Glu Tyr Asn Met Arg Pro Ala 565 570 575 Phe Val Phe Gin Ser Lys Ala Gin Leu Glu Asn Asp Pro Pro Leu Gly 580 585 590 Tyr Gly Arg Asn Gly Ala Lys Thr Ile Leu Asp Asn Leu Ser Leu Asn 595 600 605 Met Tyr Tyr Gly Ile Asn Asn Asp Asn Tyr Tyr Glu His Phe Glu Lys 610 615 620 Leu Ser Lys Val Leu Gly Lys Tyr Thr Arg Gin Asp Val Ser Arg Ser 625 630 635 640 Ile Asp Asp Asn Thr Gly Lys Thr Asn Thr Ser Ile Ser Asn Lys Glu 645 650 655 Arg Phe Leu Met Thr Pro Asp Glu Leu Met Thr Met Gly Asp Glu Leu 660 665 670 Ile Ile Leu Glu Asn Thr Leu Lys Pro Ile Lys Cys His Lys Ala Leu 675 680 685 Tyr Tyr Asp Asp Pro Phe Phe Thr Asp Glu Leu Ile Lys Val Ser Pro 690 695 700 Ser Leu Ser Lys Lys Tyr Lys Leu Gly Lys Val Pro Asn Gin Ala Thr 705 710 715 720 Phe Tyr Asp Asp Leu Gin Ala Ala Lys Thr Arg Gly Glu Leu Ser Tyr 725 730 735 Asp Lys Ser Leu Val Pro Val Gly Ser Ser Glu Leu 740 745 INFORMATION FOR SEQ ID NO:1080: SEQUENCE CHARACTERISTICS: LENGTH: 404 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES WO 97/37044 PCT/US97/05223 996 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...404 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1080: Leu Asn Ala Lys Ser Gin Ser Ser Leu Lys Asn Lys Ile Thr Leu Lys 1 5 10 Asn Lys Leu Asn His Ala Arg Ile Ile Leu Glu Phe Ile Pro Ser Leu 25 Ile Tyr Phe Leu Ile Gin Lys Val Ser Val Leu Lys His Leu Ala Pro 40 Leu Ile His Ile Pro Phe Lys Ala Leu Trp Leu Gly Thr Ala Leu Ser 55 Met Phe Leu Ser Leu Asn Leu Asn Ala Glu Glu Asn Pro Thr Lys Thr 70 75 Glu Pro Lys Pro Ala Lys Gly Val Lys Asn Lys Pro Lys Ser Pro Val 90 Thr Asn Val Met Met Thr Asn Cys Asp Asn Leu Lys Asp Phe Asn Ala 100 105 110 Lys Gin Lys Glu Val Leu Lys Ala Ala Tyr Gin Phe Gly Ser Lys Glu 115 120 125 Asn Leu Gly Tyr Glu Met Ala Gly Ile Ala Trp Lys Glu Ser Cys Ala 130 135 140 Gly Thr Tyr Lys Ile Asn Phe Ser Asp Pro Ser Ala Gly Ile Tyr His 145 150 155 160 Ala Tyr Ile Pro Ser Val Leu Lys Ser Tyr Gly His Asn Asn Ser Pro 165 170 175 Phe Leu Arg Asn Val Met Gly Glu Leu Leu Ile Lys Asp Asp Ala Phe 180 185 190 Ala Ser Glu Val Ala Leu Lys Glu Leu Leu Tyr Trp Lys Thr Arg Tyr 195 200 205 His Asp Asn Leu Lys Asp Met Ile Lys Ser Tyr Asn Lys Gly Ser Arg 210 215 220 Trp Glu Lys Asn Glu Lys Ala Asn Ala Asp Ala Glu Lys Tyr Tyr Glu 225 230 235 240 Glu Ile Gin Asp Arg Ile Arg Arg Leu Lys Glu Ser Lys Ile Phe Asp 245 250 255 Ser Gin Ser Ser Asn Asp Gin Glu Leu Gin Lys Ser Ala Asn Ser Asn 260 265 270 Leu Asp Leu Asp Pro Ile Gly Asn Ala Met Pro Gin Thr Leu Ala Gin 275 280 285 Thr Glu Thr Gin Lys Ser Gin Ile Glu Lys Ser Gin Ile Glu Glu Ala 290 295 300 Gin Thr Gin Lys Ser Gin Glu Met Lys Glu Ala Ala Ser Glu Gin Ala 305 310 315 320 Ile Lys Lys Pro Leu Glu Lys Glu Lys Asp Lys Pro Met Tyr Leu Ala 325 330 335 Gin Ile Asn Ser Ala Asp Phe Ala Pro Ala Lys Lys Ser Pro Lys Lys 340 345 350 Pro Ala Lys Ala Ser Pro Lys Arg Ser Ser Lys Asn Asn Ile Ser Val 355 360 365 Lys Ser Asn Thr Lys Thr Ala Ser Lys Asn Lys Glu Val Cys Lys Asn 370 375 380 WO 97/37044 PCT/US97/05223 997 Cys Ser Pro Gly Gin Arg Asn Ala Ile Leu Ala Asn His Ile Thr Leu 385 390 395 400 Met Gin Glu Leu INFORMATION FOR SEQ ID NO:1081: SEQUENCE CHARACTERISTICS: LENGTH: 270 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...270 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1081: Met Lys Lys Phe Val Val Phe Lys Thr Leu Cys Leu Ser Val Val Leu 1 5 10 Gly Asn Ser Leu Val Ala Ala Glu Gly Ser Thr Glu Val Gin Lys Gin 25 Leu Glu Lys Pro Lys Asp Tyr Lys Ala Val Lys Gly Glu Lys Asn Ala 40 Trp Tyr Leu Gly Ile Ser Tyr Gin Val Gly Gin Ala Ser Gin Ser Val 55 Lys Asn Pro Pro Lys Ser Ser Glu Phe Asn Tyr Pro Lys Phe Pro Val 70 75 Gly Lys Thr Asp Tyr Leu Ala Val Met Gin Gly Leu Gly Leu Thr Val 90 Gly Tyr Lys Gin Phe Phe Gly Glu Lys Arg Trp Phe Gly Ala Arg Tyr 100 105 110 Tyr Gly Phe Met Asp Tyr Gly His Ala Val Phe Gly Ala Asn Ala Leu 115 120 125 Thr Ser Asp Asn Gly Gly Val Cys Lys Leu Asn Glu Pro Cys Ala Thr 130 135 140 Lys Val Gly Thr Met Gly Asn Leu Ser Asp Met Phe Thr Tyr Gly Val 145 150 155 160 Gly Ile Asp Thr Leu Tyr Asn Val Ile Asn Lys Glu Asp Ala Ser Phe 165 170 175 Gly Phe Phe Phe Gly Ala Gin Ile Ala Gly Asn Ser Trp Gly Asn Thr 180 185 190 Thr Gly Ala Phe Leu Glu Thr Lys Ser Pro Tyr Lys His Thr Ser Tyr 195 200 205 Ser Leu Asp Pro Ala Ile Phe Gin Phe Leu Phe Asn Leu Gly Ile Arg 210 215 220 Thr His Ile Gly Gin His Gin Glu Phe Asp Phe Gly Val Lys Ile Pro 225 230 235 240 Thr Ile Asn Val Tyr Tyr Phe Asn His Gly Asn Leu Ser Phe Thr Tyr 245 250 255 WO 97/37044 PCT/US97/05223 998 Arg Arg Gin Tyr Ser Leu Tyr Val Gly Tyr Arg Tyr Asn Phe 260 265 270 INFORMATION FOR SEQ ID NO:1082: SEQUENCE CHARACTERISTICS: LENGTH: 252 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...252 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1082: Met Lys Pro Tyr Phe Ser Leu Glu Lys Leu Asp Leu Tyr His Gly Asp 1 5 10 Ala Ser Val Leu Glu Thr Phe Glu Lys Gly Phe Tyr Asp Leu Cys Val 25 Thr Ser Pro Pro Tyr Asn Leu Ser Ile Glu Tyr Gin Gly Ser Asn Asp 40 Phe Arg Ala Tyr Asp Asp Tyr Leu Asn Trp Cys Lys Asn Trp Leu Lys 55 Asn Cys Tyr Phe Trp Gly Lys Glu Gin Ala Arg Leu Cys Leu Asn Val 70 75 Pro Leu Asp Thr Asn Lys His Gly Lys Gin Ser Leu Gly Ala Asp Ile 90 Thr Ile Val Ala Lys Glu Cys Gly Trp Lys Tyr Gin Asn Thr Ile Ile.
100 105 110 Trp Asn Glu Ser Asn Ile Ser Arg Arg Thr Ala Trp Gly Ser Trp Leu 115 120 125 Gin Ala Ser Ala Pro Tyr Ala Ile Ala Pro Val Glu Leu Ile Val Val 130 135 140 Phe Tyr Lys Asn Glu Tyr Lys Arg Lys Lys Gin Thr Ser Thr Met Ser 145 150 155 160 Arg Glu Glu Phe Leu Leu Tyr Thr Asn Gly Leu Trp Asn Phe Ser Gly 165 170 175 Glu Ser Lys Lys Arg Leu Lys His Pro Ala Pro Phe Pro Arg Glu Leu 180 185 190 Pro Arg Arg Cys Ile Gln Leu Phe Ser Phe Leu Glu Asp Thr Ile Phe 195 200 205 Asp Pro Phe Ser Gly Ser Gly Thr Thr Ile Leu Glu Ala Asn Ala Leu 210 215 220 Gly Arg Phe Ser Val Gly Leu Glu Ile Glu Lys Glu Tyr Cys Glu Leu 225 230 235 240 Ser Lys Lys Arg Ile Leu Glu Ser Leu Ser Leu Val 245 250 INFORMATION FOR SEQ ID NO:1083: WO 97/37044 WO 9737044PCTIUS97/05223 999 SEQUENCE CHARACTERISTICS: LENGTH: 925 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1. 925 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:i083: Leu Val Ile Ile Ser Leu Leu Thr Thr Leu Lys Leu 1 Giu Ile Asp Ala Val Ala Lys Asp Giu 145 Lys Thr Ala Giu Gly 225 Leu Phe Gin Ile Asn Leu Gin Lys Asn Ile Val 130 Lys Giu Phe Leu Lys 210 Asp Lys Met Leu Ser Thr Al a Asn Ser Giu Asp 115 Tyr Ala Lys Asp Giu 195 Val1 Ser Arg Gly Giu 275 Ile Gly Ser Glu Ile Ile 100 Thr Ala Arg Asp Giu 180 Gly Ser Ile Arg Trp 260 Tyr 5 Lys Val Pro Thr Ser Ala Ala Thr Ile Gly 165 Gin Gln Giu Tyr Val1 245 Met Asp Lys Giu Lys Ser 70 Tyr Lys Val Phe Ala 150 Leu Lys Gly Gly Ile 230 Ile Trp Ser Phe Ala Giu 55 Gin Val1 Ile Leu Giu 135 Gly Lys Leu Tyr Ala 215 Lys Giu Gly Leu Ile Leu 40 Thr Sen Gly Arg Ala 120 Asn Val Sen Giu Tyr 200 Leu Gin Ser Leu Arg 280 Leu 25 Giu Pro Asn Leu Val 105 Leu Gly Giu Gin His 185 Gly Leu Ser Leu Asn 265 Ile 10 Ser Asn Lys Gin Sen 90 Gly Phe Ile Ile Met 170 Ala Sen Ile Ile Ser 250 Asp Gin Ser Asp Giu Thn 75 Tyr Asp Asn Leu Lys 155 Gly Lys Val Val1 Tyr 235 Ala Gly Asp Leu Gly Ala Pro Met Met Gin Giu 140 Gly Ile Thr Val1 Phe 220 Giu As n Lys Val Lys Val1 Ser Gin Lys Ser Val1 Giy 125 Phe Tyr Lys Ala Giu 205 Asp Gly Lys Leu Tyr 285 Ser Phe Lys Lys Giu Asp Asp 110 Tyr His Gly Lys Leu 190 Val1 Val1 Ser Gin.
Arg 270 Met Ile Ala Pro Asn Met Met Ser Phe Phe Thr Gly 175 Lys Arg Asn Asp Arg 255 Leu Arg Lys Cys Asn Giu Lys Leu Lys Lys Asp Giu 160 Asp Thr Thn Arg Lys 240 Asp Asp Arg Gly Tyr Leu Asp Ala His Ile Ser Ser Pro Phe Leu Lys Thn Asp Phe 290 295 300 WO 97/37044 PCT/US97/05223 1000 Ser Thr His Asp Ala Lys Leu His Tyr Lys Val Lys Glu Gly Ile Gin 305 310 315 320 Tyr Arg Ile Ser Asp Ile Leu Ile Glu Ile Asp Asn Pro Val Val Pro 325 330 335 Leu Lys Thr Leu Glu Lys Ala Leu Lys Val Lys Arg Lys Asp Val Phe 340 345 350 Asn Ile Glu His Leu Arg Ala Asp Ala Gin Ile Leu Lys Thr Glu Ile 355 360 365 Ala Asp Lys Gly Tyr Ala Phe Ala Val Val Lys Pro Asp Leu Asp Lys 370 375 380 Asp Glu Lys Asn Gly Leu Val Lys Val Ile Tyr Arg Ile Glu Val Gly 385 390 395 400 Asp Met Val His Ile Asn Asp Val Ile Ile Ser Gly Asn Gin Arg Thr 405 410 415 Ser Asp Arg Ile Ile Arg Arg Glu Leu Leu Leu Gly Pro Lys Asp Lys 420 425 430 Tyr Asn Leu Thr Lys Leu Arg Asn Ser Glu Asn Ser Leu Arg Arg Leu 435 440 445 Gly Phe Phe Ser Lys Val Lys Ile Glu Glu Lys Arg Val Asn Ser Ser 450 455 460 Leu Met Asp Leu Leu Val Ser Val Glu Glu Gly Arg Thr Gly Gin Leu 465 470 475 480 Gin Phe Gly Leu Gly Tyr Gly Ser Tyr Gly Gly Leu Met Leu Asn Gly 485 490 495 Ser Val Ser Glu Arg Asn Leu Phe Gly Thr Gly Gin Ser Met Ser Leu 500 505 510 Tyr Ala Asn Ile Ala Thr Gly Gly Gly Arg Ser Tyr Pro Gly Met Pro 515 520 525 Lys Gly Ala Gly Arg Met Phe Ala Gly Asn Leu Ser Leu Thr Asn Pro 530 535 540 Arg Ile Phe Asp Ser Trp Tyr Ser Ser Thr Ile Asn Leu Tyr Ala Asp 545 550 555 560 Tyr Arg Ile Ser Tyr Gin Tyr Ile Gin Gin Gly Gly Gly Phe Gly Val 565 570 575 Asn Val Gly Arg Met Leu Gly Asn Arg Thr His Val Ser Leu Gly Tyr 580 585 590 Asn Leu Asn Val Thr Lys Leu Leu Gly Phe Ser Ser Pro Leu Tyr Asn 595 600 605 Arg Tyr Tyr Ser Ser Val Asn Glu Val Ala Ser Pro Arg Gin Cys Ser 610 615 620 Thr Pro Ala Ser Val Ile Ile Asn Arg Leu Ser Gly Gly Arg Thr Pro 625 630 635 640 Leu Val Pro Glu Ser Cys Ser Ser Pro Gly Ala Ile Thr Thr Ser Pro 645 650 655 Glu Ile Lys Gly Ile Trp Asp Arg Asp Tyr His Thr Pro Ile Thr Ser 660 665 670 Ser Phe Thr Leu Asp Val Ser Tyr Asp Asn Thr Asp Asp Tyr Tyr Phe 675 680 685 Pro Arg Asn Gly Val Ile Phe Ser Ser Tyr Ala Thr Met Ser Gly Leu 690 695 700 Pro Ser Ser Gly Thr Leu Asn Ser Trp Asn Gly Leu Gly Gly Asn Val 705 710 715 720 Arg Asn Thr Lys Val Tyr Gly Lys Phe Ala Ala Tyr His His Leu Gin 725 730 735 Lys Tyr Leu Leu Ile Asp Leu Ile Ala Arg Phe Lys Thr Gin Gly Gly 740 745 750 Tyr Ile Phe Arg Tyr Asn Thr Asp Asp Tyr Leu Pro Leu Asn Ser Thr WO 97/37044 PCT/US97/05223 1001 755 760 765 Phe Tyr Mec Gly Gly Val Thr Thr Val Arg Gly Phe Arg Asn Gly Ser 770 775 780 Ile Thr Pro Lys Asp Glu Phe Gly Leu Trp Leu Gly Gly Asp Gly Ile 785 790 795 800 Phe Thr Ala Ser Thr Glu Leu Ser Tyr Gly Val Leu Lys Ala Ala Lys 805 810 815 Met Arg Leu Ala Trp Phe Phe Asp Phe Gly Phe Leu Thr Phe Lys Thr 820 825 830 Pro Thr Arg Gly Ser Phe Phe Tyr Asn Ala Pro Thr Thr Thr Ala Asn 835 840 845 Phe Lys Asp Tyr Gly Val Val Gly Ala Gly Phe Glu Arg Ala Thr Trp 850 855 860 Arg Ala Ser Thr Gly Leu Gin Ile Glu Trp Ile Ser Pro Met Gly Pro 865 870 875 880 Leu Val Leu Ile Phe Pro Ile Ala Phe Phe Asn Gin Trp Gly Asp Gly 885 890 895 Asn Gly Lys Lys Cys Lys Gly Leu Cys Phe Asn Pro Asn Met Asn Asp 900 905 910 Tyr Thr Gin His Phe Glu Phe Ser Met Gly Thr Arg Phe 915 920 925 INFORMATION FOR SEQ ID NO:1084: SEQUENCE CHARACTERISTICS: LENGTH: 191 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...191 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1084: Met Lys Gin Leu Phe Leu Ile Ile Gly Ala Pro Gly Ser Gly Lys Thr 1 5 10 Thr Asp Ala Glu Leu Ile Ala Lys Asn Asn Ser Glu Thr Ile Ala His 25 Phe Ser Thr Gly Asp Leu Leu Arg Ala Glu Ser Ala Lys Lys Thr Glu 40 Arg Gly Leu Leu Ile Glu Lys Phe Thr Ser Gin Gly Glu Leu Val Pro 55 Leu Glu Ile Val Val Glu Thr Ile Leu Ser Ala Ile Lys Ser Ser Ser 70 75 Lys Gly Ile Ile Leu Ile Asp Gly Tyr Pro Arg Ser Val Glu Gin Met 90 Gin Ala Leu Asp Lys Glu Leu Asn Ala Gin Asn Glu Val Ile Leu Lys 100 105 110 Ser Val Iie Glu Val Glu Val Ser Glu Asn Thr Ala Lys Glu Arg Val WO 97/37044 PCT/US97/05223 1002 115 120 Leu Gly Arg Ser Arg Gly Ala Asp Asp Asn Glu Ar 130 135 14 Arg Met Arg Val Phe Leu Asp Pro Leu Val Glu Il 145 150 155 Lys Ala Lys His Leu His Lys Ile Ile Asn Gly Gl 165 170 Glu Ile Val Asn Glu Met Gin Lys Tyr Ile Leu Se 180 185 INFORMATION FOR SEQ ID NO:1085: SEQUENCE CHARACTERISTICS: LENGTH: 340 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...340 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1085: 125 g Val Phe His Asn 0 e Gin Asn Phe Tyr 160 u Arg Ser Ile Glu 175 r Phe Ala Asn 190 Leu Ser Ser Leu Trp Leu Thr Asn Pro Leu Asn Ala 1 Gly Thr Val Cys Asn Asp Gly Tyr Ala 145 Tyr Pro Thr Ala Asn Ser Gin Cys Ile Tyr Gly 130 Asn Thr Lys Trp Phe Ala Ala Pro Val Pro Lys 115 Phe Ala Tyr Ala Thr Val Tyr Asn Ala Val Gly 100 His Phe Ile Gly Thr 180 Asn 5 Gly Arg Asp Ser Asn Arg Phe Asp Ser Phe 165 Ala Asn Ile Asn Asp Gly 70 Trp Asn Val Tyr Pro 150 Gly Gly Arg Ser Gly Gly 55 Thr Thr Gin Gly Gly 135 Phe Thr Phe Val Leu Glu 40 Lys Pro Ser Pro Lys 120 His Tyr Asp Phe Gly Glu 25 Leu Val Gly Arg Met 105 Lys Thr Leu Met Val 185 Tyr 10 Val Phe Pro Thr Thr 90 Tyr Arg Asn Ser Leu 170 Gly Phe Gly Gin Asp Pro 75 Met Gly Trp Phe Asp 155 Phe Val Lys Arg Val Gly Gly Leu Leu Phe Ser 140 Gin Asn Asn Asp His Val Pro Gin Tyr Ser Gly Gly 125 Asn Lys Val Phe Gly Glu Asp Phe Thr Thr Thr Val 110 Leu Ser Ala Ile Ala 190 Tyr Lys Gin Gly Gly Lys Asn Met Arg Arg Asp Asp 175 Gly Val Asn Lys Asp Gly Ala Lys Thr Tyr Ala Met 160 Lys Asn Tyr 195 200 205 Gly Val Asn Thr Asp Ala Asp Ala Tyr Met Thr Asn Ala Asp Gly Thr WO 97/37044 WO 9737044PCTIUS97/05223 1003 210 215 220 Ile Thr Cys Gly Asp Thr Thr Pro Ala Ser Cys Asn 225 230 235 Pro Asn Ser Val Tyr Thr Thr Gly Lys Leu Asn Ala 245 250 Thr Ilie Phe Gin Phe Leu Val Asn Val Gly Ile Arg 260 265 0Th His His Gly Ile GTh Phe Gly Ile Lys Ile Pro 275 280 Tyr Phe Phe Lys Gly Ser Thr Thr Ile Arg Ala Lys 290 295 300 Leu Giu Asn Gly Asn Pro Thr Thr Ile Thr Gly Ala 305 310 315 Ser Leu Thr Gin Thr Leu Arg Arg Gin Tyr Ser Met 325 330 Val Tyr Thr Phe 340 INFORMA~TION FOR SEQ ID NO:1086: SEQUENCE CHARACTERISTICS: LENGTH: 528 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION .528 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:i086: Val1 Lys Thr Thr 285 Lys 0Th Tyr Gly Val1 Asn 270 Leu Gin Thr Leu Ile Asn 255 Ile Pro Gly Asn Arg 335 Asn 240 His Phe Asn Pro Phe 320 Tyr Met Lys Leu Lys Lys Arg Lys Val Ala Ala Thr Leu Leu Lys Arg Leu 1 Thr Leu Pro Giu Val His Ser Pro Ilie Asn Leu Thr Thr Val Asn Arg Tyr Asp Lys Thr Ser 115 Lys Met Trp 130 Ilie Gly Tyr Leu Gly Asn Gly Val1 Arg 100 Cys Gln Met 5 Leu Asp Pro Lys Gly Trp Gly Gin Tyr Phe Phe Val Leu 70 Gly Ala Thr Gin Met Thr Ile Lys 55 Giu Val Lys Asp Gly 135 Gly Thr Asn 40 Gly Gly Leu Asp Sen 120 Pro GTh Gly 25 Phe Ile Ser Gly Phe 105 Leu Gly Trp 10 Sen Sen Tyr Val Gly 90 Thr Ser Gly Asn Leu Lys Pro His 75 Gin Pro Leu Ile Gly Gly Ala Val Giy Thr 0Th Leu Gly Val Tyr Pro Ser Cys Met 125 Ile Asp 140 Leu Phe Vali Phe Thr Arg Asp Tyr 110 Asn Pro Pro Thn Asn Phe Gly Asn Trp Ala Ang Asn Tyr Ang Val Trp Thr Asp Thr Gly Tyr WO 97/37044 PCT/US97/05223 1004 145 150 155 160 Tyr Pro Ala Asn Ala Tyr Leu Pro Gly His Ser Arg Arg Tyr Glu Val 165 170 175 Tyr Lys Ala Asn Leu Thr Tyr Asp Ser Asp Arg Val His Met Val Met 180 185 190 Gly Arg Phe Asp Val Thr Glu Gin Glu Gin Met Asp Trp Ile Tyr Gin 195 200 205 Leu Phe Gin Gly Phe Tyr Gly Thr Phe Lys Leu Thr Lys Asn Met Lys 210 215 220 Phe Leu Leu Phe Ser Ser Trp Gly Arg Gly Ile Ala Asp Gly Gin Trp 225 230 235 240 Leu Phe Pro Ile Tyr Arg Glu Lys Pro Trp Gly Ile His Lys Ala Gly 245 250 255 Ile Ile Tyr Arg Pro Thr Lys Asn Leu Met Ile His Pro Tyr Val Tyr 260 265 270 Leu Ile Pro Met Val Gly Thr Leu Pro Gly Ala Lys Ile Glu Tyr Asp 275 280 285 Thr Asn Pro Glu Phe Ser Gly Arg Gly Ile Arg Asn Lys Thr Thr Phe 290 295 300 Tyr Val Leu Tyr Asp Tyr Arg Trp Asn Asn Ala Glu Tyr Gly Arg Tyr 305 310 315 320 Ala Pro Ala Arg Tyr Asn Thr Trp Asp Pro Phe Leu Asp Asn Gly Lys 325 330 335 Trp Arg Gly Leu Gin Gly Pro Gly Gly Ala Thr Leu Tyr Leu His His 340 345 350 His Ile Asp Ile Asn Asn Tyr Phe Val Val Gly Gly Ala Tyr Leu Asn 355 360 365 Ile Gly Asn Pro Asn Met Asn Leu Gly Thr Trp Gly Asn Pro Val Ala 370 375 380 Leu Asp Gly Ile Glu Gin Trp Val Gly Gly Ile Tyr Ser Leu Gly Phe 385 390 395 400 Ala Gly Ile Asp Asn Ile Thr Asp Ala Asp Ala Phe Thr Glu Tyr Val 405 410 415 Lys Gly Gly Gly Lys His Gly Lys Phe Ser Trp Ser Val Tyr Gin Arg 420 425 430 Phe Thr Thr Ala Pro Arg Ala Leu Glu Tyr Gly Ile Gly Met Tyr Leu 435 440 445 Asp Tyr Gin Phe Ser Lys His Val Lys Ala Gly Leu Lys Leu Val Trp 450 455 460 Leu Glu Phe Gin Ile Arg Ala Gly Tyr Asn Pro Gly Thr Gly Phe Leu 465 470 475 480 Gly Pro Asn Gly Gin Pro Leu Asn Leu Asn Asn Gly Leu Phe Glu Ser 485 490 495 Ser Ala Phe Ala Gin Gly Pro Gin Asn Met Gly Gly Ile Ala Lys Ser 500 505 510 Ile Thr Gin Asp Arg Ser His Leu Met Thr His Ile Ser Tyr Ser Phe 515 520 525 INFORMATION FOR SEQ ID NO:1087: SEQUENCE CHARACTERISTICS: LENGTH: 234 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 WO 9737044PCT/US97/05223 1005 (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .234 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1087: Met Thr Lys Ile Ala Met Ala Asn Phe Lys Ser Ala 1 Lys His Asn Asp Leu Leu Gly Arg Giu 145 Ile Leu Leu Ile Giu 225 Ser Phe Ala Cys Lys Lys Lys Giu 130 Asn Trp Thr Tyr Asp 210 Asn His Asp Phe Gly Ile Glu Asn 115 Lys Ile Ala His Gly 195 Ser Phe Ala Arg Leu Ala His Ser 100 Phe Gly Asp Ile Gly 180 Gly Val1 Lys 5 Tyr Val His Phe Thr Pro Lys Phe Leu Gly 165 Phe Ser Asp Thr Leu Phe Phe Thr 70 Leu Ser Ile Arg Asn 150 Thr Leu Val1 Gly Ile 230 Lys Val1 Thr 55 Gly Leu Phe Val Ala 135 Tyr Lys Lys Asn Leu 215 Ile Glu Phe 40 Leu Giu Ile Leu Tyr 120 Val Ser Lys Gin Ala 200 Leu Sen Leu 25 Pro Gly Ile Gly Lys 105 Cys Lys Asn Ser Ile 185 Gin Ile Phe 10 Giu Asp Ala Thr His 90 Glu Ile Glu Leu Ala 170 Leu Asn Gly Leu Lys Phe Gin Sen 75 Ser Lys Gly Phe Ile 155 Ser Asn Ala Ser Thr Leu Asn Lys Giu Phe Glu Leu 140 Val Leu Gin Lys Ala 220 Met Leu Gly Ala His Arg Asp Asp 125 Sen Ala Giu Lys Giu 205 Ser Pro Lys Leu Tyr Leu Arg Phe 110 Leu Giu Tyr Asp Thr 190 Ile Leu Val1 Pro Leu Pro Giu Val Phe Thr Gln Glu Ile 175 Pro Leu Glu Phe Gin Pro Arg Giu Leu Lys Thr Leu Pro 160 Tyr Leu Gly Leu INFORMATION FOR SEQ ID NO:1088: SEQUENCE CHARACTERISTICS: LENGTH: 465 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacten pyloni (ix) FEATURE: WO 97/37044 PCT/US97/05223 1006 NAME/KEY: misc_feature LOCATION 1...465 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1088: Met Phe Ile Tyr Asp Thr Lys Leu Lys Gin Lys Val Pro Phe Glu Pro 1 5 10 Leu Val His Asn Lys Ala Asn Ile Tyr Val Cys Gly Pro Thr Val Tyr 25 Asp Asp Ala His Leu Gly His Ala Arg Ser Ala Ile Ala Phe Asp Leu 40 Leu Arg Arg Thr Leu Glu Leu Ser Gly Tyr Glu Val Val Leu Val Arg 55 Asn Phe Thr Asp Ile Asp Asp Lys Ile Ile Asn Lys Ala Phe Lys Glu 70 75 Asn Lys Ser Ile Gin Glu Leu Ser Ser Ile Tyr Ile Glu Ser Tyr Thr 90 Arg Asp Leu Asn Ala Leu Asn Val Lys Gin Pro Ser Leu Glu Pro Lys 100 105 110 Ala Ser Glu Tyr Leu Asp Ala Met Val Arg Met Ile Glu Thr Leu Leu 115 120 125 Glu Lys Asn Phe Ala Tyr Arg Val Ser Asn Gly Asp Ile Tyr Leu Asp 130 135 140 Thr Ser Lys Asp Lys Asp Tyr Gly Ser Leu Ser Met His Asn Ser Ser 145 150 155 160 Val Glu Phe Ser Arg Ile Gly Leu Val Gin Glu Lys Arg Leu Glu Gin 165 170 175 Asp Phe Val Leu Trp Lys Ser Tyr Lys Gly Asp Asn Asp Val Gly Phe 180 185 190 Asp Ser Pro Leu Gly Lys Gly Arg Pro Gly Trp His Ile Glu Cys Ser 195 200 205 Ser Met Val Phe Glu Thr Leu Ala Leu Ala Asn Ala Pro Tyr Gin Ile 210 215 220 Asp Ile His Ala Gly Gly Thr Asp Leu Leu Phe Pro His His Glu Asn 225 230 235 240 Glu Ala Cys Gin Thr Arg Cys Ala Phe Gly Val Glu Ile Ala Lys Tyr 245 250 255 Trp Met His Asn Gly Phe Val Asn Ile Asn Asn Glu Lys Met Ser Lys 260 265 270 Ser Leu Gly Asn Ser Phe Phe Ile Lys Asp Ala Leu Lys Asn Tyr Asp 275 280 285 Gly Glu Ile Leu Arg Asn Tyr Leu Leu Gly Val His Tyr Arg Ser Val 290 295 300 Leu Asn Phe Asn Glu Glu Asp Leu Leu Met Ser Lys Lys Arg Leu Asp 305 310 315 320 Lys Ile Tyr Arg Leu Lys Gin Arg Val Leu Gly Thr Leu Gly Gly Ile 325 330 335 Asn Pro Asn Phe Lys Lys Glu Ile Leu Glu Cys Met Gin Asp Asp Leu 340 345 350 Asn Val Ser Lys Ala Leu Ser Val Leu Glu Ser Met Leu Ser Ser Thr 355 360 365 Asn Glu Lys Leu Asp Gin Asn Pro Lys Asn Lys Ala Leu Lys Gly Glu 370 375 380 Ile Leu Ala Asn Leu Lys Phe Ile Glu Glu Leu Leu Gly Ile Gly Phe 385 390 395 400 Lys Asp Pro Ser Ala Tyr Phe Gin Leu Gly Val Ser Glu Ser Glu Lys 405 410 415 WO 97/37044 WO 9737044PCTJUS97/05223 Gin Lys Lys Phe 465 1007 Glu le Giu Asn Lys Ile Glu Glu Arg Lys Arg Ala Lys GTh Gin 420 425 430 Asp Phe Leu Lys Ala Asp Ser Ile Arg Giu Giu Leu Leu Gin Gin 435 440 445 Ile Ala Leu Met Asp Thr Pro Gin Gly Thr Ile Trp Glu Lys Leu 450 455 460 INFORMATION FOR SEQ ID NO:1089: SEQUENCE CHARACTERISTICS: LENGTH: 1213 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION 1. .1213 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1089: Met Lys Gin Phe Lys Lys Lys Pro Lys Lys Ile Lys 1 Asn Ile Gly Trp Phe Ala Lys Gly Ala 145 Tyr Val1 Thr Ile Gin Gly Leu Tyr Thr Asn Asn Gly 130 Thr Pro Asn His Asn 210 Lys Gly Sen Ala Gly Tyr Asp 115 Giu Phe Asn Asn Thr 195 Ser Thr Phe Trp Ile Asn Tyr 100 Asn Tyr Asn Gly Ser 180 Gly Asn 5 Ile Ala Gly Trp Gin His Gly Asn Leu His 165 Vali Thr Ile Leu Ser Glu Sen 70 Leu Ser Thr Gly Gly 150 Thn Giu Ala Ser Lys Gly Lys 55 Cys Ile Gin Tyr Gly 135 Ala Asp Val Thr Ala 215 Ang Val1 40 Ser Asp Thr Asn Phe 120 Asn Ser Val1 Gly Leu 200 Tyr Pro 25 Tyn Gin Lys Lys Asn 105 Leu Leu Ser Thr Asn 185 Asn Lys 10 Leu Ala Lys Trp Thr 90 Gin Ser Asp Gly Phe 170 Arg Leu Thr Trp Asp Val Glu 75 Tnp Asp Gly Ile Asn 155 Ser Val Asn Ser Leu Gly Cys Giu Ala Ile Leu Glu 140 Ser Ala Gly Ala Gin 220 Ang Met Thr Val Lys Gly Thr Tyr 125 Leu Phe Gly Sen Asn 205 Val Ser Pro Asp His Thn Gly Ala Asn Gly Thr Thr Gly 190 Lys Asn His Leu Ile Arg Gin Asn Asn Tyr Ser S er Ile 175 Ala Vali Val Gin Leu Leu Pro Gin Ala Leu Thr Asn Trp 160 Asn Gly Thr Gly WO 97/37044 PCT/US97/05223 1008 Asn Ala Asn Ser Val Ile Thr Ile Asn Ser Val Ser Leu Asn Gly Asp 225 230 235 240 Thr Cys Ser Ser Leu Ala Arg Val Gly Val Gly Ala Asn Cys Ser Thr 245 250 255 Ser Gly Pro Ser Tyr Ser Phe Lys Gly Thr Thr Asn Ala Thr Asn Thr 260 265 270 Thr Phe Ser Asn Ser Ser Gly Ser Phe Thr Phe Glu Glu Asn Ala Thr 275 280 285 Phe Ser Gly Ala Lys Leu Asn Gly Gly Ala Phe Thr Phe Asn Lys Lys 290 295 300 Phe Asn Ala Thr Asn Asn Thr Ala Phe Asn Ser Gly Ser Phe Thr Phe 305 310 315 320 Lys Gly Thr Ser Ser Phe Asn Gly Ala Asn Phe Ser Asn Ala Ser Tyr 325 330 335 Thr Phe Asn Asn Gin Ala Thr Phe Gin Asn Ser Ser Phe Asn Gly Gly 340 345 350 Thr Phe Thr Phe Asn Asp Gin Thr Asn Gin Ser Thr Gin His Pro Gin 355 360 365 Ile Gin Asn Ser Ser Phe Ser Gly Ser Ala Thr Thr Leu Lys Gly Phe 370 375 380 Ala Thr Phe Glu Gin Ala Phe Asn Asn Ser Asn His Gin Leu Thr Ile 385 390 395 400 Gin Asn Ala Ser Phe Asn Asn Ala Thr Phe Asn Asn Thr Gly Lys Ile 405 410 415 Thr Ile Glu Lys Asp Ala Ser Phe Asn Asn Thr Ser Phe Asn Thr Pro 420 425 430 Val Asp Thr Asn Asn Met Thr Ile Ser Gly Gly Val Thr Leu Ser Gly 435 440 445 Lys Asn Asp Leu Lys Asn Gly Ala Thr Leu Asp Phe Gly Ser Ser Lys 450 455 460 Ile Thr Leu Thr Gin Gly Thr Thr Phe Asn Leu Thr Ser Leu Gly Ser 465 470 475 480 Glu Lys Ser Val Thr Ile Leu Asn Ser Arg Gly Gly Ile Thr Tyr Asn 485 490 495 His Leu Leu Asn His Ala Ile Asn Ser Leu Thr Asn Ala Leu Lys Thr 500 505 510 Asn Glu Ser Ser Ser Lys Pro Gin Ser Phe Ala Gin Gly Leu Trp Asp 515 520 525 Met Ile Thr Tyr Asn Gly Val Thr Gly Gin Leu Leu Asn Glu Asn Ala 530 535 540 Ala Thr Ser Lys Pro Thr Asp Ser Ser Pro Ser Lys Ser Ser Thr Asn 545 550 555 560 Ser Thr Gin Val Tyr Gin Val Gly Tyr Lys Ile Gly Asp Thr Ile Tyr 565 570 575 Lys Leu Gin Glu Thr Phe Ser His Asn Ser Ile Ile Ile Gin Ala Leu 580 585 590 Glu Ser Gly Thr Tyr Thr Pro Pro Pro Val Ile Asn Gly Ser Lys Phe 595 600 605 Asp Leu Ser Ala Ser Asn Tyr Ile Asn Ala Asp Met Pro Trp Tyr Asn 610 615 620 His Lys Tyr Tyr Ile Pro Lys Ser Gin Asn Phe Thr Glu Ser Gly Thr 625 630 635 640 Tyr Tyr Leu Pro Ser Val Gin Ile Trp Gly Ser Tyr Thr Asn Ser Phe 645 650 655 Lys Gin Thr Phe Ser Ala Ser Asn Ser Asn Leu Val Ile Gly Tyr Asn 660 665 670 Ala Thr Trp Thr Asp His Asn Val Ser Ser Ser Asp Thr Val Ala Phe WO 97/37044 WO 9737044PCT1US97/05223 1009 675 680 Gly Asp Thr Ser Gly Ser Ala Leu Asn 690 695 Tyr Tyr Gin Cys Thr Gly Thr Thr Asn 705 710 Vai Tyr Ile Thr Ala Asn Leu Arg Ser 725 Gly Ala Ala Asn Leu Ile Phe Asn Gly 740 745 Asn Ala Thr Ile Thr Gin His Asn Ala 755 760 Thr Phe Ser Thr Gin Asn Met Asp Asn 770 775 Asn Ser Asn Gly Lys Leu Leu Val Tyr 785 790 Ala Lys Asp Gly Lys Phe Ile Phe Asn 805 Asn Thr Asn Phe Asn Gly Gly Ser Tyr 820 825 Asn Phe Ser Asn Asn Asn Gin Phe Asn 835 840 Ala Lys Asn Thr Ile Phe Asn Asn Ala 850 855 Phe Asn Phe Asn Asn Ser Ser Ala Thr 865 870 Thr Asn Ala Asn Ser Asn Leu Gin Ile 885 Asn Ser Thr Asn Gly Ser Gin Asn Thr 900 905 Ser Val Asn Ile Ala Gly Asn Ala Thr 915 920 Ser Pro Thr Asn Thr Ser Val Lys Gly 930 935 Thr Leu Lys Asn Leu Asn Ala Pro Leu 945 950 Val Phe Ser Ala His Ser Val Ile Asn 965 Gly Asn Pro Ile Thr Leu Val Ser Ser 980 985 Asp Ala Phe Ser Lys Asn Leu Trp Gin 995 1000 Gly Ala Ser Ser Giu Lys Leu Val Ser 1010 1015 Asp Val Val Tyr Ser Phe Asn Asn Gin 1025 1030 Phe Ser Pro Asn Ser Ile Ser Ile Arg 1045 Phe Asp Tyr Val Asp Met Giu Lys Ser 1060 1065 Ala Leu Gly Phe Met Thr Tyr Met Pro 1075 1080 Gly Asn Leu Asn Asn Thr Ile Tyr Tyr 1090 1095 Tyr Ala Ser Gly Lys Thr Leu Phe Thr 1105 1110 685 Gly His Cys Gly Pro Trp Pro 700 Gly Thr Tyr Ser Ala Tyr His 715 720 Gly Asn Arg Ile Gly Thr Gly 730 735 Val Asp Ser Ile Asn Ile Ala 750 Gly Ala Tyr Ser Ser Ser Met 765 Ser Gin Asn Leu Asn Gly Leu 780 Gly Thr Thr Phe Thr Asn Gin 795 800 Ala Gly Gin Ala Thr Phe Giu 810 815 Gin Phe Ser Gly Asp Ser Leu 830 Ser Gly Ser Phe Giu Ile Gly 845 Asn Phe Asn Asn Ser Thr Ser 860 Thr Ser Phe Val Gly Asp Phe 875 880 Ala Gly Asn Ala Val Phe Gly 890 895 Ala Asn Phe Asn Asn Thr Gly 910 Phe Asp Asn Val Val Phe Asn 925 Lys Val Thr Leu Asn Asn Ile 940 Ser Phe Gly Asp Gly Thr Ile 955 960 Ile Gly Glu Ala Ile Thr Asn 970 975 Ser Lys Ala Ile Giu Tyr Asn 990 Leu Ile Asn Tyr Gin Gly His 1005 Ser Ala Gly Asn Gly Val Tyr 1020 Thr Tyr Asn Phe Gin Giu Val 1035 1040 Arg Leu Gly Val Gly Met Val 1050 1055 Asp Arg Leu Tyr Tyr Gin Asn 1070 Asn Ser Tyr Asn Asn Asn Leu 1085 Tyr Asp Asn Ser Ile Asp Phe 1100 Lys Ala Giu Phe Ser Gin Thr 1115 1120 Phe Thr Gly Gin Asn Ser Ala Ile Val Phe Gly Ala Lys Asn Ile Trp 1125 1130 1135 WO 97/37044 PCT/US97/05223 1010 Thr Ser Val Ser Asp Ala Pro Gin Ser Asn Val Ile lie Arg Phe Gly 1140 1145 1150 Asp Asn Lys Gly Ala Gly Ser Asn Asp Ala Ser Gly His Cys Trp Asn 1155 1160 1165 Leu Gin Cys Ile Gly Phe Ile Thr Gly His Tyr Glu Ala Gin Lys Ile 1170 1175 1180 Tyr Ile Thr Gly Ser Ile Glu Ser Gly Asn Arg Ile Ser Ser Gly Gly 1185 1190 1195 1200 Ala Arg Ala Leu Ile Leu Thr Gly Phe Lys Ala Phe Phe 1205 1210 INFORMATION FOR SEQ ID NO:1090: SEQUENCE CHARACTERISTICS: LENGTH: 500 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...500 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1090: Met Ala Glu Trp Lys Thr Asp Thr Glu Glu Val Lys Lys Val Val Gly 1 5 10 Arg Cys Arg Glu Phe Lys Arg Ser Leu Gin Glu Glu Lys Cys Ser Pro 25 Phe Ile Lys Asp Leu Asp Ser Tyr Ala Leu Lys Ile Ile Val Glu Arg 40 Arg Lys Thr Glu Met Gln Leu Glu Lys Ala Ile Gly Glu Leu Lys Lys 55 Ala Lys Ser Asn Glu Asp Asp Ala Lys Val Ala Leu Arg Val Leu Gin 70 75 Gly Ala Ser Val Val Ser Trp Ile Trp Pro Pro Ala Arg Ile Ala Ala 90 Thr Ala Ala Ile Val Ala Ala Glu Ala Val Leu Lys Phe Met Lys Glu 100 105 110 Asp Thr Glu Lys Cys Lys Arg Asn Val Glu Leu Leu Glu Arg Met Leu 115 120 125 Glu Ile Tyr Ser Asn Gin Ala Lys Ala Ser Ala Asn Leu Met Asn Gin 130 135 140 Ala Trp Glu Gly Ile Lys Lys Arg Leu His Phe Tyr Thr Asp Lys His 145 150 155 160 Gin Glu Phe Ile Arg Arg Leu Lys Gin Ala Ser Asp Ala Ile Asp Asn 165 170 175 Glu Tyr Asn Phe Pro Thr Pro Gly Val Leu Leu Glu Tyr Asp Phe Glu 180 185 190 Arg Pro Ala Ile Ser Tyr Thr Pro Lys Lys Ser Val Phe Asn Glu Arg 195 200 205 WO 97/37044 PCT/US97/05223 1011 Leu Lys Asp Leu Arg Glu Asn Phe Ser Lys 225 Met Met Met Met Ala 305 Thr Ala Ser Lys Ile 385 Trp Glu Gin Glu Asp 465 Leu Gin 210 Asp Ile Gly Ala Pro 290 Lys Glu Phe Gin Ala 370 Val Arg Asp Ala Gln 450 Ala 3ly Lys Lys Ile Ala Phe Ala Tyr 260 Ile Ser 275 Ser Val Asn Arg Lys Ile Asp Asn 340 Lys Ile 355 Leu Glu Asp Glu Trp Ala Leu Asn 420 Thr Leu 435 Ile Ala Asn Val Val Lys Gin Asn 500 His Arg 245 Ser Lys Leu Cys Lys 325 Leu Asp Tyr Lys Glu 405 Lys Lys Lys Leu His 485 His 230 SGlu Tyr Glu Gly Val 310 Glu Glu Pro Arg Asn 390 Phe Thr Asp Gly Gly 470 Asn Gin Asp Gin Val 295 Lys Ser Thr Val Glu 375 Pro Asp Ala Asn Phe 455 Asn Ala Glu Glu Glu 280 Pro Asn Pro Glu Leu 360 Phe Tyr Ser Cys Asp 440 Ile Leu Leu Phe Asn 265 Leu Ser Phe Asn Leu 345 Glu Leu Pro Val Ala 425 Leu Pro Ala AlE Ser Glu 250 Pro Glu Tyr Lys Asp 330 Glu Arg Glu Glu Phe 410 His Gly Arg Leu Ser Asn 235 SLys Asn SLys Asn Glu 315 Ser Arg Asn Ser Glu 395 Ser His Phe Gly Val 475 Leu 220 Asp Ser Asp Ser Glu 300 Ala Asn Ala Glu Arg 380 Val Ala Ala Asp Tyr 460 Arg Tyr Asp Leu Glu Leu 285 Ser Leu Ala Thr Asn 365 Lys Ser Ile Leu Ala 445 Leu Glu Ala Leu Glu Leu 270 Glu Leu Glu Ile Glu 350 Tyr Glu Phe Val Lys 430 Thr Trp Glu Asp Glu Asp 255 Asp Asp Thr Gly Asn 335 Asn Thr Ser Asn Pro 415 Ala Glu His Leu Leu Arg 240 Trp Arg Leu Leu Phe 320 Glu Leu Gin Phe Glu 400 Leu Leu Leu Phe Leu 480 Leu Thr Lys Gly Tyr Ser Leu Trp Thr Glu Phe 495 INFORMATION FOR SEQ ID NO:1091: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 1012 NAME/KEY: misc feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1091: TTAACCATGG TGAAAAGCGA TA 22 INFORMATION FOR SEQ ID NO:1092: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1092: TAGAATTCGC ATAACGATCA ATC 23 INFORMATION FOR SEQ ID NO:1093: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacterpylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1093: TTAACCATGG TGAAAAGCGA TA 22 WO 97/37044 PCT/US97/05223 1013 INFORMATION FOR SEQ ID NO:1094: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1094: TAGAATTCGC ATAACGATCA ATC 23 INFORMATION FOR SEQ ID NO:1095: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1095: ATATCCATGG TGAGTTTGAT GA 22 INFORMATION FOR SEQ ID NO:1096: SEQUENCE CHARACTERISTICS: LENGTH: 25 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/US97/05223 1014 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1096: ATGAATTCAA TTTTTTATTT TGCCA INFORMATION FOR SEQ ID NO:1097: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1097: AATTCCATGG CTATCCAAAT CCG 23 INFORMATION FOR SEQ ID NO:1098: SEQUENCE CHARACTERISTICS: LENGTH: 25 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 1015 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1098: ATGAATTCGC CAAAATCGTA GTATT INFORMATION FOR SEQ ID NO:1099: SEQUENCE CHARACTERISTICS: LENGTH: 24 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...24 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1099: GATACCATGG AATTTATGAA AAAG 24 INFORMATION FOR SEQ ID NO:1100: SEQUENCE CHARACTERISTICS: LENGTH: 25 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...25 WO 97/37044 PCT/US97/05223 1016 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1100: TGAATTCGAA AAAGTGTAGT TATAC INFORMATION FOR SEQ ID NO:1101: SEQUENCE CHARACTERISTICS: LENGTH: 19 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...19 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1101: CCCTTCATTT TAGAAATCG 19 INFORMATION FOR SEQ ID NO:1102: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1102: ATTTCAACCA ATTCAATGCG INFORMATION FOR SEQ ID NO:1103: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 1017 LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1103: GCCCCTTTTG
ATTTGAAGCT
INFORMATION FOR SEQ ID NO:1104: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1104: TCGCTCCAAG ATACCAAGAA GT 22 INFORMATION FOR SEQ ID NO:1105: SEQUENCE
CHARACTERISTICS:
LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/05223 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1105: CTTGAATTAG GGGCAAAGAT CG 22 INFORMATION FOR SEQ ID NO:1106: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1106: ATGCGTTTTT ACCCAAAGAA GT 22 INFORMATION FOR SEQ ID NO:1107: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 1019 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1107: ATAACGCCAC TTCCTTATTG GT 22 INFORMATION FOR SEQ ID NO:1108: SEQUENCE CHARACTERISTICS: LENGTH: 19 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...19 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1108: CTTTGGGTAA AAACGCATC 19 INFORMATION FOR SEQ ID NO:1109: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1109: CGATCTTTGA TCCTAATTCA in WO 97/37044 PCT/US97/05223 1020 INFORMATION FOR SEQ ID NO:1110: SEQUENCE CHARACTERISTICS: LENGTH: 19 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...19 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1110: ATCAAGTTGC CTATGCTGA 19 INFORMATION FOR SEQ ID NO:ll11: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:ll11: TTGAACACTT TTGATTATGC GG 22 INFORMATION FOR SEQ ID NO:1112: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 PCTIUS97/05223 1021 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1112: GGATTATGCG ATTGTTTTAC
AAG
23 INFORMATION FOR SEQ ID NO:1113: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1113: GTCTTTAGCA AAAATGGCGT C 21 INFORMATION FOR SEQ ID NO:1114: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 PCTIUS97/05223 1022 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1114: AATGAGCGTA AGAGAGCCTT C INFORMATION FOR SEQ ID NO:1115: SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...18 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1115: CTTATGGGGG TATTGTCA 18 INFORMATION FOR SEQ ID NO:1116: SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...18 WO 97/37044 PCT/US97/05223 1023 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1116: AGCATGTGGG TATCCAGC 18 INFORMATION FOR SEQ ID NO:1117: SEQUENCE CHARACTERISTICS: LENGTH: 19 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...19 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1117: AGGTTGTTGC CTAAAGACT 19 INFORMATION FOR SEQ ID NO:1118: SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...18 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1118: CTGCCTCCAC CTTTGATC 18 INFORMATION FOR SEQ ID NO:1119: WO 97/37044 PCT/US97/05223 1024 SEQUENCE CHARACTERISTICS: LENGTH: 19 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...19 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1119: ACCAATATCA ATTGGCACT 19 INFORMATION FOR SEQ ID NO:1120: SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...18 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1120: ACTTGGAAAA GCTCTGCA 18 INFORMATION FOR SEQ ID NO:1121: SEQUENCE CHARACTERISTICS: LENGTH: 19 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/05223 1025 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...19 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1121: CTTGCTTGTC ATATCTAGC 19 INFORMATION FOR SEQ ID NO:1122: SEQUENCE CHARACTERISTICS: LENGTH: 18 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...18 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1122: GTTGAAGTGT TGGTGCTA 18 INFORMATION FOR SEQ ID NO:1123: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 1026 (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1123: CAAGCAAGTG GTTTGGTTTT AG 22 INFORMATION FOR SEQ ID NO:1124: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1124: TGGAAAGAGC AAATCATTGA AG 22 INFORMATION FOR SEQ ID NO:1125: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1125: WO 97/37044 PCT/US97/05223 1027 GCCCATAATC AAAAAGCCCA T 21 INFORMATION FOR SEQ ID NO:1126: SEQUENCE CHARACTERISTICS: LENGTH: 24 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...24 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1126: CTAAAACCAA ACCACTTGCT TGTC 24 INFORMATION FOR SEQ ID NO:1127: SEQUENCE CHARACTERISTICS: LENGTH: 16 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...16 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1127: GTAAAACGAC GGCCAG 16 INFORMATION FOR SEQ ID NO:1128: SEQUENCE CHARACTERISTICS: LENGTH: 17 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 1028 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...17 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1128: CAGGAAACAG CTATGAC 17 INFORMATION FOR SEQ ID NO:1129: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1129: TTGCCCCATC GTATTGATAG A 21 INFORMATION FOR SEQ ID NO:1130: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCT/US97/05223 1029 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1130: AGAGCGTATT TCACCCGAAA G 21 INFORMATION FOR SEQ ID NO:1131: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1131: TCTTGCATCT TAATCCACTC C 21 INFORMATION FOR SEQ ID NO:1132: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature WO 97/37044 PCT/US97/05223 1030 LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1132: CGGGTCAAAA CGACCACTTA A 21 INFORMATION FOR SEQ ID NO:1133: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1133: AATCCGTTTC GCTAATTTAG T 21 INFORMATION FOR SEQ ID NO:1134: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1134: AACACTTCAA TTTCCTCTAT A 21 INFORMATION FOR SEQ ID NO:1135: WO 97/37044 PCT/US97/05223 1031 SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1135: TGGTATAAGG ATTTGAATGG A INFORMATION FOR SEQ ID NO:1136: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1136: TTGACTAAAC ACATGCGAGA A 21 INFORMATION FOR SEQ ID NO:1137: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/US97/05223 1032 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1137: ATAGAGAGCG TTGTGTTTAG C 21 INFORMATION FOR SEQ ID NO:1138: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1138: CCTTTATTGG TTTTGATCGT G 21 INFORMATION FOR SEQ ID NO:1139: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: WO 97/37044 PCT/US97/05223 1033 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1139: ATGTCCGTTG TCTGTATGGA A 21 INFORMATION FOR SEQ ID NO:1140: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1140: TAGGGTGTCT AGGGATTTGA T 21 INFORMATION FOR SEQ ID NO:1141: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1141: WO 97/37044 PCT/US97/05223 1034 GCGTTTGGCT TCTTCGTTGT C 21 INFORMATION FOR SEQ ID NO:1142: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1142: GAAATGGAAA ATAGCGGTCA A 21 INFORMATION FOR SEQ ID NO:1143: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1143: GCAAATCCCC AGCCACTTCC INFORMATION FOR SEQ ID NO:1144: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs WO 97/37044 PCT/US97/05223 1035 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1144: GTGGCTAAAA ATGAGGGCTT INFORMATION FOR SEQ ID NO:1145: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1145: GTTAGGAAAT TAGAAATCAT TG 22 INFORMATION FOR SEQ ID NO:1146: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCT/US97/05223 1036 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1146: GCTAAAACTT CATCGCTCAA T 21 INFORMATION FOR SEQ ID NO:1147: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1147: GTTGGGCAGA AAATAAGGTG A 21 INFORMATION FOR SEQ ID NO:1148: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 1037 NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1148: CAAACAAACC TGACAAGAAA C INFORMATION FOR SEQ ID NO:1149: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1149: CATTGATGCC TAAAACTTCG INFORMATION FOR SEQ ID NO:1150: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1150: CGTGGTGGTT TTCCCGTTAG WO 97/37044 PCT/US97/05223 1038 INFORMATION FOR SEQ ID NO:1151: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1151: GGGGCATTGT
GTTTGTTTTT
INFORMATION FOR SEQ ID NO:1152: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1152: TGGTCTATCA TGCGAATTAT INFORMATION FOR SEQ ID NO:1153: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCTIUS97/05223 1039 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1153: GCCCTGATCC ATTCCCCCCT INFORMATION FOR SEQ ID NO:1154: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1154: CGCGCTAGAG GCTTGTAAAA INFORMATION FOR SEQ ID NO:1155: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 1040 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1155: GCCCTGATCC ATTCCCCCCT INFORMATION FOR SEQ ID NO:1156: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1156: CTGTTTTTAG CGTCCCTGTA INFORMATION FOR SEQ ID NO:1157: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 WO 97/37044 PCT/US97/05223 1041 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1157: GGCGTTATTA
AGCGACATCG
INFORMAVTION FOR SEQ ID NO:1158: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1158: TTTCACCGGC
AATTTTAGGC
INFORMATION FOR SEQ ID NO:1159: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1159: GCGTTTTGAT TCTGTCTGTT A 21 INFORMATION FOR SEQ ID NO:1160: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 1042 LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1160: GTAAAAACAC CGCTAACGCA T INFORMATION FOR SEQ ID NO:1161: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1161: GCGTGTTTTC TAAGGGTTCA INFORMATION FOR SEQ ID NO:1162: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/05223 1043 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1162: GGAATTTTAA CGCTCTTTTT INFORMATION FOR SEQ ID NO:1163: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1163: AAATCTCTGT GGGCTTAGTG INFORMATION FOR SEQ ID NO:1164: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 1044 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1164: AATCAAAAAC AAGAGCGTGG INFORMATION FOR SEQ ID NO:1165: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1165: GCCCCAGCCC CATAATACAA A 21 INFORMATION FOR SEQ ID NO:1166: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1166: GGAGGGCGCA ATTAAACATC G WO 97/37044 PCTIUS97/05223 1045 INFORMATION FOR SEQ ID NO:1167: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1167: TCACGCTTTC TAAATCATCA INFORMATION FOR SEQ ID NO:1168: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1168: CGCTAATCAC ATCCTTTCTT INFORMATION FOR SEQ ID NO:1169: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 PCTUS97/05223 1046 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1169: TGCCCAAAAA TCCACTAACG INFORMATION FOR SEQ ID NO:1170: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1170: AACGGGTTTG ACACTGATGA INFORMATION FOR SEQ ID NO:1171: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 1047 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1171: GAATGCGGTG GTTTTAGAGA G 21 INFORMATION FOR SEQ ID NO:1172: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1172: GCGTTTTTAA GACTGAATAC A 21 INFORMATION FOR SEQ ID NO:1173: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 WO 97/37044 PCT/US97/05223 1048 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1173: GGGCGATGTG ATTGGCGATT INFORMATION FOR SEQ ID NO:1174: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1174: GGATAGCCTG CCAAAACGCC INFORMATION FOR SEQ ID NO:1175: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1175: AAGTTTATGC GGGCGAGATT INFORMATION FOR SEQ ID NO:1176: WO 97/37044 PCT/US97/05223 1049 SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1176: GGAGCAATCA GCCATTTTTC INFORMATION FOR SEQ ID NO:1177: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1177: CTAGCGATTC AAGGCGATGG INFORMATION FOR SEQ ID NO:1178: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/05223 1050 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1178: CGGCCTCCTT CAAACACATT INFORMATION FOR SEQ ID NO:1179: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1179: AGCGGGCAGT TTAGGACCAC INFORMATION FOR SEQ ID NO:1180: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 1051 (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1180: GCATTGATCG CATTTTTAGC C 21 INFORMATION FOR SEQ ID NO:1181: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1181: ACGGGTTAGC AGGGCAGAAT INFORMATION FOR SEQ ID NO:1182: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1182: WO 97/37044 PCT/US97/05223 1052 CAAAAGAGGC GGGTTCATGC INFORMATION FOR SEQ ID NO:1183: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1183: TTTAGAAGTC GTTGATGAGA INFORMATION FOR SEQ ID NO:1184: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1184: CATACACGCT CACTTCATCG INFORMATION FOR SEQ ID NO:1185: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 1053 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1185: AGTGTGGTCG CCTGTGGTGG AG INFORMATION FOR SEQ ID NO:1186: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1186: CCCCTAATAG TCTGTCAATC AT 22 INFORMATION FOR SEQ ID NO:1187: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCT/US97/05223 1054 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1187: CAAAAGATTG AAGCAGAAGA GT 22 INFORMATION FOR SEQ ID NO:1188: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1188: AATGGTTTTC CTATACCCTT GA 22 INFORMATION FOR SEQ ID NO:1189: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature WO 97/37044 PCT/US97/05223 1055 LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1189: GAGAGCAAAT
CCTTATCCAG
INFORMATION FOR SEQ ID NO:1190: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1190: CATACACGCT CACTTCATCG INFORMATION FOR SEQ ID NO:1191: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1191: AGTGTGGTCG CCTGTGGTGG AG 22 INFORMATION FOR SEQ ID NO:1192: WO 97/37044 PCT/US97/05223 1056 SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1192: CCCCTAATAG TCTGTCAATC
AT
22 INFORMATION FOR SEQ ID NO:1193: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1193: CAAAAGATTG AAGCAGAAGA GT 22 INFORMATION FOR SEQ ID NO:1194: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/US97/05223 1057 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1194: AATGGTTTTC CTATACCCTT GA INFORMATION FOR SEQ ID NO:1195: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1195: TTGAAACCCC AAAAGTTTTA C 21 INFORMATION FOR SEQ ID NO:1196: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: WO 97/37044 PCT/US97/05223 1058 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1196: TCAACTGATA GGTAATATCC C 21 INFORMATION FOR SEQ ID NO:1197: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1197: GCTAGGATTT ATGCCAATTT A 21 INFORMATION FOR SEQ ID NO:1198: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1198: WO 97/37044 PCTIUS97/05223 1059 TACGAGACAA AATAGGGATT T 21 INFORMATION FOR SEQ ID NO:1199: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1199: ATAGGCATGC AGAATTTTTC C 21 INFORMATION FOR SEQ ID NO:1200: SEQUENCE CHARACTERISTICS: LENGTH: 17 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...17 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1200: CCATTACATT TCGCCTC 17 INFORMATION FOR SEQ ID NO:1201: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs WO 97/37044 PCT/US97/05223 1060 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1201: CGATAGATAT TGTAGAAGTC
A
21 INFORMATION FOR SEQ ID NO:1202: SEQUENCE
CHARACTERISTICS:
LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1202: GGGCTTGTAT TCATTTTGTA
A
21 INFORMATION FOR SEQ ID NO:1203: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
WO 97/37044 PCT/US97/05223 1061 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1203: GTTTTAAAAA CGCCATAGCC A 21 INFORMATION FOR SEQ ID NO:1204: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1204: TTCTAAAAGG TGGTAATCTT C 21 INFORMATION FOR SEQ ID NO:1205: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 1062 NAME/KEY: misc_feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1205: TAAGTCAAGC CATAAAACCA AA 22 INFORMATION FOR SEQ ID NO:1206: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1206: TTTTGGGGTA AAAAGGCTGA A 21 INFORMATION FOR SEQ ID NO:1207: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1207: ATCTTTTTGC CCTTGCTCAT A 21 WO 97/37044 PCT/US97/05223 1063 INFORMATION FOR SEQ ID NO:1208: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1208: AGACAGCACC AGTTTGATAA A INFORMATION FOR SEQ ID NO:1209: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1209: CAGCCACACT TCAATGTCTA T 21 INFORMATION FOR SEQ ID NO:1210: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/US97/05223 1064 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1210: GTAAGGCGTT AGAAAAATAC C 21 INFORMATION FOR SEQ ID NO:1211: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1211: GCCCCATTAA AATCCTTTTC T 21 INFORMATION FOR SEQ ID NO:1212: SEQUENCE CHARACTERISTICS: LENGTH: 17 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 1065 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...17 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1212: AAAGGATACA AGGGGGA 17 INFORMATION FOR SEQ ID NO:1213: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1213: CTCGCTCCAT TTTATCTTTT A 21 INFORMATION FOR SEQ ID NO:1214: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 WO 97/37044 PCTIUS97/05223 1066 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1214: TTTTTTAGGG AGGATTGAGA T INFORMATION FOR SEQ ID NO:1215: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1215: TGTTTGGAAA TGCTGGTGAT C 21 INFORMATION FOR SEQ ID NO:1216: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1216: CTTTTGGGGG AGTTTGACAA G 21 INFORMATION FOR SEQ ID NO:1217: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 1067 LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1217: TTTGATAAAC GCCCACTTTT
T
21 INFORMATION FOR SEQ ID NO:1218: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1218: TTTCAAAACG CTCACCTTTT G 21 INFORMATION FOR SEQ ID NO:1219: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/0522 3 1068 (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1219: TCTATTCTTT TGATGCTCTC
T
21 INFORMATION FOR SEQ ID NO:1220: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1220: ATAATGAGTT TGATCGTTAC G 21 21 INFORMATION FOR SEQ ID NO:1221: SEQUENCE
CHARACTERISTICS:
LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 1069 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1221: ACAATAATAG GCTTTGTCTT C 21 INFORMATION FOR SEQ ID NO:1222: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1222: AATTAGCCCT TAAAATAGAT G 21 INFORMATION FOR SEQ ID N0:1223: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1223: AACAACCGCT AAAATCAAAC 9n WO 97/37044 PCT/US97/05223 1070 INFORMATION FOR SEQ ID NO:1224: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1224: CTTCAGCGAT ACTAAAAGAT INFORMATION FOR SEQ ID NO:1225: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1225: TAGGGGCGAT TGAAAACAGC INFORMATION FOR SEQ ID NO:1226: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 PCTfUS97/05223 1071 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1226: GCTGGATAAG GATTTGCTCT INFORMATION FOR SEQ ID NO:1227: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1227: TTTTTGGGGG TATGCTAAAA INFORMATION FOR SEQ ID NO:1228: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 1072 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1228: GGCTGGTAAA
TACTGGATAG
INFORMATION FOR SEQ ID NO:1229: SEQUENCE
CHARACTERISTICS:
LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1229: AGGCTATTCA AGGTGGCTAA
A
21 INFORMATION FOR SEQ ID NO:1230: SEQUENCE
CHARACTERISTICS:
LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...22 WO 97/37044 PCT/US97/05223 1073 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1230: ATTCTCATCA ACGACTTCTA
AA
22 INFORMATION FOR SEQ ID NO:1231: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1231: GCAGTTGGCG
GTATTTGGTG
INFORMATION FOR SEQ ID NO:1232: SEQUENCE
CHARACTERISTICS:
LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1232: GAGAGCGAAG TTTATGAGAA INFORMATION FOR SEQ ID NO:1233: WO 97/37044 PCT/US97/05223 1074 SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1233: TGATTGTTGG
GTAGCTCTCA
INFORMATION FOR SEQ ID NO:1234: SEQUENCE
CHARACTERISTICS:
LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1234: AAAATCGGTC TGATGCTCTT A 21 INFORMATION FOR SEQ ID NO:1235: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/05223 1075 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1235: CTTTTCCTTT CGCTTGAAGA INFORMATION FOR SEQ ID NO:1236: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1236: AAAACAAACG CATCAAAAAT INFORMATION FOR SEQ ID NO:1237: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 1076 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1237: TTTCAAGGCG AGGAGGCAGA
T
21 INFORMATION FOR SEQ ID N0:1238: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1238: GCACAAAGAC CCCACCACGA
T
21 INFORMATION FOR SEQ ID NO:1239: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1239: WO 97/37044 PCT/US97/052 23 1077 CGCCCGAATG
GATGAGTAGG
INFORMATION FOR SEQ ID NO:1240: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1240: TGCAACAAAA
ATACGCCCTT
INFORMATION FOR SEQ ID NO:1241: SEQUENCE
CHARACTERISTICS:
LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1241: TTTTTAAGGG CGTATTTTTG
T
21 INFORMATION FOR SEQ ID NO:1242: SEQUENCE
CHARACTERISTICS:
LENGTH: 21 base pairs TYPE: nucleic acid WO 97/37044 PCT/US97/05223 1078 STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1242: TGGGTTTTAA GGAATGTGAT G 21 INFORMATION FOR SEQ ID NO:1243: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1243: CGCGCTCAAA ATCCCTAAAT INFORMATION FOR SEQ ID NO:1244: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCT/US97/05223 1079 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1244: GGCCCATTCT TTCGGATATT INFORMATION FOR SEQ ID NO:1245: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1245: GCCCCATTCC TGTTTTTAGC INFORMATION FOR SEQ ID NO:1246: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature WO 97/37044 PCT/US97/05223 1080 LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1246: CGCTTTAACG CTCCTTTCAC INFORMATION FOR SEQ ID NO:1247: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1247: AGCGTTTTTG TAAGGGGGTA T 21 INFORMATION FOR SEQ ID NO:1248: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1248: TCCCTATCAT AGCGTTAGTG C 21 INFORMATION FOR SEQ ID NO:1249: WO 97 3 7044 PCT/US97/05223 1081 SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1249: CTTAGGGGTT TTTAGCATGA A 21 INFORMATION FOR SEQ ID NO:1250: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1250: GCACAATTCC CACACGCTGC INFORMATION FOR SEQ ID NO:1251: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCT/US97/05223 1082 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1251: CCAAAGCTAA AGCGGTGTTT
T
21 INFORMATION FOR SEQ ID NO:1252: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1252: GCTCATGGAT ATAAAGGGGT ATT 23 INFORMATION FOR SEQ ID NO:1253: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: WO 97/37044 PCT/US97/05223 1083 ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1253: CTTAGCCCCT TTAGTGTTTA INFORMATION FOR SEQ ID NO:1254: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1254: CGCAAAAGGG TAGGGGATAA INFORMATION FOR SEQ ID NO:1255: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1255: WO 97/37044 PCT/US97/05223 1084 TTTTATTTTT AGAAACGAAT
C
21 INFORMATION FOR SEQ ID NO:1256: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1256: CAAACTTATC
GCCCTCTCTA
INFORMATION FOR SEQ ID NO:1257: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1257: AAAGATAACG CTAGGATTTC TAC 23 INFORMATION FOR SEQ ID NO:1258: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs WO 97/37044 PCT/US97/05223 1085 TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1258: TAATTCTACA GAGTGGTTAA TGG 23 INFORMATION FOR SEQ ID NO:1259: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1259: GCGGTCATGG AATTTTTAGA INFORMATION FOR SEQ ID NO:1260: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO WO 97/37044 PCTIUS97/05223 1086 (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...22 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1260: ATTCAAAGAA AGCTGGCTGT CT INFORMATION FOR SEQ ID NO:1261: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1261: GCTTGTGGGG GTTGTTTTAT INFORMATION FOR SEQ ID NO:1262: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: WO 97/37044 PCT/US97/05223 1087 NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1262: GAACCCCCTA AAATGACAAT INFORMATION FOR SEQ ID NO:1263: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1263: ACCATGCTCA TTAACGCTAG G 21 INFORMATION FOR SEQ ID NO:1264: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1264: GTAAGTTTGA GCGGCTAATTC C WO 97/37044 PCT/US97/05223 1088 INFORMATION FOR SEQ ID NO:1265: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1265: AAAAAGAAAG AAGAACTCGT G 21 INFORMATION FOR SEQ ID NO:1266: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1266: AAAGATACTC CCCTGTGATT A 21 INFORMATION FOR SEQ ID NO:1267: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular WO 97/37044 PCTIUS97/05223 1089 (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1267: CAAAGAAACG CAAAATACAG INFORMATION FOR SEQ ID NO:1268: SEQUENCE CHARACTERISTICS: LENGTH: 20 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1268: GCATGGTATT CAGCGTTTTC INFORMATION FOR SEQ ID NO:1269: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 1090 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1269: GCAGCGGCAC AGCGACTTTA
G
21 INFORMATION FOR SEQ ID NO:1270: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1270: CGCCCCCAAA AAGTCGCAGT A 21 INFORMATION FOR SEQ ID NO:1271: SEQUENCE CHARACTERISTICS: LENGTH: 22 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...22 WO 97/37044 PCT/US97/05223 1091 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1271: AAAGGTTTGA AACAAGAAAT
CT
22 INFORMATION FOR SEQ ID N0:1272: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1272: CACTTGAGCG TTAGCAACAA
T
21 INFORMATION FOR SEQ ID NO:1273: SEQUENCE
CHARACTERISTICS:
LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1273: GGGGTGTTTG AAATTGATAG A 21 INFORMATION FOR SEQ ID NO:1274: SEQUENCE CHARACTERISTICS: WO 97/37044 PCT/US97/05223 1092 LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1274: CGCTAGGAGA AAGGAAGGAA
A
21 INFORMATION FOR SEQ ID NO:1275: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1275: CAAGGGCGTT TTTTGGGGTA T 21 INFORMATION FOR SEQ ID NO:1276: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCT/US97/05223 1093 (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1276: GGGATTGTTA CAGGAAAAGA T 21 INFORMATION FOR SEQ ID NO:1277: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1277: GGAATACAAT AACGCATAAA T 21 INFORMATION FOR SEQ ID NO:1278: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori WO 97/37044 PCT/US97/05223 1094 (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1278: GCCTTTTTAG ACAACCCTAC T 21 INFORMATION FOR SEQ ID NO:1279: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1279: CCCCTAAACT CAAATCTCAA T 21 INFORMATION FOR SEQ ID NO:1280: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1280: GCTAGAAATG CCATGAGAAA G 91 WO 97/37044 PCT/US97/05223 1095 INFORMATION FOR SEQ ID NO:1281: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1281: GCGATTATGG GGTATTTATT G INFORMATION FOR SEQ ID NO:1282: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1282: TTATTGTGGA GTTGCTTGTC A 21 INFORMATION FOR SEQ ID NO:1283: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double WO 97/37044 PCT/US97/05223 1096 TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1283: TATGCGGCTC ATCCTATTAA A 21 INFORMATION FOR SEQ ID NO:1284: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID N0:1284: CCCTAAATCC AAATCAAGCA G 21 INFORMATION FOR SEQ ID NO:1285: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO WO 97/37044 PCT/US97/05223 1097 (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1285: ATCTTACCTA TCACCTCAAA T INFORMATION FOR SEQ ID NO:1286: SEQUENCE CHARACTERISTICS: LENGTH: 21 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...21 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1286: AGACAGCAAC ATCTTTGTGA A 21 INFORMATION FOR SEQ ID NO:1287: SEQUENCE CHARACTERISTICS: LENGTH: 25 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...25 WO 97/37044 PCT/US97/05223 1098 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1287: TTATGGATCC AAACCAATTA AAACT INFORMATION FOR SEQ ID NO:1288: SEQUENCE CHARACTERISTICS: LENGTH: 23 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...23 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1288: TATCTCGAGT TATAGAGAAG GGC 23 INFORMATION FOR SEQ ID NO:1289: SEQUENCE CHARACTERISTICS: LENGTH: 29 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc_feature LOCATION 1...29 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1289: AAATAGTCAT ATGAAAATAG GCGTTTTTG 29 INFORMATION FOR SEQ ID NO:1290: WO 97/37044 PCT/US97/05223 1099 SEQUENCE CHARACTERISTICS: LENGTH: 28 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL: NO (iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...28 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1290: AGAATTCTAT TACAATTTGA GCCATTCT 28 INFORMATION FOR SEQ ID NO:1291: SEQUENCE CHARACTERISTICS: LENGTH: 26 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...26 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1291: GCGAATTCGA TCAGAATTTT TTTTCT 26 INFORMATION FOR SEQ ID N0:1292: SEQUENCE CHARACTERISTICS: LENGTH: 26 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) WO 97/37044 PCTfUS97/05223 1100 (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc-feature LOCATION 1 .26 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1292: ATAAGTACTT GTGAATCTTA
TACTAG
INFORMATION FOR SEQ ID N0KL293: SEQUENCE CHARACTERISTICS: LENGTH: 2694 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE: NO (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION .2694 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1293:
ATGGAGCAGC
TTTAGAAGCT
ACAGGGCTTT
ATGCCTTTTA
GGCGAATACA
GCCTTAGAAT
GATGATGTGA
GATAAGGATT
TTTTTGGCGA
CAGGGCATTG
AACGCTAAGG
TTGGCGAA.AA
GCCTTTTTAA
TTAAGTTGCG
TATGGTTTCA
CCCATATTAA
AAATCGCGCA
GAAAACCCTA
CAGTCATTAA
ATTACATGAG
TAACGGGGCT
TCGTGTTCGC
AACAAAATCG
GGTTGCAAAA
TCGCAAGTTT
TTAACCAGCT
AAGATTGCGT
TGGGGGATAG
AATTGTTGCA
ATTTACTCAG
GCAAAGAATT
CTTTTCCGAG
TTTCTACCTT
ACAGCACGCC
TGATTGTTTT
ACGCTAGGGT
AGAAGGGACT
CGCTAAAAAT
TGTGGGCATG
CCTAGAAAGC
TAAAGACGCC
AATGGGTTTT
AGCCACGCTA
TTTGAGCGAT
GGAAAAATAC
CAGCGATAAT
GCGATTAGGG
CCCTAAAATG
AGCCACTTTA
CGAAAACCCC
AAGGGATTTA
TATATTAGAC
AGAAAGCGCT
TTTTATGCGT
TTAGCTTTAA
AAGCCTTTAA
GTTAAAAAT
CAGACTAAAA
CCTAAAGAGA
ACTTGCGTGG
AGCCCTTATA
AAAATCGCGC
GGGATTTTGC
TACAAGGGGG
AGCTTGGAAA
TATCAAGCCT
GAAAGAGGAT
TTGTTGAAA.A
GAAAATTCCC
AATACCCCCG
GAGCCTTTGA
TTGGTTTTGG
TTGATACCTT
CCAACGATAA
TTTATAAAGA
CTAAAAGAGC
TGCTTTTACA
AGGTOGGAGO
AAACCCGCAT
TTTTTGATGG
CGAGTCAATT
TTAAAGGCAT
AAATCTATGA
TGATACAAGA.
GCATTAAAGA
TCAAAGATGA
CTTTCATTGT
CCTTAGACA\
GCATGTTTTT
ATAAAGACAA
TGCGTATTTG
GGGCTTTCCT
CAGAAAAAAC
TGAAAAACTG
AATCCCTATC
GTTTGAAGCC
TTATTCTAAA
CAAAACGGAG
CACGGATTAT
TGGGAGCAAG
AAATTTAGAC
CAAAGGAAGC
ATTTGATTTT
ATTGAAAGAA
AGAAAACGTG
CGCCCCTAAA
AGAAAAATTA
AAAAATTCTA
120 180 240 300 360 420 480 540 600 660 720 780 840 900 960 1020 1080 WO 97/37044 PCTIUS97/05223 1101
GCCCTAGCGI
TCGCCCT=T
ATCATTGGGC
TTGGAAA.ATPA
GTGGGCTTTC
ATCAAAGATT
AACGCCTTA.A
ACTTTAGCTA
GGCTTTAAGA
A.ACGTTTTAG
AAGCALACTTG
ACCGATGAAA
GAATACAGGG
GACAAA~GACG
AGCTCGCATT
CGTAAGGGTT
ATTGAATTAC
AAGGGGCGAG
GAAAAACGAT
AAATTGAGCG
TTCAACCGAT
ACTTCTA-AAG
GATTATGTCA
AGCGATTTAT
GTGAGGCTGC
CCAGAGTTGC
CCGTTAGAAA
*TTTTATTACA
CTTTGGAATT
*ATGATTTAA-A
TCCGCATCCA
ATGAGCTTTT
TTAAGACAAA
*AGCGTTTGTG
GAGACATTGA
TTGATGCGCC
AGCGCCAAAT
GCGAAGTCTT
AAAACTTGTT
AATTGAATAA
ATAAAATCCA
CGCCTAATTT
TTATCGCTAG
GCTTGTTGGC
ACATCCATTT
CCATCGCTAA
AAACTTTAAG
TCCCTAGCAT
CTTTTACTTT
AGGGCAATTA
TGAAATTAGG
TTTTGCAAGT
AGCAAGAAAT
CAAGCGCGTT
AGATCAAGC
TTTACAAAAC
GCCTTTATTA
AGACACTCAG
AAAGGAATAT
AAGTAAGGCG
CGAATACTTT
AACGCCGTTT
TTATTTCAAG
TTTGGATCTA
GTATGATAAA
AAAAATCCTA
GCTTTTTAAC
TACCACTTTC
GCAAAATATC
CTCTAAAGAA
CCATTTCAGC
AGAAACTTCT
AAGCATTAAT
CATCCCTTTA
TAAAGATTAT
GCTGGGGCGT
TTTGCGAGAG
CATGCTTAAA~
GCATGACGAA
CCAACGCATT
TATCGCTAAG
TATTTTTTAC
GCTTTTTCTC
AGCTTTTTAA
ATTTTAGCGT
TTAAAAGAAG
GAAAAATCAG
GAAAAAGGGG
GTCAAAGTTT
CGCTTAGAGC
ATCGCCGTGG
TTAGGGCTTC
GACAAGCACC
ACTTACACGA
ATCCAAACCG
CCGGTGCGAT
TATTGTTTAT
CAGGATAAGG
AAGGCGTTGT
TTTGGGCTGG
AGTGAGGCTA
TTGAATGGCA
TACCGGGTGT
GGCGTGAATG
GTGAGCGAGC
TTGATTTTTG
CTTAATGATG
CGTTGGAATG
CTTTAGAAGA
AAATGTTACA
AAGCCAATA
TTTTAAGAAJ
ATTTA-ATCCC
AACTTTTAAG
GGCTAGAAGA
TAATGGGCAT
AGGAGTTTAA
ATTTCAACCT
CTAAAAATALA
CALAGCATCCC
CCCCCTTATT
GCACAGCTAC
CGCCTAAAGG
TAGGGGTGGA
ATTTAATGGA
TTGGAGAAGA
TGTATGGCAT
AAAGTTACAT
TGCGAGAAGA
TTGATTTTAC
CGATTTTTCA
GTTTCAAAA
.A'AATTGAAGA
.AGTCTATCC
.ATTAAAAGG
GGCGTTCTTT
GCATGCGTGT
TCAGGTGCCT
TCCGGAAAAA
CCATGAAAAG
CATGGAATTA
GGATTTGCTC
GGAATTTCjA
GAATGAATTA
CAATTCGCCC
AAGCCACTCT
TTTGATTTTA
CCGCTTAAAA
CGGGCGTTTA
CTTACTCATT
TTATTCGCAG
AGCGTTTTTA
TTTAGCCAAA
GGGGAGTAAG
AGAAGCGTAT
GATTTTAAAA
CGGCGTGAAT
AGGGAGTGCC
CAACCCTTCG
AAAAAACGCC
rTTGAGGGTG
TTAG
1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2040 2100 2160 2220 2280 2340 2400 2460 2520 2580 2640 2694 INFORMA~TION FOR SEQ ID NO:1294: SEQUENCE
CHARACTERISTICS:
LENGTH: 2007 base pairs TYPE: nucleic acid STRAIJDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .2007 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1294:
ATGAAAAAAC
GGCTTTTTTA
GGCGAATTGA
AACAACCTCA
GATAATTTAA
CAAGCGGTGT
CCTTTTACTC
TCAGCGCGGG
AAAAACTTTC
ATCAGGCGGT
AAGCAAACAC
ATTTGGCGCT
TCTCTCTCTC
CTATCAAATC
AGACACTTAT
AACGAACGCG
GCAAGGGCTA
CAATGCGGCG
GCTTCATICGC
GGTGAAGCCG
GAGAATTTGA
AGCAGCCCTT
ATTGGCGAAA
GTAGGGCTGT
TTTTAAACGC
CTCAAATGGT
GCAACCTTTT
CAGAAAkTCAA
AAACCAATTC
GGAATGTCAT
TGAAGACAAC
GAAAAACACC
AACCAATTTT
TGCTGCGATC
CCCGGCGTAT
CGCCTATAAT
120 180 240 300 360 WO 97/37044 PCTIUS97/05223 1102
GTCCAATGCG
CATAATTCA
TTATCCATTG
AAACAAGATA
ACAACGCA3A.
GACGCTCAAA~
CCATGGGTCA
GGGAATGTGT
GCCCAAGAAA
CA.AGATTTCA
GCGCAAGCGC
ATCCCAAGCC
CCTGATGCGG
ATAACGGCTT
TCTGAGTTGT
ACTTATAACA
TTGATAAGCC
AACCAAAGCG
AGACGCGTTG
CAAGTGGGCT
TTTTTTGACT
TTCACTTATG
AACAGCAAGA
AATTCCCAGT
GCGAATTTCC
AAAGCGAGCG
AACACGAATT
TATTTGAATT
GTCCTGGTAA. CAGTGGACAJA
CAAAGCGTAA
GTTCCATTAA
AGAATTTTALA
GCGGATTTCC
CTAATGGAGC
CCCTTTTGCA
ATCACAATCA
GTCAGGTTTT
TCGTAACGCA
ATCCTTACAC
AAGCCAAGAT
AATTTATCAC
GGGTTACTAA
TAAACAACAG
TGGCGCGCAC
GCATCACCAC
AATCCACTAA
CTTATTCGCA
GATTAATCAG
ACAAACAATT
ATAACCATGC
GGGTAGGGAC
TTTCTTTTGG
ATGTGAATTT
AATTCTTGTT
ATCATGCGGC
ACTATTCTTT
ATGTGTTCGC
TTGCAATTTA
AAAGCTTAAT
TGTTTTGGAT
TAATAAAAGT
AGAAGCCAGT
AGGACAAAAC
TGCCACGGAA
AGCTCAAAGC
CTCTGCTGAT
ACTTGAGCTA
AAATTACTTG
CAACACTTGG
CCTTGCGCAT
CATACTTGAT
GACCGCTTCA
CCCTAATAAC
ATTATTAAGC
CTCTCAAACC
TTTTGGTGAA
TTATATCAAA
AGATGTCCTC
GGTGTTTGGG
AGCGACCTTC
CAATTTAGGC
TCAGCATGGC
GCTAGGCACT
TTACTAA
ACCGGTTATA
CAGGCTTATC
AGTGCAGGAA
GAAACTACTA
AAAATGATAA
GGGGGCGCGC
TTTAGCGCCG
CTTAACCAGC
AGGGCTTTCG
GCCGATCAAA
GCAGCTTGCC
GGGGCCGGTT
TTTGGCACTC
TTTAGAGGCA
AACACGCCTA
CCCGGGGGCT
GCCACGCAAo
A.ACAATGGTG
AAGAGAAGGT
TCTAGCTTTT
TATAACTTTA
GGGATTGCGT
AATAATTTCT
TTGAGAATGA
GTGGAATTAG
CAACTCCAAT
CCTTTGAGGG
ACAACGGGGT
AAACTATCCA
AACAAGTAAC
CTACTACTAC
GCGTCCTCAC
CGTGGGGTTT
TTACTAGCAT
AAAACAATCA
CTCAAAACAT
TGAAAAAIAGA
ACAATGGGGG
GCGCGTATGT
AAGCTGAGCA
GCCTTAGTAA
ATTCCCCATT
TACAGGCCGT
AATTAGGGCA
CCATGAATGG
GGGGGTTAAG
TCAATTCGGC
TCAATGATAA
TAGCTGGCAC
ATAGCGCT.
ACCTCGCTAA
GCGTGAAGAT
ACCGAAGATT
CCAACCAGGA
TAGCGGCCCT
ACAAGCTTTA
TATAAcAATA
TACTACTAAT
TACAAACTGC
AGATACGGCA
GATCAAAAAC
AAACGCGCCG
GCTCAATCAC
CCTTAACACT
TOGGACATTA
GGAAGAGACG
AATCAAGCAA
TTTAA.ACAAC
CCTTAAAAAT
TTATCAAGTC
TALACCCTTTC
GATCGGTGTG
GTATTACGGC
TTCTGATGTG
AACCACCAAA
TTCATGGCTG
ALATGAATGTG
GAATAAGAA
CCCCACGATC
GTATAGCGTG
420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1920 1980 2007 INFORMATION FOR SEQ ID NO:1295: SEQUENCE CHARACTERISTICS: LENGTH: 1914 base pairs TYPE: nucleic acid STRANDEDNESS: double TOPOLOGY: circular (ii) MOLECULE TYPE: DNA (genomic) (iii) HYPOTHETICAL:
NO
(iv) ANTI-SENSE:
NO
(vi) ORIGINAL
SOURCE:
ORGANISM: Helicobacter pyloni (ix) FEATURE: NAME/KEY: misc feature LOCATION .1914 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1295:
ATGAAAAGAC
GACAACGGCT
A-ACACCGGCG
CAACTGGCTT
GCTTTAAGCG
AATTCTACTC TCTCTCTCTC TCTCTCGCTT
CATCGCTCTT
TTTTTGTGAG CGCGGGC TAT CAAATCGGCG
AAGCGGTGCA
AATTGApJA CTTGAACGAC AAATACGAGC
AGTTAAGCCA
CGTTAAAAAA AAGCATTCAA ACGGCGAACA
ACATTCAGGC
ATTTAAAAAG CTTTGCGAGT AACAACCACA
CAAACAAAGA
GCATGCTGAA
AATGGTCAAA
ATCTTTAGCC
TGTCAACAAT
AACATCGCCC
120 180 240 300 WO 97/37044 WO 9737044PCTIUS97/05223 1103
ATCTACAACA
GGGAACGCTC
AGAATCCATA
TATGATAAAA
TTATCAGAAT
CAAACCGCGC
AATATCGTCA
GGTCATGTAA
CAAGCGCTTA
ATGGGATCTC
AAGCAAATCC
CAACTTAAGT
AACCCTTACA
GCTAATTATG
AAATCCAATC
ATTTCTAAAC
TCTACAGCGG
GCGGCGATGA
GGCGCTTTGA
AGATGGGGGT
TTTTTTAATT
TTTATCAACG
ATCCAACTAG
AACCCTTACA
AGGACGAATC
GAACTGGGCA
CTAGAATACC
CCGCGCAAkGC
TCAGTTTTCA
GAGATGGGAA
TGAAGACACT
GCTCTAGCAA
AACAGCTCAT
TCGCAGGTGT
CCGACTATGC
CGCTTTCTCA
AAACAAATCG
TTTCTAACGC
ATTTGGAGAA
GACAGALATGT
GTAATCGTTT
AAACAGAGAT
TTCCCTATAA
GCCAATACCA
GCAATAACCC
ACGGGCTTGG
TAAGGTATTA
CTTCTTCTGA
ATAGCATCAC
CAGGGACTAC
GCGCGAAAGT
TCGCTACAGC
TTAAAATCCC
GAAGGCTTTA
TGTTATCACT
TGTGACCGGT
CTGCACAGGA
TGCCGAAAAC
TCAATCAAAT
GGACTTAATC
TACAAACAAA
GGTGTTTAAC
AAGTAACCAC
TGAATTCGCT
TTCAAGTATC
CGCTTACTTG
GA-ATTTGAAT
GGATTCGGCT
CGTALACCACT
CCAAGTCAAT
AATCAACCCA
CTTTAAAAAA
CGTGCAAGTG
TGGTTTCTTT
TATATGGACT
AAGAAAGAAC
ATGGCTTAAT
CAATGCTTCC
TAAGAAAAAA
TACCATTAAC
TAGCGTGTAT
TCAGTATTGG
TTGAATGATG
TTACAACAAT
CTCCAAAALAG
GGAGGCAAAA
GAACAGACCA
CCCA.ATGGTG
AACATCAAGG
ACCCTATCCA
AAAGACATCT
TTCAATCTCT
AAAGTGCCAC
AAAGAAATTA
TTALAGCGTGG
TATAACGATG
GTAACAAACA
GAGCAGCAAT
GTGGGCATGA
GGTTATAAAC
GATTACAACC
TATGGCGGTG
AACAAGCTTT
TCTCAATACA
AATTTCCAAT
GACAGCGAAC
ACCAATTATT
CTCAATTATG
CTTTTTGGAG
GATCTAATTC
GTTTTATGAG
CTCAAGGCAA
CTTCCATGAC
AGGTTTCTAT
CTGGCGCTAT
CGATGCTACC
CTCAGTTGCA
ACGCTTTAGC
TTA.ATTCCAT
ATTTGGGTAA
ATGCGGTTCA
CTAAAGATGT
CTAAGA.ATTT
TCGTTATGTC
CCAATCTTAA
TCAGCTCTCA
AATTCTTTGG
ACGGCTATAT
GGAGCGATTT
CTGTGGGTCT
TGAATTTAAC
TTTTGTTCAA
GTTCCGCGCA
ATTCTTTTCT
TGTTTGCTTA
TCTTTATGCA
TCCTTTAGGA
CAAAGAAACT
TCTCTGTGCC
TACAGCTCTT
GGTGTGGAA-A
CACATCCACT
TATCTTGCAA
AGCTCGAGCT
TCAAkACCAA
TCCTAAA.GAC
AACCCC TACT
AGACAATGTA
TTATAACCTA
GAGCGAAGAG
GCCTAAAGAT
CCAAGCTTTA
AAACA-ATAAC
CGAAAGCAAA
CAAATCCAGC
GTTAGTGAAT
TTTTGGTGGT
AGCGTTCAAT
TCTCGGCTTG
ACATGGCGTT
AGGCACTAAG
TTAA
360 420 480 540 600 660 720 780 840 900 960 1020 1080 1140 1200 1260 1320 1380 1440 1500 1560 1620 1680 1740 1800 1860 1914 INFORMATION FOR SEQ ID NO:1296: SEQUENCE CHARACTERISTICS: LENGTH: 897 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL: YES (vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NP 4EIKEY: misc-feature LOCATION 1 897 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1296: Met Glu Gin Pro Val Ile Lys Glu Gly Thr Leu Ala Leu Ile Asp Thr 1 5 10 Phe Ala Tyr Leu Phe Arg Sen Tyr Tyr Met Ser Ala Lys Asn Lys Pro 25 Leu Thr Asn Asp Lys Gly Phe Pro Thn Gly Leu Lou Thn Gly Leu Val 40 Gly Met Val Lys Lys Phe Tyr Lys Asp Arg Lys Asn Met Pro Phe Ile 55 Val Phe Ala Leu Glu Sen Gin Thr Lys Thr Lys Ang Ala Glu Lys Lou WO 97/37044 PCT/US97/05223 1104 Gly Gin Val Thr Asn 145 Phe Phe Gly Leu Leu 225 Ala Glu Lys Asp Ser 305 Lys Leu Leu Gin Leu 385 Ile Tyr Ala Glu Lys 465 Asn Glu Val Glu lie Glu Leu 130 Gin Leu Thr Val Gly 210 Leu Phe Phe Ile Leu 290 Thr Ser Glu Asp Gly 370 Glu Ile Gln Phe Tyr 450 Thr Ala Asp Leu Tyr Pro Val 115 Ser Leu Ala Asp Lys 195 Ser Ser Leu Asp Lys 275 Glu Pro Arg Lys Lys 355 Tyr Phe Gly Val Leu 435 Leu Lys Leu Leu Met 515 Lys Ile 100 Gly Pro Leu Lys Tyr 180 Gly Leu Pro Ser Phe 260 Asp Asn Ile Met Leu 340 Asp Phe Leu His Pro 420 Lys Lys Ser Lys Leu 500 Gly Gin Ala Gly Tyr Ser Asp 165 Gin Ile Glu Lys Lys 245 Leu Glu Ser Leu Ile 325 Glu Lys Leu Gin Asp 405 Leu Asn Glu Lys Arg 485 Thr Met 70 Asn Leu Phe Lys Asp 150 Cys Gly Gly Lys Met 230 Glu Ser Leu Pro Asp 310 Val Asn Lys Pro Asn 390 Leu Glu Pro Asp Ala 470 Leu Leu Glu Arg Glu Glu Thr 135 Lys Val Ile Ser Ile 215 Tyr Leu Cys Lys Phe 295 Asn Leu Pro Ile Leu 375 Ala Lys Asn Glu Leu 455 Glu Cys Ala Phe Lys Trp Ala 120 Arg Ile Glu Val Lys 200 Tyr Gin Ala Ala Glu 280 Ile Thr Glu Asn Leu 360 Glu Phe Pro Ile Lys 440 Ile Lys Glu Arg Gin 520 Asp Leu 105 Asp Ile Ala Lys Gly 185 Asn Glu Ala Thr Phe 265 Tyr Val Pro Ser Ala 345 Ala Glu Ser Leu Arg 425 Val Pro Ser Tyr Asp 505 Gly Ala 90 Gin Asp Tyr Leu Tyr 170 Asp Ala Asn Leu Leu 250 Pro Gly Glu Ala Ala 330 Arg Leu Ala Gin Leu 410 Ile Gly His Glu Phe 490 Ile Phe 75 Pro Lys Val Ser Phe 155 Gly Ser Lys Leu Ile 235 Glu Ser Phe Asn Leu 315 Glu Val Ala Leu Met 395 Ser Gin Phe Glu Leu 475 Glu Glu Lys Lys Met lle Lys 140 Asp Ile Ser Glu Asp 220 Gin Arg Glu Ile Val 300 Asp Pro Phe Phe Phe 380 Leu Phe Asp Asp Lys 460 Leu Lys Thr Ile Glu Gly Ala 125 Asp Gly Leu Asp Leu 205 Leu Asp Gly Asn Ser 285 Pro Asn Leu Met Leu 365 Ser Gin Leu Thr Glu 445 Ile Ser Gly Pro Asp 525 Met Phe 110 Ser Lys Lys Pro Asn 190 Leu Ala Lys Cys Pro 270 Thr Ile Ala Ser Arg 350 Leu Pro His Lys Gin 430 Val Lys Met Gly Phe 510 Ala Leu Thr Leu Asp Thr Ser 175 Tyr Gin Lys Gly Ile 255 Leu Leu Leu Pro Met 335 Leu Gin Phe Ala Ala 415 Ile Leu Asp Glu Leu 495 Val Pro Leu Cys Ala Phe Glu 160 Gin Lys Arg Asn Ser 240 Lys Leu Arg Asn Lys 320 Phe Val Asp Ser Cys 400 Lys Leu Lys Phe Leu 480 Glu Lys Tyr WO 97/37044 PCT/US97/05223 1105 Phe Lys Arg Leu Glu Gin Glu Phe Lys Asn Glu Leu Asn Val Leu Glu 530 535 540 Arg Gin Ile Leu Asp Leu Ile Gly Val Asp Phe Asn Leu Asn Ser Pro 545 550 555 560 Lys Gin Leu Gly Glu Val Leu Tyr Asp Lys Leu Gly Leu Pro Lys Asn 565 570 575 Lys Ser His Ser Thr Asp Glu Lys Asn Leu Leu Lys Ile Leu Asp Lys 580 585 590 His Pro Ser Ile Pro Leu Ile Leu Glu Tyr Arg Glu Leu Asn Lys Leu 595 600 605 Phe Asn Thr Tyr Thr Thr Pro Leu Leu Arg Leu Lys Asp Lys Asp Asp 610 615 620 Lys Ile His Thr Thr Phe Ile Gin Thr Gly Thr Ala Thr Gly Arg Leu 625 630 .635 640 Ser Ser His Ser Pro Asn Leu Gin Asn Ile Pro Val Arg Ser Pro Lys 645 650 655 Gly Leu Leu Ile Arg Lys Gly Phe Ile Ala Ser Ser Lys Glu Tyr Cys 660 665 670 Leu Leu Gly Val Asp Tyr Ser Gin Ile Glu Leu Arg Leu Leu Ala His 675 680 685 Phe Ser Gln Asp Lys Asp Leu Met Glu Ala Phe Leu Lys Gly Arg Asp 690 695 700 Ile His Leu Glu Thr Ser Lys Ala Leu Phe Gly Glu Asp Leu Ala Lys 705 710 715 720 Glu Lys Arg Ser Ile Ala Lys Ser Ile Asn Phe Gly Leu Val Tyr Gly 725 730 735 Met Gly Ser Lys Lys Leu Ser Glu Thr Leu Ser Ile Pro Leu Ser Glu 740 745 750 Ala Lys Ser Tyr Ile Glu Ala Tyr Phe Lys Arg Phe Pro Ser Ile Lys 755 760 765 Asp Tyr Leu Asn Gly Met Arg Glu Glu Ile Leu Lys Thr Ser Lys Ala 770 775 780 Phe Thr Leu Leu Gly Arg Tyr Arg Val Phe Asp Phe Thr Gly Val Asn 785 790 795 800 Asp Tyr Val Lys Gly Asn Tyr Leu Arg Glu Gly Val Asn Ala Ile Phe 805 810 815 Gin Gly Ser Ala Ser Asp Leu Leu Lys Leu Gly Met Leu Lys Val Ser 820 825 830 Glu Arg Phe Lys Asn Asn Pro Ser Val Arg Leu Leu Leu Gin Val His 835 840 845 Asp Glu Leu Ile Phe Glu Ile Glu Glu Lys Asn Ala Pro Glu Leu Gin 850 855 860 Gin Glu Ile Gin Arg Ile Leu Asn Asp Glu Val Tyr Pro Leu Arg Val 865 870 875 880 Pro Leu Glu Thr Ser Ala Phe Ile Ala Lys Arg Trp Asn Glu Leu Lys 885 890 895 Gly INFORMATION FOR SEQ ID NO:1297: SEQUENCE CHARACTERISTICS: LENGTH: 668 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein WO 97/37044 PCT/US97/05223 1106 (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: miscfeature LOCATION 1...668 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1297: Lys Lys Pro Phe Tyr Ser Leu Ser Leu Ala Ser Ser Met 1 Ala Ala Thr Gin Asp Ser Leu Gly Ser 145 Leu Gin Gly Lys Leu I 225 Pro 9 Leu I Ala V Gin S 2 Pro T 305 Ala G Gl Al Ty Al Asr Pr Tr Glr 130 Iie Ser G1n Lys Ser 210 Leu rrp sp al :er 90 'yr In 5 u Asp Asn Gly a Gin Met Val r Glu Asn Leu a Val Thr Asn n Leu Lys Ala o Ala Tyr Gin 100 p Asn Val Ile 115 1 Gin Ser Val Asn Cys Asn Ile Glu Asn 165 Ala Leu Lys 180 Gin Val Thr 195 Glu Thr Thr Gin Glu Ala S 2 Val Asn His A 245 Thr Ala Gly A 260 Thr Ser Met I 275 Leu Asn Gin G Thr Ser Ala A 3 Ala Gin Ala L 325 Ph Ly Se: Al 70 As Al Ala Thr Leu 150 Phe 31n Ile ?hr er !30 isn .sn le in sp 10 ys ro e Ph s Asi r As] 55 a Sej I Th i Val Tyr Phe 135 Thr SLys Asp Thr Thr 215 Lys Gin Val Lys Asn 295 Arg Ile Ser e Ile n Thr 40 n Leu r Ser r Gin Tyr Asn 120 Glu Gly Lys Ser Ile 200 Thr Met Gly C Cys C 2 Asn A 280 Asn G Ala P Leu G Gin P 10 Ser Ala Gly Tyr Gin Leu Leu Asn Ile Gly Glu Gly Glu Le Leu Thr As Pr Gl Leu 105 Val Gly Tyr Leu Gly 185 Thr Phr Ile G1n ;ln 65 \la in he lu he o Ser y Leu 90 Ala 5 SGin Gin Asn Asn 170 Phe Thr Thr Ser Asn 250 Val Gin Asn Ala Leu 330 Ile I Gl 75 Ii.
Lei Cys Prc Asr 155 G1r Pro Gin Thr Val 235 Gly Phe Glu la Sin 315 la 'hr u Lys Lys n Phe Asn u Ile Asn e Gly Glu u Asn Ala s Gly Pro 125 o Gly His 140 i Gly Val Ala Tyr SVal Leu Thr Asn 205 Asn Asp 220 Leu Thr Gly Ala I Ala Thr C 2 Ile Val T 285 Pro Gin A 300 Asn Met L Asp Gin M Asn Tyr L Lei Asi Al Lys Ala 11I Gly Asn Ser Gin Asp 190 Gly Ala rhr Pro lu 370 'hr sp ,eu et eu u Se n Le I Ali STh Val Asr Ser Gly Thr 175 Ser Ala Gin Asn Trp 255 Phe Gin Phe Asn Lys 335 Ala r Asp u Asn a Ile r Asn L Gly n Ser Ser SPro 160 Ile Ala Asn Thr Cys 240 Gly Ser Ala Asn His 320 Lys Ala Asp Leu Asn Thr Ile 340
P:
Ie n Cys His Asn Gly Gly Gly Thr Leu Pro Asp Ala Gly Val Thr Asn Asn WO 97/37044 PCT/US97/0522 3 1107 355 Thr Trp Gly 370 Ala Gly Cys Ala 360 Tyr Asi 385 Sei Asr Pro Asn Tyr 465 Arg Gly Arg Ile Val 545 Asn Thr Phe Leu His 625 Asn Leu n Asn Glu 1Leu Asn Asn 450 Ser Arg Ile Trp Lys 530 Gly Ser Ser Tyr Gly I 610 Ala I Thr I Tyr S Ser Leu Asn Ser 435 Pro Gin Val Gly Gly 515 Ser Thr Lys Trp Ser 595 Leu la Asn ;er Lei SLet Asr 420 Pro Gly Leu Gly Val 500 Leu Ser Asp Ile Leu 580 Ala Arg Gin Tyr Val 660 u Ala u Ala 405 SThr Phe Gly Leu Leu 485 Gin Arg Phe Val Ser 565 Asn Lys Met His C Tyr E 645 Tyr L His 39C Arg Tyr Leu Leu Ser 470 Ile Val Tyr Phe Leu 550 Phe 3er Met Asn Gly 630 Ser beu 375 Phe Gly Thr Ile Asn Ser Lys Asn 440 Gin Ala 455 Ala Thr Ser Ser Gly Tyr Tyr Gly 520 Asn Ser 535 Tyr Asn Gly Val Gin Tyr Asn Val 600 Leu Ala 615 Val Glu Leu Leu Asn Tyr Val Glu Thr Gin Leu Asp 410 Ile Thr 425 Leu Ile Val Tyr Gin Glu Gin Thr 490 Lys Gin 505 Phe Phe Ala Ser Phe Ile Phe Gly 570 Val Asn I 585 Ala Asn I Lys Asn I Leu Gly 6 3ly Thr G 650 Val Phe A Gl Al 39 Phe Thr Ser Gin Leu 475 Asn Phe Asp Asp Asn 555 Gly Leu ?he Lys al i35 ;1n ila u Thr 380 a Glu SArg SThr Gin Val 460 Gly Asn Phe Tyr Val 540 Asp Ile Ala Gin I Lys I 620 Lys I Leu C Tyr 365 Ile Gin Gly Ala Ser 445 Asn His Gly Gly Asn 525 Phe Lys Ala rhr Phe 505 .ys Ile l1n Th Slil Ser Ser 43C Thr Gin Asn Ala Glu 510 His Thr Thr Leu Phe 590 Leu Ala Pro Tyr r Ala e Lys SLeu 415 Asn Asn SSer Pro Met 495 Lys Ala Tyr Thr Ala 575 Asn Phe I Ser I Thr
E
Arg A 655 Leu Gin 400 Ser Thr Pro Ala Phe 480 Asn Arg Tyr Gly Lys 560 Gly Asn Asn Asp :le %rg 665 INFORMATION FOR SEQ ID NO:1298: SEQUENCE CHARACTERISTICS: LENGTH: 637 amino acids TYPE: amino acid TOPOLOGY: linear (ii) MOLECULE TYPE: protein (iii) HYPOTHETICAL:
YES
(vi) ORIGINAL SOURCE: ORGANISM: Helicobacter pylori (ix) FEATURE: NAME/KEY: misc feature LOCATION 1...637 WO 97/37044 PCTIUS97/05223 1108 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:1298: Met Lys Arg Gin Phe Tyr Ser Leu Ser Leu Ser Leu Ala Ser Se r 1 Leu Gly Asn Leu Ala Glu Leu Thr Asp 145 Tyr Asn Lys Leu Ala 225 Gly I Pro Ser Phe 2 Ser Z 305 Gin I Lys T Ile A Ser A 3 Thr G 385 Ile S Ser P Hi.
Gli As Ly5 Lei Th2 AlF- Gly 130 Gly Asp Leu Thr Ile 210 G1y His lie rhr la sn 1 eu hr isn la 70 lu er ro s A! i Al P Ly s Ly: I Se Se PhE lE Lei Asr Lys Cys Ser 195 Glu Val Val Leu Gin 275 Lys Ala Lys Pro Ala 355 Leu Ile Lys Lys a Glu a Val s Tyr Ser r Asp Pro 100 Trp a Asn 1 Cys Met Ala 180 Met Gin Thr Thr Gin 260 Leu Asp I Ser Tyr I 3 Thr P 340 Val G Ser V Val T Leu P 4 Asp S 420 5 As Gl Gil IlE LeL Ile Ser Asp Thr Lys 165 Leu Thr Thr Asn ksp 245 Mln 31n le jer ~eu 25 ~sn ;ln 'a1 hr ro 05 er p Asn n Met a Gin Gin 70 1 Lys Tyr Leu Gly Gly 150 Thr Ser Thr Lys Lys 230 Tyr Ala Ala Tyr 2 Ile I 310 Glu I Pro Asp I Ala L 3 Thr T 390 Tyr A Thr A
GL
Va Lei 55 Th Sei Asr TyT Ser 135 Leu Leu Glu Ala Va1 215 Pro Ala Leu %rg kla 295 ?he ~sn ryr ~sn lys ;75 'yr isn la y Ph 1 Ly 40 Se Al T PhE I Thx Ala 120 Asn Gin Ala Cys Leu 200 Ser Asn Va1 Thr Ala 280 Leu Asn Ala Arg Va1 360 Asp Asn Gin Gly Phe 25 Asn r Gin i Asn Ala Ala 105 Gly Ser Gin Glu Ser 185 Gin Met Gly Phe 2 Leu 265 Met C Ala C Leu F Tyr L 3 Gin A 345 Ala A Val T Asp A Val A 4 Gin T 425 10 Va Th Sea Asr Ser 90 Gli Asn Pro Cys Asn 170 Ser Thr Val kla ksn 250 ,er ly ln 'he leu 30 sn ~sn yr .la sn 10 yr I Se Gi Lei 1 IL 75 Asa Alz Alz Let Phe 155 Leu Asn Ala Trp Gly 235 Asn Gin Ser Asn Asn 315 Lys Va1 Tyr Asn Lys 395 Val Gin r Ala Y Glu u Ala e Gin Asn 1 Vai Leu Gly 140 Met Gin Gin Gin Lys 220 Ala Ile Ser 2 Gin 1 Gin I 300 Ser I Val P Asn L Gly A 3 Leu L 380 Asn L Thr A Ile A 01 Le 01 Al Hi:
IL
Se 12E Arc Sei Lys Ser 3Gl 205 Asn lie -ys ksn hr ys le 'ro leu sn 'ys eu sn sn y Tyr u Lys n Leu a Val s Thr e Thr 110 Phe SIle Lys Ala Asn 190 Leu Ile Thr Ala I His 270 Asn 2 Gin Pro I His I Asn I 350 Arg L Ser A Ser G Ile V 4 Pro G 430 Gin Asn Ala Asn Asn Ser His His Glu Gin 175 Gly Met Val Ser Met 255 Thr I krg C Ile I 1 ys P eu G 135 lys G leu A sn G lu G 4 al M iu G Leu Ile Leu Ser Asn Lys Va1 Va1 Arg Thr 160 Gly Gly Asp lie rhr 240 Leu eu lu Ieu ~sp ;ly ;lu sp in lu 00 et in WO 97/37044 PCT/US97/05223 Gin Ser Asn Leu Asn Gin Ala Leu Ala 431; AhCn Lys Gly 465 Arg Ile Gly Lys Gly 545 Asn Asn Giu Ile Arg Lys 450 Leu Trp Lys Gly Asn 530 Thr Pro Leu Arg Asn 610 Leu Vali Giy Gly Ser Ser 515 Asn Thr Tyr Gly Ser 595 Thr Tyr Gly Val1 Leu Ser 500 Asp Lys Trp Ser Leu 580 Ala Asn Ser Met Gin Arg 485 Phe Leu Leu Leu Aia 565 Arg Gin Tyr Val1 Ile Val1 470 Tyr Phe Leu Ser Asni 550 Lys Thr His Tyr Ser 455 Giy Tyr Asn Val1 Val1 535 Ser Val1 Asn Gly Ser 615 Ser Tyr Giy Ser Asn 520 Gly Gin Asn Leu Val1 600 Phe Gin Lys Phe Ser 505 Phe Leu Tyr Al a Ala 585 Glu Leu 1109 Ala Met Asn Asn Gin Phe 475 Phe Asp 490 Ser Asp Ile Asn Phe Gly Met Asn 555 Ser Asn 570 Thr Ala Leu Gly Gly Thr S er Asn 460 Phe Tyr Ile Asp Giy 540 Leu Phe Lys Ile Lys Asn 445 Gly Gly Asn Trp Ser 525 Ile Thr Gin Lys Lys 605 Leu Asn Ala Giu His Thr 510 Ile Gin Ala Phe Lys 590 Ile Pro Leu Ser Gly 495 Tyr Thr Leu Phe Leu 575 Asp Pro Tyr Phe Asn Lys 480 Tyr Giy Arg Ala Asn 560 Phe Ser Thr Arg Tyr Leu Asn Tyr Vai 625 630 Phe Ala Tyr 635
Claims (5)
1. An isolated nucleic acid comprising a nucleotide sequence encoding an immunogenic amino acid sequence that is effective against H. pylori and is at least 60% homologous to an amino acid sequence selected from SEQ ID NO: 795, SEQ ID NO: 745, SEQ ID NO: 929, SEQ ID NO: 890, SEQ ID NO: 956, SEQ ID NO: 904, SEQ ID NO: 777, SEQ ID NO: 938, SEQ ID NO: 940, SEQ ID NO: 945, SEQ ID NO: 925, SEQ ID NO: 844, SEQ ID NO: 754, SEQ ID NO: 960, SEQ ID NO: 810, SEQ ID NO: 817, SEQ ID NO: 809, SEQ ID NO: 816, SEQ ID NO: 855, SEQ ID NO: 802 and SEQ ID NO: 943.
2. An isolated nucleic acid comprising a nucleotide sequence encoding an immunogenic amino acid sequence that is effective against H. pylori and is at least 60% homologous to an amino acid sequence selected from SEQ ID NO:
492-SEQ ID NO: 759, SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 765-SEQ ID NO: 818, SEQ ID NO: 820-SEQ ID NO: 846, SEQ ID NO: 848-SEQ ID NO: 15 896, SEQ ID NO: 898-SEQ ID NO: 963, SEQ ID NO: 966-SEQ ID NO: 982, SEQ ID NO: 1037, SEQ ID NO: 1038, SEQ ID NO: 1041-SEQ ID NO: 1087, SEQ ID NO: 1090 and SEQ ID NO: 1296-SEQ ID NO: 1298. 3. An isolated nucleic acid comprising a nucleotide sequence that encodes an immunogenic amino acid sequence effective against H. pylori, the S 20 nucleotide sequence being at least 60% homologous to a nucleotide sequence S selected from SEQ ID NO: 304, SEQ ID NO: 254, SEQ ID NO: 438, SEQ ID NO: e S399, SEQ ID NO: 465, SEQ ID NO: 413, SEQ ID NO: 286, SEQ ID NO: 447, SEQ ID NO: 449, SEQ ID NO: 454, SEQ ID NO: 434, SEQ ID NO: 353, SEQ ID NO: 263, SEQ ID NO: 469, SEQ ID NO: 319, SEQ ID NO: 326, SEQ ID NO: 318, SEQ ID NO: 325, SEQ ID NO: 364, SEQ ID NO: 311 and SEQ ID NO: 452. 4. An isolated nucleic acid comprising a nucleotide sequence that encodes an immunogenic amino acid sequence effective against H. pylori, the nucleotide sequence being at least 60% homologous to a nucleotide sequence selected from SEQ ID NO: 1-SEQ ID NO: 268, SEQ ID NO: 270, SEQ ID NO: 272, SSEQ ID NO: 274-SEQ ID NO: 327, SEQ ID NO: 329-SEQ ID NO: 364, SEQ ID CD/00369646.5 1111 NO: 366-SEQ ID NO: 405, SEQ ID NO: 407-SEQ ID NO: 472, SEQ ID NO: 475- SEQ ID NO: 491, SEQ ID NO: 983, SEQ ID NO: 984, SEQ ID NO: 987-SEQ ID NO: 1033, SEQ ID NO: 1036 and SEQ ID NO: 1293-SEQ ID NO: 1295. An isolated nucleic acid comprising a nucleotide sequence of at least 8 nucleotides in length which encodes an immunogenic amino acid sequence effective against H. pylori, the nucleotide sequence being hybridizable to a nucleic acid having a nucleotide sequence selected from SEQ ID NO: 1-SEQ ID NO: 268, SEQ ID NO: 270, SEQ ID NO: 272, SEQ ID NO: 274-SEQ ID NO: 327, SEQ ID NO: 329-SEQ ID NO: 364, SEQ ID NO: 366-SEQ ID NO: 405, SEQ ID NO: 407- SEQ ID NO: 472, SEQ ID NO: 475-SEQ ID NO: 491, SEQ ID NO: 983, SEQ ID NO: 984, SEQ ID NO: 987-SEQ ID NO: 1033, SEQ ID NO: 1036 and SEQ ID NO:
1293-SEQ ID NO: 1295 or the complement thereof. 6. The isolated nucleic acid of claim 4 or 5, wherein the amino acid sequence encoded by said nucleotide sequence is the amino acid sequence of an 15 H. pylori cell envelope polypeptide or a fragment thereof, said selected nucleotide 4.* 44* 4 u 4 4 4 4* 44 ft o o sequence being selected from 266, SEQ ID NO: 277, SEQ ID ID NO: 294, SEQ ID NO: 299, 313, SEQ ID NO: 321, SEQ ID 20 ID NO: 353, SEQ ID NO: 364, 375, SEQ ID NO: 384, SEQ ID ID NO: 398, SEQ ID NO: 402, 410, SEQ ID NO: 412, SEQ ID ID NO: 441, SEQ ID NO: 444, SEQ ID NO: 255, SEQ ID NO: 263, SEQ ID NO: NO: 280, SEQ ID NO: 285, SEQ ID NO: 292, SEQ SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: NO: 391, SEQ ID NO: 392, SEQ ID NO: 397, SEQ SEQ ID NO: 404, SEQ ID NO: 409, SEQ ID NO: NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450, SEQ ID NO: 452, SEQ ID NO: 453, SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ ID NO: 983, SEQ ID NO: 989, SEQ ID NO: 1008, SEQ ID NO: 1011, SEQ ID NO: 1014, SEQ ID NO: 1015, SEQ ID NO: 1029, SEQ ID NO: 1032, SEQ ID NO: 259, SEQ ID NO: 286, SEQ ID NO: 326, SEQ ID NO: 374, SEQ ID NO: 399, SEQ ID NO: 422, SEQ ID NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ ID NO: 1009, SEQ ID NO: 1023, SEQ ID NO: 1294, SEQ ID NO: 1295, SEQ ID NO: 319, SEQ ID NO: 325, SEQ ID NO: 425, SEQ ID NO: 437, CD/00369646.5 1112 SEQ ID NO: 438, SEQ ID NO: 447, SEQ ID NO: 448, SEQ ID NO: 467, SEQ ID NO: 996, SEQ ID NO: 1027, SEQ ID NO: 1031, SEQ ID NO: 254, SEQ ID NO: 352, SEQ ID NO: 415, SEQ ID NO: 1019, SEQ ID NO: 381, SEQ ID NO: 389, SEQ ID NO: 1010, SEQ ID NO: 1012, SEQ ID NO: 354, SEQ ID NO: 372, SEQ ID NO: 400, SEQ ID NO: 421, SEQ ID NO: 1022, SEQ ID NO: 463, SEQ ID NO: 281, SEQ ID NO: 988, SEQ ID NO: 411, SEQ ID NO: 407, SEQ ID NO: 1017, SEQ ID NO: 290, SEQ ID NO: 417, SEQ ID NO: 430, SEQ ID NO: 992, SEQ ID NO: 1025, SEQ ID NO: 477, SEQ ID NO: 414, SEQ ID NO: 253, SEQ ID NO: 293, SEQ ID NO: 334, SEQ ID NO: 343, SEQ ID NO: 418, SEQ ID NO: 424 and SEQ ID NO: 443. 7. The isolated nucleic acid of claim 6, wherein said H. pylori cell envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide or a .fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 255, SEQ ID NO: 263, SEQ ID NO: 266, SEQ ID NO: 15 277, SEQ ID NO: 280, SEQ ID NO: 285, SEQ ID NO: 292, SEQ ID NO: 294, SEQ ID NO: 299, SEQ.ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 321, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ ID NO: 353, SEQ ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: 375, SEQ ID NO: 384, SEQ ID NO: 391, SEQ ID NO: 392, SEQ ID NO: 397, SEQ ID NO: 398, SEQ 20 ID NO: 402, SEQ ID NO: 404, SEQ ID NO: 409, SEQ ID NO: 410, SEQ ID NO: 412, SEQ ID NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ ID NO: 441, SEQ ID NO: 444, SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450, SEQ ID NO: 452, SEQ ID NO: 453, SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ ID NO: 983, SEQ ID NO: 989, SEQ ID NO: 1008, SEQ ID NO: 1011, SEQ ID NO: 1014, SEQ ID NO: 1015, SEQ ID NO: 1029, SEQ ID NO: 1032, SEQ ID NO: 259, SEQ ID NO: 286, SEQ ID NO: 326, SEQ ID NO: 374, SEQ ID NO: 399, SEQ ID NO: 422, SEQ ID NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ ID NO: 1009, SEQ ID NO: 1023, SEQ ID NO: 1294, SEQ ID NO: 1295, SEQ ID NO: 319, SEQ ID NO: 325, SEQ ID NO: 425, SEQ ID NO: 437, SEQ ID NO: 438, SEQ ID NO: 447, SEQ ID NO: 448, SEQ ID NO: 467, SEQ ID NO: 996, SEQ ID NO: 1027, SEQ ID NO: 1031, SEQ ID NO: 254, SEQ ID NO: 352, SEQ ID NO: 415, SEQ ID NO: 1019, SEQ ID NO: 381, SEQ ID NO: 389, SEQ ID NO: 1010, and SEQ ID CD/00369646.5 1113 NO: 1012. 8. The isolated nucleic acid of claim 7, wherein said H. pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 255, SEQ ID NO: 263, SEQ ID NO: 266, SEQ ID NO: 277, SEQ ID NO: 280, SEQ ID NO: 285, SEQ ID NO: 292, SEQ ID NO: 294, SEQ ID NO: 299, SEQ ID NO: 311, SEQ ID NO: 312, SEQ ID NO: 313, SEQ ID NO: 321, SEQ ID NO: 327, SEQ ID NO: 329, SEQ ID NO: 331, SEQ ID NO: 353, SEQ ID NO: 364, SEQ ID NO: 366, SEQ ID NO: 368, SEQ ID NO: 375, SEQ ID NO: 384, SEQ ID NO: 391, SEQ ID NO: 392, SEQ ID NO: 397, SEQ ID NO: 398, SEQ ID NO: 402, SEQ ID NO: 404, SEQ ID NO: 409,410, SEQ ID NO: 412, SEQ ID NO: 427, SEQ ID NO: 433, SEQ ID NO: 434, SEQ ID NO: 441, SEQ ID NO: 444, SEQ ID NO: 445, SEQ ID NO: 449, SEQ ID NO: 450, SEQ ID NO: :452, SEQ ID NO: 453, SEQ ID NO: 466, SEQ ID NO: 468, SEQ ID NO: 469, SEQ 15 ID NO: 983, SEQ ID NO: 989, SEQ ID NO: 1008, SEQ ID NO: 1011, SEQ ID NO: 1014, SEQ ID NO: 1015, SEQ ID NO: 1029, and SEQ ID NO: 1032. 9. The isolated nucleic acid of claim 7, wherein said H. pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a •C-terminal tyrosine cluster or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 286, SEQ ID NO: 326, SEQ ID NO: 374, SEQ ID NO: 399, SEQ ID NO: 422, SEQ ID NO: 454, SEQ ID NO: 465, SEQ ID NO: 998, SEQ ID NO: 1009, SEQ ID NO: 1023, SEQ ID NO: 1294, and SEQ ID NO: 1295. The isolated nucleic acid of claim 7, wherein said H. pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 319, SEQ ID NO: 325, SEQ ID NO: 425, SEQ ID NO: 437, SEQ ID NO: 438, SEQ ID NO: 447, SEQ ID NO: 448, SEQ ID NO: 467, SEQ ID NO: 996, SEQ ID NO: 1027, 30 and SEQ ID NO: 1031. CD/00369646.5 1114 11. The isolated nucleic acid of claim 6, wherein said H. pylori cell envelope polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 354, SEQ ID NO: 372, SEQ ID NO: 400, SEQ ID NO: 421, SEQ ID NO: 1022, SEQ ID NO: 463, SEQ ID NO: 281, SEQ ID NO: 988, SEQ ID NO: 411, SEQ ID NO: 407, SEQ ID NO: 1017, SEQ ID NO: 290, SEQ ID NO: 417, SEQ ID NO: 430, SEQ ID NO: 992, and SEQ ID NO: 1025. 12. The isolated nucleic acid of claim 11, wherein said H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in outer membrane and cell wall synthesis or a fragment thereof and the selected nucleotide sequence is SEQ ID NO: 354. 13. The isolated nucleic acid of claim 11, wherein said H. pylori inner membrane polypeptide or fragment thereof is an H. pylori polypeptide involved in i energy conversion or a fragment thereof and said selected nucleotide sequence is 15 selected from SEQ ID NO: 372, SEQ ID NO: 400, SEQ ID NO: 421, and SEQ ID NO: 1022. 14. The isolated nucleic acid of claim 11, wherein said H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in cofactor metabolism or a fragment thereof and the selected nucleotide S 20 sequence is SEQ ID NO: 463. 15. The isolated nucleic acid of claim 11, wherein said H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in secretion and adhesion or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 281 and SEQ ID NO: 988. 16. The isolated nucleic acid of claim 11, wherein said H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in transport or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 407 and SEQ ID NO: 1017. CD/00369646.5 1115 17. The isolated nucleic acid of claim 6, wherein said H. pylori cell envelope polypeptide or a fragment thereof is an H. pylori flagellar polypeptide or a fragment thereof and said selected nucleotide sequence is SEQ ID NO: 477. 18. The isolated nucleic acid of claim 6, wherein said H. pylori cell envelope polypeptide or a fragment thereof is an H. pylori transport polypeptide or a fragment thereof and said selected nucleotide sequence is SEQ ID NO: 414. 19. The isolated nucleic acid of claim 4 or 5, wherein the amino acid sequence encoded by said nucleotide sequence is the amino acid sequence of an H. pylori secreted polypeptide or a fragment thereof, said selected nucleotide sequence being selected from SEQ ID NO: 355, SEQ ID NO: 1006, SEQ ID NO: C .r C .C 257, SEQ ID NO: 258, SEQ ID ID NO: 265, SEQ ID NO: 268, 274, SEQ ID NO: 275, SEQ ID ID NO: 284, SEQ ID NO: 287, 291, SEQ ID NO: 295, SEQ ID ID NO: 300, SEQ ID NO: 301, 304, SEQ ID NO: 305, SEQ ID ID NO: 338, SEQ ID NO: 342, 356, SEQ ID NO: 358, SEQ ID 20 ID NO: 362, SEQ ID NO: 363, NO: 260, SEQ ID NO: 261, SEQ ID NO: 270, SEQ ID NO: 276, SEQ ID NO: 279, SEQ ID NO: 288, SEQ ID NO: 296, SEQ ID NO: 297, SEQ ID NO: 302, SEQ ID NO: 314, SEQ ID NO: 315, SEQ ID NO: 348, SEQ ID NO: 359, SEQ ID NO: 360, SEQ ID NO: 367, SEQ ID SEQ ID NO: 264, SEQ NO: 272, SEQ ID NO: SEQ ID NO: 283, SEQ NO: 289, SEQ ID NO: SEQ ID NO: 298, SEQ NO: 303, SEQ ID NO: SEQ ID NO: 323, SEQ NO: 349, SEQ ID NO: SEQ ID NO: 361, SEQ NO: 370, SEQ ID NO: SEQ ID NO: 379, SEQ NO: 394, SEQ ID NO: SEQ ID NO: 405, SEQ NO: 428, SEQ ID NO: SEQ ID NO: 451, SEQ NO: 987, SEQ ID NO: 371, SEQ ID NO: 373, SEQ ID NO: 377, SEQ ID NO: 378, ID NO: 380, SEQ ID NO: 388, SEQ ID NO: 390, SEQ ID 395, SEQ ID NO: 396, SEQ ID NO: 401, SEQ ID NO: 403, ID NO: 408, SEQ ID NO: 420, SEQ ID NO: 426, SEQ ID 429, SEQ ID NO: 432, SEQ ID NO: 439, SEQ ID NO: 442, ID NO: 471, SEQ ID NO: 478, SEQ ID NO: 488, SEQ ID 990, SEQ ID NO: 991, SEQ ID NO: 993, SEQ ID NO: 1001, SEQ ID NO: 1002, SEQ ID NO: 1007, SEQ ID NO: 1013, SEQ ID NO: 1016, SEQ ID NO: 1018, SEQ ID NO: 1021, and SEQ ID NO: 1026. The isolated nucleic acid of claim 19, wherein said H. pylori secreted polypeptide or a fragment thereof is an H. pylori polypeptide involved in secretion CD/00369646.5 1116 and adhesion or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 355 and SEQ ID NO: 1006. 21. The isolated nucleic acid of claim 4 or 5, wherein the amino acid sequence encoded by said nucleotide sequence is the amino acid sequence of an H. pylori cytoplasmic polypeptide or a fragment thereof, said selected nucleotide sequence being selected from SEQ ID NO: 470, SEQ ID NO: 1033, SEQ ID NO: 357, SEQ ID NO: 457, SEQ ID NO: 461, SEQ ID NO: 1030, SEQ ID NO: 345, SEQ ID NO: 383, SEQ ID NO: 387, SEQ ID NO: 455, SEQ ID NO: 1003, SEQ ID NO: 351, SEQ ID NO: 416, SEQ ID NO: 278, SEQ ID NO: 335, SEQ ID NO: 346, SEQ ID NO: 350, SEQ ID NO: 419, SEQ ID NO: 460, SEQ ID NO: 472, SEQ ID NO: 1000, SEQ ID NO: 1004, SEQ ID NO: 1020, SEQ ID NO: 1293, SEQ ID NO: 318, SEQ ID NO: 322, SEQ ID NO: 324, SEQ ID NO: 330, SEQ ID NO: 347, SEQ ID NO: 440, SEQ ID NO: 446, SEQ ID NO: 464, SEQ ID NO: 490, SEQ ID NO: 491, SEQ ID NO: 995, SEQ ID NO: 997, SEQ ID NO: 1005, SEQ ID NO: 15 1028. 22. The isolated nucleic acid of claim 21, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in energy conversion or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 470 and SEQ ID NO: 1033. S 20 23. The isolated nucleic acid of claim 21, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved S in amino acid metabolism and transport or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 357 and SEQ ID NO: 457. 24. The isolated nucleic acid of claim 21, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in nucleotide metabolism and transport or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 461 and SEQ ID NO: 1030. The isolated nucleic acid of claim 21, wherein said H. pylori d4> cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved CD/00369646.5 1117 in cofactor metabolism or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 345, SEQ ID NO: 383, SEQ ID NO: 387, SEQ ID NO: 455, and SEQ ID NO: 1003. 26. The isolated nucleic acid of claim 21, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is H. pylori polypeptide involved in lipid metabolism or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 351 and SEQ ID NO: 416. 27. The isolated nucleic acid of claim 21, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in genome replication, transcription, recombination and repair or a fragment thereof and said selected nucleotide sequence is selected from SEQ ID NO: 278, SEQ ID NO: 335, SEQ ID NO: 346, SEQ ID NO: 350, SEQ ID NO: 419, SEQ ID NO: 460, SEQ ID NO: 472, SEQ ID NO: 1000, SEQ ID NO: 1004, SEQ ID NO: 1020, and SEQ ID NO: 1293. 15 28. The isolated nucleic acid of claim 4 or 5, wherein the amino acid sequence encoded by said nucleotide sequence is the amino acid sequence of an H. pylori cellular polypeptide or a fragment thereof, said selected nucleotide sequence being selected from SEQ ID NO: 256, SEQ ID NO: 267, SEQ ID NO: 282, SEQ ID NO: 306, SEQ ID NO: 307, SEQ ID NO: 308, SEQ ID NO: 309, SEQ 20 ID NO: 310, SEQ ID NO: 316, SEQ ID NO: 317, SEQ ID NO: 332, SEQ ID NO: 333, SEQ ID NO: 336, SEQ ID NO: 337, SEQ ID NO: 339, SEQ ID NO: 340, SEQ ID NO: 341, SEQ ID NO: 344, SEQ ID NO: 369, SEQ ID NO: 376, SEQ ID NO: 382, SEQ ID NO: 386, SEQ ID NO: 423, SEQ ID NO: 431, SEQ ID NO: 435, SEQ ID NO: 436, SEQ ID NO: 458, SEQ ID NO: 462, SEQ ID NO: 475, SEQ ID NO: 476, SEQ ID NO: 479, SEQ ID NO: 480, SEQ ID NO: 481, SEQ ID NO: 482, SEQ ID NO: 483, SEQ ID NO: 484, SEQ ID NO: 485, SEQ ID NO: 486, SEQ ID NO: 487, SEQ ID NO: 489, SEQ ID NO: 984, SEQ ID NO: 994, SEQ ID NO: 1024, and SEQ ID NO: 1036. 29. The isolated nucleic acid of any preceding claim, wherein the Snucleotide sequence comprised by the nucleic acid is at least 70% homologous to CD/00369646.5 1118 the selected nucleotide sequence. The isolated nucleic acid of any one of claims 1 to 28, wherein the nucleotide sequence comprised by the nucleic acid is at least 80% homologous to the selected nucleotide sequence. 31. The isolated nucleic acid of any one of claims 1 to 28, wherein the nucleotide sequence comprised by the nucleic acid is at least 90% homologous to the selected nucleotide sequence. 32. The isolated nucleic acid of any one of claims 1 to 28, wherein the nucleotide sequence comprised by the nucleic acid is at least 95% homologous to the selected nucleotide sequence. 33. The isolated nucleic acid of any one of claims 1 to 28, wherein the ;°:'°nucleotide sequence comprised by the nucleic acid is at least 98% homologous to S: the selected nucleotide sequence. 34. The isolated nucleic acid of any one of claims 1 to 28, wherein the nucleotide sequence comprised by the nucleic acid is at least 99% homologous to the selected nucleotide sequence. eeo. a ~35. The isolated nucleic acid of any one of claims 1 to 28, wherein the nucleotide sequence comprised by the nucleic acid is identical to the selected nucleotide sequence. ao 36. A recombinant expression vector comprising the nucleic acid of any preceding claim operably linked to a transcription regulatory element. 37. A cell comprising a recombinant expression vector of claim 36. 38. A method for producing a polypeptide comprising culturing a cell of claim 37 under conditions that permit expression of the polypeptide. 39. A composition comprising an isolated nucleic acid according to any CD/00369646.5 1119 one of claims 1 to 35 and a pharmaceutically acceptable carrier. The composition of claim 39, wherein the pharmaceutically acceptable carrier is an adjuvant. 41. The composition of claim 39, including a genetically engineered attenuated live virus or bacteria, or a recombinant virus-like particle. 42. Use of the nucleic acid of any one of claims 1 to 35, in the manufacture of a composition, for treating a subject for Helicobacter pylori infection. 43. The use according to claim 42, wherein the treatment is a prophylactic treatment. 44. The use according to claim 42, wherein the treatment is a therapeutic treatment. 45. A purified polypeptide comprising an immunogenic amino acid sequence that is effective against H. pylori and is at least 60% homologous to an 15 amino acid sequence selected from SEQ ID NO: 795, SEQ ID NO: 745, SEQ ID NO: 929, SEQ ID NO: 890, SEQ ID NO: 956, SEQ ID NO: 904, SEQ ID NO: 777, SEQ ID NO: 938, SEQ ID NO: 940, SEQ ID NO: 945, SEQ ID NO: 925, SEQ ID NO: 844, SEQ ID NO: 754, SEQ ID NO: 960, SEQ ID NO: 810, SEQ ID NO: 817, SEQ ID NO: 809, SEQ ID NO: 816, SEQ ID NO: 855, SEQ ID NO: 802 and SEQ 20 ID NO: 943. 46. A purified polypeptide comprising an immunogenic amino acid sequence that is effective against H. pylori and is at least 60% homologous to an amino acid sequence selected from SEQ ID NO: 492-SEQ ID NO: 759, SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 765-SEQ ID NO: 818, SEQ ID NO: 820- SEQ ID NO: 846, SEQ ID NO: 848-SEQ ID NO: 896, SEQ ID NO: 898-SEQ ID NO: 963, SEQ ID NO: 966-SEQ ID NO: 982, SEQ ID NO: 1037, SEQ ID NO: R A 4 1038, SEQ ID NO: 1041-SEQ ID NO: 1087, SEQ ID NO: 1090 and SEQ ID NO: CD/00369646.5 1120
1296-SEQ ID NO: 1298. 47. The purified polypeptide of claim 46, wherein said amino acid sequence comprised by the purified polypeptide is the amino acid sequence of an H. pylori cell envelope polypeptide or a fragment thereof, said selected amino acid sequence being selected from SEQ ID NO: 746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ ID NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, SEQ ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 859, SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ ID NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ ID NO: 925, SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO: 941, SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ 15 ID NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, SEQ ID NO: 1065, SEQ ID NO: 1068, SEQ ID NO:.1069, SEQ ID NO: 1083, SEQ ID NO: 1086, SEQ ID NO: 750, SEQ ID NO: 777, SEQ ID NO: 817, SEQ ID NO: 865, SEQ ID NO: 890, SEQ ID NO: 913, SEQ ID NO: 945, SEQ ID NO: 956, SEQ ID .NO: 1052, SEQ ID NO: 1063, SEQ ID NO: 1077, SEQ ID NO: 1297, SEQ ID NO: 20 1298, SEQ ID NO: 810, SEQ ID NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ ID NO: 938, SEQ ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID NO: 1081, SEQ ID NO: 1085, SEQ ID NO: 745, SEQ ID NO: *o 843, SEQ ID NO: 906, SEQ ID NO: 1073, SEQ ID NO: 872, SEQ ID NO: 880, SEQ ID NO: 1064, SEQ ID NO: 1066, SEQ ID NO: 845, SEQ ID NO: 863, SEQ ID NO: 891, SEQ ID NO: 912, SEQ ID NO: 1076, SEQ ID NO: 954, SEQ ID NO: 772, SEQ ID NO: 1042, SEQ ID NO: 902, SEQ ID NO: 898, SEQ ID NO: 1071, SEQ ID NO: 781, SEQ ID NO: 908, SEQ ID NO: 921, SEQ ID NO: 1046, SEQ ID NO: 1079, SEQ ID NO: 968, SEQ ID NO: 905, SEQ ID NO: 744, SEQ ID NO: 784, SEQ ID NO: 825, SEQ ID NO: 834, SEQ ID NO: 909, SEQ ID NO: 915, and SEQ ID NO: 934. 48. The purified polypeptide of claim 47, wherein said H. pylori cell CD/00369646.5 1121 envelope polypeptide or a fragment thereof is an H. pylori outer membrane polypeptide or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ ID NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, SEQ ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 859, SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ ID NO: 901, SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ ID NO: 925, SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO: 941, SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ ID NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, SEQ ID NO: 1065, SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, SEQ ID NO: 1086, SEQ ID NO: 15 750, SEQ ID NO: 777, SEQ ID NO: 817, SEQ ID NO: 865, SEQ ID NO: 890, SEQ ID NO: 913, SEQ ID NO: 945, SEQ ID NO: 956, SEQ ID NO: 1052, SEQ ID NO: 1063, SEQ ID NO: 1077, SEQ ID NO: 1297, SEQ ID NO: 1298, SEQ ID NO: 810, SEQ ID NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ ID NO: 938, SEQ ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID NO: 1081, SEQ ID NO: 1085, SEQ ID NO: 745, SEQ ID NO: 843, SEQ ID NO: 906, SEQ ID NO: 1073, SEQ ID NO: 872, SEQ ID NO: 880, SEQ ID NO: 1064, and 'SEQ ID NO: 1066. 49. The purified polypeptide of claim 48, wherein said H. pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a terminal phenylalanine residue or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 746, SEQ ID NO: 754, SEQ ID NO: 757, SEQ ID NO: 768, SEQ ID NO: 771, SEQ ID NO: 776, SEQ ID NO: 783, SEQ ID NO: 785, SEQ ID NO: 790, SEQ ID NO: 802, SEQ ID NO: 803, SEQ ID NO: 804, SEQ ID NO: 812, SEQ ID NO: 818, SEQ ID NO: 820, SEQ ID NO: 882, SEQ ID NO: 844, SEQ ID NO: 855, SEQ ID NO: 857, SEQ ID NO: 859, SEQ ID NO: 866, SEQ ID NO: 875, SEQ ID NO: 882, SEQ ID NO: 883, SEQ ID NO: 888, SEQ ID NO: 889, SEQ ID NO: 893, SEQ ID NO: 895, SEQ ID NO: 900, SEQ ID NO: 901, CD/00369646.5 1122 SEQ ID NO: 903, SEQ ID NO: 918, SEQ ID NO: 924, SEQ ID NO: 925, SEQ ID NO: 932, SEQ ID NO: 935, SEQ ID NO: 936, SEQ ID NO: 940, SEQ ID NO: 941, SEQ ID NO: 943, SEQ ID NO: 944, SEQ ID NO: 957, SEQ ID NO: 959, SEQ ID NO: 960, SEQ ID NO: 1037, SEQ ID NO: 1043, SEQ ID NO: 1062, SEQ ID NO: 1065, SEQ ID NO: 1068, SEQ ID NO: 1069, SEQ ID NO: 1083, and SEQ ID NO: 1086. The purified polypeptide of claim 48, wherein said H. pylori outer membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a C-terminal tyrosine cluster or fragment thereof, said selected amino acid sequence being selected from SEQ ID NO: 777, SEQ ID NO: 817, SEQ ID NO: 865, SEQ ID NO: 890, SEQ ID NO: 913, SEQ ID NO: 945, SEQ ID NO: 956, SEQ ID NO: 1052, SEQ ID NO: 1063, SEQ ID NO: 1077, SEQ ID NO: 1297, SEQ ID NO: 1298. 9o 51. The purified polypeptide of claim 48, wherein said H. pylori outer S 15 membrane polypeptide or a fragment thereof is an H. pylori polypeptide having a terminal phenylalanine residue and a C-terminal tyrosine cluster or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 810, SEQ ID NO: 816, SEQ ID NO: 916, SEQ ID NO: 928, SEQ ID NO: 929, SEQ ID NO: 938, SEQ ID NO: 939, SEQ ID NO: 958, SEQ ID NO: 1050, SEQ ID NO: 1081, and SEQ ID NO: 1085. 52. The purified polypeptide of claim 47, wherein said H. pylori cell envelope polypeptide or a fragment thereof is an H. pylori inner membrane polypeptide or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 845, SEQ ID NO: 863, SEQ ID NO: 891, SEQ ID NO: 912, SEQ ID NO: 1076, SEQ ID NO: 954, SEQ ID NO: 772, SEQ ID NO: 1042, SEQ ID NO: 902, SEQ ID NO: 898, SEQ ID NO: 1071, SEQ ID NO: 781, SEQ ID NO: 908, SEQ ID NO: 921, SEQ ID NO: 1046, and SEQ ID NO: 1079. 53. The purified polypeptide of claim 52, wherein said H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved Sin outer membrane and cell wall synthesis or a fragment thereof and said selected CD/00369646.5 1123 amino acid sequence is SEQ ID NO: 845. 54. The purified polypeptide of claim 52, wherein said H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in energy conversion or a fragment thereof, said selected amino acid sequence being selected from SEQ ID NO: 863, SEQ ID NO: 891, SEQ ID NO: 912, and SEQ ID NO: 1076. The purified polypeptide of claim 52, wherein said H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in cofactor metabolism or a fragment thereof and said selected amino acid sequence is SEQ ID NO: 954. 56. The purified polypeptide of claim 52, wherein said H. pylori inner "membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in secretion and adhesion or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 772 and SEQ ID NO: 1042. 9* 15 57 The purified polypeptide of claim 52, wherein said H. pylori inner membrane polypeptide or a fragment thereof is an H. pylori polypeptide involved in transport or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 898 and SEQ ID NO: 1071. 58. The purified polypeptide of claim 47, wherein said H. pylori cell 20 envelope polypeptide or a fragment thereof is an H. pylori flagellar polypeptide or a fragment thereof and said selected amino acid sequence is SEQ ID NO: 968. 59. The purified polypeptide of claim 47, wherein said H. pylori cell envelope polypeptide or a fragment thereof is an H. pylori transport polypeptide or a fragment thereof and said selected amino acid sequence is SEQ ID NO: 905. 60. The purified polypeptide of claim 46, wherein said amino acid sequence comprised by the purified polypeptide is the amino acid sequence of an H. pylori cellular polypeptide or a fragment thereof, said selected amino acid CD/00369646.5 1124 sequence being selected from SEQ ID NO: 747, SEQ ID NO: 758, SEQ ID NO: 773, SEQ ID NO: 797, SEQ ID NO: 798, SEQ ID NO: 799, SEQ ID NO: 800, SEQ ID NO: 801, SEQ ID NO: 807, SEQ ID NO: 808, SEQ ID NO: 823, SEQ ID NO: 824, SEQ ID NO: 827, SEQ ID NO: 828, SEQ ID NO: 830, SEQ ID NO: 831, SEQ ID NO: 832, SEQ ID NO: 835, SEQ ID NO: 860, SEQ ID NO: 867, SEQ ID NO: 873, SEQ ID NO: 877, SEQ ID NO: 914, SEQ ID NO: 922, SEQ ID NO: 926, SEQ ID NO: 927, SEQ ID NO: 949, SEQ ID NO: 953, SEQ ID NO: 966, SEQ ID NO: 967, SEQ ID NO: 970, SEQ ID NO: 971, SEQ ID NO: 972, SEQ ID NO: 973, SEQ ID NO: 974, SEQ ID NO: 975, SEQ ID NO: 976, SEQ ID NO: 977, SEQ ID NO: 978, SEQ ID NO: 980, SEQ ID NO: 1038, SEQ ID NO: 1048, SEQ ID NO: 1078, and SEQ ID NO: 1090. 61. The purified polypeptide of claim 46, wherein said amino acid sequence comprised by the purified polypeptide is the amino acid sequence of an H. pylori secreted polypeptide or a fragment thereof, said selected amino acid 15 sequence being selected from SEQ ID NO: 846, SEQ ID NO: 1060, SEQ ID NO: S748, SEQ ID NO: 749, SEQ ID NO: 751, SEQ ID NO: 752, SEQ ID NO: 755, SEQ ID NO: 756, SEQ ID NO: 759, SEQ ID NO: 761, SEQ ID NO: 763, SEQ ID NO: 765, SEQ ID NO: 766, SEQ ID NO: 767, SEQ ID NO: 770, SEQ ID NO: 774, SEQ ID NO: 775, SEQ ID NO: 778, SEQ ID NO: 779, SEQ ID NO: 780, SEQ ID NO: 20 782, SEQ ID NO: 786, SEQ ID NO: 787, SEQ ID NO: 788, SEQ ID NO: 789, SEQ ID NO: 791, SEQ ID NO: 792, SEQ ID NO: 793, SEQ ID NO: 794, SEQ ID NO: 20 795, SEQ ID NO: 796, SEQ ID NO: 805, SEQ ID NO: 806, SEQ ID NO: 814, SEQ ID NO: 829, SEQ ID NO: 833, SEQ ID NO: 839, SEQ ID NO: 840, SEQ ID NO: 849, SEQ ID NO: 850, SEQ ID NO: 851, SEQ ID NO: 852, SEQ ID NO: 853, SEQ ID NO: 854, SEQ ID NO: 858, SEQ ID NO: 861, SEQ ID NO: 862, SEQ ID NO: 864, SEQ ID NO: 868, SEQ ID NO: 869, SEQ ID NO: 870, SEQ ID NO: 871, SEQ ID NO: 879, SEQ ID NO: 881, SEQ ID NO: 885, SEQ ID NO: 886, SEQ ID NO: 887, SEQ ID NO: 892, SEQ ID NO: 894, SEQ ID NO: 896, SEQ ID NO: 899, SEQ ID NO: 911, SEQ ID NO: 917, SEQ ID NO: 919, SEQ ID NO: 920, SEQ ID NO: 923, SEQ ID NO: 930, SEQ ID NO: 933, SEQ ID NO: 942, SEQ ID NO: 962, SEQ ID NO: 969, SEQ ID NO: 979, SEQ ID NO: 1041, SEQ ID NO: 1044, SEQ ID NO: 1045, SEQ ID NO: 1047, SEQ ID NO: 1055, SEQ ID NO: 1056, SEQ ID NO: CD/00369646.5 1125 1061, SEQ ID NO: 1067, SEQ ID NO: 1070, SEQ ID NO: 1072, SEQ ID NO: 1075, and SEQ ID NO: 1080. 62. The purified polypeptide of claim 61, wherein said H. pylori secreted polypeptide or a fragment thereof is an H. pylori polypeptide involved in secretion and adhesion or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 846 and SEQ ID NO: 1060. 63. The purified polypeptide of claim 46, wherein said amino acid sequence comprised by the purified polypeptide is the amino acid sequence of an H. pylori cytoplasmic polypeptide or a fragment thereof, said selected amino acid sequence being selected from SEQ ID NO: 961, SEQ ID NO: 1087, SEQ ID NO: 848, SEQ ID NO: 948, SEQ ID NO: 952, SEQ ID NO: 1084, SEQ ID NO: 836, SEQ ID NO: 874, SEQ ID NO: 878, SEQ ID NO: 946, SEQ ID NO: 1057, SEQ ID NO: 842, SEQ ID NO: 907, SEQ ID NO: 769, SEQ ID NO: 826, SEQ ID NO: 837, SEQ ID NO: 841, SEQ ID NO: 910, SEQ ID NO: 951, SEQ ID NO: 963, SEQ ID 15 NO: 1054, SEQ ID NO: 1058,1074, SEQ ID NO: 1296, SEQ ID NO: 809, SEQ ID S. NO: 813, SEQ ID NO: 815, SEQ ID NO: 821, SEQ ID NO: 838, SEQ ID NO: 931 SEQ ID NO: 937, SEQ ID NO: 955, SEQ ID NO: 981, SEQ ID NO: 982, SEQ ID NO: 1049, SEQ ID NO: 1051, SEQ ID NO: 1059, and SEQ ID NO: 1082. a 64. The purified polypeptide of claim 63, wherein said H. pylori 20 cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in energy conversion or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 961 and SEQ ID NO: 1087. The purified polypeptide of claim 63, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in amino acid metabolism and transport or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 848 and SEQ ID NO: 948. 66. The purified polypeptide of claim 63, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in nucleotide metabolism and transport or a fragment thereof and said CD/00369646.5 1126 selected amino acid sequence is selected from SEQ ID NO: 952 and SEQ ID NO: 1084. 67. The purified polypeptide of claim 63, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in cofactor metabolism or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 836, SEQ ID NO: 874, SEQ ID NO: 878, SEQ ID NO: 946, and SEQ ID NO: 1057. 68. The purified polypeptide of claim 63, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved in lipid metabolism or a fragment thereof and said selected amino acid sequence is selected from SEQ ID NO: 842 and SEQ ID NO: 907. 69. The purified polypeptide of claim 63, wherein said H. pylori cytoplasmic polypeptide or a fragment thereof is an H. pylori polypeptide involved :1 in genome replication, transcription, recombination and repair or a fragment S 15 thereof and said selected amino acid sequence is selected from SEQ ID NO: 769, SEQ ID NO: 826, SEQ ID NO: 837, SEQ ID NO: 841, SEQ ID NO: 910, SEQ ID NO: 951, SEQ ID NO: 863, SEQ ID NO: 1054, SEQ ID NO: 1058, SEQ ID NO: 1074, and SEQ ID NO: 1296. 70. The purified polypeptide of any one of claims 45 to 69, wherein the t.a: 20 amino acid sequence comprised by the purified polypeptide is at least homologous to the selected amino acid sequence. 71. The purified polypeptide of any one of claims 45 to 69, wherein the amino acid sequence comprised by the purified polypeptide is at least homologous to the selected amino acid sequence. 72. The purified polypeptide of any one of claims 45 to 69, wherein the amino acid sequence comprised by the purified polypeptide is at least homologous to the selected amino acid sequence. CD/00369646.5 1127 73. The purified polypeptide of any one of claims 45 to 69, wherein the amino acid sequence comprised by the purified polypeptide is at least homologous to the selected amino acid sequence. 74. The purified polypeptide of any one of claims 45 to 69, wherein the amino acid sequence comprised by the purified polypeptide is at least 98% homologous to the selected amino acid sequence. The purified polypeptide of any one of claims 45 to 69, wherein the amino acid sequence comprised by the purified polypeptide is at least 99% homologous to the selected amino acid sequence. 76. The purified polypeptide of any one of claims 45 to 69, wherein the amino acid sequence comprised by the purified polypeptide is identical to the selected amino acid sequence. 77. A composition comprising a purified polypeptide according to any one of claims 45 to 76 and a pharmaceutically acceptable carrier. 78. The composition of claim 77, wherein the pharmaceutically O: acceptable carrier is an adjuvant. 79. The composition of claim 77, including a substance selected from cholera toxin, a non-toxic derivative of cholera toxin, procholeragenoid, a fungal polysaccharide, muramyl dipeptide, a muramyl dipeptide derivative, a phorbol 20 ester, E. coli labile toxin, a non-H. pylori bacterial lysate, a block polymer, a saponin, biodegradable microcapsules, ISCOMs, cochleates, liposomes, a genetically engineered attenuated live virus or bacteria, and a recombinant virus- like particle. Use of a purified polypeptide according to any one of claims 45 to 76, in the manufacture of a composition, for treating a subject for Helicobacter pylori infection. CD/00369646.5 1128 81. The use according to claim 80, wherein the treatment is a prophylactic treatment. 82. The use according to claim 80, wherein the treatment is a therapeutic treatment. 83. A method for detecting the presence of a Helicobacter nucleic acid in a sample comprising: contacting the sample with a nucleic acid according to any one of claims 1 to 35 under conditions in which a hybrid can form between the probe and a Helicobacter nucleic acid in the sample; and detecting the hybrid formed in step wherein detection of a hybrid indicates the presence of a Helicobacter nucleic acid in the sample. 84. A compound screening assay comprising the steps of: contacting a test compound with a polypeptide comprising an amino acid sequence that is at least 60% homologous to an amino acid S* sequence shown in the sequence listing, and determining if the compound binds to the amino acid sequence of Sthe polypeptide. 85. The assay of Claim 84, wherein the assay is a cell-free assay. S 20 86. The assay of Claim 84 or 85, wherein the amino acid sequence comprised by the polypeptide is at least 70%, 80%, 90%, 95%, 98% or 99% homologous to the selected amino acid sequence, or identical to the selected sequence. 87. A compound screening assay comprising the steps of: contacting a test compound with a nucleic acid comprising a nucleotide sequence that is at least 60% homologous to a nucleotide sequence shown in the sequence listing, and determining if the compound binds to the nucleotide sequence of the CD/00369646.5 1129 nucleic acid. 88. The assay of Claim 87, wherein the assay is a cell-free assay. 89. The assay of Claim 87 or 88, wherein the nucleotide sequence comprised by the nucleic acid is at least 70%, 80%, 90%, 95%, 98% or 99% homologous to the selected nucleotide sequence, or identical to the selected sequence. An isolated nucleic acid according to any one of claims 1 to substantially as hereinbefore described with reference to the examples. 91. A purified polypeptide according to claim 45 or 46, substantially as hereinbefore described with reference to any one of the examples. ASTRA AKTIEBOLAG By their Registered Patent Attorneys Freehills Carter Smith Beadle 10 July 2000 a *o a e a a a a.
Applications Claiming Priority (11)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US62581196A | 1996-03-29 | 1996-03-29 | |
| US08/625811 | 1996-03-29 | ||
| US75873196A | 1996-04-02 | 1996-04-02 | |
| US08/758731 | 1996-04-02 | ||
| US73690596A | 1996-10-25 | 1996-10-25 | |
| US08/736905 | 1996-10-25 | ||
| US73885996A | 1996-10-28 | 1996-10-28 | |
| US08/738859 | 1996-10-28 | ||
| US76131896A | 1996-12-06 | 1996-12-06 | |
| US08/761318 | 1996-12-06 | ||
| PCT/US1997/005223 WO1997037044A1 (en) | 1996-03-29 | 1997-03-27 | Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| AU2598497A AU2598497A (en) | 1997-10-22 |
| AU726892B2 true AU726892B2 (en) | 2000-11-23 |
Family
ID=27542009
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| AU25984/97A Ceased AU726892B2 (en) | 1996-03-29 | 1997-03-27 | Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof |
Country Status (18)
| Country | Link |
|---|---|
| EP (1) | EP0901530A1 (en) |
| JP (1) | JP2000501621A (en) |
| CN (1) | CN1220703A (en) |
| AU (1) | AU726892B2 (en) |
| BR (1) | BR9708456A (en) |
| CA (1) | CA2248985A1 (en) |
| CZ (1) | CZ297698A3 (en) |
| EE (1) | EE9800334A (en) |
| HU (1) | HUP0100267A3 (en) |
| ID (1) | ID18542A (en) |
| IL (1) | IL125808A0 (en) |
| IS (1) | IS4831A (en) |
| NO (1) | NO984517L (en) |
| NZ (1) | NZ332565A (en) |
| PL (1) | PL329045A1 (en) |
| SK (1) | SK130598A3 (en) |
| TR (1) | TR199801939T2 (en) |
| WO (1) | WO1997037044A1 (en) |
Families Citing this family (16)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| NZ304756A (en) * | 1995-04-21 | 2000-02-28 | Univ New South Wales | Protective Helicobacter antigens and vaccine compositions containing them |
| WO1998021225A1 (en) * | 1996-11-14 | 1998-05-22 | Merieux Oravax | Helicobacter polypeptides and corresponding polynucleotide molecules |
| SE9702240D0 (en) * | 1997-06-12 | 1997-06-12 | Astra Ab | Vaccine compositions III |
| RU2227043C2 (en) * | 1998-05-01 | 2004-04-20 | Чирон Корпорейшн | Neisseria meningitidis antigens and compositions |
| EP2261338A3 (en) | 1998-05-01 | 2012-01-04 | Novartis Vaccines and Diagnostics, Inc. | Neisseria meningitidis antigens and compositions |
| WO2000000614A2 (en) * | 1998-06-26 | 2000-01-06 | American Cyanamid Company | NOVEL ANTIGENS OF $i(HELICOBACTER PYLORI) |
| DE19847628C2 (en) * | 1998-10-15 | 2000-10-19 | Chiron Behring Gmbh & Co | Helicobacter pylori vaccine |
| GB9825184D0 (en) * | 1998-11-17 | 1999-01-13 | Cortecs Uk Ltd | Antigen |
| JP2002542821A (en) * | 1999-04-30 | 2002-12-17 | ハイブリジェニックス・ソシエテ・アノニム | Prokaryotic DNA collection for two-hybrid systems, Helicobacter pylori protein-protein interaction and uses thereof |
| SE0001988D0 (en) * | 2000-05-29 | 2000-05-29 | A & Science Invest Ab | Novel polypeptides and use thereof |
| WO2002040516A2 (en) * | 2000-11-15 | 2002-05-23 | Ludwig Deml | Helicobacter cysteine rich protein a (hcpa) and uses thereof |
| US7033790B2 (en) | 2001-04-03 | 2006-04-25 | Curagen Corporation | Proteins and nucleic acids encoding same |
| TW200303919A (en) * | 2001-12-05 | 2003-09-16 | Hiroyuki Ohno | Cytotoxic protein and the use |
| US10828358B2 (en) * | 2015-12-14 | 2020-11-10 | Technische Universität München | Helicobacter pylori vaccines |
| CN106868024B (en) * | 2017-04-01 | 2020-03-17 | 山东新创生物科技有限公司 | Clostridium goeri specific PCR detection primer and method |
| CN115991745B (en) * | 2022-07-19 | 2024-12-17 | 四川大学华西医院 | Helicobacter pylori recombinant antigen protein TatB, and preparation method and application thereof |
Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AU6327896A (en) * | 1995-06-07 | 1996-12-30 | Astra Aktiebolag | Nucleic acid and amino acid sequences relating to helicobacter pylori for diagnostics and therapeutics |
-
1997
- 1997-03-27 TR TR1998/01939T patent/TR199801939T2/en unknown
- 1997-03-27 CZ CZ982976A patent/CZ297698A3/en unknown
- 1997-03-27 PL PL97329045A patent/PL329045A1/en unknown
- 1997-03-27 CN CN97195113A patent/CN1220703A/en active Pending
- 1997-03-27 EE EE9800334A patent/EE9800334A/en unknown
- 1997-03-27 BR BR9708456A patent/BR9708456A/en not_active IP Right Cessation
- 1997-03-27 NZ NZ332565A patent/NZ332565A/en unknown
- 1997-03-27 WO PCT/US1997/005223 patent/WO1997037044A1/en not_active Ceased
- 1997-03-27 JP JP9529649A patent/JP2000501621A/en active Pending
- 1997-03-27 EP EP97917731A patent/EP0901530A1/en not_active Withdrawn
- 1997-03-27 HU HU0100267A patent/HUP0100267A3/en unknown
- 1997-03-27 AU AU25984/97A patent/AU726892B2/en not_active Ceased
- 1997-03-27 ID IDP971050A patent/ID18542A/en unknown
- 1997-03-27 SK SK1305-98A patent/SK130598A3/en unknown
- 1997-03-27 CA CA002248985A patent/CA2248985A1/en not_active Abandoned
- 1997-03-27 IL IL12580897A patent/IL125808A0/en unknown
-
1998
- 1998-08-21 IS IS4831A patent/IS4831A/en unknown
- 1998-09-28 NO NO984517A patent/NO984517L/en unknown
Patent Citations (1)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| AU6327896A (en) * | 1995-06-07 | 1996-12-30 | Astra Aktiebolag | Nucleic acid and amino acid sequences relating to helicobacter pylori for diagnostics and therapeutics |
Also Published As
| Publication number | Publication date |
|---|---|
| IL125808A0 (en) | 1999-04-11 |
| IS4831A (en) | 1998-08-21 |
| HUP0100267A2 (en) | 2001-06-28 |
| JP2000501621A (en) | 2000-02-15 |
| ID18542A (en) | 1998-04-16 |
| WO1997037044A1 (en) | 1997-10-09 |
| PL329045A1 (en) | 1999-03-01 |
| BR9708456A (en) | 1999-08-03 |
| EE9800334A (en) | 1999-04-15 |
| CN1220703A (en) | 1999-06-23 |
| CA2248985A1 (en) | 1997-10-09 |
| HUP0100267A3 (en) | 2002-09-30 |
| EP0901530A1 (en) | 1999-03-17 |
| TR199801939T2 (en) | 1999-02-22 |
| SK130598A3 (en) | 1999-06-11 |
| NZ332565A (en) | 2000-03-27 |
| CZ297698A3 (en) | 1999-02-17 |
| AU2598497A (en) | 1997-10-22 |
| NO984517L (en) | 1998-11-25 |
| NO984517D0 (en) | 1998-09-28 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| AU745787B2 (en) | Enterococcus faecalis polynucleotides and polypeptides | |
| AU762606B2 (en) | Chlamydia pneumoniae genomic sequence and polypeptides, fragments thereof and uses thereof, in particular for the diagnosis, prevention and treatment of infection | |
| AU726892B2 (en) | Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof | |
| AU2016219667B2 (en) | Antibacterial phage, phage peptides and methods of use thereof | |
| AU754264B2 (en) | Chlamydia trachomatis genomic sequence and polypeptides, fragments thereof and uses thereof, in particular for the diagnosis, prevention and treatment of infection | |
| KR100923598B1 (en) | Surface protein of Streptococcus piogenes | |
| WO1998018931A2 (en) | Streptococcus pneumoniae polynucleotides and sequences | |
| JPH09322781A (en) | Staphylococcus aureus polynucleotide and sequence | |
| WO1997037044A9 (en) | Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof | |
| AU756010B2 (en) | Identification of polynucleotides encoding novel helicobacter polypeptides in the helicobacter genome | |
| AU2015327511B2 (en) | Biomarkers for rheumatoid arthritis and usage thereof | |
| WO1998058943A1 (en) | Borrelia burgdorferi polynucleotides and sequences | |
| RU2673715C2 (en) | Haemophilus parasuis vaccine serovar type 4 | |
| KR20200044134A (en) | Selection and use of lactic acid bacteria preventing bone loss in mammals | |
| AU734052B2 (en) | Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof | |
| AU739641B2 (en) | Nucleic acid and amino acid sequences relating to helicobacter pylori and vaccine compositions thereof | |
| EP0977864A2 (en) | Antigenic composition and method of detection for helicobacter pylori | |
| AU777190B2 (en) | Streptococcus pneumoniae polynucleotides and sequences | |
| AU710880B2 (en) | Nucleic acid and amino acid sequences relating to helicobacter pylori for diagnostics and therapeutics | |
| AU713692B2 (en) | Nucleic acid and amino acid sequences relating to helicobacter pylori for therapeutics | |
| AU2021240230B2 (en) | Vaccines and vaccine components for inhibition of microbial cells | |
| AU3796099A (en) | Assays using nucleic acid and amino acid sequences relating to helicobacter pylori | |
| AU1546202A (en) | Enterococcus faecalis polynucleotides and polypeptides | |
| AU8938601A (en) | Lyme disease polynucleotides |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FGA | Letters patent sealed or granted (standard patent) | ||
| MK14 | Patent ceased section 143(a) (annual fees not paid) or expired |