Deprecated: The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456 Steven Salzberg's home page
Genomics research papers (click
on the
titles to download a reprint)
Genome
analysis linking recent European and African influenza (H5N1) viruses.
Steven L. Salzberg,
Carl Kingsford, Giovanni Cattoli, David J. Spiro, Daniel A. Janies,
Mona Mehrez Aly, Ian H. Brown, Emmanuel Couacy-Hymann, Gian Mario
De Mia, Do Huu Dung, Annalisa Guercio, Tony Joannis, Ali Safar
Maken Ali, Azizullah Osmani, Iolanda Padalino, Magdi D.
Saad, Vladimir Savić, Naomi A. Sengamalay, Samuel Yingst,
Jennifer Zaborsky, Olga Zorman-Rojs, Elodie Ghedin, and Ilaria
Capua. Emerging Infectious Diseases
13:5 (May 2007).
The
Sequence of the Human Genome. (free at the Science
website) J. Craig Venter et al. (274 authors), Science 291 (2001),
1304-1351. Get the figures showing genome-scale duplications in
PDF format here: [Page 1]
[Page 2]
Genome Sequence of the Radioresistant Bacterium Deinococcus
radiodurans R1 (get
abstract). Owen White et al. , Science 286 (1999), 1571-1577.
DNA uptake signal sequences in naturally transformable
bacteria. H.O. Smith, M.L. Gwinn, and S.L. Salzberg. Research
in Microbiology, 150 (1999), 603-616.
Evidence for lateral gene transfer between Archaea and Bacteria
from genome sequence of Thermotoga maritima (get
abstract). Karen E. Nelson et al. , Nature
399 (1999), 323-329.
Chromosome 2 sequence of the human malaria parasite Plasmodium
falciparum (get
abstract). Malcolm J. Gardner et al., Science282 (1998), 1126-1132.
Complete Genomic Sequence of Treponema pallidum, the
Syphilis Spirochete. C.M. Fraser et al., Science 281 (1998), 375-388.
Genomic Sequence of a Lyme Disease Spirochaete, Borrelia
burgdorferi. C.M. Fraser et al., Nature 390
(1997), 580-586.
Hierarchical
scaffolding with Bambus. M. Pop, D. Kosack, and S.L.
Salzberg. Genome Research
14(2004), 149-159. This describes our open source system for the
scaffolding phase of genome assembly.
Computational
gene prediction using multiple sources of evidence. J.E.
Allen, M. Pertea, and S.L. Salzberg. Genome Research 14(2004),
142-148. This describes our open source system for producing a
gene prediction based on multiple gene finders, alignment programs, and
other evidence.
Book: Computational Methods in Molecular Biology (1998;
in paperback since 1999) edited by S.L. Salzberg, D.B. Searls, and S.
Kasif. See
the table
of contents here.
Alignment of
Whole Genomes A.L. Delcher, S. Kasif, R.D. Fleischmann, J.
Peterson, O. White, and S.L. Salzberg. Nucleic Acids Research,
27:11 (1999), 2369-2376. Note that Figure
6 is
supposed to be in color, and was mistakenly printed as black and
white. Click
here for the color figure (PDF). This describes the MUMmer
system, available below.
Best-Case
Results for Nearest-Neighbor Learning. S.L. Salzberg, A.L.
Delcher, D. Heath, and S. Kasif.IEEE Transactions on Pattern
Analysis
and Machine Intelligence17:6, June 1995, 599-608. (Earlier version,
"Learning with a Helpful Teacher," appeared in the IJCAI-91
proceedings.)
Glimmer, a
system that uses interpolated Markov models to find genes in microbial
DNA. March 2003: New release, version 2.1, automatically optimizes ORF
length for training.
TWAIN,
a Generalized Pair HMM to predict genes simultaneously in two closely
related eukaryotic organisms.
GlimmerHMM
(formerly GlimmerM), an
interpolated Markov Model system for finding genes in many eukaryotes,
including P.
falciparum, A. thaliana,
rice (O. sativa), mosquito (A. aegypti), B. malayi, C. neoformans, and others.
GeneZilla, a generalized
HMM for eukaryotic gene finding that improves upon and replaces TigrScan,
a 2003-vintage generalized HMM with a
design similar to Genscan.
JIGSAW
(formerly called Combiner) a program that
predicts gene models using
the output from multiple sources of evidence, including other gene
finders, Blast searches, and other alignment data.
GeneSplicer,
a fast
system for detecting splice sites in genomic DNA of various eukaryotes.
PIRATE,
a
website collecting many links to our gene finders and others.
Genome assembly and large-scale genome alignment
MUMmer, a
system for aligning whole genomes, chromosomes, and other very long DNA
sequences. Since April 2003:
MUMmer 3.0 and later releases are open source.
The AMOS Assembler
project is a set of tools, libraries, and freestanding genome
assemblers, all open source. AMOS is also an open consortium that
includes TIGR, the University of Maryland, The Karolinska Institutet,
and the Marine Biological Laboratory.
Hawkeye, a
flexible graphical interface to genome assemblies from a variety of
assemblers.
AMOScmp
is a comparative genome assembler, which uses one genome as a reference
on which to assemble another, closely related species. See the journal
paper here.
MINIMUS
(new in August 2004) is a
small, lightweight assembler for small jobs such as assembling a viral
genome, assembling a set of reads that match a single gene, or other
tasks that don't require the complex infrastructure of a large-genome
assembler.
BAMBUS the first
publicly available, standalone genome sequence
scaffolding program. It orders and orients contigs into scaffolds based
on various types of linking information.
AutoEditor, a tool
for
correcting sequencing and basecaller errors using sequence assembly
and chromatogram data. On average AutoEditor corrects 80% of erroneous
base calls, with an accuracy of 99.99%.
Transcription terminators, operons, and motif analysis tools
ELPH, a
motif
finder that can find ribosome binding sites, exon splicing enhancers,
or regulatory sites.
SeeESE,
an online tool for identifying exon splicing enhancers (ESEs) in
Arabidopsis, Drosophila, and other species.
TransTermHP (new release, November 2005), a program that finds rho-independent transcription
terminators in bacterial genomes.
OperonDB (new release, fall 2006),
results from our operon-finding software on a large number of
prokaryotic genomes. (Originally described in Ermolaeva et al., Prediction
of operons in microbial genomes.)
My father, Herman
Salzberg, has a home page at the University of South Carolina.
My brother Alan Salzberg is president of Quantitative Analysis,
Inc., a statistical consulting company.
My wife Claudia and I have two daughters, Annika and Alyssa.