Deprecated : The each() function is deprecated. This message will be suppressed on further calls in /home/zhenxiangba/zhenxiangba.com/public_html/phproxy-improved-master/index.php on line 456
List of Selected Papers
Sparsity/Bebop
Performance Optimizations and Bounds for Sparse Matrix-Vector Multiply
Richard Vuduc, James W. Demmel, Katherine A. Yelick,
Shoaib Kamil, Rajesh Nishtala, Benjamin Lee.
SC 2002 (High Performance
Networking and Computing, commonly called "Supercomputing").
Baltimore, November 2002.
Available in pdf (834k)
| Gzip'd PostScript (2.7M)
Automatic Performance Tuning and Analysis of Sparse
Triangular Solve
Richard Vuduc, Shoaib Kamil, Jen Hsu, Rajesh Nishtala,
James W. Demmel, Katherine A. Yelick.
ICS 2002 :
Workshop
on Performance Optimization via High-Level Languages and
Libraries . New York, June 22-26, 2002.
Available in pdf (548k)
| Gzip'd PostScript (1.2M)
Optimizing Sparse Matrix-Vector Multiplication for Register Reuse
E. Im and K. A. Yelick
International Conference on Computational Science, San Francisco,
California, May 2001.
(Postscript)
Optimizing Sparse Matrix Kernels for Data Mining
E. Im and K. A. Yelick
Proceedings of the Text Mine Workshop
Chicago, IL, April 2001.
(Postscript)
Optimizing Sparse Matrix Vector Multiplication on SMPs
E. Im and K. A. Yelick
SIAM Conf. Parallel Processing for Scientific Computing, San
Antonio, TX, March 1999.
(Postscript)
Model-based Memory Hierarchy Optimizations for Sparse Matrices
E. Im and K. A. Yelick
Workshop on Profile and Feedback-Directed Compilation, Paris,
France, October 1998.
(Postscript)
Intelligent RAM (IRAM)
Memory-Intensive Benchmarks: IRAM vs. Cache-Based Machines,
Brian R. Gaeke, Parry Husbands, Xiaoye S. Li, Leonid Oliker,
Katherine A. Yelick, and Rupak Biswas.
Proceedings of the International Parallel and Distributed Processing
Symposium (IPDPS). Ft. Lauderdale, FL.
April, 2002.
Available in PDF .
Hardware/Compiler Co-development for an Embedded Media Processor,
C. Kozyrakis, D. Judd, J. Gebis, S. Williams, D. Patterson, K. Yelick,
Proceedings of the IEEE, vol. 89, no. 11, November 2001 (p. 1694-709).
Draft available in PDF .
Exploiting On-Chip Memory Bandwidth in the VIRAM Compiler,
D. Judd, K. Yelick, C. Kozyraki, D. Martin, and D. Patterson,
Second Workshop on Intelligent Memory Systems, Cambridge,
November 2000.
Available in Postscript .
Performance Analysis of an H.263 Video Encoder on VIRAM,
T. Nguyen, A. Zakhor and K. Yelick
International Conference on Image Processing (ICIP) ,
Vancouver, B.C., Canada, September 2000.
Available in PDF
Efficient FFTs on IRAM
Thomas, R. and Yelick, K.
First Workshop on Media Processors and DSPs ,
November 15, 1999.
Postscript available.
Scalable processors in the billion-transistor era: IRAM
Kozyrakis, C.E., Perissakis, S., Patterson, D., Anderson, T.,
Asanovic, K., Cardwell, N., Fromm, R., Golbus, J., Gribstad, B.,
Keeton, K., Thomas, R., Treuhaft, N., Yelick, K.
Computer , vol.30, (no.9), IEEE Comput. Soc,
Sept. 1997. p.75-8.
Available in PDF .
The Energy Efficiency of IRAM Architectures
R. Fromm, S. Perissakis, N. Cardwell, D. Patterson, T. Anderson,
and K. Yelick
Proceedings of the 24th Annual International Conference on
Computer Architecture, June 1997.
Available in Postscript .
A Case for Intelligent DRAM: IRAM
D. Patterson, T. Anderson, N. Cardwell, R. Fromm, K. Keeton,
C. Kozyrakis, R. Thomas, and K. Yelick.
IEEE Micro , April 1997, pp. 34-44.
Also appeared as an Award Paper, Hot Chips VIII , August 1996.
Available in PDF or
Postscript .
Intelligent RAM (IRAM): Chips that remember and compute
D. Patterson, T. Anderson, N. Cardwell, R. Fromm, K. Keeton,
C. Kozyrakis, R. Thomas, and K. Yelick.
Proceedings of the 1997 IEEE International Solid-State Circuits
Conference, February 1997, pp. 224-225.
Available in PDF or
Postscript .
Clusters (includes ISTORE and ROC)
ROC-1: Hardware Support for Recovery-Oriented Computing.
Oppenheimer, D., A. Brown, J. Beck, D. Hettena, J. Kuroda,
N. Treuhaft, D.A. Patterson, and K. Yelick.
IEEE Transactions on Computers Special Issue on
Embedded Fault-Tolerant Computer Systems, Jul.-Aug., 2001.
Available in PDF
Cluster I/O with River: Making the Fast Case Common
R. H. Arpaci-Dusseau, E. Anderson, N. Treuhaft, D. E. Culler,
J. M. Hellerstein, D. A. Patterson, and K. A. Yelick
Workshop on I/O in Parallel and Distributed Systems, Atlanta, GA,
May 1999.
Postscript available.
Parallel Applications
Performance Modeling and Composition: A Case Study in
Cell Simulation
Seve G. Steinberg, Jun Yang, and Katherine Yelick, IPPS '96
April 1996.
Abstract ,
Postscript available.
Parallelizing the Phylogeny Problem
J. Jones and K. Yelick, Supercomputing '95
December 1995.
Abstract ,
Postscript available.
Connected Components on Distributed Memory Machines
A. Krishnamurthy, S. Lumetta, D. Culler, and K. Yelick,
June 1994.
Abstract ,
Postscript available.
Parallel Timing Simulation on a Distributed Memory Multiprocessor
Chih-Po Wen and Katherine Yelick, International Conference on
Computer Aided Design, Santa Clara, California, November 1993.
Abstract ,
Postscript available.
Implementing an Irregular Application on a Distributed Memory
Multiprocessor
Soumen Chakrabarti and Katherine Yelick, ACM Symposium on
Principles and Practice of Parallel Programming, San Diego,
California, June 1993.
Abstract ,
Postscript available.
A Parallel Completion Procedure for Term Rewriting Systems
Katherine Yelick and Stephen J. Garland, Conference on
Automated Deduction , June 1992.
Abstract ,
Postscript available.
Compilation
Analyses and Optimizations for Shared Address Space Programs
A. Krishnamurthy and K. Yelick
Journal of Parallel and Distributed Computation , 1996.
Postscript available.
Optimizing Parallel Programs with
Explicit Synchronization
Arvind Krishnamurthy and Katherine Yelick,
Programming Language Design and Implementation,
La Jolla, California, June 1995.
Abstract ,
Postscript available.
Optimizing Parallel SPMD Programs
Arvind Krishnamurthy and Katherine Yelick,
Seventh Annual Workshop on Languages and Compilers for Parallel
Computing, Ithaca, New York, August 1994.
Abstract ,
Postscript available.
Compiling Sequential Programs for Speculative Parallelism
Chih-Po Wen and Katherine Yelick, International Conference on
Parallel and Distributed Systems, National Taiwan University,
Taiwan, December 1993.
Abstract ,
Postscript available.
Scheduling and Load Balancing
Models and Scheduling Algorithms for Mixed Data and Task Parallel
Programs
S. Chakrabarti, J. Demmel, and K. Yelick
Journal of Parallel and Distributed Computing ,
Vol. 47, pp. 168--184. 1997.
Modeling the Benefits of Mixed Data and Task Parallelism
Soumen Chakrabarti, James Demmel, and Katherine Yelick,
Symposium on Parallel Algorithms and Architectures, Santa Barbara,
California, July 1995.
Abstract ,
Postscript available.
Randomized Load Balancing for Tree Structured Computation
Soumen Chakrabarti, Abhiram Ranade, and Katherine Yelick,
IEEE Scalable High Performance Computing Conference, Knoxville,
Tennessee, May 1994.
Abstract ,
Postscript available.
Distributed Data Structures & the Multipol Library
Portable Parallel Irregular Applications.
K. Yelick, C.-P. Wen, S. Chakrabarti, E. Deprit,
J. Jones, A. Krishnamurthy, Workshop on Parallel Symbolic
Languages and Systems, Beaune, France, October 1995.
To appear in Lecture Notes in Computer Science.
Abstract ,
Postscript available.
Multipol: A Distributed Data Structure Library.
S. Chakrabarti, E. Deprit, J. Jones, A. Krishnamurthy,
E.-J. Im, C.-P. Wen, and K. Yelick, UCB//CSD-95-879, July 1995.
Abstract ,
Postscript available.
Portable Runtime Support for Asynchronous Simulation
Chih-Po Wen and Katherine Yelick, International Conference on
Parallel Processing, August 1995.
Abstract ,
Postscript available.
Portable Runtime Support for Asynchronous Simulation
C.-P. Wen, S. Chakrabarti, E. Deprit,
A. Krishnamurthy and K. Yelick,
``Runtime Support for Portable Distributed Data Structures,''
Workshop on Languages, Compilers, and Runtime Systems for
Scalable Computers, May 1995.
Postscript available.
Distributed Data Structures and Algorithms for Gröbner Basis
Computation
Soumen Chakrabarti and Katherine Yelick, Lisp and Symbolic
Computation, Vol. 7, 1994.
Abstract available.
Data Structures for Irregular Applications
K. Yelick, S. Chakrabarti, E. Deprit, J. Jones, A. Krishnamurthy,
and C.-P. Wen, DIMACS Workshop on Parallel Algorithms for
Unstructured and Dynamic Problems, Piscataway, New Jersey, June 1993.
Abstract ,
Postscript available.
Programming Models for Irregular Applications
Katherine Yelick.
Workshop on Languages and Compilers and Run-Time Environments
for Distributed Memory Multiprocessors , October 1992.
Also appeared in SIGPLAN Notices, January 1993.
Postscript available.
A Survey of Portable Message Passing Libraries
Chih-Po Wen and Katherine Yelick, unpublished manuscript,
October 15, 1992.
Postscript available.
Parallel Languages: Split-C, Titanium, and UPC
An Evaluation of Current High Performance Networks ,
C. Bell, D. Bonachea, Y. Cote, J. Duell, P. Hargrove,
P. Husbands, C. Iancu, M. Welcome, K. Yelick,
International Parallel and Distributed Processing Symposium,
Nice, France, April 22-26, 2003.
Available in PDF
Introduction to UPC and Language Specification ,
W. Carlson, J. Draper, D. Culler, K. Yelick, E. Brooks, and K. Warren,
CCS-TR-99-157, IDA Center for Computing Sciences, 1999.
Available in PDF
Titanium: A High-Performance Java Dialect
K. A. Yelick, L. Semenzato, G. Pike, C. Miyamoto, B. Liblit, A.
Krishnamurthy, P. N. Hilfinger, S. L. Graham, D. Gay, P. Colella,
and A. Aiken
Concurrency: Practice and Experience ,
Vol. 10, No. 11-13, September-November 1998. An earlier version was
presented at the Workshop on Java for High-Performance Network Computing,
Palo Alto, CA, Feb. 1998.
Postscript available.
Empirical Evaluation of Global Memory Support on the Cray-T3D and
Cray-T3E
A. Krishnamurthy, D. Culler, and K. Yelick
UCB//CSD-98-991, 1998.
Postscript available.
Evaluation of Architectural Support for Global
Address-Based Communication in Large-Scale Parallel Machines
Arvind Krishnamurthy, Klaus E. Schauser, Chris Scheiman, Randy Wang,
David Culler, and Katherine Yelick,
Proceedings of Architecture Support on Programming Languages and
Operating Systems, Cambridge, MA, November 1996.
Postscript available.
Empirical Evaluation of the CRAY-T3D: A Compiler Perspective
Remzi H. Arpaci, David E. Culler, Arvind Krishnamurthy,
Steve G. Steinberg, and Katherine Yelick,
International Symposium on Computer Architecture,
Santa Margherita Ligure, Italy, June 1995.
Abstract ,
Postscript available.
Parallel Programming in Split-C
D. Culler, A. Dusseau, S. Goldstein, A. Krishnamurthy, S. Lumetta,
T. von Eicken, and K. Yelick, Supercomputing, Portland, Oregon,
November 1993.
Abstract ,
Postscript available.
Symbolic Computation
On the Correctness of a Distributed Memory Gröbner Basis
Algorithm
Soumen Chakrabarti and Katherine Yelick, International Conference
on Rewriting Techniques and Applications, Montreal, Canada, June 1993.
Abstract ,
Postscript available.
Compiling Verilog into Finite State Machines
S.-T. Cheng, R. Brayton, G. York, K. Yelick, A. Saldanha.
International Verilog Conference, 1995.
Abstract ,
Postscript available.
Using Abstraction in Explicitly Parallel Programs
Katherine A. Yelick, MIT Laboratory for Computer Science,
July 1991, TR-507. (Revised from PhD Thesis, December 1990.)
Abstract ,
Postscript available.
A Generalized Approach to Equational Unification
Katherine A. Yelick, MIT Laboratory for Computer Science,
August 1985, TR-344.