University of Houston
Department of Computer Science


In partial fulfillment of the Requirements for the Degree of

Doctor of Philosophy



Ling-Hua Chang

will defend her dissertation


Efficient Compiler Techniques for Data Driven Languages





Abstract


Production system programs have been notorious for their inability to handle large data sets. The primary cause of their poor scalability is the combinatorial explosion in the number of possible matches which arises from the need to match conjunctive conditions where each conjunct can match the whole data set. The primary limitation on parallelism in production system programs is the data-dependent nature of the computation combined with a lack of information about the run-time contents of working memory and what instantiations might be chosen. This dissertation investigates two approaches for handling large data sets in production system programs -- scalable parallelism and scalable match algorithms. The Rete match algorithm was parallelized for the match phase, and three different approaches were studied for the select phase: sequential firings, partial parallelized firings, and fully parallelized firings. Our experiments on an IBM SP2 demonstrate that the approach fully parallelized firings improves significantly the performance over partial parallelized firings, which in turn was better than sequential firings. We obtained super-linear scalability on several benchmark programs (when executing them on between 1 and 32 nodes). We also ported this compiler to an HP Itanium2 cluster to demonstrate the platform independence of our MPI-based approach. Both versions show similar speed-ups.




Date: Monday, March 8, 2004
Time: 10:00 AM
Place: 550-PGH



Faculty, students, and the general public are invited.
Thesis Advisor: Dr. Ernst L. Leiss