Next: About this document ... Up: Summary of Recent Research Previous: 2. Environments for Combinatorial

3. Combinatorial Algorithms and Data Structures

Geometric Reconstruction Problems. CRC Handbook of Discrete and Computational Geometry, ed. J. E. Goodman and J. O'Rourke, CRC Press, 481-490, 1997.
Interactive Reconstruction via Probing, (invited paper) Proceedings of the IEEE, 80 (1992) 1364-1383.

Geometric probing considers problems of determining a geometric structure or some aspect of that structure from the results of a mathematical or physical measuring device, a probe. A variety of problems from robotics, medical instrumentation, mathematical optimization, integral and computational geometry, graph theory and other areas fit into this paradigm. Beginning with my dissertation, I have worked to develop a field of geometric probing. The emphasis is on interactive reconstruction, where the results of all previous measurements are used to determine the orientation of the next probe so it provides the maximum amount of information about the structure. Through interactive reconstruction, we have developed finite determination strategies for such diverse models as finger, x-ray, and half-plane probes.

Efficient Data Structures for Maintaining Set Partitions (with Michael Bender and Saurabh Sethia). In preparation.

Efficiently maintaining the partition induced by a set of features is an important problem in building decision-tree classifiers. In order to identify a small set of discriminating features, we need the capability of efficiently adding and removing specific features and determining the effect of these changes on the induced classification, or partition.

We introduce a variety of efficient (randomized and deterministic) data structures to support these operations on both general and geometrically-induced set partitions. We give both Monte Carlo and Las Vegas algorithms which realize near-optimal time bounds and are practical to implement. We provide an efficient suffix-tree based algorithm for a more general problem on maintaining the sorted order of strings under character insertion/deletion and introduce an interesting related problem on simultaneously sorting a binary matrix by both rows and columns.

Matching for Run-Length Encoded Strings (with A. Apostolico and G. Landau). Journal of Complexity, 15 (1999) 4-16. Special issue for papers from Sequences '97, Positano Italy, June 11-13, 1997.
Probe Trees for Touching Character Recognition (with G. Sazaklis, E. Arkin, and J. Mitchell) Proc. International Conference on Imaging Science, Systems and Technology, (CISST) Las Vegas, NV July 6-9, 1998, pp. 282-289.

A well-known dynamic programming algorithm computes the longest common subsequence of strings X and Y in $O(\vert X\vert \cdot \vert Y\vert)$ time. In this paper, we develop significantly faster algorithms for a special class of strings which emerge frequently in pattern matching problems. In particular, we present the first algorithm which finds the longest common subsequence of strings X and Y in time polynomial in the size of the compressed strings. Our final algorithm runs in $O(k l \log ( k l ) )$ time, where k and l are the compressed lengths of strings X and Y, and is a substantial improvement on the previously best algorithm of Bunke and Csirik, which runs in O(l|Y|+k|X|) time.

The need to approximately match run-length encoded strings emerged during development of an optical character recognition (OCR) system. This system, built in association with Data Capture Systems Inc. has been designed to achieve a low substitution error-rate via fixed-font character recognition.

Next: About this document ... Up: Summary of Recent Research Previous: 2. Environments for Combinatorial

Steve Skiena
1999-12-04