Next: Defining Data Types Up: Getting Started with Shore Previous: Contents

Subsections

Introduction

This tutorial explains, through the use of detailed examples, how to write application programs in C++ that store and manipulate complex data structures in Shore.

What this Tutorial Is

This tutorial illustrates many aspects of Shore, including

how to use the SDL data-definition language to define data types of persistent data,
how to use the SDL compiler to translate these definitions into C++ class definitions,
how to implement the member functions of these classes in C++,
how to write a driver program that creates, looks up, and destroys objects,
how to compile, build, and run the application program,
how to perform an "inventory" of persistent objects that have been created, and
how to combine Shore application programs with "legacy" Unix programs that access Shore objects as if they were Unix files.

What this Tutorial Is Not

This tutorial does not try to do everything. In particular:

It is not a general introduction to Shore, its goals, structure or status. You should read An Overview of Shore before reading any further.
It is not a reference manual. Reference manuals exist or are being written for all aspects of Shore. See The Shore Release for an index to the rest of the documentation.
It does not even attempt to demonstrate all the features of Shore, just a subset sufficient to get you started.
Most importantly, it is not a tutorial on how to write high-quality, efficient applications. It uses a rather contrived example designed to show off several features, but it does not claim the algorithms presented are a good way to manipulate persistent data.

The example program stree uses an unbalanced binary search tree as an inverted index to a set of documents. The tree contains one node for each distinct word appearing in any of the documents. Associated with each word is a set of citations, each of which indicates a document and the offset within the document of the start of a line containing the word. Each tree node, citation, and document is a separate object stored in the Shore database. Although the documents are actually stored in the Shore database, they are stored in such a way that existing Unix programs can manipulate them as if they were ordinary files.

The program has options to add and remove documents from the database and to list the lines containing a given word. It also has a debugging option that dumps all the objects in the database. This option illustrates how to write a maintenance program that iterates through the objects in the database in a "raw" form.

The tutorial then shows how to modify the example to use the index facility of Shore to accomplish the same task in a different way.

What the Examples Demonstrate

The example programs illustrate how to define objects with methods ("member functions" in C++) linked together with pointers (called "references" in Shore). They show how to use relationships, which generalize pointers, adding the ability to represent "1-to-N" and "M-to-N" associations and automatically maintain inverse "pointers". They also illustrate the Unix-compatibility features of Shore.

What the Examples are Not

First and foremost, the binary search tree program does not illustrate the best-or even a good-way to build inverted indices. The second example, which uses Shore's built-in index feature, is closer to the way a real application would accomplish this task. A binary search tree is not a very good data structure for disk-based data structures, since fetching each node requires a disk access. In fact, even this program was designed to run in main memory, a hash table would be better! The example was chosen to illustrate how a program that manipulates linked data structures can easily be adapted to make those structures persistent, but converting a main-memory program to run efficiently with persistent data generally requires a careful re-design of data structures and algorithms.

The examples do not illustrate all the features of Shore or the SDL data-definition language. They do not use all the available pre-defined data structures (such sequences or bags), the "module" facility for managing large complex designs, or inheritance (SDL supports multiple inheritance). See The SDL Reference Manual) for more details.

Reading the Example Program

This tutorial walks through the sources of the search-tree program in detail. These sources, as well as associated test programs and data, may be found in the src/examples/stree sub-directory of the distribution. They are also included in the Appendix. The next few sections walk through this example in detail. It will be useful to keep a copy of the program sources close to hand. Throughout this tutorial, we will assume (as does the Shore Software Installation Manual), that the environment variable $SHROOT contains the absolute path name of the root directory of the installed Shore software.

Next: Defining Data Types Up: Getting Started with Shore Previous: Contents

This page was generated from LaTeX sources
10/27/1997