Description

A Multipol program starts by executing the function MultipolMain on every processor. MultipolMain then calls Rts_start(firstThread), which transfers control to the Multipol top-level sheduler after some initialization steps. firstThread is the first thread to execute, and it normally starts other threads in the programs. The runtime system keeps scheduling threads until some thread calls Rts_exit to exit the runtime system. The call transfers control back to the remainder of MultipolMain.

Multipol threads appear to be atomic. All threads execute until completion. Long-live threads can be built on top of atomic threads via a sequence of continuation thread. Threads do not migrate - they are used latency hiding, not for load balance.

Multipol threads can be scheduled by schedulers supplied by the programmer (called customized schedulers). A scheduler is a data structure with two operations: deposit and select. When a thread is created, the user specifies the scheduler to use. When a thread is enabled for execution, the runtime system calls its designated deposit operation to store the thread handle. The top level scheduler calls the select operation periodically to choose the next thread to execute. Multipol provides a variety of default schedulers such as FIFO queues. It also provides a preemptive scheduler which allows threads to preempt running threads to reduce overhead. The preemptive scheduler must be used with caution so that atomicity is maintained and no starvation is incurred.

Different threads on the same processor share the same local memory. Threads running on different processors communicate via asynchronous messages. A thread can transfer local data to a preallocated buffer on a remote processor and invoke a handler thread to process the message. It can also invoke a remote thread with an arbitrary number of arguments. The Multipol runtime system attempts to optimize communication performance, using knowledge of the machine architecture and the semantics of the schedulers. For example, a preemptive thread can usually execute as an active message if its arguments fit in a machine packet. For machines with high communication start-up overheads, the runtime system automatically aggregates small messages into large physical messages to improve bandwidth.

For data structures and applications that are not multi-threaded, the runtime system provides a SPMD execution environment between the Rts_enable and Rts_disable calls. The user must call Rts_pollRts periodically to ensure progress of all data structures, including those that are multithreaded.

Rts_Store, Rts_Store1, Rts_Get, Rts_Put are split-phase operations. Split-phase operations return OK if it has completed upon return (i.e., no need to create a thread to wait for the the result), and WAIT otherwise. Non-split-phase primitives whose return type is Flag return OK if they succeed, and FAIL if exceptions occur.

Except for Rts_start, Rts_enable, Rts_disable, Rts_registerInit, Rts_registerCleanUp, and Rts_registerSelect, all primitives must be called only when the top-level scheduler is running.

`Rts_numProc, Rts_myProc`

Rts_numProc and Rts_myProc are constants that are defined as the number of processors in the system and the identity of the local processor (ranging from 0 to Rts_numProc - 1), respectively.

`Rts_outputStats`

Setting Rts_outputStats to 1 on a processor enables its statistics output. Setting it to zero disable the statistics output. The default is 0.