The REXX/imc Technical Reference This reference contains information for programmers of functions, subcommand interfaces and other utilities to be called by or to call the Rexx interpreter. Information for Rexx programmers can be found in rexx.summary, rexx.info and rexx.ref. ______________________________________________________________________________ 1. The Application Programming Interface REXX/imc has an implementation of the "SAA API" based on that of OS/2. This API is at present partially implemented and "mostly" compatible with OS/2, but it will be standardised and completed in a future release. Programs using the API are typically written in C. The supplied header file and the descriptions below use C declarations to describe the available functions and their parameters. The API allows an application to start the REXX interpreter under program control, to add subcommand environments and external functions, to access the REXX variable pool, and to install system exits to process certain interpreter activities. A C program using the API should include the header file "rexxsaa.h". Before including this file, the program should define appropriate preprocessor symbols to indicate which parts of the API are required. By default, the header file includes only enough information for the application to start the interpreter. The symbols which may be defined are: INCL_RXSUBCOM include definitions for the subcommand interface INCL_RXSHV include definitions for the variable pool interface INCL_RXFUNC include definitions for the external function interface INCL_RXSYSEXIT include definitions for the system exit interface. If all of the above are required, the program may just define: INCL_REXXSAA include definitions for the whole API. When an application is compiled, it should be linked with the REXX/imc library using the compiler flag "-lrexx". The compiler may need to be told the location of the library file using the "-L/path/" flag. Either a statically linked library or a dynamically linked library may be used. Details of how to make the libraries are contained in the installation instructions for REXX/imc. 1(a). RXSTRINGs The rexxsaa.h header file defines the RXSTRING datatype as follows to hold a REXX string: typedef struct { ULONG strlength; /* length of the string */ char *strptr; /* pointer to the string */ } RXSTRING; typedef RXSTRING *PRXSTRING; /* a pointer to an RXSTRING */ All the API functions which deal with REXX strings use this structure. Any RXSTRING which has a value will have the strptr field set to a valid character pointer and the strlength field set to a length value. The empty string is represented by having the strlength field set to zero. An RXSTRING may be NULL, in which case the strptr field will be set to a NULL pointer. This represents an undefined string, for example the second parameter in the function call foo(1,,2), and is not the same as an empty string, for example the second parameter in the function call foo(1,'',2). Associated with the RXSTRING structure are the following helper macros: MAKERXSTRING(rxstring,ptr,len) - sets the strptr field of the string to the given a pointer and the strlength field to the given length. RXNULLSTRING(x) - returns true if x is a NULL string. RXSTRLEN(x) - returns the length of the string x. This function returns zero if x is a NULL string. RXSTRPTR(x) - returns the strptr field of the string x. RXVALIDSTRING(x) - returns true if x is neither NULL nor of length zero. RXZEROLENSTRING(x) - returns true if x is not NULL but is of length zero. Note that since these are macros they should not be trusted with arguments that have side-effects. 1(b). Starting the REXX interpreter The function RexxStart invokes the interpreter to execute a REXX program. long RexxStart(long argc, PRXSTRING argv, PSZ *name, PRXSTRING instore, PSZ envname, long calltype, PRXSYSEXIT exits, PSHORT rc, PRXSTRING result); The parameters are: argc: the number of arguments passed to the program argv: an array of argc RXSTRINGs containing the arguments passed to the program. Any omitted arguments should be passed as NULL strings. name: a name for the program. This will usually be the file name of the program on disk (which will be searched for in the usual way as described in the REXX programming reference, but will not have any file extension added to it). However, if the instore parameter contains the source of the program, then the name parameter is used only for the PARSE SOURCE instruction. instore: either NULL or a pointer to the first of two RXSTRINGs which define an in-storage REXX program. If instore is NULL, then the program is loaded from a disk file named by the name parameter. Otherwise, instore[0] must be a valid RXSTRING which contains the source to be executed and instore[1] must be a NULL string. The source contained in instore[0] should be in the same format as it would be if it had just been loaded from disk. Note that REXX/imc will not, at present, return a tokenised image of the program in instore[1]. envname: either NULL or a null-terminated string which contains the name of the initial subcommand environment for the REXX program. The environment name must be at most 30 characters in length. If envname is NULL then a default will be chosen. This will be the uppercased file extension, if this is present and suitable, otherwise the string "UNIX". calltype:one of the three integer values RXCOMMAND, RXSUBROUTINE or RXFUNCTION as defined in rexxsaa.h. This determines the invocation method as given by the instruction "PARSE SOURCE . method .". In addition, if the calltype is RXFUNCTION then any RETURN instruction which causes the program to end will be required to return a result. exits: either NULL or a pointer to the first in an array of RXSYSEXIT structures defining the exits which will be used (see the section on system exits for more information). The last item in the array must have an exit code of RXENDLST. Exits will not be used if the exits parameter is NULL. rc: a pointer to a short which will receive the numeric form of the result string. If the result string is a whole number in the range -32767 to +32767 then it will be converted to an integer and stored in rc. Otherwise, rc will be set to (short)(1<<15), but note that this is an extension which is present in REXX/imc only. This parameter is documented as PLONG, but it seems to be PSHORT in most versions so for the time being it is a PSHORT in REXX/imc. result: a pointer to an RXSTRING which will receive the result string returned by a RETURN or EXIT instruction from the REXX program, or NULL to indicate that the return value should be discarded. If the program did not return a result, then this parameter will be set to a NULL string. If result points to an RXSTRING then the caller of RexxStart may set it to either a NULL string or a valid string. If result is a valid string which is long enough to hold the result, then the result will be copied into that string and result->strlength will be set to indicate its length. Otherwise, REXX will allocate (using malloc) a string to hold the result and the caller of RexxStart is responsible for freeing this storage. The possible return values from RexxStart are: negative: a syntax error (or other REXX error) occurred. The absolute value of the return value will be the same as the REXX error number. zero: the program ended normally. positive: some error occurred which made it impossible to execute the program. A return value of 3 means that the program could not be read in (although this number may change in the future). A return value of 1 means that the parameters were incorrect or that the interpreter could not initialise properly. Note that the interpreter will never return a result string to the caller if an error occurred, and in this case the value of rc is undefined when RexxStart returns. RexxStart is recursive - that is, the application may call it from within a subcommand handler, external function call or system exit even though a previous call to RexxStart has still not finished. It is not, however, re-entrant, so if the operating system supports threads or lightweight processes (LWPs), the SAA API should only be called from within a single thread. 1(c). The Subcommand Interface A subcommand handler is a function supplied by the application which processes commands (that is, REXX instructions which consist only of expressions, or occurrences of "command" in instructions of the form "ADDRESS environment command"). Each subcommand handler must be registered under an environment name, and these names are selected by the ADDRESS instruction in REXX. A subcommand handler must be declared as a function with the following prototype: ULONG command_handler( PRXSTRING command, PUSHORT flags, PRXSTRING result); where: command is the command string created by REXX. The character just after the end of the string is guaranteed to be a null, but the command may also contain other null characters. flags is a pointer to a short integer to receive the completion status of the command. The subcommand handler should set this to one of RXSUBCOM_OK, RXSUBCOM_ERROR or RXSUBCOM_FAILURE in order to indicate that the command completed successfully, completed with ERROR status, or completed with FAILURE status respectively. result is a pointer to an RXSTRING which will receive the return code from the command in the form of a string (note that any numeric return code must be converted to a string before it can be passed to REXX). This string will be assigned to the variable RC when the subcommand handler returns. If result is a NULL string, then REXX will assign the string "0" to RC. REXX will set result to a 256-byte string before calling the subcommand handler. The handler may store the result in this string (and set result->strlength to the length of the string), or (if the result is longer than that) it may allocate another using malloc. If the handler allocates a new string to hold the result, then REXX will call free to release the storage after the handler returns. The only valid return value from a subcommand handler seems to be 0. The following functions are available to register, deregister and query subcommand handlers: ULONG RexxRegisterSubcomExe( PSZ envname, RexxSubcomHandler *handler, PUCHAR userarea); The RexxRegisterSubcomExe function registers the subcommand handler whose address is given by the parameter handler under the environment name given by the parameter envname. The userarea parameter may be NULL or it may point to a user-defined eight byte area of memory which will be associated with the subcommand handler. The user area information may be retrieved with the RexxQuerySubcom function. The possible return values from RexxRegisterSubcomExe are: RXSUBCOM_OK The handler was registered. RXSUBCOM_NOTREG The handler was not registered because there was another already registered under the same name. RXSUBCOM_NOEMEM The handler was not registered due to lack of memory. RXSUBCOM_BADTYPE The handler was not registered due to a parameter error. ULONG RexxDeregisterSubcom(PSZ envname, PSZ module); The RexxDeregisterSubcom function deregisters the subcommand handler which was registered under the name given by the parameter envname. The module parameter is not used by REXX/imc. The possible return values from RexxDeregisterSubcom are: RXSUBCOM_OK The handler was deregistered. RXSUBCOM_NOTREG The handler was not found. RXSUBCOM_BADTYPE The handler was not deregistered due to a parameter error. ULONG RexxQuerySubcom(PSZ envname, PSZ module, PUSHORT flag, PUCHAR userarea); The RexxQuerySubcom function queries any subcommand handler which might have been registered under the name given by the parameter envname. The module parameter is not used by REXX/imc. The result is both stored in the short integer pointed to by the flag parameter and returned as the result of the function. If the userarea parameter is supplied, it must point to an eight-byte area which will receive the user area which was given when the subcommand handler was registered. The possible return values from RexxQuerySubcom are: RXSUBCOM_OK The handler is registered. RXSUBCOM_NOTREG The handler is not registered. RXSUBCOM_BADTYPE The parameters were in error. 1(d). External Functions An external function may be registered with REXX/imc using the SAA API. After it has been registered, it may be called by a Rexx program as usual, but only using the name under which it was registered. The names of functions are case-sensitive and unquoted function names are always translated into upper case, so that a function to be called with the REXX function call foo() must be registered under the name FOO. This is a difference from OS/2. Functions registered using the SAA interface of REXX/imc are searched just after the built-in functions, and at the same time as functions loaded from ".rxfn" files (see section 2). Function names registered under the SAA interface of REXX/imc must not contain the slash character (/). Note that function names in REXX function calls have all characters up to and including the last slash character removed before they are matched against function names which have been loaded from ".rxfn" files or registered under the SAA interface, so that "/tmp/FOO" matches a function registered under the name "FOO". [This was done so that the path name of an auto-registering function may be specified, but will not prevent the function from being found on subsequent calls when it has already been registered. This behaviour might be removed in a future release.] An external function to be registered using the SAA interface must be a function with the following prototype: LONG function_handler( PSZ name, long argc, PRXSTRING argv, PSZ queuename, PRXSTRING result); where: name is a null-terminated string giving the name by which REXX called the function. argc is the number of arguments passed to the function. argv is an array of argc RXSTRINGs giving the arguments passed to the function. Any omitted arguments will be passed as NULL strings. The character just after the end of each argument which is present is guaranteed to be a null, but the arguments may also contain other null characters. queuename is the null-terminated name of the current queue, which is currently meaningless in REXX/imc and will equal the string "SESSION". result is a pointer to an RXSTRING which will receive the result from the function. If the function does not return a result, then it should set this RXSTRING to a NULL string. If the function handler was called as a function, then the result will be used directly (and an error will be raised if the function did not return a result). If it was called as a subroutine, the result (if any) will be assigned to the REXX variable RESULT. REXX will set result to a 256-byte string before calling the function. The function may copy the result into this string (and set result->strlength to the length of the result), or (if the result is longer than 256 bytes) it may allocate a new string using malloc and set the result string to that. If the function handler allocates a new string, REXX will use free to release it when the function returns. The valid return values from a function handler are: 0 The function completed successfully. 40 An error occurred. The REXX interpreter will raise error 40 (incorrect call to routine) when the function returns. Note that if the function returns this value, it must not return a result. The following functions are available to register, deregister and query external functions: ULONG RexxRegisterFunctionExe(PSZ funcname, RexxFunctionHandler *handler); The RexxRegisterFunctionExe function registers the function given by the handler parameter under the name given by the funcname parameter, which is a null-terminated string. The possible return values from RexxRegisterFunctionExe are: RXFUNC_OK The function was registered. RXFUNC_DEFINED The function was not registered because it has already been registered or loaded from a ".rxfn" file. RXFUNC_NOMEM The function was not registered due to lack of memory. ULONG RexxRegisterFunctionDll(PSZ funcname, PSZ dllname, PSZ entryname); The RexxRegisterFunctionDll call registers a function stored in a shared object file (or DLL). The name of the shared object is given in the dllname parameter (see below). The name of the function handler within the shared object is given in the entryname parameter, and the name by which Rexx will call the function is given by the funcname parameter. The dllname parameter may be either the exact path name of a file or a relative file name which will be searched for along a path given by one of the following: the REXXLIB environment variable; if that is not set the REXXFUNC environment variable; or if neither is set the compile-time default. If the name of the shared object ends with ".rxfn" then this may be omitted from the dllname parameter. This version of the RexxRegisterFunctionDll function differs from the OS/2 version in that it attempts to load the function before returning control to the caller, and it returns an indication of whether this succeeded. The possible return values are: RXFUNC_OK The function was registered. RXFUNC_NOTREG The function was not registered because the shared object could not be loaded or did not contain the required entry point. RXFUNC_DEFINED The function was not registered because it has already been registered or loaded from a ".rxfn" file. RXFUNC_NOMEM The function was not registered due to lack of memory. ULONG RexxDeregisterFunction(PSZ funcname); The RexxDeregisterFunction function deregisters the function which was previously registered under the name given by the funcname parameter, which is a null-terminated string. Note that this function is also able to deregister functions which were loaded from ".rxfn" files. The possible return values from RexxDeregisterFunction are: RXFUNC_OK The function was deregistered. RXFUNC_NOTREG The function was not found. ULONG RexxQueryFunction(PSZ funcname); The RexxQueryFunction function queries whether or not a function has been registered or loaded from a ".rxfn" file under the name given by the funcname parameter, which is a null-terminated string. The possible return values from RexxQueryFunction are: RXFUNC_OK The function is registered. RXFUNC_NOTREG The function is not registered. 1(e). System Exits The application may define functions which REXX will call instead of using its own system interface in certain circumstances, such as whenever it needs to write out a line as a result of the SAY instruction. The application registers its system exit functions under names (which are similar to environment names, although they occupy a separate name space) before executing a REXX program. When the application starts REXX, it specifies the system exits which it wishes to handle and associates each with a named exit handler. Each system exit has a major function code and a subfunction code; these are listed later in this section. A system exit handler must be a function with the following prototype: LONG exit_handler(long exitcode, long subcode, PEXIT parmblock); where: exitcode is the major function code of the exit which the handler has been called to handle. subcode is the subfunction code of the exit which the handler has been called to handle. parmblock is a pointer to a parameter block whose contents depend on the exit being handled. The exit descriptions below give details on the parameter block required for each exit. Some exits may not require a parameter block at all, in which case the parmblock parameter will be a NULL pointer. Since each exit requires a different kind of parameter block, the PEXIT data type has been declared as a union data type. It is probably most convenient for each exit handler to convert the parameter block pointer into a pointer of the required type for the exit (which is allowed in C (I think!)). The valid return values from a system exit handler are: RXEXIT_HANDLED The exit handler processed the system exit as required. RXEXIT_NOT_HANDLED The exit handler did not process the system exit, and therefore REXX should behave as if the exit handler had not been called. RXEXIT_RAISE_ERROR An error occurred. REXX will raise error 48 (failure in system service) when the handler returns this value. A system exit handler should begin by checking the function and subfunction codes, and return immediately with RXEXIT_NOT_HANDLED if they correspond to an exit which the handler was not supposed to handle. The following functions are available to register, deregister and query system exit handlers. ULONG RexxRegisterExitExe(PSZ name, RexxExitHandler *handler, UCHAR *userarea); The RexxRegisterExitExe function registers the system exit handler given by the handler parameter under the name given by the name parameter, which is a null-terminated string. The userarea parameter may be NULL or it may point to a user-defined eight byte area of memory which will be associated with the system exit handler. The user area information may be retrieved with the RexxQueryExit function. The possible return values from RexxRegisterExitExe are: RXEXIT_OK The handler was registered. RXEXIT_NOTREG The handler was not registered because there is already a system exit handler registered under the given name. RXEXIT_NOEMEM The handler was not registered due to lack of memory. RXEXIT_BADTYPE The handler was not registered due to a parameter error. ULONG RexxDeregisterExit(PSZ name, PSZ module); The RexxDeregisterExit function deregisters the system exit handler which was registered under the name given by the name parameter, which is a null-terminated string. The module parameter is not used by REXX/imc. The possible return values from RexxDeregisterExit are: RXEXIT_OK The handler was deregistered. RXEXIT_NOTREG The handler was not found. RXEXIT_BADTYPE The parameters were in error. ULONG RexxQueryExit(PSZ name, PSZ module, PUSHORT flag, PUCHAR userarea); The RexxQueryExit function queries any system exit handler which may have been registered under the name given by the name parameter, which is a null-terminated string. The module parameter is not used by REXX/imc. The flag parameter is a pointer to a short integer which will be set to the return value from RexxQueryExit. If the userarea parameter is not NULL, it must point to an eight-byte area of memory which will receive the user area which was given when the exit handler was registered. The possible return values from RexxQueryExit are: RXEXIT_OK The handler is registered. RXEXIT_NOTREG The handler is not registered. RXEXIT_BADTYPE The parameters were in error. Once the required system exit handlers have been registered, they may be used by passing them to RexxStart in the "exits" parameter. This parameter is an array of structures of the form: typedef struct { char *sysexit_name; /* name of exit handler */ short sysexit_code; /* major function code of system exit */ } RXSYSEXIT; Each of these structures associates a system exit handler which was previously registered under the name given by the sysexit_name field of the structure with the system exits whose major function codes agree with the sysexit_code field of the structure. Note that an exit handler must handle all the subfunctions of any particular system exit; it may do this by immediately returning RXEXIT_NOT_HANDLED as described earlier. The last element of the array of RXSYSEXIT structures must have sysexit_code set to RXENDLST. The possible system exits are as follows. Each major exit code is given by a name of the form RXxxx, and each subfunction code is given by a name of the form RXxxxyyy, where the RXxxx is the same as the major exit code. RXCMD: Call a subcommand handler RXCMDHST: This exit is called when REXX is about to carry out a command. By default, REXX will call the appropriate subcommand handler to execute the command, but the exit may instead choose to process the command. The parameter block for this exit is: typedef struct { struct { unsigned int rxfcfail:1; /* completed with FAILURE */ unsigned int rxfcerr:1; /* completed with ERROR */ } rxcmd_flags; char *rxcmd_address; /* environment name */ unsigned short rxcmd_addressl;/* environment length */ char *rxcmd_dll; /* not used by REXX/imc */ unsigned short rxcmd_dll_len; /* not used by REXX/imc */ RXSTRING rxcmd_command; /* the command to be executed */ RXSTRING rxcmd_retc; /* the return string */ } RXCMDHST_PARM; The handler may set either rxfcfail or rxfcerr to raise a FAILURE or ERROR condition, respectively. The rxcmd_address parameter is the null-terminated name of the environment to which the command is being sent. The rxcmd_command is an RXSTRING containing the command to be executed. The character just after the end of the string is guaranteed to be a null, but the command may also contain other null characters. The rxcmd_retc is an RXSTRING which is to receive the return code from the command as a string. Any numeric return code must be translated into a string before it is returned. REXX will provide a 256-byte buffer into which the return string may be copied. The handler may allocate another using malloc, in which case REXX will use free to release it after the handler returns. RXSIO: Perform input or output RXSIOSAY: This exit is called when REXX is about to write a line as a result of a SAY instruction. The exit may choose to deal with the line as it requires; otherwise the interpreter will write the line to the standard output stream. The parameter block for this exit is: typedef struct { RXSTRING rxsio_string; } RXSIOSAY_PARM; The rxsio_string is the string to be written out. The character just after the end of the string is guaranteed to be a null, but the string may also contain other null characters. The string will not contain an end-of-line character unless one was present in the SAY instruction. RXSIOTRC: This exit is called when REXX is about to write a line during tracing. The exit may choose to deal with the line as it requires; otherwise the interpreter will write the line to the trace output stream (usually the standard error). The parameter block for this exit is: typedef struct { RXSTRING rxsio_string; } RXSIOTRC_PARM; The rxsio_string is the string to be written out. The character just after the end of the string is guaranteed to be a null. The string will not contain an end-of-line character. RXSIOTRD: This exit is called when REXX is about to read a line of input as a result of the PARSE PULL instruction when the REXX stack is empty. The exit may choose to supply a line of its own; otherwise the interpreter will read the line from the standard input stream. The parameter block for this exit is: typedef struct { RXSTRING rxsiotrd_retc; } RXSIOTRD_PARM; The input line should be returned in the rxsiotrd_retc string. It should not contain any end-of-line character which might have been used to terminate the line of input (but it may contain other end-of-line characters). REXX will provide a 256-byte string in which the handler may place the input line. The handler may allocate its own using malloc, in which case REXX will use free to release it when the handler returns. RXSIODTR: This exit is called when REXX is about to read a line of input at a pause during interactive tracing. The exit may choose to supply a line of its own; otherwise the interpreter will read the line from the standard input stream. The parameter block for this exit is: typedef struct { RXSTRING rxsiodtr_retc; } RXSIODTR_PARM; The input line should be returned in the rxsiotrd_retc string. It should not contain any end-of-line character which might have been used to terminate the line of input. REXX will provide a 256-byte string in which the handler may place the input line. The handler may allocate its own using malloc, in which case REXX will use free to release it when the handler returns. Note: the PARSE LINEIN instruction and the built-in functions LINEIN, LINEOUT, CHARIN and CHARS do not call the RXSIO exit handler. RXINI: Initialisation processing RXINIEXT: This exit is called when initialisation is complete, just before execution of the REXX program starts. The exit handler may use this exit to perform initialisation functions such as setting REXX variables. There is no parameter block for this exit. RXTER: Termination processing RXTEREXT: This exit is called just after the REXX program has finished, and just before termination processing. The exit handler may use this exit to perform terminatino functions such as retrieving REXX variables. There is no parameter block for this exit. 1(f). The Variable Pool Interface Variables in the REXX program may be accessed using the RexxVariablePool function, which processes one or more request blocks arranged in a linked list. Each request is a structure of the form: typedef struct shvnode { struct shvnode *shvnext; RXSTRING shvname; RXSTRING shvvalue; ULONG shvnamelen; ULONG shvvaluelen; UCHAR shvcode; UCHAR shvret; } SHVBLOCK; where: shvnext is the address of the next request block, or NULL if this is the last request block. shvcode is the request code. Valid request codes are listed below. shvname is an RXSTRING containing a REXX variable name. It may be an input or an output parameter according to the request code. shvnamelen is the length of the buffer which the shvname RXSTRING points to. shvvalue is an RXSTRING containing the value of a REXX variable. It may be an input or an output parameter according to the request code. shvvaluelen is the length of the buffer which the shvvalue RXSTRING points to. shvret is a return code which will be set when the request has been processed. It will be set to the OR of zero or more of the following flags: RXSHV_NEWV the named variable was uninitialised before the request was processed RXSHV_LVAR no more variables are available for an RXSHV_NEXTV request. RXSHV_TRUNC A returned variable name or value was truncated because the supplied RXSTRING was too short to hold the name or value. RXSHV_BADN An invalid variable name was given. RXSHV_MEMFL The request was not processed due to lack of memory. RXSHV_BADF An invalid function code was given. The valid request codes and their meanings are: RXSHV_SYSET: Symbolic set. The name given in the shvname field is interpreted as a variable as if it were present in a REXX program. It is set to the value given in the shvvalue field. RXSHV_SET: Direct set. The variable whose name matches that given in the shvname field is set to the value given in the shvvalue field. RXSHV_SYFET: Symbolic fetch. The name given in the shvname field is interpreted as a variable as if it were present in a REXX program. Its value is fetched and stored in the shvvalue RXSTRING. If the application has set this to a NULL string, then REXX allocates one using malloc. The application is responsible for freeing this storage. Otherwise, shvvaluelen gives the maximum length of a string which REXX can store in the shvvalue RXSTRING. If the value is longer than this then it is truncated. The RXSHV_TRUNC flag will then be set in shvret. RXSHV_FETCH: Direct fetch. The value of the variable whose name matches that given in the shvname field is fetched and stored in the shvvalue RXSTRING, as in the RXSHV_SYFET request. RXSHV_SYDRO: Symbolic drop. The name given in the shvname field is interpreted as a variable as if it were present in a REXX program. It is dropped (that is, it becomes undefined). RXSHV_DROPV: Direct drop. The variable whose name matches that given in the shvname field is dropped (that is, it becomes undefined). RXSHV_NEXTV: Fetch next variable. Each time this request is processed the interpreter returns the name and value of a variable which is currently accessible by the executing REXX program. The interpreter keeps track of which variables have been returned, and successive RXSHV_NEXTV requests will return different variables (in no particular order) until all the accessible variables have been returned. When there is no variable left to return, the RXSHV_NEXTV request will set the RXSHV_LVAR flag in shvret and will leave the returned variable and value undefined. The information about which variables have been returned is reset every time the REXX program resumes execution, and also at every set, fetch or drop request from the variable pool. The name of each returned variable will be stored in the shvname field of the request block. If the application has set this to a NULL string then REXX allocates one using malloc. The application is responsible for freeing this storage. Otherwise, shvnamelen gives the maximum length of a string which REXX can store in the shvname RXSTRING. If the name is longer than this then it is truncated. The RXSHV_TRUNC flag will then be set in shvret. The value of each variable is returned in an analogous manner in the shvvalue field (in exactly the same way as for the RXSHV_SYFET and RXSHV_FETCH requests). In the current release of REXX/imc, simple symbols and stems are returned in (approximately) the order in which they became defined. The compound symbols of any one stem are returned consecutively in a similar order, just after the stem (if any) is returned. Note that it is in general not possible to tell the difference between a stem being returned (for example, after the instruction foo.=3) and a compound variable whose tail has length zero (for example, after the instructions bar=''; foo.bar=3). However, if "foo." is returned after any other variable with that stem (including "foo." itself), it is always the case that this occurrence of "foo." is a compound variable. The converse is unfortunately not true. That is, if "foo." is the first variable with that stem, it is not necessarily a stem variable unless another copy of "foo." appears, in which case the first one is a stem and the second is a compound variable. A symbolic variable name is one which is interpreted according to REXX rules. That is, it must contain only letters, digits, dots, and a small number of other characters that are valid in symbols and must not start with a digit or dot; it is uppercased before use, and each simple symbol component of a compound symbol (except the stem) is replaced by its value before use. A direct variable name is the exact name of a variable which appears in the variable pool (and it is the kind of variable name that RXSHV_NEXTV returns). It is the kind of name that a symbolic variable name turns into after it has been processed as described in the above paragraph. That part of the name up to the first dot must satisfy the conditions for a symbolic name and must be in upper case. After the first dot, the name is allowed to contain any character. Variable pool requests are processed by the function: ULONG RexxVariablePool(PSHVBLOCK RequestBlockList); The parameter is a pointer to the first request block in the linked list. The return value may be RXSHV_NOAVL, meaning that the variable pool API is not available. In this case, none of the requests will have been processed. The API is made available just before execution of a program begins (just before the RXINIEXT exit is called) and may be used until just after the execution finishes (just after the RXTEREXT exit is called). If the return value is not RXSHV_NOAVL, then it will be the OR of all the return codes which have been returned individually in the shvret fields of the request blocks. ______________________________________________________________________________ 2. Writing external functions for REXX/imc There are four ways to write external functions for REXX/imc: (a) in REXX (b) using the SAA application programming interface (c) as a ".rxfn" file, and (d) as a Unix program. External functions in REXX are covered in the REXX programming reference. The SAA application programming interface for external functions is covered in section 1(d). The remaining methods are described in this section. 2(c). Writing a ".rxfn" file A function written as a ".rxfn" file will be searched for and linked in with the interpreter at the time when it is first requested. This gives an advantage over using the SAA API in that SAA functions must be registered by the application before they may be used by a REXX program - though this is made easier by the inclusion of the RxFuncAdd REXX function in the interpreter. In general, the function(s) in a ".rxfn" file may either be SAA function handlers as described above or specially written functions for REXX/imc. However, the latter type is the only one supported for stand-alone functions; files containing only SAA functions must be supplied as a library with a ".rxlib" file (see section 3). This is because it is the ".rxlib" file which tells the interpreter that the functions must be called using the SAA calling sequence (either that, or a REXX/imc-style function must use the SAA interface to register all the SAA functions). A ".rxfn" file should be compiled and then linked in the normal way for shared objects. This differs between operating systems, but if the "Make" program knows how to compile shared objects on your system then try making rxmathfn.rxfn to see what flags it uses. For a stand-alone function the name of the output file should be the name by which the function is to be called with ".rxfn" appended, all in lower case. If this file is in the rexx function search order (see the section on function or subroutine invocation in the REXX/imc programming reference) then the file will be loaded and linked in with the interprter automatically the first time that the function is called. Future calls to the same function will not have to load the function from a file. The REXX/imc-style function is described in the remainder of this section. It is not really recommended for new functions because of portability issues, so skip to the next section if this is not of interest. A REXX/imc function will need access to several of the interpreter's functions, so its source must include "functions.h". It may also need access to certain global variables, in which case it should include "globals.h". Useful functions and global variables are listed below. In addition, the function may call any routine from the API, in which case it should include "rexxsaa.h" as described in section 1. A REXX/imc function should be declared as: int rxfunction(char *name, int argc); (including the name rxfunction). The name parameter gives the name by which the function was called, excluding any path name. That is, any part of the name up to and including the last slash (/) will have been removed. This may be used to distinguish between several function calls that have all been implemented by the same routine (this is more common in a function library; see the section on function libraries). The argc parameter gives the number of arguments which were passed to the function. These will be placed on the calculator stack in LIFO order. Any missing parameters will have been stacked as strings of length -1. The function will be expected to remove all its parameters from the calculator stack and either (i) place a result on the calculator stack and return 1, or (ii) return 0 without placing anything on the calculator stack. This means that the function has not returned a result. However, the function may choose to return an error. In this case, it need not remove all arguments from the calculator stack. A valid error code is a negative number whose absolute value is the number of a REXX error. When the function returns, that error will be reported. REXX/imc contains a function declared as follows: int funccall(RexxFunctionHandler *func,char *name,int argc); This function enables a function that has a REXX/imc-style calling sequence to call a function that has an SAA-style calling sequence. For example, if function_handler is an external function designed for use with the SAA API, then it can be saved as a REXX/imc-style ".rxfn" file by appending the following function: int rxfunction(char *name, int argc) { return funccall(function_handler,name,argc); } This gives the SAA function the advantage of being able to be loaded on demand instead of having to be registered. A single function stored in a REXX/imc file should not contain a public symbol called rxdictionary; this is reserved for function libraries. Useful functions, including functions to retrieve arguments and stack the answer, are the following: char *delete(len) int *len; Deletes a string from the calculator stack, and returns an address where it can be found. On return, len will be set to the length of the argument. The string is guaranteed to be followed by two bytes of available memory, so that the application may teminate it with a null character if necessary, and the string may be written to. The string is not guaranteed to be null-terminated when it is unstacked. The string is not actually moved in memory, so it will be invalidated whenever the calculator stack is changed (that means that it may not be used as an argument to stack()). However, further calls to delete() will not corrupt the string. int getint(flag) int *flag; Deletes a string from the calculator stack, interprets it as an integer, and returns the result. If the string is not a number which can be held in an int variable, the function will die. If the string is not an integer and the flag is non-zero, the function will die, otherwise the number will be converted into the nearest integer. void stack(string,len) char *string; int len; Copies the given string of the given length on to the calculator stack. void stackint(i) int i; Stacks a string representing the integer i on the calculator stack. int isnull() Returns 1 if the next value to be unstacked is null (that is, it has length -1 indicating an omitted parameter), and zero otherwise. void die(rc) int rc; Raise the error whose code is rc. The interpreter will then either halt with diagnostics or signal to an appropriate label, according to the current settings of "SIGNAL ON". This function never returns to its caller. int num(minus,exp,zero,len) int *minus,*exp,*zero,*len; Attempts to extract a number from the top value on the calculator stack, returning it as a sequence of digits. The value is always left stacked. If unsuccessful, num returns a negative number (except when the top value is null, in which case num dies). Otherwise, num stores a sequence of digits in the workspace (see below), and returns the offset from the start of the workspace to the start of the sequence of digits (which always equals the old value of eworkptr). The value eworkptr is updated to point past the end of the sequence of digits. The length of the sequence is returned in len. If the number is zero, then zero is set to 1, and otherwise it is set to 0. If it is negative, then minus is set to 1, and otherwise 0. The exponent stored in exp is such that if a decimal point were placed between the first two digits of the sequence and the result were multiplied by ten to the power exp, then the original number would be recovered. void stacknum(num,len,exp,minus) char *num; int exp,len,minus; A sequence of digits starting at address num and of length len is formatted according to the Rexx rules for numerics and stacked. The exponent "exp" and sign "minus" are interpreted according to the rules in "num" above. The following functions from the variable interface should probably not be used. Use the SAA API instead. char *varget(name,namelen,len) char *name; int varlen; int *len; The variable whose name is "name" and contains "namelen" characters is searched for in the current symbol table. If the name is a compound symbol or a stem, then the first character of the name must have bit 7 set. The name is used as-is, that is, it is not translated to upper case, and if it is a compound variable then no substitution occurs in the tail. A pointer to the variable's value is returned, and "len" is set to its length. If the variable has not been assigned a value then "len" and the result are zero and the null pointer, respectively. The copy of the value which is returned must not be changed in any way. It may be invalidated whenever the symbol table is changed (and may not be used as a parameter to varset()). void varset(name,namelen,value,len) char *name,*value; int namelen,len; The parameters "name" and "namelen" name a variable and satisfy the same conditions as in varget above. The parameters "value" and "len" describe the position and length respectively of a string which is to be assigned to the variable. The assignment always succeeds unless there is insufficient memory available, in which case the routine dies. Note: the variable name and the string may contain arbitrary characters, and neither is validated. If you assign a value to a simple symbol called, e.g. "blah" (in lower case) or "XYZ.123" (containing a dot), etc. then the symbol will not be accessible to the Rexx program, but it may be recovered using varget(), provided it has not been hidden or destroyed by a PROCEDURE instruction. Any other function from the Rexx source may be used, but they are [or perhaps are not!] described elsewhere. Try in the source itself... A function may use global data from Rexx. It may either include the header file "globals.h" (which defines all the global variables of the Rexx interpreter) or simply include declarations for the particular variables it uses. Some useful variables are: char *workptr; unsigned eworkptr,worklen; The workspace, which may be used freely by a function. The space starts at workptr, and is of maximum length worklen. The variable eworkptr may be used to hold the length of the data currently stored in the workspace. int precision,fuzz; The setting of "NUMERIC DIGITS", and precision minus the setting of "NUMERIC FUZZ", respectively. That is, "precision" holds the precision of ordinary calculations, and "fuzz" holds the precision of comparison operations. char numform; Zero if "NUMERIC FORM SCIENTIFIC" is in effect, and one if "NUMERIC FORM ENGINEERING". int ppc; The number of the instruction being interpreted. int rxstacksock; A file descriptor which is connected to the Rexx stack process via a socket. FILE *ttyin,*ttyout; Streams which may be used to communicate with the terminal. A Rexx memory structure (for example the workspace) may be extended using the macros mtest or dtest. These are called like functions, but note that some arguments may be evaluated more than once or not at all. mtest(memptr,alloc,length,extend) Check that the desired length for the area, "length", is not greater than the actual length, "alloc". If it is, then attempt to reallocate the area, pointed to by "memptr", with "extend" more bytes than its previous length. The parameters memptr and alloc will be updated to reflect the new position and length of the area. The macro will die if not enough memory is available. dtest(memptr,alloc,length,extend) Perform mtest on the four arguments, but return 0 if the area did not move, and non-zero otherwise. In order to use this macro, the current function must contain variables: char *mtest_old; long mtest_diff; If the memory pointer has moved, then mtest_diff contains the difference (the new pointer minus the old). 2(d). Writing an external function as a Unix program If a REXX program calls a function which is not internal or built-in and cannot be found in any of the other categories of external function, then REXX/imc searches for a file having the same name as the function in lower case without a file extension, and this is assumed to be a Unix program. The program may be written in any language supported by Unix such as C or perl (even REXX, but note that although the programs will share the same stack, a fresh copy of the interpreter will be loaded and some of the parameter information will be lost as usual for a REXX program called as a command). If the program is interpreted then it must start with a "#!" comment that names its interpreter, and the Unix kernel must be able to recognise this form of comment. REXX/imc will execute the Unix program with argv[0] containing the name by which the function was called, excluding any path name (that part of the name up to and including the last slash character), and the rest of argv[] containing the function's arguments. The program will thus have argc equal to one more than the number of arguments passed by REXX. The arguments will be null-terminated, which means that the arguments must not contain null characters. The program will be expected to print its result on its standard output, followed by a newline character. The program returns an empty string by printing a single newline character. The program returns no result by printing nothing. If the program finishes with an exit code of 255 (or -1), then REXX/imc will raise error 50 (error in called routine). ______________________________________________________________________________ 3. Writing Function Libraries for REXX/imc Any of the four methods used in section 2 may be used to write a function library, with some minor changes in certain circumstances. A function library is a single file which implements several functions (although there is nothing to say that a library must contain more than one function). Each function library is accompanied by a ".rxlib" file, described below, which lists the functions contained in the library. If the file is a REXX program or a Unix program, then the same entry point (the start of the program) is used for all the functions in the library, and the program must examine the name by which it was called in order to determine which function to carry out. In a Unix program, this is argv[0] as described in section 2(d). In a REXX program, the name may be found from PARSE SOURCE (in the fourth word). If the file is a ".rxfn" file, then the functions have separate entry points and must be listed in a dictionary within the file. [The only time when a ".rxfn" file would not have a dictionary is when it contains a single function whose entry point is called rxfunction; in that case the only difference between a stand-alone function and a library is the presence of a ".rxlib" file, described below. The name by which Rexx calls the function will be the name listed in the ".rxlib" file.] A ".rxfn" dictionary is a public symbol whose name is rxdictionary and whose type is an array of dictionary elements, thus: typedef struct _dictionary { char *name; int (*function)(); } dictionary; dictionary rxdictionary[]; Each name field must be initialised to the null-terminated name of a function, as called by a REXX program. The names are case-sensitive and should usually be in upper case. Each function field must be initialised to the entry point of the function, which will have either the calling sequence described in section 2(c) or the SAA calling sequence (whichever choice is made, all functions should have the same calling sequence). The last element of the dictionary array must have both fields set to NULL. When a ".rxfn" file is opened, all the functions in the dictionary are registered automatically and will not need to be loaded again from the file on subsequent references. Once the functions have been collected together in one file as described above, the library should be placed in a directory which REXX/imc will search for function libraries (see the section on function or subroutine invocation in the REXX/imc programming reference). REXX/imc also needs to be made aware of the library's contents. This is done by writing a text file whose name is the same as that of the library (but without the file extension such as ".rxfn" or ".rexx") with ".rxlib" appended. The text file contains the names of the functions which may be found in the library, separated by blanks, newlines or tab characters. The names are case-sensitive. REXX/imc will only load a function library if the required function name is found within this text file. The name of the function library itself is irrelevant; REXX/imc will read all ".rxlib" files found in the appropriate directories and store the names found inside. If the function library is a set of functions using the SAA interface then the first token in the ".rxlib" file must be "rxsaa:". If this is missing then the functions will be called using the wrong protocol and the interpreter will crash. The set of mathematical functions, rxmathfn.*, is an example of a function library. It is supplied in two forms: a REXX program and a ".rxfn" file. Either will work. The rxmathfn.rexx function library demonstrates a kludge in the library interface: if the ".rxlib" file contains the token "rxmathfn:" then the REXX interpreter will skip the usual procedure of resetting NUMERIC DIGITS to 9 whenever it starts an external function. The setting is still preserved, and restored when the function returns, but if the ".rxlib" file contains the token "rxmathfn:" then the function library can inherit the value. This token is only meaningful when the function library is a REXX program. It is ignored for the other forms of library. ______________________________________________________________________________ 4. Accessing the REXX stack (a) Initialising When the application which requires access to the stack initialises, or when it first wants to use the stack, it should do the following: i. If the environment variable RXSTACK is set, then a stack already exists and uses a socket whose name is the value of RXSTACK. This should always happen if the application is called from within REXX. The application may choose to use this stack, or create another. ii. In order to create a new stack, the application should run "rxque" and store the output. If the application prefers a particular filename for the stack socket, it should give that as a parameter to "rxque" and also set the environment variable RXSTACK to that name. The output from "rxque filename" will be the stack's process number in the format "%d\n". The output from "rxque" will be in the form "RXSTACK=%s RXSTACKPROC=%d". The "%s" in this format is the stack's socket name, and the "%d" is the stack's process number. The RXSTACK variable must be exported to the environment. The process number must be remembered, so that the stack can be terminated later. iii.When a socket name has been obtained, the application should connect to it a socket of type SOCK_STREAM. This socket will be used in all further communication with the stack. The stack obtained by the application in this way can be accessed by any other process, providing it knows the socket name and has permission to access it. This will usually be limited to descendants of the application. In particular, if the application executes the REXX interpreter, the interpreter will use the same stack. (b) Terminating If the application did not create the stack, it has nothing further to do. Otherwise it must kill the stack process (using the pid which was remembered earlier) with signal 15 (SIGTERM). In case the application crashes, the stack process will terminate after five minutes of inactivity if it finds that its parent no longer exists (that means that when the stack server is executed it should be executed as a child of the application, and not a grandchild or other descendant). (c) Communications There are three data types which may be communicated: commands, lengths and data. A command is a single character. A length is a sequence of six hex digits followed by a newline character: seven bytes in all. Data is an arbitrary sequence of characters of a pre-determined length. All communications are written unbuffered. Each communication with the stack Each is initiated by the application writing a command character down the socket. The possible command characters are as follows: N (for Number): The stack responds by writing a length down the socket, representing the number of items on the stack. S (for Stack): The stack expects the application to write a length value followed by a data item of that length. It stacks that data item in lifo order. Q (for Queue): The stack expects the application to write a length value followed by a data item of that length. It queues that data item in fifo order. G (for Get): The stack responds by writing the length of the top stack entry followed by its data, and removes the entry from the stack. P (for Peek): The stack responds by writing the length of the top stack entry followed by its data, but leaves the stack unchanged. D (for Drop): No further communication occurs, but the stack deletes its top entry. K (for Kill me): The stack expects the application to write a length value followed by one more byte. The length value is interpreted as the pid of a process, and the byte as a signal number. On receipt of this information, the stack server will (attempt to) kill the given process with the given signal. This command may be used to make an application allow the stack to catch up, in case some of the communications are still en route to the server (the Num command could also be used for this purpose). If the stack process receives an incorrect command value or encounters an error while reading from the socket, then the connection to the offending application will be broken immediately. ______________________________________________________________________________ 5. The REXX/imc internal data structures REXX/imc has several data structures including the program, the label table, the symbol table, and the program stack. Each is kept as a block of "malloc"ed memory which is grown dynamically as necessary (the "growing" is usually performed by the mtest and dtest macros, mentioned above). (a) The Source The source code is stored in memory as it appears in the REXX program on disk, except that newline characters are translated into zero bytes. The address of each line (numbered from 1 to the number of lines in the program) is stored in an array (which is another block of "malloc"ed memory), and accessed using the construction "source[num]". The zeroth element of source stores the file name of the source, which is stored in a separately allocated block of memory. The block allocated for the source itself is guaranteed to start at source[1]. (b) The Program The "program", or preprocessed source, is stored as a list of instructions, and prog[num] is a structure containing details about the "num"th instruction. There are "stmts" instructions in this list. Note that "THEN", "ELSE" and "OTHERWISE" are counted as individual instructions, and that null clauses do not appear in the list of instructions. prog[0] contains some pointers to the starts of things (namely the block allocated for the tokenised program itself, the block allocated for the source, and the beginning of the first comment in the source). prog[stmts] is a null instruction which points to the end of the source. Each instruction is obtained from the source by removing all comments and labels; stripping all unnecessary blanks and collapsing multiple spaces into one; concatenating lines together whenever the continuation character "," appears; tokenising keywords into special characters, and translating "^" (the variant NOT operator) into "\" (the real NOT operator). The source is also checked for mismatched quotes, invalid labels and invalid characters during preprocessing. Whereas "source" always contains the source of the program being interpreted, "prog" can sometimes contain the list of instructions obtained from an "interpret" instruction. In this case, the source pointer in prog[0] is used instead of source[1] when freeing the source. Also, certain instructions (such as "call") are required to save the current program temporarily and go back to the original program, by searching for it on the program stack. The current instruction number within the program is stored in the global variable ppc. The current character position within the program line is stored in a local variable, often called lineptr. Each prog[num] consists of the following structure, called "program": int num; The line number in the source where the instruction originates - used when tracing source instructions. char *source; The start of this instruction in the source char *sourcend; The end of this instruction in the source int related; Reserved for future use char *line; The address of the tokenised instruction The source between "source" and "sourcend" is the current instruction including any comments on the same line but not including labels. Comments on separate lines and labels will appear after "sourcend" or before "source". (c) The Labels Labels within a program are separated off and stored in a table while the program is being preprocessed. The table is stored in a block of memory at address labptr and of length lablen. While the table is being built up the length of data so far is elapbtr. The table consists of the concatenation of elements in the following format: int total length of this element int instruction number of label char[] name of label, terminated with a zero character and padded [*] The last element in the table is followed by the integer zero. [*] In all the data structures used by REXX/imc, variable-length string entries are padded (with random bytes) to a multiple of 4 bytes, in order that later data is aligned to the Sun SPARCstation's requirements. The alignment is done by macros in const.h and may be changed. (d) The Calculator Stack The calculator stack is simply a list of values in temporary storage during the evaluation of an expression. Expressions are in effect translated to reverse polish notation (e.g. 1+2 becomes 1 2 +), with each operation stacking and removing values as appropriate. The stack is stored in a block of memory addressed by cstackptr and of cstacklen bytes. The total size of the data on the stack is ecstackptr. Each element on the stack consists of a string (padded as necessary [*]) followed by its integer length. (e) The Workspace The workspace is a block of memory which is used by various parts of the interpreter to store transient data. It is addressed by workptr and is of length worklen. Some routines (especially num()) maintain the length of the currently stored data in eworkptr. (f) The Symbol Table The symbol table is a multiple-level associative array containing the definitions of all the REXX symbols which have been assigned values. It is stored in a block of memory addressed by vartab and of length varlen. Each level of the table is a separate entity, referring to the active variables of a program fragment which used the PROCEDURE instruction to hide the earlier levels, except that each level may contain pointers to earlier levels as a result of the PROCEDURE EXPOSE instruction. The start of each level is stored in the array varstk[] as an integer offset from the start of the entire table. The number of levels is varstkptr, and the number of elements in the array is varstklen. The total length of all current levels of the symbol table is varstk[varstkptr+1]. Each level of the symbol table contains all currently active simple symbols and stems as a binary search tree. The elements of the tree are in the following format (as described by the varent data type): int total length of this element int position of the left child (as offset from start of level) int position of the right child (as offset from start of level) int length of the symbol's name int amount of memory allocated to hold the symbol's value int actual length of the symbol's value char[] the symbol's name, padded [*] char[] space for the symbol's value If the symbol is a stem, the terminating dot of the name is not stored, but the most significant bit of the first character of the name is set. If the length of the value is negative, then the symbol is considered to be undefined; it has been deleted by DROP or some other process. If the "amount of memory" element of the structure is negative, then this symbol is actually located in an earlier level; the negative value gives the level number, starting at -1 which means the earliest level (i.e. the first in the memory block). Each stem entry in the symbol table has a value containing a default value and a mini-symbol table. The default value is stored first, in the format: int the amount of space allocated to the default value int the actual length of the default value char[] space for the default value Immediately following that are entries for all the tails, in exactly the same format as for the main symbol table. (g) The Program Stack The program stack holds information relating to currently active control structures, e.g. function calls and DO-END blocks. It is stored in a block of memory addressed by pstackptr and of length pstacklen. The length of all data currently on the program stack is epstackptr. The stack holds a sequence of various entries in the following formats, and the number of entries in the current function is held in pstacklev: for a simple DO-END block (type 0), a SELECT block (type 2), a DO WHILE or DO FOREVER block or (type 8), a "struct minstack" entry: int stmt the statement number where the block started char *pos the character where the block started (used for finding the WHILE or UNTIL part of a repetitive DO instruction) int len the length of this structure (16) int type the type value (as mentioned above) for a repetitive DO with a control variable, an extended "struct minstack" entry: int stmt the statement number where the block started char *pos the character where the block started char[] the limit value, padded [*] (empty string for "no limit") int the length of the limit value char[] the step value, padded [*] int the length of the step value char[] the name of the control variable, padded [*] int the length of the control variable name int the value which appeared after FOR, or -1 for "no value" int len the total length of this structure int type the number 10. for a repetitive block introduced by "DO count", a "struct forstack" entry: int stmt the statement number where the block started char *pos the character where the block started int fornum the value specified after "DO" int len the length of this structure (20) int type the number 15. for an internal function call, a "struct procstack2" entry: int stmt the statement number where the function call ocurred char *csp the caller's cstackptr int ecsp the caller's ecstaclptr int csl the caller's cstacklen char trc the caller's trace flag char tim the caller's timestamp flag char form the caller's NUMERIC FORM (0 for SCIENTIFIC) the compiler will probably insert one byte here int digits the caller's NUMERIC DIGITS int fuzz the caller's "NUMERIC DIGITS minus NUMERIC FUZZ" long mic the caller's timestamp microseconds value long sec the caller's timestamp seconds value int address1 the caller's environment number (for ADDRESS) int address2 the caller's alternate environment number program *prg the program which was being interpreted at the time int stmts the number of statements in that program int len the length of this structure (60) int type the number 11, becoming 12 when the PROCEDURE instruction has been executed Note: the program is stored in case it was generated by "interpret"; if that is so then the "real" program will be reinstated before the call. for an INTERPRET instruction, a "struct interpstack" entry: int stmt the statement number where the INTERPRET occurred program *prg the program which was being interpreted at the time int stmts the number of instructions in that program int len the length of this structure (20) int type the number 14. for a command typed during interactive trace mode, a "struct interactstack" entry (note that a "struct interpstack" entry also appears above this): int stmt the statement number where the interruption occurred char *csp the interrupted program's cstackptr int ecs the interrupted program's ecstackptr int csl the interrupted program's cstacklen int len the length of this structure (16) int type the number 16. Occasionally, the interpreter stacks a program line with the sole intent of having it appear in the traceback. Such an entry would be in the format of a "struct errorstack": int stmt the statement number where the error occurred program *prg the program where the error occurred int stmts the number of statements in this program int len the length of this structure (20) int type the number 20 Note that external calls are no longer stored on the program stack. For most purposes, they can be seen to be executed by a separate incarnation of the interpreter. (h) The "Hash" Tables There are three "hash" tables, each arranged in a similar way to one level of the symbol table; however the value of each symbol in the hash table is a single void* pointer rather than a string of characters. There are three hash tables, each occupying a separate block of memory addressed by hashptr[i] and of length hashlen[i] (for i=0,1,2). They are used to store environment variables, details of open files, and details about loaded functions respectively. Each element of a hash table is arranged as follows (and as indicated in the "hashent" data type): int length of this element int position of the left child int position of the right child void* value of the element char[] name of the element, terminated by a zero byte and padded [*] The value of each element may be one of the following: In hash table 0: a pointer to the string "NAME=VALUE" which has been used in a putenv() call. In hash table 1: either a null pointer, or a pointer to a block of memory which contains a "struct fileinfo" followed by a zero-terminated filename (which may be empty if the file name is unknown). The "struct fileinfo" is described in const.h. In hash table 2: a pointer to a "funcinfo" structure containing the address of a loaded function. One of the functions loaded from each function package also contains the handle which was returned from dlopen(). (i) The Signal Stack REXX stores the context of each invocation of the interpreter() function, so that it can be restored whenever an EXIT or a caught signal is encountered. The context is stored in the array sigstack[], which at any time has sigstacklen elements allocated to it. The number of the highest used element of sigstack[] is interplev. Each entry of the signal stack contains: short bits The combined bits for all SIGNAL ON traps which are on short bitson ...all SIGNAL ON traps which were executed at this level short callon ...all CALL ON traps which are on short delay ...all conditions which are delayed char type 1 if a "signal" trap just occurred, 2 if "call", 0 if none char which The number for the condition which just occurred (if any) char *data A description for for the condition which just occurred int ppc[6] The statement numbers to jump to for all the conditions or minus the statement number to flag with "label not found" if a condition occurs jmp_buf jmp The context of this level of the interpreter. Note: a negative number in ppc[i] means that the trap instruction for this condition named a non-existent label. -ppc[i] will be the statement number of this trap instruction. This means that we can delay the "label not found" error until it actually happens: moreover, we can include the condition trap instruction in the traceback. However, an interpreted condition trap instruction is not guaranteed to stay around, and therefore it is reported as an error immediately if it names a non-existent label. (j) The Shell's Hash Table The builtin shell keeps a record of where it found each command in a hash table (yes, a real one ;-) ). The variable "hashtable" points to an array of bucket pointers, and the variable "hashbuckets" holds the number of pointers in the table. A bucket pointer is null if there are no entries in the bucket; otherwise it points to a hashitem structure, representing the first item in the bucket. That item may in turn point to another item in the bucket. The items in each bucket are kept in alphabetical order, to reduce the average time taken for an unsuccessful search (or an insertion). Each item contains the following information: struct _hashitem *next The next item in the bucket, or null int hits Number of times this has been found int expense Position within $PATH int dot Whether dot occurred in the path before this int data Offset from end of header to data char[] The key; a string of characters ending with 0. char[] The data; another nul-terminated string.