Ganymede Metadata Tables

Ganymede metadata tables allow users to query and reference details about flow runs, tables, and files stored within the Ganymede platform. These tables can be utilized for various purposes, such as debugging previous runs, locating specific files or table records associated with past runs, and systematically displaying tags used to contextualize and ensure traceability of captured data.

The tables described below are found in the Ganymede database, and can be referenced from Data Explorer, within Flow code, or in Dashboards.

Flow Runs

The flow_run_metadata table captures a record for each flow execution.

Field Name	Description
flow_run_id	Unique identifier for the Flow run, stored as epoch time in milliseconds
flow_id	Flow name
inputs_dict	Run context for the Flow; corresponds to the run context described in the Ganymede Class overview
initiator	Flow run initiator; agent name + MAC address for agent-initiated Flows, user email for user-initiated flows, event name for event-triggered Flows, or Flow name if triggered from another Flow
initiator_type	Type of the user who initiated the flow run (AGENT, USER, EVENT, FLOW)
flow_version	Commit hash associated with flow execution
state	Final disposition of Flow run (Success, Failed)

Table Writes

The public_flow_run_output table contains 1 record per output table written in Ganymede.

Field Name	Description
id	ID for table save
node	Name of node that wrote to table
name	Name of table written to Ganymede table storage
timestamp	Time of the table save; stored as epoch time in fractional seconds
datastream_metadata	JSON containing the UUID associated with the table write transaction and Flow run ID

File Writes

The file_metadata table contains 1 record per file written in Ganymede.

Field Name	Description
bucketType	Either "Input" or "Output"; indicates whether the file was input into Ganymede or an output from processing performed in Ganymede
uri	Full URI to the file
path	Path within storage bucket to file
size	Size of the file in bytes
createdAt	Time of the file save, stored as epoch time in milliseconds
creator	Name of the file creator (e.g. - email address for users, agent name + MAC address for Agents)
fileTags	List of JSON objects containing Tags associated with the file

Python Packages

The ganymede_env_python_package_list shows a list of all Python packages installed in the workflow environment.

Field Name	Description
package_name	Name of the Python package
version_number	Version of the Python package

Pandas <-> Ganymede DB Field Reference

SQL columns have character restrictions for compability with SQL which are not present in Pandas. __TABLE_SCHEMA contains the mapping between table field names referenced in NodeReturn and the corresponding SQL column names.

The retrieve_tables method associated with Ganymede class provides a convenient method to retrieve tables in notebooks with their original Pandas Dataframe column names.

This table contains 1 record per table write transaction.

Field Name	Description
pd_field_name	Pandas field name
pd_field_type	Pandas field column type
bq_table_name	Table name in Ganymede data lake
bq_field_name	Column name in Ganymede data lake
flow_run_id	Flow run that the table write is associated with

Flow Runs​

Table Writes​

File Writes​

Python Packages​

Pandas <-> Ganymede DB Field Reference​

Flow Runs

Table Writes

File Writes

Python Packages

Pandas <-> Ganymede DB Field Reference