Using YAML

In addition to CLI parameters, Grid supports the use of YML files so you don't have to pass in many parameters in all occasions. If you don't change compute parameters often, we suggest you create a Grid config file and use that instead.

Grid Run Spec Overview

The following YML file contains a commented version of every YML key. When writing config file you only need to provide keys and values for the parameters that you actually want to change to what you need. There is no need to have all keys in the config, the ones missing will have the default CLI values.

note

Use either cluster context or make sure to place your cluster ID in the cluster field, replacing the prod-2 value.

Config can be used to specify arguments that will be used to run experiments. Apart from parametrization config allows to define custom actions executed in given moments of the experitment lifecycle, and pass environment variables. Tutorial on how to create runs from config file and fully leverage its capabilities is available [here](../1_Creating Runs/2_Adv Runs/3_creating-runs-from-config.md) To check the defaults please refer to the CLI docs here.

# Main compute configuration. Defines parameters for `grid run` command. 
compute:

  # For BYOC users, we can specify directly which cluster to run experiments on.
  provider:
    cluster: prod-2                         # [str] --cluster

  # Train section defines the training environment each experiement eg. instance type, datastore access, custom actions.
  train:
    framework: lightning                    # [enum] --framework
    environment:
      MY_ENVIRONMENT_VARIABLE: "example"    # Pass in environment variables
    dependency_file_info:
      path: null                            # [str] --dependency_file
    instance: m5a.large                     # [enum] --instance_type
    use_spot: False                         # [str] --use_spot
    storage_gb: 100                         # [int] --memory [should be greater than 100Gb]
    localdir: False                         # [bool] --localdir
    dockerfile: null                        # [str] --dockerfile
    auto_resume: bool                       # [bool] --auto_resume (flag)
    datastore_name: null                    # [str] --datastore_name
    datastore_version: null                 # [int] --datastore_version
    datastore_mount_dir: null               # [str] by default mounts to /datastores/{datastore_name}  --datastore_mount_dir
    actions:                                # Pass list of commands to be executed at given moment.
      on_build: null                        # List[str]
      on_build_start: null
      on_build_end: null
      on_experiment_end: null
      on_experiment_start: null
    scratch:
      size_gb: 100                          # [int] --scratch_size default 100 int
      mount_path: /tmp/scratch              # [str] --scratch_mount_path default

# This section defines the hyperparameters settings. 
# We currently have 3 strategies: grid_search, random_search and none.
# strategy_options are only for the random_search strategy.
hyper_params:
  settings:
    strategy: grid_search                   # [enum] One of `['grid_search', 'random_search', 'none']`
    strategy_options:                       # For 'random_search' we must at least define num_trials 
      num_trials: 1                         # [int] we must specify the num_trials (via CLI or config) if we use random_search strategy (--num_trials)
      seed: 0                               # [int] --seed
  
  # Additional script parameters appended as "{key} {value}"
  params: [dict[str, str]]                  # Dict of key: value entries appended to as script parameters
    --foo: 1                                # [str: str]
    -bar: something

Creating Runs With a Config File

You can create a Run with a config file instead of passing CLI arguments. For example:

grid run --config my_config.yml script.py

Grid Session Spec Overview

It's also possible to use config files to define parameters for Grid Sessions.

# Main compute configuration. Defines parameters for `grid session` command. 
compute:

  # For BYOC users, we can specify directly which cluster to run experiments on.
  provider:
    cluster: prod-2                         # [str] --cluster
  train:
    instance_type: m5a.large                # [str] --instance_type
    disk_size: 200                          # [int] --disk_size
    use_spot: False                         # [bool] --use_spot
    datastore_name: null                    # [str] --datastore_name
    datastore_version: null                 # [int] --datastore_version
    datastore_mount_dir: null               # [str] --datastore_mount_dir

Creating Sessions With a Config File

To create a new Session defined in my_session_config.yml file, just run:

grid session create --config my_session_config.yml

Grid Run Spec Overview​

note

Creating Runs With a Config File​

Grid Session Spec Overview​

Creating Sessions With a Config File​

Grid Run Spec Overview

Creating Runs With a Config File

Grid Session Spec Overview

Creating Sessions With a Config File