Skip to content

Contribute An Experimentalist

AutoRA experimentalists are meant to return novel experimental conditions based on prior experimental conditions, prior observations, and/or prior models. Such conditions may serve as a basis for new, informative experiments conducted by an experiment runner. Experimentalists are generally implemented as functions that can be integrated into an Experimentalist Pipeline.

Experimentalist Module

Experimentalists can be implemented as poolers or as samplers. - Poolers return a pool of candidate experimental conditions, which can be passed to a sampler that selects a subset of conditions from the pool to be used in the next experiment. - Samplers directly return a subset of experimental conditions from a pool of candidate experimental conditions that already exist.

Repository Setup

We recommend using the cookiecutter template to set up a repository for your experimentalist. Alternatively, you can use the unguided template. If you choose the cookiecutter template, you can set up your repository using

cookiecutter https://github.com/AutoResearch/autora-template-cookiecutter

Make sure to select the experimentalist option when prompted. You may also select whether you want to implement an experimentalist as a sampler, pooler, or custom function. You can skip all other prompts pertaining to other modules (e.g., experiment runners) by pressing enter.

Implementation

Irrespective of whether you are implementing a pooler or a sampler, you should implement a function that returns a set of experimental conditions. This set may be a numpy array, iterator variable or other data format.

Hint

We generally recommend using 2-dimensional numpy arrays as outputs in which each row represents a set of experimental conditions. The columns of the array correspond to the independent variables.

Implementing Poolers

Once you've created your repository, you can implement your experimentalist pooler by editing the init.py file in src/autora/experimentalist/pooler/name_of_your_experimentalist/. You may also add additional files to this directory if needed. It is important that the init.py file contains a function called name_of_your_experimentalist which returns a pool of experimental conditions (e.g., as an iterator object or numpy array).

The following example init.py illustrates the implementation of a simple experimentalist pooler that generates a grid of samples within the specified bounds of each independent variable (IV):

"""
Example Experimentalist Pooler
"""

from itertools import product
from typing import List
from autora.variable import IV


def grid_pool(ivs: List[IV]):
    """
    Creates exhaustive pool from discrete values using a Cartesian product of sets

    Arguments:
        ivs {List[IV]}:  List of independent variables

    Returns:
        pool: An iterator over all possible combinations of IV values
    """

    l_iv_values = []
    for iv in ivs:
        assert iv.allowed_values is not None, (
            f"gridsearch_pool only supports independent variables with discrete allowed values, "
            f"but allowed_values is None on {iv=} "
        )
        l_iv_values.append(iv.allowed_values)

    # Return Cartesian product of all IV values
    return product(*l_iv_values)

Implementing Samplers

Once you've created your repository, you can implement your experimentalist sampler by editing the init.py file in src/autora/experimentalist/sampler/name_of_your_experimentalist/. You may also add additional files to this directory if needed. It is important that the init.py file contains a function called name_of_your_experimentalist which returns a set of experimental conditions (e.g., as a numpy array) given a pool of candidate experimental conditions.

The following example init.py illustrates the implementation of a simple experimentalist sampler that uniformly samples without replacement from a pool of candidate conditions.

"""
Example Experimentalist Sampler
"""

import random
from typing import Iterable, Sequence, Union

random_sample(conditions: Union[Iterable, Sequence], n: int = 1):
    """
    Uniform random sampling without replacement from a pool of conditions.
    Args:
        conditions: Pool of conditions
        n: number of samples to collect

    Returns: Sampled pool

    """

    if isinstance(conditions, Iterable):
        conditions = list(conditions)
    random.shuffle(conditions)
    samples = conditions[0:n]

    return samples

Next Steps: Testing, Documentation, Publishing

For more information on how to test, document, and publish your experimentalist, please refer to the general guideline for module contributions .