Asynchronous Bayesian optimization with Trieste#
In this notebook we demonstrate Trieste’s ability to perform asynchronous Bayesian optimisation, as is suitable for scenarios where the objective function can be run for several points in parallel but where observations might return back at different times. To avoid wasting resources waiting for the evaluation of the whole batch, we immediately request the next point asynchronously, taking into account points that are still being evaluated. Besides saving resources, asynchronous approach also can potentially improve sample efficiency in comparison with synchronous batch strategies, although this is highly dependent on the use case.
To contrast this approach with regular batch optimization, this notebook also shows how to run parallel synchronous batch approach.
[1]:
# silence TF warnings and info messages, only print errors
# https://stackoverflow.com/questions/35911252/disable-tensorflow-debugging-information
import os
os.environ["TF_CPP_MIN_LOG_LEVEL"] = "3"
import tensorflow as tf
tf.get_logger().setLevel("ERROR")
import numpy as np
import time
import timeit
First, let’s define a simple objective that will emulate evaluations taking variable time. We will be using a classic Bayesian optimisation benchmark function Branin with a sleep call inserted in the middle of the calculation to emulate delay. Our sleep delay is a scaled sum of all input values to make sure delays are uneven.
[2]:
from trieste.objectives import ScaledBranin
def objective(points, sleep=True):
if points.shape[1] != 2:
raise ValueError(
f"Incorrect input shape, expected (*, 2), got {points.shape}"
)
observations = []
for point in points:
observation = ScaledBranin.objective(point)
if sleep:
# insert some artificial delay
# increases linearly with the absolute value of points
# which means our evaluations will take different time
delay = 3 * np.sum(point)
pid = os.getpid()
print(
f"Process {pid}: Objective: pretends like it's doing something for {delay:.2}s",
flush=True,
)
time.sleep(delay)
observations.append(observation)
return np.array(observations)
# test the defined objective function
objective(np.array([[0.1, 0.5]]), sleep=False)
/opt/hostedtoolcache/Python/3.7.15/x64/lib/python3.7/site-packages/gpflow/experimental/utils.py:43: UserWarning: You're calling gpflow.experimental.check_shapes.decorator.check_shapes which is considered *experimental*. Expect: breaking changes, poor documentation, and bugs.
f"You're calling {name} which is considered *experimental*."
/opt/hostedtoolcache/Python/3.7.15/x64/lib/python3.7/site-packages/gpflow/experimental/utils.py:43: UserWarning: You're calling gpflow.experimental.check_shapes.inheritance.inherit_check_shapes which is considered *experimental*. Expect: breaking changes, poor documentation, and bugs.
f"You're calling {name} which is considered *experimental*."
[2]:
array([[-0.42052567]])
As always, we need to prepare the model and some initial data to kick-start the optimization process.
[3]:
from trieste.space import Box
from trieste.data import Dataset
search_space = Box([0, 0], [1, 1])
num_initial_points = 3
initial_query_points = search_space.sample(num_initial_points)
initial_observations = objective(initial_query_points.numpy(), sleep=False)
initial_data = Dataset(
query_points=initial_query_points,
observations=tf.constant(initial_observations, dtype=tf.float64),
)
import gpflow
from trieste.models.gpflow import GaussianProcessRegression, build_gpr
# We set the likelihood variance to a small number because
# we are dealing with a noise-free problem.
gpflow_model = build_gpr(initial_data, search_space, likelihood_variance=1e-7)
model = GaussianProcessRegression(gpflow_model)
# these imports will be used later for optimization
from trieste.acquisition import LocalPenalization
from trieste.acquisition.rule import (
AsynchronousGreedy,
EfficientGlobalOptimization,
)
from trieste.ask_tell_optimization import AskTellOptimizer
/opt/hostedtoolcache/Python/3.7.15/x64/lib/python3.7/site-packages/gpflow/experimental/utils.py:43: UserWarning: You're calling gpflow.experimental.check_shapes.checker.ShapeChecker.__init__ which is considered *experimental*. Expect: breaking changes, poor documentation, and bugs.
f"You're calling {name} which is considered *experimental*."
Multiprocessing setup#
To keep this notebook as reproducible as possible, we will only be using Python’s multiprocessing package here. In this section we will explain our setup and define some common code to be used later.
In both synchronous and asynchronous scenarios we will have a fixed set of worker processes performing observations. We will also have a main process responsible for optimization process with Trieste. When Trieste suggests a new point, it is inserted into a points queue. One of the workers picks this point from the queue, performs the observation, and inserts the output into the observations queue. The main process then picks up the observation from the queue, at which moment it either waits for the rest of the points in the batch to come back (synchronous scenario) or immediately suggests a new point (asynchronous scenario). This process continues either for a certain number of iterations or until we accumulate necessary number of observations.
The overall setup is illustrated in this diagram:
[4]:
# Necessary multiprocessing primitives
from multiprocessing import Manager, Process
We now define several common functions to implement the described setup. First we define a worker function that will be running a single observation in a separate process. Worker takes both queues as an input, reads next point from the points queue, makes an observation, and inserts observed data into the observations queue.
[5]:
def observer_proc(points_queue, observations_queue):
pid = os.getpid()
while True:
point_to_observe = points_queue.get()
if point_to_observe is None:
return
print(
f"Process {pid}: Observer : observing data at point {point_to_observe}",
flush=True,
)
new_observation = objective(point_to_observe, sleep=enable_sleep_delays)
new_data = (point_to_observe, new_observation)
print(f"Process {pid}: Observer : observed data {new_data}", flush=True)
observations_queue.put(new_data)
Next we define two helper functions, one is to create a certain number of worker processes, and another is to terminate them once we are done.
[6]:
def create_worker_processes(n_workers, points_queue, obseverations_queue):
observer_processes = []
for i in range(n_workers):
worker_proc = Process(
target=observer_proc, args=(points_queue, obseverations_queue)
)
worker_proc.daemon = True
worker_proc.start()
observer_processes.append(worker_proc)
return observer_processes
def terminate_processes(processes):
for prc in processes:
prc.terminate()
prc.join()
prc.close()
Finally we set some common parameters. See comments below for explanation of what each one means.
[7]:
# Number of worker processes to run simultaneously
# Setting this to 1 will turn both setups into non-batch sequential optimization
num_workers = 3
# Number of iterations to run the sycnhronous scenario for
num_iterations = 10
# Number of observations to collect in the asynchronous scenario
num_observations = num_workers * num_iterations
# Set this flag to False to disable sleep delays in case you want the notebook to execute quickly
enable_sleep_delays = True
Asynchronous optimization#
This section runs the asynchronous optimization routine. We first setup the ask/tell optimizer as we cannot hand over the evaluation of the objective to Trieste. Next we create thread-safe queues for points and observations, and run the optimization loop.
Crucially, even though we are using batch acquisition function Local Penalization, we specify batch size of 1. This is because we don’t really want a batch. Since the amount of workers we have is fixed, whenever we see a new observation we only need one point back. However this process can only be done with acquisition functions that implement greedy batch collection strategies, because they are able to take into account points that are currently being observed (in Trieste we call them “pending”). Trieste currently provides two such functions: Local Penalization and GIBBON. Notice that we use AsynchronousGreedy rule specifically designed for using greedy batch acquisition functions in asynchronous scenarios.
[8]:
# setup Ask Tell BO
local_penalization_acq = LocalPenalization(search_space, num_samples=2000)
local_penalization_rule = AsynchronousGreedy(builder=local_penalization_acq) # type: ignore
async_bo = AskTellOptimizer(
search_space, initial_data, model, local_penalization_rule
)
# retrieve process id for nice logging
pid = os.getpid()
# create point and observation queues
m = Manager()
pq = m.Queue()
oq = m.Queue()
# keep track of all workers we have launched
observer_processes = []
# counter to keep track of collected observations
points_observed = 0
start = timeit.default_timer()
try:
observer_processes = create_worker_processes(num_workers, pq, oq)
# init the queue with first batch of points
for _ in range(num_workers):
point = async_bo.ask()
pq.put(np.atleast_2d(point.numpy()))
while points_observed < num_observations:
# keep asking queue for new observations until one arrives
try:
new_data = oq.get_nowait()
print(
f"Process {pid}: Main : received data {new_data}",
flush=True,
)
except:
continue
# new_data is a tuple of (point, observation value)
# here we turn it into a Dataset and tell of it Trieste
points_observed += 1
new_data = Dataset(
query_points=tf.constant(new_data[0], dtype=tf.float64),
observations=tf.constant(new_data[1], dtype=tf.float64),
)
async_bo.tell(new_data)
# now we can ask Trieste for one more point
# and feed that back into the points queue
point = async_bo.ask()
print(f"Process {pid}: Main : acquired point {point}", flush=True)
pq.put(np.atleast_2d(point))
finally:
terminate_processes(observer_processes)
stop = timeit.default_timer()
# Collect the observations, compute the running time
async_lp_observations = (
async_bo.to_result().try_get_final_dataset().observations
- ScaledBranin.minimum
)
async_lp_time = stop - start
print(f"Got {len(async_lp_observations)} observations in {async_lp_time:.2f}s")
Process 2651: Observer : observing data at point [[0.17653843 0.69629403]]
Process 2651: Objective: pretends like it's doing something for 2.6s
Process 2655: Observer : observing data at point [[0.23254628 0.57564957]]
Process 2655: Objective: pretends like it's doing something for 2.4s
Process 2658: Observer : observing data at point [[0.05729346 0.63342206]]
Process 2658: Objective: pretends like it's doing something for 2.1s
Process 2651: Observer : observed data (array([[0.17653843, 0.69629403]]), array([[-0.9926945]]))
Process 2614: Main : received data (array([[0.17653843, 0.69629403]]), array([[-0.9926945]]))
Process 2658: Observer : observed data (array([[0.05729346, 0.63342206]]), array([[-0.4211653]]))
Process 2655: Observer : observed data (array([[0.23254628, 0.57564957]]), array([[-0.85157757]]))
Process 2614: Main : acquired point [[0.08764324 0.80981755]]
Process 2614: Main : received data (array([[0.05729346, 0.63342206]]), array([[-0.4211653]]))
Process 2651: Observer : observing data at point [[0.08764324 0.80981755]]
Process 2651: Objective: pretends like it's doing something for 2.7s
Process 2614: Main : acquired point [[0.27877215 0.67023365]]
Process 2614: Main : received data (array([[0.23254628, 0.57564957]]), array([[-0.85157757]]))
Process 2658: Observer : observing data at point [[0.27877215 0.67023365]]
Process 2658: Objective: pretends like it's doing something for 2.8s
Process 2651: Observer : observed data (array([[0.08764324, 0.80981755]]), array([[-0.97898575]]))
Process 2614: Main : acquired point [[0.19366093 0.6478782 ]]
Process 2614: Main : received data (array([[0.08764324, 0.80981755]]), array([[-0.97898575]]))
Process 2655: Observer : observing data at point [[0.19366093 0.6478782 ]]
Process 2655: Objective: pretends like it's doing something for 2.5s
Process 2614: Main : acquired point [[0.0497703 1. ]]
Process 2651: Observer : observing data at point [[0.0497703 1. ]]
Process 2651: Objective: pretends like it's doing something for 3.1s
Process 2658: Observer : observed data (array([[0.27877215, 0.67023365]]), array([[-0.59959542]]))
Process 2614: Main : received data (array([[0.27877215, 0.67023365]]), array([[-0.59959542]]))
Process 2655: Observer : observed data (array([[0.19366093, 0.6478782 ]]), array([[-0.95444084]]))
Process 2614: Main : acquired point [[0.09498329 0.9117591 ]]
Process 2658: Observer : observing data at point [[0.09498329 0.9117591 ]]
Process 2614: Main : received data (array([[0.19366093, 0.6478782 ]]), array([[-0.95444084]]))
Process 2658: Objective: pretends like it's doing something for 3.0s
Process 2614: Main : acquired point [[0.13067861 0.78088743]]
Process 2655: Observer : observing data at point [[0.13067861 0.78088743]]
Process 2655: Objective: pretends like it's doing something for 2.7s
Process 2651: Observer : observed data (array([[0.0497703, 1. ]]), array([[-0.94422236]]))
Process 2614: Main : received data (array([[0.0497703, 1. ]]), array([[-0.94422236]]))
Process 2614: Main : acquired point [[0.11264332 0.88837479]]
Process 2651: Observer : observing data at point [[0.11264332 0.88837479]]
Process 2658: Observer : observed data (array([[0.09498329, 0.9117591 ]]), array([[-1.02812565]]))Process 2651: Objective: pretends like it's doing something for 3.0s
Process 2614: Main : received data (array([[0.09498329, 0.9117591 ]]), array([[-1.02812565]]))
Process 2655: Observer : observed data (array([[0.13067861, 0.78088743]]), array([[-1.0444855]]))
Process 2614: Main : acquired point [[0.64710938 0.31605182]]
Process 2614: Main : received data (array([[0.13067861, 0.78088743]]), array([[-1.0444855]]))
Process 2658: Observer : observing data at point [[0.64710938 0.31605182]]
Process 2658: Objective: pretends like it's doing something for 2.9s
Process 2651: Observer : observed data (array([[0.11264332, 0.88837479]]), array([[-1.03684969]]))
Process 2614: Main : acquired point [[0.84535024 0. ]]
Process 2655: Observer : observing data at point [[0.84535024 0. ]]Process 2614: Main : received data (array([[0.11264332, 0.88837479]]), array([[-1.03684969]]))
Process 2655: Objective: pretends like it's doing something for 2.5s
Process 2614: Main : acquired point [[0.91207017 0.61179124]]
Process 2651: Observer : observing data at point [[0.91207017 0.61179124]]
Process 2651: Objective: pretends like it's doing something for 4.6s
Process 2658: Observer : observed data (array([[0.64710938, 0.31605182]]), array([[-0.64502695]]))
Process 2614: Main : received data (array([[0.64710938, 0.31605182]]), array([[-0.64502695]]))
Process 2655: Observer : observed data (array([[0.84535024, 0. ]]), array([[-0.79306441]]))
Process 2614: Main : acquired point [[0.93815424 0.27878362]]
Process 2614: Main : received data (array([[0.84535024, 0. ]]), array([[-0.79306441]]))
Process 2658: Observer : observing data at point [[0.93815424 0.27878362]]
Process 2658: Objective: pretends like it's doing something for 3.7s
Process 2614: Main : acquired point [[1. 0.42809993]]
Process 2655: Observer : observing data at point [[1. 0.42809993]]
Process 2655: Objective: pretends like it's doing something for 4.3s
Process 2651: Observer : observed data (array([[0.91207017, 0.61179124]]), array([[0.01536883]]))
Process 2614: Main : received data (array([[0.91207017, 0.61179124]]), array([[0.01536883]]))
Process 2658: Observer : observed data (array([[0.93815424, 0.27878362]]), array([[-0.95995401]]))
Process 2614: Main : acquired point [[0.67247371 0. ]]
Process 2614: Main : received data (array([[0.93815424, 0.27878362]]), array([[-0.95995401]]))
Process 2651: Observer : observing data at point [[0.67247371 0. ]]
Process 2651: Objective: pretends like it's doing something for 2.0sProcess 2655: Observer : observed data (array([[1. , 0.42809993]]), array([[-0.79269351]]))
Process 2614: Main : acquired point [[1. 0.05484506]]
Process 2614: Main : received data (array([[1. , 0.42809993]]), array([[-0.79269351]]))
Process 2658: Observer : observing data at point [[1. 0.05484506]]
Process 2658: Objective: pretends like it's doing something for 3.2s
Process 2651: Observer : observed data (array([[0.67247371, 0. ]]), array([[-0.76498905]]))
Process 2614: Main : acquired point [[1. 0.13977834]]
Process 2614: Main : received data (array([[0.67247371, 0. ]]), array([[-0.76498905]]))
Process 2655: Observer : observing data at point [[1. 0.13977834]]
Process 2655: Objective: pretends like it's doing something for 3.4s
Process 2614: Main : acquired point [[1. 0.19056766]]
Process 2651: Observer : observing data at point [[1. 0.19056766]]
Process 2651: Objective: pretends like it's doing something for 3.6s
Process 2658: Observer : observed data (array([[1. , 0.05484506]]), array([[-0.92614506]]))
Process 2614: Main : received data (array([[1. , 0.05484506]]), array([[-0.92614506]]))
Process 2614: Main : acquired point [[1. 0.22306645]]
Process 2658: Observer : observing data at point [[1. 0.22306645]]
Process 2658: Objective: pretends like it's doing something for 3.7s
Process 2655: Observer : observed data (array([[1. , 0.13977834]]), array([[-1.00183856]]))
Process 2614: Main : received data (array([[1. , 0.13977834]]), array([[-1.00183856]]))
Process 2614: Main : acquired point [[0.48869902 0.17826542]]
Process 2655: Observer : observing data at point [[0.48869902 0.17826542]]
Process 2655: Objective: pretends like it's doing something for 2.0s
Process 2651: Observer : observed data (array([[1. , 0.19056766]]), array([[-1.01724728]]))
Process 2614: Main : received data (array([[1. , 0.19056766]]), array([[-1.01724728]]))
Process 2614: Main : acquired point [[0.50232246 0.06278467]]
Process 2651: Observer : observing data at point [[0.50232246 0.06278467]]
Process 2651: Objective: pretends like it's doing something for 1.7s
Process 2655: Observer : observed data (array([[0.48869902, 0.17826542]]), array([[-0.987901]]))
Process 2614: Main : received data (array([[0.48869902, 0.17826542]]), array([[-0.987901]]))
Process 2658: Observer : observed data (array([[1. , 0.22306645]]), array([[-1.01538369]]))
Process 2614: Main : acquired point [[0.52550337 0. ]]
Process 2614: Main : received data (array([[1. , 0.22306645]]), array([[-1.01538369]]))
Process 2655: Observer : observing data at point [[0.52550337 0. ]]
Process 2655: Objective: pretends like it's doing something for 1.6s
Process 2651: Observer : observed data (array([[0.50232246, 0.06278467]]), array([[-0.94823752]]))
Process 2614: Main : acquired point [[0.53917509 0.04877306]]
Process 2658: Observer : observing data at point [[0.53917509 0.04877306]]Process 2614: Main : received data (array([[0.50232246, 0.06278467]]), array([[-0.94823752]]))
Process 2658: Objective: pretends like it's doing something for 1.8s
Process 2655: Observer : observed data (array([[0.52550337, 0. ]]), array([[-0.92229289]]))
Process 2614: Main : acquired point [[0.55096393 0.14824658]]
Process 2614: Main : received data (array([[0.52550337, 0. ]]), array([[-0.92229289]]))
Process 2651: Observer : observing data at point [[0.55096393 0.14824658]]
Process 2651: Objective: pretends like it's doing something for 2.1s
Process 2658: Observer : observed data (array([[0.53917509, 0.04877306]]), array([[-0.9987134]]))
Process 2614: Main : acquired point [[0.54759658 0.1682307 ]]
Process 2655: Observer : observing data at point [[0.54759658 0.1682307 ]]
Process 2655: Objective: pretends like it's doing something for 2.1s
Process 2614: Main : received data (array([[0.53917509, 0.04877306]]), array([[-0.9987134]]))
Process 2651: Observer : observed data (array([[0.55096393, 0.14824658]]), array([[-1.04596563]]))
Process 2614: Main : acquired point [[0.54369449 0.16069791]]
Process 2614: Main : received data (array([[0.55096393, 0.14824658]]), array([[-1.04596563]]))
Process 2658: Observer : observing data at point [[0.54369449 0.16069791]]
Process 2658: Objective: pretends like it's doing something for 2.1sProcess 2655: Observer : observed data (array([[0.54759658, 0.1682307 ]]), array([[-1.04512884]]))
Process 2614: Main : acquired point [[0.95326664 0.15430506]]
Process 2614: Main : received data (array([[0.54759658, 0.1682307 ]]), array([[-1.04512884]]))
Process 2651: Observer : observing data at point [[0.95326664 0.15430506]]
Process 2651: Objective: pretends like it's doing something for 3.3s
Process 2658: Observer : observed data (array([[0.54369449, 0.16069791]]), array([[-1.04696464]]))
Process 2614: Main : acquired point [[0.95588106 0.16330272]]
Process 2614: Main : received data (array([[0.54369449, 0.16069791]]), array([[-1.04696464]]))
Process 2655: Observer : observing data at point [[0.95588106 0.16330272]]
Process 2655: Objective: pretends like it's doing something for 3.4s
Process 2614: Main : acquired point [[0.95231677 0.14574362]]
Process 2658: Observer : observing data at point [[0.95231677 0.14574362]]
Process 2658: Objective: pretends like it's doing something for 3.3s
Process 2651: Observer : observed data (array([[0.95326664, 0.15430506]]), array([[-1.0458726]]))
Process 2614: Main : received data (array([[0.95326664, 0.15430506]]), array([[-1.0458726]]))
Process 2614: Main : acquired point [[0.96559621 0.17939837]]
Process 2651: Observer : observing data at point [[0.96559621 0.17939837]]
Process 2651: Objective: pretends like it's doing something for 3.4s
Process 2655: Observer : observed data (array([[0.95588106, 0.16330272]]), array([[-1.04666006]]))
Process 2614: Main : received data (array([[0.95588106, 0.16330272]]), array([[-1.04666006]]))
Process 2658: Observer : observed data (array([[0.95231677, 0.14574362]]), array([[-1.04500693]]))
Process 2614: Main : acquired point [[0.12184377 0.82364613]]
Process 2614: Main : received data (array([[0.95231677, 0.14574362]]), array([[-1.04500693]]))
Process 2655: Observer : observing data at point [[0.12184377 0.82364613]]
Process 2655: Objective: pretends like it's doing something for 2.8s
Process 2614: Main : acquired point [[0.12202188 0.82284247]]
Process 2658: Observer : observing data at point [[0.12202188 0.82284247]]
Got 33 observations in 52.35s
Synchronous parallel optimization#
This section runs the synchronous parallel optimization with Trieste. We again use Local Penalization acquisition function, but this time with batch size equal to the number of workers we have available. Once Trieste suggests the batch, we add all points to the point queue, and workers immediatelly pick them up, one point per worker. Therefore all points in the batch are evaluated in parallel.
[9]:
# setup Ask Tell BO
gpflow_model = build_gpr(initial_data, search_space, likelihood_variance=1e-7)
model = GaussianProcessRegression(gpflow_model)
local_penalization_acq = LocalPenalization(search_space, num_samples=2000)
local_penalization_rule = EfficientGlobalOptimization( # type: ignore
num_query_points=num_workers, builder=local_penalization_acq
)
sync_bo = AskTellOptimizer(
search_space, initial_data, model, local_penalization_rule
)
# retrieve process id for nice logging
pid = os.getpid()
# create point and observation queues
m = Manager()
pq = m.Queue()
oq = m.Queue()
# keep track of all workers we have launched
observer_processes = []
start = timeit.default_timer()
try:
observer_processes = create_worker_processes(num_workers, pq, oq)
# BO loop starts here
for i in range(num_iterations):
print(f"Process {pid}: Main : iteration {i} starts", flush=True)
# get a batch of points from Trieste, send them to points queue
# each worker picks up a point and processes it
points = sync_bo.ask()
for point in points.numpy():
pq.put(point.reshape(1, -1)) # reshape is to make point a 2d array
# now we wait for all workers to finish
# we create an empty dataset and wait
# until we collected as many observations in it
# as there were points in the batch
all_new_data = Dataset(
tf.zeros((0, initial_data.query_points.shape[1]), tf.float64),
tf.zeros((0, initial_data.observations.shape[1]), tf.float64),
)
while len(all_new_data) < num_workers:
# this line blocks the process until new data is available in the queue
new_data = oq.get()
print(
f"Process {pid}: Main : received data {new_data}",
flush=True,
)
new_data = Dataset(
query_points=tf.constant(new_data[0], dtype=tf.float64),
observations=tf.constant(new_data[1], dtype=tf.float64),
)
all_new_data = all_new_data + new_data
# tell Trieste of new batch of observations
sync_bo.tell(all_new_data)
finally:
terminate_processes(observer_processes)
stop = timeit.default_timer()
# Collect the observations, compute the running time
sync_lp_observations = (
sync_bo.to_result().try_get_final_dataset().observations
- ScaledBranin.minimum
)
sync_lp_time = stop - start
print(f"Got {len(sync_lp_observations)} observations in {sync_lp_time:.2f}s")
Process 2614: Main : iteration 0 starts
Process 3086: Observer : observing data at point [[0.05731238 0.63303423]]Process 3082: Observer : observing data at point [[0.23273984 0.57722276]]
Process 3078: Observer : observing data at point [[0.17653843 0.69629402]]
Process 3082: Objective: pretends like it's doing something for 2.4sProcess 3086: Objective: pretends like it's doing something for 2.1s
Process 3078: Objective: pretends like it's doing something for 2.6s
Process 3086: Observer : observed data (array([[0.05731238, 0.63303423]]), array([[-0.42017528]]))
Process 2614: Main : received data (array([[0.05731238, 0.63303423]]), array([[-0.42017528]]))
Process 3082: Observer : observed data (array([[0.23273984, 0.57722276]]), array([[-0.85110062]]))
Process 2614: Main : received data (array([[0.23273984, 0.57722276]]), array([[-0.85110062]]))
Process 3078: Observer : observed data (array([[0.17653843, 0.69629402]]), array([[-0.9926945]]))
Process 2614: Main : received data (array([[0.17653843, 0.69629402]]), array([[-0.9926945]]))
Process 2614: Main : iteration 1 starts
Process 3086: Observer : observing data at point [[0.19949945 0.65451424]]Process 3078: Observer : observing data at point [[0.18898222 0.63851184]]Process 3082: Observer : observing data at point [[0.21783596 0.67336004]]
Process 3082: Objective: pretends like it's doing something for 2.7s
Process 3078: Objective: pretends like it's doing something for 2.5s
Process 3086: Objective: pretends like it's doing something for 2.6s
Process 3078: Observer : observed data (array([[0.18898222, 0.63851184]]), array([[-0.96175301]]))
Process 2614: Main : received data (array([[0.18898222, 0.63851184]]), array([[-0.96175301]]))
Process 3086: Observer : observed data (array([[0.19949945, 0.65451424]]), array([[-0.94053742]]))
Process 2614: Main : received data (array([[0.19949945, 0.65451424]]), array([[-0.94053742]]))
Process 3082: Observer : observed data (array([[0.21783596, 0.67336004]]), array([[-0.87474186]]))
Process 2614: Main : received data (array([[0.21783596, 0.67336004]]), array([[-0.87474186]]))
Process 2614: Main : iteration 2 starts
Process 3078: Observer : observing data at point [[0.06564085 1. ]]Process 3086: Observer : observing data at point [[0.53485058 0.44588956]]Process 3082: Observer : observing data at point [[0.15195812 0.7041801 ]]
Process 3086: Objective: pretends like it's doing something for 2.9s
Process 3078: Objective: pretends like it's doing something for 3.2s
Process 3082: Objective: pretends like it's doing something for 2.6s
Process 3082: Observer : observed data (array([[0.15195812, 0.7041801 ]]), array([[-1.02118059]]))
Process 2614: Main : received data (array([[0.15195812, 0.7041801 ]]), array([[-1.02118059]]))
Process 3086: Observer : observed data (array([[0.53485058, 0.44588956]]), array([[-0.68704465]]))
Process 2614: Main : received data (array([[0.53485058, 0.44588956]]), array([[-0.68704465]]))
Process 3078: Observer : observed data (array([[0.06564085, 1. ]]), array([[-0.97587444]]))
Process 2614: Main : received data (array([[0.06564085, 1. ]]), array([[-0.97587444]]))
Process 2614: Main : iteration 3 starts
Process 3078: Observer : observing data at point [[0.76103935 0.06964228]]Process 3082: Observer : observing data at point [[0.80153633 0.35858277]]
Process 3086: Observer : observing data at point [[0.79406236 0.66462767]]
Process 3082: Objective: pretends like it's doing something for 3.5s
Process 3086: Objective: pretends like it's doing something for 4.4sProcess 3078: Objective: pretends like it's doing something for 2.5s
Process 3078: Observer : observed data (array([[0.76103935, 0.06964228]]), array([[-0.67927059]]))
Process 2614: Main : received data (array([[0.76103935, 0.06964228]]), array([[-0.67927059]]))
Process 3082: Observer : observed data (array([[0.80153633, 0.35858277]]), array([[-0.38899693]]))
Process 2614: Main : received data (array([[0.80153633, 0.35858277]]), array([[-0.38899693]]))
Process 3086: Observer : observed data (array([[0.79406236, 0.66462767]]), array([[0.77719976]]))
Process 2614: Main : received data (array([[0.79406236, 0.66462767]]), array([[0.77719976]]))
Process 2614: Main : iteration 4 starts
Process 3078: Observer : observing data at point [[0.58622862 0.16637797]]Process 3086: Observer : observing data at point [[1. 0.]]Process 3082: Observer : observing data at point [[0.5699846 0.05814076]]
Process 3082: Objective: pretends like it's doing something for 1.9s
Process 3086: Objective: pretends like it's doing something for 3.0s
Process 3078: Objective: pretends like it's doing something for 2.3s
Process 3082: Observer : observed data (array([[0.5699846 , 0.05814076]]), array([[-1.00865879]]))
Process 2614: Main : received data (array([[0.5699846 , 0.05814076]]), array([[-1.00865879]]))
Process 3078: Observer : observed data (array([[0.58622862, 0.16637797]]), array([[-1.00074998]]))
Process 2614: Main : received data (array([[0.58622862, 0.16637797]]), array([[-1.00074998]]))
Process 3086: Observer : observed data (array([[1., 0.]]), array([[-0.84406373]]))
Process 2614: Main : received data (array([[1., 0.]]), array([[-0.84406373]]))
Process 2614: Main : iteration 5 starts
Process 3086: Observer : observing data at point [[0.46410346 0.2588423 ]]Process 3082: Observer : observing data at point [[0.48347455 0.20073793]]
Process 3078: Observer : observing data at point [[0.49391118 0.142604 ]]
Process 3082: Objective: pretends like it's doing something for 2.1s
Process 3086: Objective: pretends like it's doing something for 2.2sProcess 3078: Objective: pretends like it's doing something for 1.9s
Process 3078: Observer : observed data (array([[0.49391118, 0.142604 ]]), array([[-0.98831364]]))
Process 2614: Main : received data (array([[0.49391118, 0.142604 ]]), array([[-0.98831364]]))
Process 3082: Observer : observed data (array([[0.48347455, 0.20073793]]), array([[-0.97890336]]))
Process 2614: Main : received data (array([[0.48347455, 0.20073793]]), array([[-0.97890336]]))
Process 3086: Observer : observed data (array([[0.46410346, 0.2588423 ]]), array([[-0.92800356]]))
Process 2614: Main : received data (array([[0.46410346, 0.2588423 ]]), array([[-0.92800356]]))
Process 2614: Main : iteration 6 starts
Process 3086: Observer : observing data at point [[0.53909973 0.16520308]]Process 3082: Observer : observing data at point [[0.54043642 0.15632245]]Process 3078: Observer : observing data at point [[1. 0.24717253]]
Process 3078: Objective: pretends like it's doing something for 3.7s
Process 3082: Objective: pretends like it's doing something for 2.1s
Process 3086: Objective: pretends like it's doing something for 2.1s
Process 3082: Observer : observed data (array([[0.54043642, 0.15632245]]), array([[-1.04724587]]))
Process 2614: Main : received data (array([[0.54043642, 0.15632245]]), array([[-1.04724587]]))
Process 3086: Observer : observed data (array([[0.53909973, 0.16520308]]), array([[-1.04662257]]))
Process 2614: Main : received data (array([[0.53909973, 0.16520308]]), array([[-1.04662257]]))
Process 3078: Observer : observed data (array([[1. , 0.24717253]]), array([[-1.00809151]]))
Process 2614: Main : received data (array([[1. , 0.24717253]]), array([[-1.00809151]]))
Process 2614: Main : iteration 7 starts
Process 3082: Observer : observing data at point [[0.94769512 0.16169429]]Process 3078: Observer : observing data at point [[0.940581 0.13026355]]Process 3086: Observer : observing data at point [[0.94991018 0.19278587]]
Process 3082: Objective: pretends like it's doing something for 3.3sProcess 3086: Objective: pretends like it's doing something for 3.4s
Process 3078: Objective: pretends like it's doing something for 3.2s
Process 3078: Observer : observed data (array([[0.940581 , 0.13026355]]), array([[-1.03686261]]))
Process 2614: Main : received data (array([[0.940581 , 0.13026355]]), array([[-1.03686261]]))
Process 3082: Observer : observed data (array([[0.94769512, 0.16169429]]), array([[-1.04307479]]))
Process 2614: Main : received data (array([[0.94769512, 0.16169429]]), array([[-1.04307479]]))
Process 3086: Observer : observed data (array([[0.94991018, 0.19278587]]), array([[-1.03846888]]))
Process 2614: Main : received data (array([[0.94991018, 0.19278587]]), array([[-1.03846888]]))
Process 2614: Main : iteration 8 starts
Process 3082: Observer : observing data at point [[0.11167913 0.89022522]]Process 3078: Observer : observing data at point [[0.11387566 0.87492846]]Process 3086: Observer : observing data at point [[0.11642047 0.85973345]]
Process 3082: Objective: pretends like it's doing something for 3.0s
Process 3086: Objective: pretends like it's doing something for 2.9s
Process 3078: Objective: pretends like it's doing something for 3.0s
Process 3086: Observer : observed data (array([[0.11642047, 0.85973345]]), array([[-1.04387606]]))
Process 2614: Main : received data (array([[0.11642047, 0.85973345]]), array([[-1.04387606]]))
Process 3078: Observer : observed data (array([[0.11387566, 0.87492846]]), array([[-1.04078541]]))
Process 2614: Main : received data (array([[0.11387566, 0.87492846]]), array([[-1.04078541]]))
Process 3082: Observer : observed data (array([[0.11167913, 0.89022522]]), array([[-1.03656965]]))
Process 2614: Main : received data (array([[0.11167913, 0.89022522]]), array([[-1.03656965]]))
Process 2614: Main : iteration 9 starts
Process 3086: Observer : observing data at point [[0.96912604 0.17199467]]Process 3078: Observer : observing data at point [[0.96866126 0.169513 ]]Process 3082: Observer : observing data at point [[0.9696072 0.17446955]]
Process 3078: Objective: pretends like it's doing something for 3.4sProcess 3086: Objective: pretends like it's doing something for 3.4s
Process 3082: Objective: pretends like it's doing something for 3.4s
Process 3078: Observer : observed data (array([[0.96866126, 0.169513 ]]), array([[-1.04636352]]))
Process 2614: Main : received data (array([[0.96866126, 0.169513 ]]), array([[-1.04636352]]))
Process 3086: Observer : observed data (array([[0.96912604, 0.17199467]]), array([[-1.04623203]]))
Process 2614: Main : received data (array([[0.96912604, 0.17199467]]), array([[-1.04623203]]))
Process 3082: Observer : observed data (array([[0.9696072 , 0.17446955]]), array([[-1.04604936]]))
Process 2614: Main : received data (array([[0.9696072 , 0.17446955]]), array([[-1.04604936]]))
Got 33 observations in 51.86s
Comparison#
To compare outcomes of sync and async runs, let’s plot their respective regrets side by side, and print out the running time. For this toy problem we expect async scenario to run a little bit faster on machines with multiple CPU.
[10]:
from trieste.experimental.plotting import plot_regret
import matplotlib.pyplot as plt
fig, ax = plt.subplots(1, 2)
sync_lp_min_idx = tf.squeeze(tf.argmin(sync_lp_observations, axis=0))
async_lp_min_idx = tf.squeeze(tf.argmin(async_lp_observations, axis=0))
plot_regret(
sync_lp_observations.numpy(),
ax[0],
num_init=len(initial_data),
idx_best=sync_lp_min_idx,
)
ax[0].set_yscale("log")
ax[0].set_ylabel("Regret")
ax[0].set_ylim(0.0000001, 100)
ax[0].set_xlabel("# evaluations")
ax[0].set_title(
f"Sync LP, {len(sync_lp_observations)} points, time {sync_lp_time:.2f}"
)
plot_regret(
async_lp_observations.numpy(),
ax[1],
num_init=len(initial_data),
idx_best=async_lp_min_idx,
)
ax[1].set_yscale("log")
ax[1].set_ylabel("Regret")
ax[1].set_ylim(0.0000001, 100)
ax[1].set_xlabel("# evaluations")
ax[1].set_title(
f"Async LP, {len(async_lp_observations)} points, time {async_lp_time:.2f}s"
)
fig.tight_layout()