Task Rehearsal#
Task Rehearsal is a powerful feature of UPSTAGE, but one that requires some care to implement. The problem that Rehearsal is trying to solve is to reduce the amount of excess code needed to plan the usage of Actors.
For example, you may have an Actor that is an Airplane, and you’d like to estimate how long it can fly for. In UPSTAGE, it is possible to rehearse that actor through its flight path, using planning factors, and see if the final state is feasible.
Rehearsing a Single Task#
Define an actor and a task where some states change:
from upstage_des.utils import waypoint_time_and_dist
class Plane(UP.Actor):
speed = UP.State[float]()
location: UP.CartesianLocation = UP.CartesianLocationChangingState()
fuel: float = UP.LinearChangingState()
fuel_burn = UP.State[float]()
class Fly(UP.Task):
def task(self, *, actor: Plane):
fly_to: list[UP.CartesianLocation] = self.get_actor_knowledge(actor, "destination", must_exist=True)
time, dist = waypoint_time_and_dist(actor.location, fly_to, actor.speed)
print(f"Rehearsing the task: {self._rehearsing}")
print(f"\tFlying {dist:.2f} units over {time:.2f} hrs")
actor.activate_linear_state(
state="fuel",
rate=-actor.fuel_burn,
task=self,
)
actor.activate_location_state(
state="location",
speed=actor.speed,
waypoints=fly_to,
task=self,
)
yield UP.Wait(time)
actor.deactivate_all_states(task=self)
Then rehearse a version of it to get a “cloned” actor with new state:
with UP.EnvironmentContext() as env:
plane = Plane(
name="Plane",
speed=2.0,
location=UP.CartesianLocation(0, 0),
fuel=120,
fuel_burn=1.5,
)
point_1 = UP.CartesianLocation(100, 50)
point_2_options = [
UP.CartesianLocation(50, 50),
UP.CartesianLocation(100, 75),
]
task = Fly()
for point_2 in point_2_options:
fake_plane = task.rehearse(
actor=plane,
knowledge={"destination": [point_1, point_2]},
)
print(f"Final fuel: {fake_plane.fuel:.2f}\n")
print(f"The original plane's fuel: {plane.fuel}")
>>> Rehearsing the task: True
>>> Flying 161.80 units over 80.90 hrs
>>> Final fuel: -1.35
>>>
>>> Rehearsing the task: True
>>> Flying 136.80 units over 68.40 hrs
>>> Final fuel: 17.40
>>>
>>> The original plane's fuel: 120
The key feature is that you call rehearse
on an instance of the task, provide it the actor, and optionally provide any knowledge to give to the actor. Then UPSTAGE runs the task
on a fake environment.
Limits of Rehearsal#
Rehearsal currently only works for one Actor at a time, and while the Actor is clone-able without affecting the rest of the sim, the stage
is not cloned.
If a task references stage
, or looks to other actors, events, stores, etc. the rehearsal may cause side-effects in the actual sim.
The actor states and knowledge are shallow copies during rehearsal, which is one part of the risk of side effects. Since UPSTAGE only knows what you tell it to do
through the yield
, any effects not going through the yield
will likely cause problems for rehearsal.
Rehearsing is best for helping planning code determine which actors are capable of doing a series of tasks that have easily separable side-effects.
Rehearsing Events, Gets, and Puts#
When rehearsing UPSTAGE events, we need to tell UPSTAGE how long to run the fake clock for all non-Wait
events. We do this by setting planning_time_to_complete
in the event initialization.
class ExampleTask(UP.Task):
def task(self, *, actor: UP.Actor):
# Wait for a timeout or an event to succeed
# Pretend 'event' is saved and another process can succeed it
# This is what actor.create_knowledge_event() does (more later)
event = UP.Event(planning_time_to_complete=3.0)
wait = UP.Wait(3.5)
yield UP.Any(event, wait)
# When planning, UP.Any will use the earliest planning time
If the planning time for the event were larger than 3.5, then 3.5 would be the time that passes during rehearsal of the Any
event.
Get
events generally provide a value or object from the container or store. For rehearsal purposes, UPSTAGE sends a special object to the task:
1import simpy as SIM
2
3class OtherTask(UP.Task):
4 def task(self, *, actor: UP.Actor):
5 shelf: SIM.FilterStore = self.stage.a_shelf
6 # Find an item, which is an object that has a `value` attribute
7 item = yield UP.FilterGet(
8 get_location=shelf,
9 filter=lambda x: x.value >= 10,
10 rehearsal_time_to_complete=1.0,
11 )
12 time_to_work: float
13 if item is UP.PLANNING_FACTOR_OBJECT:
14 time_to_work = 3.0
15 else:
16 time_to_work = item.value / 3.14
17 yield UP.Wait(time_to_work)
18
19class Item:
20 def __init__(self, value:float):
21 self.value = value
22
23class Worker(UP.Actor):
24 ...
25
26with UP.EnvironmentContext() as env:
27 store = SIM.FilterStore(env)
28 UP.add_stage_variable("a_shelf", store)
29
30 actor = Worker(name="example")
31
32 task = OtherTask()
33 new_actor = task.rehearse(actor=actor)
34 print(f"Time of completion: {new_actor.env.now}")
35
36 def proc():
37 yield env.timeout(1.0)
38 yield store.put(Item(value=8))
39 yield env.timeout(1.0)
40 yield store.put(Item(value=314))
41
42 env.process(proc())
43 task.run(actor=actor)
44 env.run()
45 print(f"Actual runtime: {env.now}")
46
47>>> Time of completion: 4.0
48>>> Actual runtime: 102.0
Testing if a returned item is a PLANNING_FACTOR_OBJECT
is the only approved way to know if the task is being rehearsed. If there are no
Get
events (everything is time-based)
Put
events have a planning time to complete as well, and do not touch the actual stores/containers given to those events.
Rehearsing a Task Network#
You can rehearse paths through a task network as well, to allow more complicated decision making tests.
In this example, the plane is part of a search and rescue team for natural disaster aid. The plane will fly to as many locations as it can, perform a search, and then fly somewhere else. At the end, it needs to contingency plan for a landing spot that is as far away as possible. Here we’ll use Decision Tasks as a way to do task network planning for both running and rehearsing.
The full example can be found here.
Here is the planning portion of the TaskNetwork that lets us plan a long route to rehearse on, using rehearse_decision
from DecisionTask
. The some_preference_function
is
just a stub for example purposes, showing how to separate the runtime decision logic from the planning logic.
class Planner(UP.DecisionTask):
def make_decision(self, *, actor:Plane):
go_to_loc = some_preference_function(self.stage.search_spots)
if go_to_loc is None: # implies we are done with searching
self.set_actor_task_queue(actor, ["Fly", "Land"])
else:
self.set_actor_knowledge(actor, "destination", go_to_loc, overwrite=True)
self.set_actor_task_queue(actor, ["Fly", "Search"])
def rehearse_decision(self, *, actor:Plane):
# Pop off a destination from the queue, or go "home"
next_dests:list[list[UP.CartesianLocation]] | None= self.get_actor_knowledge(actor, "destination_plan", must_exist=False)
dests: list[UP.CartesianLocation]
task_queue: list[str]
if not next_dests: # fly home
dests = [UP.CartesianLocation(0, 0)]
task_queue = ["Fly", "Land"]
else: # pop a location from the plan
dests = next_dests.pop(0)
self.set_actor_knowledge(actor, "destination_plan", next_dests, overwrite=True)
task_queue = ["Fly", "Search"]
self.set_actor_knowledge(actor, "destination", dests, overwrite=True)
self.set_actor_task_queue(actor, task_queue)
When we run the rehearsal, we make sure to set end_task
to be Land
, so that the network looping takes over from the initial task queue we gave it. If
we hadn’t given end_task
, the rehearsal would have stopped after the 3 tasks in the task_name_list
.
with UP.EnvironmentContext() as env:
search_locs = [
[UP.CartesianLocation(x, y)]
for x, y in [
(10, 20),
(30, 10),
(15, 15),
]
]
plane = Plane(
name="searcher",
speed=2,
fuel=200,
fuel_burn=5.0,
location=UP.CartesianLocation(20, 10),
debug_log=True,
)
net = search_network.make_network()
plane.add_task_network(net)
new_plane = plane.rehearse_network(
net.name,
task_name_list=["Planner", "Fly", "Search"],
knowledge={"destination_plan": search_locs},
end_task="Land",
)
print(f"Fuel left: {new_plane.fuel}")
print(f"Time passed: {new_plane.env.now}")
print(f"Actual time passed: {env.now}")
>>> Rehearsing the task: True
>>> Flying 14.14 units over 7.07 hrs
>>> Rehearsing the task: True
>>> Flying 0.00 units over 0.00 hrs
>>> Rehearsing the task: True
>>> Flying 22.36 units over 11.18 hrs
>>> Rehearsing the task: True
>>> Flying 15.81 units over 7.91 hrs
>>> Rehearsing the task: True
>>> Flying 21.21 units over 10.61 hrs
>>> Fuel left: 6.181482162082084
>>> Time passed: 38.76370356758358
>>> Actual time passed: 0.0