atomica.plotting.PlotData¶

class atomica.plotting.PlotData(results, outputs=None, pops=None, output_aggregation=None, pop_aggregation=None, project=None, time_aggregation=None, t_bins=None, accumulate=None)[source]¶

Bases: object

Process model outputs into plottable quantities

This is what gets passed into a plotting function, which displays a View of the data Conceptually, we are applying visuals to the data. But we are performing an extraction step rather than doing it directly because things like labels, colours, groupings etc. only apply to plots, not to results, and there could be several different views of the same data.

Operators for - and / are defined to faciliate looking at differences and relative differences of derived quantities (quantities computed using PlotData operations) across individual results. To keep the implementation tractable, they don’t generalize further than that, and operators + and * are not implemented because these operations rarely make sense for the data being operated on.

Parameters:

results –
Specify which results to plot. Can be
- a Result,
- a list of Results,
- a dict/odict of results (the name of the result is taken from the Result, not the dict)
outputs – The name of an output compartment, characteristic, or parameter, or list of names. Inside a list, a dict can be given to specify an aggregation e.g. outputs=['sus',{'total':['sus','vac']}] where the key is the new name. Or, a formula can be given which will be evaluated by looking up labels within the model object. Links will automatically be summed over
pops – The name of an output population, or list of names. Like outputs, can specify a dict with a list of pops to aggregate over them
output_aggregation –
If an output aggregation is requested, combine the outputs listed using one of
’sum’ - just add values together. If summing together characteristics with a denominator, then
all of the items being included in the sum must have the same denominator. This denominator will then continue to be tracked for the sum.

’average’ - unweighted average of quantities. If averaging characteristics with a denominator, then
the denominator will be dropped afterwards (so subsequent population aggregation will be performed based on total population size)

’weighted’ - weighted average where the weight is the

Link source compartment size, or

Characteristic denominator size, or

Compartment size

depending on the quantities being aggregated. If one quantity has a denominator, then all quantities must have denominators. This method cannot be used with non-transition parameters and a KeyError will result in that case. Note that if using this method to aggregate characteristics, there should generally not be any overlap in the compartments contributing to the denominators.
For aggregating characteristics, the behaviour for combining characteristics a/b and c/d is as follows:
’sum’ will give (a+c)/b and raise an error if b!=d. The denominator b can be used for weighted population aggregation.

’average’ will give ((a/b)+(c/d))/2. No denominator is tracked for weighted population aggregation, so such aggregation would use the total population size.

’weighted’ will give (a+c)/(b+d). The denominator b+d can be used for weighted population aggregation.
The default is ‘weighted’ for dimensionless units, probabilities, rates, and fractions, and ‘sum’ for everything else, with the exception of characteristics with denominators, where
- ’sum’ is used if the characteristics all have the same denominator
- ’weighted’ is used if the characteristics all have different denominators
- ’average’ is used otherwise
pop_aggregation – Same options as output_aggregation, except - ‘weighted’ uses population sizes OR characteristic denominator size. Note that output aggregation is performed before population aggregation. This also means that population aggregation can be used to combine already aggregated outputs (e.g. can first sum ‘sus’+’vac’ within populations, and then take weighted average across populations). This weighting is principally intended for aggregating nondimensional quantities (fractions, proportions etc.). - By default, quantities in ‘duration’ units are averaged across populations rather than summed.
project – Optionally provide a Project object, which will be used to convert names to labels in the outputs for plotting.
time_aggregation – Optionally specify time aggregation method. Supported methods are ‘integrate’ and ‘average’ (no weighting). When aggregating times, non-annualized flow rates will be used.
t_bins –
Optionally specify time bins, which will enable time aggregation. Supported inputs are
- A vector of bin edges. Time points are included if the time is >= the lower bin value and < upper bin value.
- A scalar bin size (e.g. 5) which will be expanded to a vector spanning the data
- The string ‘all’ will maps to bin edges [-inf, inf] aggregating over all time
accumulate – Optionally accumulate outputs over time. Can be ‘sum’ or ‘integrate’ to either sum quantities or integrate by multiplying by the timestep. Accumulation happens after time aggregation. The logic is extremely simple - the quantities in the Series pass through cumsum. If ‘integrate’ is selected, then the quantities are multiplied by dt and the units are multiplied by years

Returns:

A PlotData instance that can be passed to plot_series() or plot_bars()

__getitem__(key)[source]¶

Implement custom indexing

The Series objects stored within PlotData are each bound to a single result, population, and output. This operator makes it possible to easily retrieve a particular Series instance. For example,

>>> d = PlotData(results)
... d['default','0-4','sus']

Parameters:: key (tuple) – A tuple of (result,pop,output)
Returns:: A Series instance

Methods

`accumulate`	Accumulate values over time
`interpolate`	Interpolate all `Series` onto new time values
`programs`	Constructs a PlotData instance from program values
`set_colors`	Assign colors to quantities
`time_aggregate`	Aggregate values over time
`tvals`	Return vector of time values

accumulate(accumulation_method)[source]¶

Accumulate values over time

Accumulation methods are

Parameters:

accumulation_method –

Select whether to add or integrate. Supported methods are: - ‘sum’ : runs cumsum on all quantities - should not be used if units are flow rates (so will check for a timescale).

Summation should be used for compartment-based quantities, such as DALYs

’integrate’integrate using trapezoidal rule, assuming initial value of 0
Note that here there is no concept of ‘dt’ because we might have non-uniform time aggregation bins Therefore, we need to use the time vector actually contained in the Series object (via cumtrapz())

Return type:

None

interpolate(new_tvec)[source]¶

Interpolate all Series onto new time values

This will modify all of the contained Series objects in-place. The modified PlotData instance is also returned, so that interpolation and construction can be performed in one line. i.e. both

>>> d = PlotData(result)
... d.interpolate(tvals)

and

>>> vals = PlotData(result).interpolate(tvals)

will work as intended.

Parameters:: new_tvec – Vector of new time values
Returns:: The modified PlotData instance

static programs(results, outputs=None, t_bins=None, quantity='spending', accumulate=None, nan_outside=False)[source]¶

Constructs a PlotData instance from program values

This alternate constructor can be used to plot program-related quantities such as spending or coverage.

Parameters:

results – single Result, or list of Results
outputs –
specification of which programs to plot spending for. Can be: - the name of a single program - a list of program names - aggregation dict e.g. {‘treatment’:[‘tx-1’,’tx-2’]} or list of such dicts. Output aggregation type is automatically ‘sum’ for

program spending, and aggregation is NOT permitted for coverages (due to modality interactions)
t_bins – aggregate over time, using summation for spending and number coverage, and average for fraction/proportion coverage. Notice that unlike the PlotData() constructor, this function does _not_ allow the time aggregation method to be manually set.
quantity – can be ‘spending’, ‘coverage_number’, ‘coverage_eligible’, or ‘coverage_fraction’. The ‘coverage_eligible’ is the sum of compartments reached by a program, such that coverage_fraction = coverage_number/coverage_eligible
accumulate – can be ‘sum’ or ‘integrate’
nan_outside – If True, then values will be NaN outside the program start/stop year

Returns:

A new PlotData instance

set_colors(colors=None, results='all', pops='all', outputs='all', overwrite=False)[source]¶

Assign colors to quantities

This function facilitates assigned colors to the Series objects contained in this PlotData instance.

Parameters:

colors – Specify the colours to use. This can be - A list of colours that applies to the list of all matching items - A single colour to use for all matching items - The name of a colormap to use (e.g., ‘Blues’)
results – A list of results to set colors for, or a dict of results where the key names the results (e.g. PlotData.results)
pops – A list of pops to set colors for, or a dict of pops where the key names the pops (e.g. PlotData.pops

:param outputs:A list of outputs to set colors for, or a dict of outputs where the key names the outputs (e.g. PlotData.outputs) :type overwrite: :param overwrite: False (default) or True. If True, then any existing manually set colours will be overwritten :return: The PlotData instance (also modified in-place)

Essentially, the lists of results, pops, and outputs are used to filter the Series resulting in a list of Series to operate on. Then, the colors argument is applied to that list.

time_aggregate(t_bins, time_aggregation=None, interpolation_method=None)[source]¶

Aggregate values over time

Note that accumulation is a running total, whereas aggregation refers to binning. The two can be both be applied (aggregation should be performed prior to accumulation).

Normally, aggregation is performed when constructing a PlotData instance and this method does not need to be manually called. However, in rare cases, it may be necessary to explicitly set the interpolation method. Specifically, the interpolation method needs to match the underlying assumption for parameter values. For parameter scenarios, this may require that the ‘previous’ method is used (to match the assumption in the parameter overwrite) rather than relying on the standard assumption that databook quantities can be interpolated directly.

This method modifies the PlotData object in-place. However, the modified object is also returned, so that time aggregation can be chained with other operations, the same as PlotData.interpolate().

Parameters:

t_bins – Vector of bin edges OR a scalar bin size, which will be automatically expanded to a vector of bin edges
time_aggregation – can be ‘integrate’ or ‘average’. Note that for quantities that have a timescale, flow parameters in number units will be adjusted accordingly (e.g. a parameter in units of ‘people/day’ aggregated over a 1 year period will display as the equivalent number of people that year)
interpolation_method – Assumption on how the quantity behaves in between timesteps - in general, ‘linear’ should be suitable for most dynamic quantities, while ‘previous’ should be used for spending and other program-related quantities.

Returns:

The same modified PlotData instance

tvals()[source]¶

Return vector of time values

This method returns a vector of time values for the PlotData object, if all of the series have the same time axis (otherwise it will throw an error). All series must have the same number of timepoints. This will always be the case for a PlotData object unless the instance has been manually modified after construction.

Returns:: Tuple with (array of time values, array of time labels)