The 3ML workflow

Generally, an analysis in 3ML is performed in 3 steps:

Load the data: one or more datasets are loaded and then listed in a DataList object
Define the model: a model for the data is defined by including one or more PointSource, ExtendedSource or ParticleSource instances
Perform a likelihood or a Bayesian analysis: the data and the model are used together to perform either a Maximum Likelihood analysis, or a Bayesian analysis

Loading data

3ML is built around the concept of plugins. A plugin is used to load a particular type of data, or the data from a particular instrument. There is a plugin of optical data, one for X-ray data, one for Fermi/LAT data and so on. Plugins instances can be added and removed at the loading stage without changing any other stage of the analysis (but of course, you need to rerun all stages to update the results).

First, let’s import 3ML:

[1]:

import warnings

warnings.simplefilter("ignore")
import numpy as np

np.seterr(all="ignore")

[1]:

{'divide': 'warn', 'over': 'warn', 'under': 'ignore', 'invalid': 'warn'}

[2]:

%%capture
from threeML import *
import matplotlib.pyplot as plt

[3]:

from jupyterthemes import jtplot

%matplotlib inline
jtplot.style(context="talk", fscale=1, ticks=True, grid=False)
set_threeML_style()
silence_warnings()

Let’s start by loading one dataset, which in the 3ML workflow means creating an instance of the appropriate plugin:

[4]:

# Get some example data
from threeML.io.package_data import get_path_of_data_file

data_path = get_path_of_data_file("datasets/xy_powerlaw.txt")

# Create an instance of the XYLike plugin, which allows to analyze simple x,y points
# with error bars
xyl = XYLike.from_text_file("xyl", data_path)

# Let's plot it just to see what we have loaded
fig = xyl.plot(x_scale="log", y_scale="log")

../_images/notebooks_The_3ML_workflow_6_0.png

Now we need to create a DataList object, which in this case contains only one instance:

[5]:

data = DataList(xyl)

The DataList object can receive one or more plugin instances on initialization. So for example, to use two datasets we can simply do:

[6]:

# Create the second instance, this time of a different type

pha = get_path_of_data_file("datasets/ogip_powerlaw.pha")
bak = get_path_of_data_file("datasets/ogip_powerlaw.bak")
rsp = get_path_of_data_file("datasets/ogip_powerlaw.rsp")

ogip = OGIPLike("ogip", pha, bak, rsp)

# Now use both plugins
data = DataList(xyl, ogip)

FILTER is not set. This is not a compliant OGIP file. Assuming no FILTER.
The default choice for MATRIX extension failed:KeyError("Extension ('MATRIX', 1) not found.")available: None 'EBOUNDS' 'SPECRESP MATRIX'
FILTER is not set. This is not a compliant OGIP file. Assuming no FILTER.

The DataList object can accept any number of plugins in input.

You can also create a list of plugins, and then create a DataList using the “expansion” feature of the python language (’*’), like this:

[7]:

# This is equivalent to write data = DataList(xyl, ogip)

my_plugins = [xyl, ogip]
data = DataList(*my_plugins)

This is useful if you need to create the list of plugins at runtime, for example looping over many files.

Define the model

After you have loaded your data, you need to define a model for them. A model is a collection of one or more sources. A source represents an astrophysical reality, like a star, a galaxy, a molecular cloud… There are 3 kinds of sources: PointSource, ExtendedSource and ParticleSource. The latter is used only in special situations. The models are defined using the package astromodels. Here we will only go through the basics. You can find a lot more information here: astromodels.readthedocs.org

Point sources

A point source is characterized by a name, a position, and a spectrum. These are some examples:

[8]:

# A point source with a power law spectrum

source1_sp = Powerlaw()
source1 = PointSource("source1", ra=23.5, dec=-22.7, spectral_shape=source1_sp)

# Another source with a log-parabolic spectrum plus a power law

source2_sp = Log_parabola() + Powerlaw()
source2 = PointSource("source2", ra=30.5, dec=-27.1, spectral_shape=source2_sp)

# A third source defined in terms of its Galactic latitude and longitude
source3_sp = Cutoff_powerlaw()
source3 = PointSource("source3", l=216.1, b=-74.56, spectral_shape=source3_sp)

Extended sources

An extended source is characterized by its spatial shape and its spectral shape:

[9]:

# An extended source with a Gaussian shape centered on R.A., Dec = (30.5, -27.1)
# and a sigma of 3.0 degrees
ext1_spatial = Gaussian_on_sphere(lon0=30.5, lat0=-27.1, sigma=3.0)
ext1_spectral = Powerlaw()

ext1 = ExtendedSource("ext1", ext1_spatial, ext1_spectral)

# An extended source with a 3D function
# (i.e., the function defines both the spatial and the spectral shape)
ext2_spatial = Continuous_injection_diffusion()
ext2 = ExtendedSource("ext2", ext2_spatial)

NOTE: not all plugins support extended sources. For example, the XYLike plugin we used above do not, as it is meant for data without spatial resolution.

Create the likelihood model

Now that we have defined our sources, we can create a model simply as:

[10]:

model = Model(source1, source2, source3, ext1, ext2)

# We can see a summary of the model like this:
model.display(complete=True)

Model summary:

	N
Point sources	3
Extended sources	2
Particle sources	0

Free parameters (19):

	value	min_value	max_value	unit
source1.spectrum.main.Powerlaw.K	1.0	0.0	1000.0	keV-1 s-1 cm-2
source1.spectrum.main.Powerlaw.index	-2.01	-10.0	10.0
source2.spectrum.main.composite.K_1	1.0	0.0	100000.0	keV-1 s-1 cm-2
source2.spectrum.main.composite.alpha_1	-2.0	None	None
source2.spectrum.main.composite.beta_1	1.0	None	None
source2.spectrum.main.composite.K_2	1.0	0.0	1000.0	keV-1 s-1 cm-2
source2.spectrum.main.composite.index_2	-2.01	-10.0	10.0
source3.spectrum.main.Cutoff_powerlaw.K	1.0	0.0	1000.0	keV-1 s-1 cm-2
source3.spectrum.main.Cutoff_powerlaw.index	-2.0	-10.0	10.0
source3.spectrum.main.Cutoff_powerlaw.xc	10.0	1.0	None	keV
ext1.Gaussian_on_sphere.lon0	30.5	0.0	360.0	deg
ext1.Gaussian_on_sphere.lat0	-27.1	-90.0	90.0	deg
ext1.Gaussian_on_sphere.sigma	3.0	0.0	20.0	deg
ext1.spectrum.main.Powerlaw.K	1.0	0.0	1000.0	keV-1 s-1 cm-2
ext1.spectrum.main.Powerlaw.index	-2.01	-10.0	10.0
ext2.Continuous_injection_diffusion.lon0	0.0	0.0	360.0	deg
ext2.Continuous_injection_diffusion.lat0	0.0	-90.0	90.0	deg
ext2.Continuous_injection_diffusion.rdiff0	1.0	0.0	20.0	deg
ext2.spectrum.main.Constant.k	0.0	None	None

Fixed parameters (16):

	value	min_value	max_value	unit
source1.position.ra	23.5	0.0	360.0	deg
source1.position.dec	-22.7	-90.0	90.0	deg
source1.spectrum.main.Powerlaw.piv	1.0	None	None	keV
source2.position.ra	30.5	0.0	360.0	deg
source2.position.dec	-27.1	-90.0	90.0	deg
source2.spectrum.main.composite.piv_1	1.0	None	None	keV
source2.spectrum.main.composite.piv_2	1.0	None	None	keV
source3.position.l	216.1	0.0	360.0	deg
source3.position.b	-74.56	-90.0	90.0	deg
source3.spectrum.main.Cutoff_powerlaw.piv	1.0	None	None	keV
ext1.spectrum.main.Powerlaw.piv	1.0	None	None	keV
ext2.Continuous_injection_diffusion.rinj	100.0	0.0	200.0
ext2.Continuous_injection_diffusion.delta	0.5	0.3	0.6
ext2.Continuous_injection_diffusion.b	3.0	1.0	10.0
ext2.Continuous_injection_diffusion.piv	20000000000.0	0.0	None	keV
ext2.Continuous_injection_diffusion.piv2	1000000000.0	0.0	None	keV

Properties (0):

(none)

Linked parameters (0):

(none)

Independent variables:

(none)

Linked functions (0):

(none)

You can easily interact with the model. For example:

[11]:

# Fix a parameter
model.source1.spectrum.main.Powerlaw.K.fix = True
# or
model.source1.spectrum.main.Powerlaw.K.free = False

# Free it again
model.source1.spectrum.main.Powerlaw.K.free = True
# or
model.source1.spectrum.main.Powerlaw.K.fix = False

# Change the value
model.source1.spectrum.main.Powerlaw.K = 2.3
# or using physical units (need to be compatible with what shown
# in the table above)
model.source1.spectrum.main.Powerlaw.K = 2.3 * 1 / (u.cm**2 * u.s * u.TeV)

# Change the boundaries for the parameter
model.source1.spectrum.main.Powerlaw.K.bounds = (1e-10, 1.0)
# you can use units here as well, like:
model.source1.spectrum.main.Powerlaw.K.bounds = (
    1e-5 * 1 / (u.cm**2 * u.s * u.TeV),
    10.0 * 1 / (u.cm**2 * u.s * u.TeV),
)

# Link two parameters so that they are forced to have the same value
model.link(
    model.source2.spectrum.main.composite.K_1, model.source1.spectrum.main.Powerlaw.K
)

# Link two parameters with a law. The parameters of the law become free
# parameters in the fit. In this case we impose a linear relationship
# between the index of the log-parabolic spectrum and the index of the
# powerlaw in source2: index_2 = a * alpha_1 + b.

law = Line()
model.link(
    model.source2.spectrum.main.composite.index_2,
    model.source2.spectrum.main.composite.alpha_1,
    law,
)

# If you want to force them to be in a specific relationship,
# say index_2 = alpha_1 + 1, just fix a and b to the corresponding values,
# after the linking, like:
# model.source2.spectrum.main.composite.index_2.Line.a = 1.0
# model.source2.spectrum.main.composite.index_2.Line.a.fix = True
# model.source2.spectrum.main.composite.index_2.Line.b = 0.0
# model.source2.spectrum.main.composite.index_2.Line.b.fix = True

# Now display() will show the links
model.display(complete=True)

We have set the min_value of source1.spectrum.main.Powerlaw.K to 1e-99 because there was a postive transform
We have set the min_value of source1.spectrum.main.Powerlaw.K to 1e-99 because there was a postive transform

Model summary:

	N
Point sources	3
Extended sources	2
Particle sources	0

Free parameters (19):

	value	min_value	max_value	unit
source1.spectrum.main.Powerlaw.K	0.0	0.0	0.0	keV-1 s-1 cm-2
source1.spectrum.main.Powerlaw.index	-2.01	-10.0	10.0
source2.spectrum.main.composite.alpha_1	-2.0	None	None
source2.spectrum.main.composite.beta_1	1.0	None	None
source2.spectrum.main.composite.K_2	1.0	0.0	1000.0	keV-1 s-1 cm-2
source2.spectrum.main.composite.index_2.Line.a	0.0	None	None
source2.spectrum.main.composite.index_2.Line.b	1.0	None	None
source3.spectrum.main.Cutoff_powerlaw.K	1.0	0.0	1000.0	keV-1 s-1 cm-2
source3.spectrum.main.Cutoff_powerlaw.index	-2.0	-10.0	10.0
source3.spectrum.main.Cutoff_powerlaw.xc	10.0	1.0	None	keV
ext1.Gaussian_on_sphere.lon0	30.5	0.0	360.0	deg
ext1.Gaussian_on_sphere.lat0	-27.1	-90.0	90.0	deg
ext1.Gaussian_on_sphere.sigma	3.0	0.0	20.0	deg
ext1.spectrum.main.Powerlaw.K	1.0	0.0	1000.0	keV-1 s-1 cm-2
ext1.spectrum.main.Powerlaw.index	-2.01	-10.0	10.0
ext2.Continuous_injection_diffusion.lon0	0.0	0.0	360.0	deg
ext2.Continuous_injection_diffusion.lat0	0.0	-90.0	90.0	deg
ext2.Continuous_injection_diffusion.rdiff0	1.0	0.0	20.0	deg
ext2.spectrum.main.Constant.k	0.0	None	None

Fixed parameters (18):

	value	min_value	max_value	unit
source1.position.ra	23.5	0.0	360.0	deg
source1.position.dec	-22.7	-90.0	90.0	deg
source1.spectrum.main.Powerlaw.piv	1.0	None	None	keV
source2.position.ra	30.5	0.0	360.0	deg
source2.position.dec	-27.1	-90.0	90.0	deg
source2.spectrum.main.composite.K_1.Line.a	0.0	None	None	keV-1 s-1 cm-2
source2.spectrum.main.composite.K_1.Line.b	1.0	None	None
source2.spectrum.main.composite.piv_1	1.0	None	None	keV
source2.spectrum.main.composite.piv_2	1.0	None	None	keV
source3.position.l	216.1	0.0	360.0	deg
source3.position.b	-74.56	-90.0	90.0	deg
source3.spectrum.main.Cutoff_powerlaw.piv	1.0	None	None	keV
ext1.spectrum.main.Powerlaw.piv	1.0	None	None	keV
ext2.Continuous_injection_diffusion.rinj	100.0	0.0	200.0
ext2.Continuous_injection_diffusion.delta	0.5	0.3	0.6
ext2.Continuous_injection_diffusion.b	3.0	1.0	10.0
ext2.Continuous_injection_diffusion.piv	20000000000.0	0.0	None	keV
ext2.Continuous_injection_diffusion.piv2	1000000000.0	0.0	None	keV

Properties (0):

(none)

Linked parameters (2):

	source2.spectrum.main.composite.K_1
linked to	source1.spectrum.main.Powerlaw.K
function	Line
current value	0.0
unit	1 / (keV s cm2)

	source2.spectrum.main.composite.index_2
linked to	source2.spectrum.main.composite.alpha_1
function	Line
current value	-2.0
unit

Independent variables:

(none)

Linked functions (0):

(none)

Now, for the following steps, let’s keep it simple and let’s use a single point source:

[12]:

new_model = Model(source1)

source1_sp.K.bounds = (0.01, 100)

We have set the min_value of source1.spectrum.main.Powerlaw.K to 1e-99 because there was a postive transform
The current value of the parameter K (2.300000000000001e-09) was below the new minimum 0.01.

A model can be saved to disk, and reloaded from disk, as:

[13]:

new_model.save("new_model.yml", overwrite=True)

new_model_reloaded = load_model("new_model.yml")

The output is in YAML format, a human-readable text-based format.

Perform the analysis

Maximum likelihood analysis

Now that we have the data and the model, we can perform an analysis very easily:

[14]:

data = DataList(ogip)

jl = JointLikelihood(new_model, data)

best_fit_parameters, likelihood_values = jl.fit()

Best fit values:

	result	unit
parameter
source1.spectrum.main.Powerlaw.K	(9.0 -3.0 +5) x 10^-1	1 / (keV s cm2)
source1.spectrum.main.Powerlaw.index	-1.98 +/- 0.07

Correlation matrix:

1.00	-0.99
-0.99	1.00

Values of -log(likelihood) at the minimum:

	-log(likelihood)
ogip	181.766598
total	181.766598

Values of statistical measures:

	statistical measures
AIC	367.629196
BIC	373.237257

The output of the fit() method of the JointLikelihood object consists of two pandas DataFrame objects, which can be queried, saved to disk, reloaded and so on. Refer to the pandas manual for details.

After the fit the JointLikelihood instance will have a .results attribute which contains the results of the fit.

[15]:

jl.results.display()

Best fit values:

	result	unit
parameter
source1.spectrum.main.Powerlaw.K	(9.0 -3.0 +5) x 10^-1	1 / (keV s cm2)
source1.spectrum.main.Powerlaw.index	-1.98 +/- 0.07

Correlation matrix:

1.00	-0.99
-0.99	1.00

Values of -log(likelihood) at the minimum:

	-log(likelihood)
ogip	181.766598
total	181.766598

Values of statistical measures:

	statistical measures
AIC	367.629196
BIC	373.237257

This object can be saved to disk in a FITS file:

[16]:

jl.results.write_to("my_results.fits", overwrite=True)

The produced FITS file contains the complete definition of the model and of the results, so it can be reloaded in a separate session as:

[17]:

results_reloaded = load_analysis_results("my_results.fits")

results_reloaded.display()

Best fit values:

	result	unit
parameter
source1.spectrum.main.Powerlaw.K	(9.0 -3.0 +5) x 10^-1	1 / (keV s cm2)
source1.spectrum.main.Powerlaw.index	-1.98 +/- 0.07

Correlation matrix:

1.00	-0.99
-0.99	1.00

Values of -log(likelihood) at the minimum:

	-log(likelihood)
ogip	181.766598
total	181.766598

Values of statistical measures:

	statistical measures
AIC	367.629196
BIC	373.237257

The flux of the source can be computed from the ‘results’ object (even in another session by reloading the FITS file), as:

[18]:

fluxes = jl.results.get_flux(100 * u.keV, 1 * u.MeV)

# Same results would be obtained with
# fluxes = results_reloaded.get_point_source_flux(100 * u.keV, 1 * u.MeV)

We can change the energy range on the fly… even from the reloaded fit!

[19]:

fluxes = jl.results.get_flux(100 * u.eV, 1 * u.TeV)

We can also plot the spectrum with its error region, as:

[20]:

fig = plot_spectra(
    jl.results, ene_min=0.1, ene_max=1e6, num_ene=500, flux_unit="erg / (cm2 s)"
)

../_images/notebooks_The_3ML_workflow_42_2.png

Bayesian analysis

In a very similar way, we can also perform a Bayesian analysis. As a first step, we need to define the priors for all parameters:

[21]:

# It can be set using the currently defined boundaries
new_model.source1.spectrum.main.Powerlaw.index.set_uninformative_prior(Uniform_prior)

# or uniform prior can be defined directly, like:
new_model.source1.spectrum.main.Powerlaw.index.prior = Uniform_prior(
    lower_bound=-3, upper_bound=0
)


# The same for the Log_uniform prior
new_model.source1.spectrum.main.Powerlaw.K.prior = Log_uniform_prior(
    lower_bound=1e-3, upper_bound=100
)
# or
new_model.source1.spectrum.main.Powerlaw.K.set_uninformative_prior(Log_uniform_prior)

new_model.display(complete=True)

Model summary:

	N
Point sources	1
Extended sources	0
Particle sources	0

Free parameters (2):

	value	min_value	max_value	unit
source1.spectrum.main.Powerlaw.K	0.900259	0.01	100.0	keV-1 s-1 cm-2
source1.spectrum.main.Powerlaw.index	-1.976928	-10.0	10.0

Fixed parameters (4):

	value	min_value	max_value	unit
source1.position.ra	23.5	0.0	360.0	deg
source1.position.dec	-22.7	-90.0	90.0	deg
source1.spectrum.main.Powerlaw.piv	1.0	None	None	keV
cons_ogip	1.0	0.8	1.2

Properties (0):

(none)

Linked parameters (0):

(none)

Independent variables:

(none)

Linked functions (0):

(none)

Then, we can perform our Bayesian analysis like:

[22]:

bs = BayesianAnalysis(new_model, data)
bs.set_sampler("ultranest")
bs.sampler.setup()
# This uses the ultranest sampler
samples = bs.sample(quiet=True)

External parameter cons_ogip already exist in the model. Overwriting it...

[ultranest] Sampling 400 live points from prior ...

[ultranest] Explored until L=-2e+02
[ultranest] Likelihood function evaluations: 6415
[ultranest]   logZ = -188.7 +- 0.1008
[ultranest] Effective samples strategy satisfied (ESS = 1582.7, need >400)
[ultranest] Posterior uncertainty strategy is satisfied (KL: 0.46+-0.09 nat, need <0.50 nat)
[ultranest] Evidency uncertainty strategy is satisfied (dlogz=0.10, need <0.5)
[ultranest]   logZ error budget: single: 0.12 bs:0.10 tail:0.01 total:0.10 required:<0.50
[ultranest] done iterating.

The BayesianAnalysis object will now have a “results” member which will work exactly the same as explained for the Maximum Likelihood analysis (see above):

[23]:

bs.results.display()

Maximum a posteriori probability (MAP) point:

	result	unit
parameter
source1.spectrum.main.Powerlaw.K	(7.7 -1.6 +6) x 10^-1	1 / (keV s cm2)
source1.spectrum.main.Powerlaw.index	-1.95 -0.10 +0.04

Values of -log(posterior) at the minimum:

	-log(posterior)
ogip	-181.580406
total	-181.580406

Values of statistical measures:

	statistical measures
AIC	367.256812
BIC	372.864872
DIC	365.258889
PDIC	-0.086437
log(Z)	-81.942186

[24]:

fluxes_bs = bs.results.get_flux(100 * u.keV, 1 * u.MeV)

[25]:

fig = plot_spectra(
    bs.results, ene_min=0.1, ene_max=1e6, num_ene=500, flux_unit="erg / (cm2 s)"
)

../_images/notebooks_The_3ML_workflow_50_2.png

We can also produce easily a “corner plot”, like:

[26]:

bs.results.corner_plot()

[26]:

../_images/notebooks_The_3ML_workflow_52_0.png

../_images/notebooks_The_3ML_workflow_52_1.png

[ ]: