About Sherpa
Sherpa is the CIAO modeling and fitting application made available by the Chandra X-ray Center (CXC). It can be used for analysis of images, spectra and time series from many telescopes, including optical telescopes such as Hubble. Sherpa is flexible, modular and extensible. It has an IPython user interface and it is also an importable Python module. Sherpa models, optimization and statistic functions are available via both C++ and Python for software developers wishing to link such functions directly to their own compiled code.
Sherpa supports fitting of 1-D X-ray spectra from Chandra and other X-ray missions, as well as 1-D non-X-ray data, including ASCII data arrays, radial profiles, and lightcurves. The options for grating data analysis include fitting the spectrum with multiple response files required for overlapping orders in LETG observations. Modeling of 2-D spatial data is fully supported, including the PSF and exposure maps. User specified models can be added to Sherpa with advanced "user model" functionality.
A list of recent publications featuring the various uses of Sherpa may be found here.
Features
- Python environment
-
Sherpa is an importable module for the Python scripting language, providing users with the ability to write their own Python scripts for use in Sherpa. This easily allows for creating complex analysis and modeling functions, building batch-mode analysis, and extending the provided functionality to meet required needs.
- Multiple startup options
-
There are five command-line options which may be specified when Sherpa is started:
- -x : launch chips/sherpa shell in separate display terminal.
- -b : run in batch mode
- -n : do not print banner
- -rcfile : specify a specific .sherpa.rc file to use; refer to "ahelp sherparc" for details.
- -norcfile : do not load any .sherpa.rc file; overrides "-rcfile" if both are set.
The same options may be used with ChIPS.
-
The Sherpa and ChIPS applications use the $HOME/.ipython-ciao/ directory to store configuration information. The first time Sherpa or ChIPS is started you will see a message asking whether you want to update your ipythonrc-sherpa or ipythonrc-chips file.
unix% ciao unix% sherpa ATTENTION: Out of date IPython profile for Sherpa found in: /home/username/.ipython-ciao Local version (40201) vs latest (40301). Update to latest? [Y/N] :
Unless you have manually changed your $HOME/.ipython-ciao/ipythonrc-sherpa(or -chips) file, it is safe to answer "Y" here. The outdated file is renamed with a timestamp to preserve it.
- Visualization
-
Sherpa uses ChIPS and DS9 as default packages for 1- and 2-D visualization in CIAO, respectively, however the Sherpa plotting functions are also compatible with the Python "matplotlib" package which is not included in CIAO. Users can specify in the "plot_pkg" field of the .sherpa.rc preferences file which plotting package should be used in Sherpa when plot commands are issued; it is "chips" by default, but may be changed to "pylab".
- Parallelization
-
The following computationally intensive functions are parallelized to make use of multi-core systems (i.e., laptops or desktops with 2 or 4 cores), providing users with such systems significant improvements in efficiency compared to previous releases of Sherpa. Each function has a "numcores" option to specify the number of cores to use in parallelization; the default is to use all cores available.
- Access to data structures
-
In addition to the normal mode of loading data from files, Sherpa allows you to manipulate the data arrays internally at the Python prompt and then "re-load" them before fitting and modeling further. The resulting data may be manipulated without writing it to a file, making it easy to incorporate into scripts.
-
The analysis of data from modern instruments requires a flexible modeling and fitting system. The Sherpa infrastructure provides the user with an environment to develop complex and sophisticated analysis that goes beyond the default Sherpa functionality. Examples of user-contributed Python packages which extend Sherpa's functionality are Cosmocalc, Datastack, Deproject, and the standalone Sherpa application.
- User Interface
-
There are many functions available for accessing internal data, model data, fit and projection results, plotting objects, etc., in a Sherpa session. These provide the user with easy access and interaction with the system required by modern data analysis. Examples of such functions include:
show_data show_bkg show_model show_filter show_fit show_all get_arf get_rmf get_response get_bkg_scale get_backscal get_conf_results get_data_plot_prefs get_delchi_plot get_curve_xrange get_curve_yrange set_plot_title set_plot_xlabel set_plot_ylabel
The much longer list of get*, set*, show*, etc., functions may be accessed by typing, e.g., "*get*?" at the Sherpa prompt.
- Optimization methods
-
There are four optimization methods available in Sherpa: "levmar", a modification of the Levenberg-Marquardt algorithm which uses the LMDIF algorithm; "neldermead", A Nelder-Mead Simplex direct search; "moncar", a Monte Carlo method - differential evolution - based on the paper by Hedar & Fukushima; and "gridsearch"
, a grid searching method available for fitting data with Sherpa template models. These algorithms are robust and show very good convergence required by modern data analysis.
The Primini and Sigma-rejection iterative fitting methods in Sherpa allow the chosen optimization method to be called multiple times during a fit, instead of just once, until the fit can no longer be improved. The Primini method re-calculates statistical errors using the best-fit model parameters from the previous fit, while the Sigma-rejection method excludes poorly fit data points at each successive fit.
- Statistics
-
There are several choices of fit statistic in Sherpa, including cash (maximum likelihood) and many options of weighted χ2 statistics. The full list includes:
cash cstat χ2 statistics: chi2/leastsq chi2constvar (CHI PARENT | CVAR) chi2datavar (CHI DVAR) chi2gehrels (CHI GEHRELS) chi2modvar (CHI MVAR) chi2xspecvar
Descriptions of all available fit statistics are provided on the Sherpa Statistics page.
-
The Sherpa interface also provides an easy way to define new statistics required for analysis by the user. The load_user_stat function accommodates user-defined functions for a statistic and statistical errors, in addition to defining a list of model parameters and hyperparameters for prior distributions (if prior desired).
- Parameter Uncertainty and Confidence
-
Confidence and covariance calculate the confidence range for the best-fit model parameters; interval-projection and region-projection provide a way to visualize the model constraints on the best-fit parameter values. The results are easily accessible with the user functions get_conf_results and get_covar_results.
The
PPP statistic function plot_pvalue (with associated get_pvalue_results and get_pvalue_plot functions) performs a likelihood ratio test to compare a fit to data done with a simple, null model versus a more complex, alternative model. This function helps users determine if the fitted model is statistically significant compared to the null model.
- Models
-
Sherpa allows the user to define customized models and provides several flexible options for defining complex model expressions for fitting data.
-
Multi-component, multi-response expressions
Complex model expressions which apply response matrices, or PSFs, to some model components, while not applying the response or the PSF to the rest of the model expression, may be defined with the set_full_model and set_bkg_full_model functions. This allows for including both convolved and unconvolved model components in a single model expression for fitting source and/or background data.
There are plotting functions available for visualizing the contribution of individual model components to a fit of a multi-component model to data. These functions can accept one or multiple Sherpa model components. Similarly, users can view 2D model components in DS9.
- plot_model_component()
- plot_source_component()
- get_model_component_plot()
- get_source_component_plot()
- image_model_component()
- image_source_component()
- get_model_component_image()
- get_source_component_image()
- get_model_component()
It is possible to simultaneously fit multiple data sets with each independent source model convolved by a different PSF model.
-
User models
The add_model function may be used to register a user-defined Sherpa model class as a Sherpa model type, to create instances of models that are recognized automatically in Sherpa model expressions.
The Sherpa table model reads data from a file or crate and assigns it to an instance of the table model class that can be combined arithmetically with the source model. It supports additive and multiplicative XSpec-style table models, as well as linear interpolation of data points on the data set grid from the model grid supplied from file.
The Sherpa template model
can read in a collection of templates from a directory full of template files, and can compare a data set to all the templates in that collection. Sherpa finds the template that best matches the data, and reports back the parameter values associated with that template.
Both the table and template models support linear, nearest-neighbor, and polynomial interpolation.
-
Optical models
The Sherpa model library in CIAO 4.4 has been extended to include 24 optical models that are designed for use with spectra or SEDs at optical wavelengths. These include continuum models, absorption and emission features, and extinction curves for the ISM at optical and UV wavelengths. The position, line width, and other similar model parameters have units of Angstroms where relevant.
- Fitting
-
The fit_bkg function is available for fitting only background data sets when fitting with PHA spectral data. The function's list of arguments is the same as the fit function.
Iterative fitting can be done with the Primini and Sigma-rejection methods using the following functions:
calc_stat_info and get_stat_info give the full statistical information based on the current models.
- Filtering and Grouping Data
-
The following grouping functions are avaible for grouping PHA channel data to match functionality provided in Sherpa 3.4. These functions use the CIAO group module to group data in various ways:
- group_bins - Group into a specified number of bins
- group_width - Group so there are N channels in each group
- group_counts - Group by a number of counts
- group_snr - Group by a signal-to-noise value
- group_adapt - Adaptively bin based on the number of counts
- group_adapt_snr - Adaptively bin based on a signal-to-noise value
- Saving the Sherpa Session
-
save_all command saves all the settings which define the current Sherpa session to an ASCII file. The resulting file can be read into a future Sherpa session to recreate the data, models, filter, grouping and quality flags, statistic and method used at that time.
-
The save command creates a binary file that can be restored in Sherpa with the restore.
- Advanced options
-
Bayesian analysis with pyBLoCXS
pyBLoCXS, a sophisticated Markov chain Monte Carlo based algorithm designed to carry out Bayesian Low-Count X-ray Spectral (BLoCXS) analysis in the Sherpa environment, is available via the functions listed below. (For more information on BLoCXS, see van Dyk, D.A., Connors, A., Kashyap, V.L., & Siemiginowska, A. 2001, Ap.J., 548, 224, "Analysis of Energy Spectra with Low Photon Counts via Bayesian Posterior Simulation")
- get_draws()
- list_priors()
- set_prior()
- get_prior()
- list_samplers()
- set_sampler()
- set_sampler_opt()
- get_sampler()
- get_sampler_opt()
- get_sampler_name()
The code is a Python extension to Sherpa that explores parameter space at a suspected minimum using a predefined Sherpa model to high-energy X-ray spectral data. The get_draws function runs a pyBLoCXS chain using fit information associated with the specified data set(s), and the currently set sampler and parameter priors, for a specified number of iterations. It returns an array of statistic values, an array of acceptance Booleans, and a 2-D array of associated parameter values.