J/A+A/615/A56 Code for iterative clustering method - iSRS (Pacciani, 2018)
Identification of activity peaks in time-tagged data with a scan-statistics
driven clustering method and its application to gamma-ray data samples.
Pacciani L.
<Astron. Astrophys. 615, A56 (2018)>
=2018A&A...615A..56P 2018A&A...615A..56P (SIMBAD/NED BibCode)
ADC_Keywords: Models ; Gamma rays
Keywords: methods: statistical - methods: data-analysis -
techniques: photometric - gamma-ray: general -
Abstract:
The investigation of activity periods in time-tagged data samples is a
topic of large interest. Among Astrophysical samples, gamma-ray
sources are widely studied, due to the huge quasi-continuum data set
available today from the FERMI-LAT and AGILE-GRID gamma-ray
telescopes.
I developed a general temporal-unbinned method to identify flaring
periods in time-tagged data and discriminate statistically-significant
flares: I propose an event clustering method in one-dimension to
identify flaring episodes, and Scan-statistics to evaluate the flare
significance within the whole data sample.
This is a photometric algorithm. The comparison of the photometric
results (e.g., photometric flux, gamma-ray spatial distribution) for
the identified peaks with the standard likelihood analysis for the
same period is mandatory to establish if source-confusion is spoiling
results. The result of the proposed method is similar to a photometric
light-curve, but peaks are resolved, they are statistically
significant within the whole period of investigation, and peak
detection capability does not suffer time-binning related issue.
The method can be applied to reveal flares in any time-tagged data
sample. I will show results for gamma-ray sources of known celestial
position, e.g., from a catalog. Furthermore it can be used when it is
necessary to assess the statistical significance within the whole
period of investigation of a flare from an unknown gamma-ray source.
Description:
The C code implementing the iSRS clustering (isrs.c) and the removal
of random clusters (rmrndm.c) from a generic data sample is reported
here. The "isrs" executable builds the set
of candidate clusters Ci. (no removal of random clusters is
applied).
The "rmrndm" executable performs the
removal of random clusters. It produced the list of survived clusters
(the unbinned light curve) and the list of flares which includes peak
flux, peak time and flare FWHM.
A first release (version 1) of scan statistics tables (stab.txt)
filled using 32bit Marsaglia-Zaman RANMAR random generator are
reported. The frequency of false positive samples (fcoinc) discussed
in the paper (section 5) is reported too (ctab.txt).
The iSRS clustering is a method of event clustering in 1D iterated to
obtain all the conceivable clusters from the original sample. It
depends on 1 parameter: the Ntol parameter (its meaning is
explained in section 3 of the paper.
In appendix A of the paper it is explained how to correctly choose it.
Clustering method: The clustering method (SRS) has two parameters
($Ntol and Δthr. $Ntol parameter is kept constant.
Δthr naively corresponds to the maximum allowed distance
among the elements belonging to a certain cluster.
The clustering procedure is iterated (iSRS) scanning on the
Δthr parameter. It is finely decreased starting from the
largest spacing among contiguous events of the data sample under
investigation. The Δthr decreasing procedure stops when only
clusters of size 2 (of two events) remain. At the end of the scanning,
the Δthr space is fully explored. We can obtain the same
cluster from a sub-set of Δthr. Duplicate clusters are
removed.
Instructions:
A "Makefile" builds the executables
"isrs" and "<&getCatFile
/rmrndm|rmrndm>" for linux based systems with gcc installed.
To prepare the executables, put all the files provided here in the same
directory, and at prompt type:
make clean
make
The algorithm has two steps.
1) production of candidate clusters, command syntax:
./isrs <N_tol>