Statistical inference for multiple change‐point models

Wang, Wu; He, Xuming; Zhu, Zhongyi

Statistical inference for multiple change‐point models

dc.contributor.author	Wang, Wu
dc.contributor.author	He, Xuming
dc.contributor.author	Zhu, Zhongyi
dc.date.accessioned	2020-12-02T14:36:32Z
dc.date.available	WITHHELD_13_MONTHS
dc.date.available	2020-12-02T14:36:32Z
dc.date.issued	2020-12
dc.identifier.citation	Wang, Wu ; He, Xuming; Zhu, Zhongyi (2020). "Statistical inference for multiple change‐point models." Scandinavian Journal of Statistics 47(4): 1149-1170.
dc.identifier.issn	0303-6898
dc.identifier.issn	1467-9469
dc.identifier.uri	https://hdl.handle.net/2027.42/163546
dc.description.abstract	In this article, we propose a new technique for constructing confidence intervals for the mean of a noisy sequence with multiple change‐points. We use the weighted bootstrap to generalize the bootstrap aggregating or bagging estimator. A standard deviation formula for the bagging estimator is introduced, based on which smoothed confidence intervals are constructed. To further improve the performance of the smoothed interval for weak signals, we suggest a strategy of adaptively choosing between the percentile intervals and the smoothed intervals. A new intensity plot is proposed to visualize the pattern of the change‐points. We also propose a new change‐point estimator based on the intensity plot, which has superior performance in comparison with the state‐of‐the‐art segmentation methods. The finite sample performance of the confidence intervals and the change‐point estimator are evaluated through Monte Carlo studies and illustrated with a real data example.
dc.publisher	Cambridge University Press
dc.publisher	Wiley Periodicals, Inc.
dc.subject.other	copy number variation
dc.subject.other	multiple change‐points
dc.subject.other	bootstrap
dc.subject.other	binary segmentation
dc.subject.other	bagging estimator
dc.title	Statistical inference for multiple change‐point models
dc.type	Article
dc.rights.robots	IndexNoFollow
dc.subject.hlbsecondlevel	Statistics (Mathematical)
dc.subject.hlbtoplevel	Science
dc.description.peerreviewed	Peer Reviewed
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/163546/3/sjos12456.pdf	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/163546/2/sjos12456_am.pdf	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/163546/1/SJOS_12456_supplement.pdf	en_US
dc.identifier.doi	10.1111/sjos.12456
dc.identifier.source	Scandinavian Journal of Statistics
dc.identifier.citedreference	Perron, P. ( 2006 ). Dealing with structural breaks. In K. Patterson & T. Mills (Eds.), Palgrave handbook of econometrics (Vol. 1, pp. 278 – 352 ). New York, NY: Palgrave‐Macmillan.
dc.identifier.citedreference	Killick, R., Fearnhead, P., & Eckley, I. ( 2012 ). Optimal detection of changepoints with a linear computational cost. Journal of the American Statistical Association, 107 ( 500 ), 1590 – 1598.
dc.identifier.citedreference	Killick, R., Haynes, K. & Eckley, I. A. ( 2016 ). Changepoint: An R package for changepoint analysis. R package version 2.2.2. https://CRAN.R‐project.org/package=changepoint.
dc.identifier.citedreference	Kim, C., Suh, M.‐S., & Hong, K.‐O. ( 2009 ). Bayesian changepoint analysis of the annual maximum of daily and subdaily precipitation over South Korea. Journal of Climate, 22 ( 24 ), 6741 – 6757.
dc.identifier.citedreference	Kirch, C. ( 2007 ). Block permutation principles for the change analysis of dependent data. Journal of Statistical Planning and Inference, 137 ( 7 ), 2453 – 2474.
dc.identifier.citedreference	Kunsch, H. R. ( 1989 ). The jackknife and the bootstrap for general stationary observations. The Annals of Statistics, 17 ( 3 ), 1217 – 1241.
dc.identifier.citedreference	Muggeo, V. M. ( 2012 ). Cumseg: Change point detection in genomic sequences. R package version 1.1. https://CRAN.R‐project.org/package=cumSeg.
dc.identifier.citedreference	Muggeo, V. M., & Adelfio, G. ( 2011 ). Efficient change point detection for genomic sequences of continuous measurements. Bioinformatics, 27 ( 2 ), 161 – 166.
dc.identifier.citedreference	Niu, Y. S., & Zhang, H. ( 2012 ). The screening and ranking algorithm to detect DNA copy number variations. The Annals of Applied Statistics, 6 ( 3 ), 1306 – 1326.
dc.identifier.citedreference	Olshen, A. B., Venkatraman, E., Lucito, R., & Wigler, M. ( 2004 ). Circular binary segmentation for the analysis of array‐based DNA copy number data. Biostatistics, 5 ( 4 ), 557 – 572.
dc.identifier.citedreference	Pein, F., Hotz, T., Sieling, H. & Aspelmeier, T. ( 2017 ). Stepr: Multiscale change‐point inference. R package version 2.0‐1. https://CRAN.R‐project.org/package=stepR.
dc.identifier.citedreference	Pein, F., Sieling, H., & Munk, A. ( 2017 ). Heterogeneous change point inference. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 79 ( 4 ), 1207 – 1227.
dc.identifier.citedreference	R Core Team ( 2019 ). R: A language and environment for statistical computing. R foundation for statistical computing. Vienna, Austria. https://www.R‐project.org/.
dc.identifier.citedreference	Rozenholc, Y., & Nuel, G. ( 2013 ). Fast estimation of posterior probabilities in change‐point analysis through a constrained hidden Markov model. Computational Statistics & Data Analysis, 68, 129 – 140.
dc.identifier.citedreference	Shiryaev, A. N. ( 1963 ). On optimum methods in quickest detection problems. Theory of Probability & Its Applications, 8 ( 1 ), 22 – 46.
dc.identifier.citedreference	Siegmund, D. ( 1988 ). Confidence sets in change‐point problems. International Statistical Review/Revue Internationale de Statistique, 56 ( 1 ), 31 – 48.
dc.identifier.citedreference	Snijders, A. M., Nowak, N., Segraves, R., Blackwood, S., Brown, N., Conroy, J., … Kimura, K. ( 2001 ). Assembly of microarrays for genome‐wide measurement of DNA copy number. Nature Genetics, 29 ( 3 ), 263 – 264.
dc.identifier.citedreference	Spokoiny, V., & Zhilova, M. ( 2015 ). Bootstrap confidence sets under model misspecification. The Annals of Statistics, 43 ( 6 ), 2653 – 2675.
dc.identifier.citedreference	Vander Vaart, A. W. ( 2000 ). Asymptotic Statistics. Cambridge, MA: Cambridge University Press.
dc.identifier.citedreference	vander Vaart, A. W., & Weller, J. A. ( 1996 ). Weak convergence and empirical processes: With applications to statistics. New York, NY: Springer‐Verlag.
dc.identifier.citedreference	Vostrikova, L. J. ( 1981 ). Detecting ’disorder’ in multidimensional random process. Soviet Mathematics Doklady, 24, 55 – 59.
dc.identifier.citedreference	Wager, S., Hastie, T., & Efron, B. ( 2014 ). Confidence intervals for random forests: The jackknife and the infinitesimal jackknife. The Journal of Machine Learning Research, 15 ( 1 ), 1625 – 1651.
dc.identifier.citedreference	Yao, Y.‐C. ( 1988 ). Estimating the number of change‐points via Schwarz’criterion. Statistics & Probability Letters, 6 ( 3 ), 181 – 189.
dc.identifier.citedreference	Arnold, T. B. & Tibshirani, R. J. ( 2014 ). Genlasso: Path algorithm for generalized lasso problems. R package version 1.3. https://CRAN.R‐project.org/package=genlasso.
dc.identifier.citedreference	Bai, J. ( 1997 ). Estimating multiple breaks one at a time. Econometric Theory, 13 ( 3 ), 315 – 352.
dc.identifier.citedreference	Baranowski, R. & Fryzlewicz, P. ( 2015 ). Wbs: wild binary segmentation for multiple change‐point detection. R package version 1.3. https://CRAN.R‐project.org/package=wbs
dc.identifier.citedreference	Carlstein, E. ( 1986 ). The use of subseries values for estimating the variance of a general statistic from a stationary sequence. The Annals of Statistics, 14 ( 3 ), 1171 – 1179.
dc.identifier.citedreference	Chatterjee, S., & Bose, A. ( 2005 ). Generalized bootstrap for estimating equations. The Annals of Statistics, 33 ( 1 ), 414 – 436.
dc.identifier.citedreference	Chernoff, H., & Zacks, S. ( 1964 ). Estimating the current mean of a normal distribution which is subjected to changes in time. The Annals of Mathematical Statistics, 35 ( 3 ), 999 – 1018.
dc.identifier.citedreference	Chernozhukov, V., Chetverikov, D., & Kato, K. ( 2013 ). Gaussian approximations and multiplier bootstrap for maxima of sums of high‐dimensional random vectors. The Annals of Statistics, 41 ( 6 ), 2786 – 2819.
dc.identifier.citedreference	Chib, S. ( 1998 ). Estimation and comparison of multiple change‐point models, J. Econometrics, 86 ( 2 ), 221 – 241.
dc.identifier.citedreference	Cleynen, A., Rigaill, G. & Koskas, M. ( 2016 ). Segmentor3IsBack: a fast segmentation algorithm. R package version. 2. https://CRAN.R‐project.org/package=Segmentor3IsBack.
dc.identifier.citedreference	Davison, A. C., & Hinkley, D. V. ( 1997 ). Bootstrap Methods and Their Application. Cambridge, MA: Cambridge University Press.
dc.identifier.citedreference	Du, C., Kao, C.‐L. M., & Kou, S. ( 2016 ). Stepwise signal extraction via marginal likelihood. Journal of the American Statistical Association, 111 ( 513 ), 314 – 330.
dc.identifier.citedreference	Efron, B. ( 1982 ). The jackknife, the bootstrap, and other resampling plans. Philadelphia, PA: Siam.
dc.identifier.citedreference	Efron, B. ( 1992 ). Jackknife‐after‐bootstrap standard errors and influence functions. Journal of the Royal Statistical Society: Series B (Methodological), 54 ( 1 ), 83 – 111.
dc.identifier.citedreference	Efron, B. ( 2014 ). Estimation and accuracy after model selection. Journal of the American Statistical Association, 109 ( 507 ), 991 – 1007.
dc.identifier.citedreference	Freedman, D. ( 1984 ). On bootstrapping two‐stage least‐squares estimates in stationary linear models. The Annals of Statistics, 12 ( 3 ), 827 – 842.
dc.identifier.citedreference	Frick, K., Axel, M., & Hannes, S. ( 2014 ). Multiscale change point inference. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 76 ( 3 ), 495 – 580.
dc.identifier.citedreference	Fryzlewicz, P. ( 2014 ). Wild binary segmentation for multiple change‐point detection. The Annals of Statistics, 42 ( 6 ), 2243 – 2281.
dc.identifier.citedreference	Giordano, R., Stephenson, W., Liu, R., Jordan, M. & Broderick, T. ( 2019 ). A swiss army infinitesimal jackknife, Paper presented at the 22nd International Conference on Artificial Intelligence and Statistics 89, 1139–1147.
dc.identifier.citedreference	Harchaoui, Z., & Lévy‐Leduc, C. ( 2010 ). Multiple change‐point estimation with a total variation penalty. Journal of the American Statistical Association, 105 ( 492 ), 1480 – 1493.
dc.identifier.citedreference	Hastings, P., Lupski, J. R., Rosenberg, S. M., & Ira, G. ( 2009 ). Mechanisms of change in gene copy number. Nature Reviews Genetics, 10 ( 8 ), 551 – 564.
dc.identifier.citedreference	Hlávka, Z., Hušková, M., Kirch, C., & Meintanis, S. G. ( 2016 ). Bootstrap procedures for online monitoring of changes in autoregressive models. Communications in Statistics‐Simulation and Computation, 45 ( 7 ), 2471 – 2490.
dc.identifier.citedreference	Horowitz, J. L. ( 2019 ). Bootstrap methods in econometrics. Annual Review of Economics, 11, 193 – 224.
dc.identifier.citedreference	Hušková, M., & Kirch, C. ( 2010 ). A note on studentized confidence intervals for the change‐point. Economic Record, 25 ( 2 ), 269 – 289.
dc.identifier.citedreference	Jaeckel, L. ( 1972 ). The infinitesimal jackknife Memorandum MM72‐1215‐11. Murray Hill: Bell Laboratories.
dc.owningcollname	Interdisciplinary and Peer-Reviewed

Files in this item

Name:: SJOS_12456_supplement.pdf
Size:: 299.7KB
Format:: PDF

View/Open

Name:: sjos12456_am.pdf
Size:: 2.346MB
Format:: PDF

View/Open

Name:: sjos12456.pdf
Size:: 836.4KB
Format:: PDF

View/Open

Interdisciplinary and Peer-Reviewed

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.